UNIX Fault Management: A Guide for System Administrators

UNIX Fault Management: A Guide for System Administrators, 9780130265258 (013026525X), Prentice Hall, 1999

If you're responsible for maintaining the integrity and availability of a mission-critical UNIX system, then you need UNIX Fault Management: A Guide for System Administrators, the first book that brings together all of the monitoring and fault management information. Expert UNIX system management engineers Brad Stone and Julie Symons show you exactly how to implement appropriate, cost-effective system monitoring on any UNIX server -- including systems configured as high availability clusters. You'll learn how to:

Plan for-and establish-cost-effective, reliable system monitoring procedures
Monitor systems, disks, networks, applications, and databases
Detect, investigate, and recover from server problems
Implement best practices for high availability in enterprise-class UNIX installations-including clusters
Take advantage of key fault management trends, new standards, and new technologies

This book contains detailed descriptions of fault monitoring tools and monitoring frameworks to help you make better purchasing decisions. You'll also find a handy quick reference of monitoring tasks and techniques for operators -- including specific, step-by-step recovery solutions. If you can't afford one nanosecond more downtime than necessary, you can't afford to be without UNIX Fault Management.

This book is intended for system administrators and operators who are responsible for maintaining the integrity and availability of mission-critical UNIX systems. The book provides a description of the fault monitoring tools and techniques available for UNIX servers, including systems that are configured as high availability clusters. This book can therefore be a handy quick reference for an operator trying to troubleshoot a problem in the customer environment, by pointing out where to find key diagnostic messages and describing how to take recovery actions.

A system administrator responsible for the initial configuration and administration of UNIX systems will also find this book useful because it describes the procedures to follow to set up the appropriate levels of system monitoring. The product descriptions can also help in making purchasing decisions as the customer determines the appropriate amount of event monitoring needed in their environment.

An overview of the tasks performed by an operator is provided, with details on how events are received and processed. The remainder of the book focuses on the types of events that can be received, how they are detected, how operators receive event notifications, and how problems can be investigated and recovery performed. The goal is to introduce the necessary tools, but not to show how every possible problem can be solved.

This book provides numerous descriptions of how fault management tools and products can be used to solve a variety of problems. Many of the chapters are focused on specific computer components, such as disks or databases, to be helpful to operators with specific roles.

Comments

Amazing Books

Intelligence and Security Informatics: Techniques and Applications (Studies in Computational Intelligence)

Springer, 2008

The book is organized in four major areas. The first unit focuses on the terrorism informatics and data mining. The second unit discusses the intelligence and crime analysis. The third unit covers access control, infrastructure protection, and privacy. The fourth unit presents surveillance and emergency response.

Readers will find the book...

Dual-Fuel Gas-Steam Power Block Analysis: Methodology and Continuous-Time Mathematical Models (Power Systems)

Springer, 2018

This book presents the methodology and mathematical models for dual-fuel coal-gas power plants in two basic configurations: systems coupled in parallel and in series.

Dual-fuel gas and steam systems, especially parallel systems, have great potential for modernizing existing combined heat and power (CHP) plants. This book...

Everyday Project Management

Berrett-Koehler Publishers, 2019

“Everyday Project Management is a practical guide for anyone new or needing to learn more about project management. Unlike many other books, it does not rely on arcane concepts and terms, and simply tells it like it is.”
—Todd C. Williams, President of eCameron, Inc., and author of Filling...

The Elements of Statistical Learning: Data Mining, Inference, and Prediction, Second Edition (Springer Series in Statistics)

Springer, 2019

This book describes the important ideas in a variety of fields such as medicine, biology, finance, and marketing in a common conceptual framework. While the approach is statistical, the emphasis is on concepts rather than mathematics. Many examples are given, with a liberal use of colour graphics. It is a valuable resource...

Sensor Technologies: Healthcare, Wellness and Environmental Applications (Expert's Voice in Networked Technologies)

Apress, 2013

Sensor Technologies: Healthcare, Wellness and Environmental Applications explores the key aspects of sensor technologies, covering wired, wireless, and discrete sensors for the specific application domains of healthcare, wellness and environmental sensing. It discusses the social, regulatory, and design...

Higher Creativity for Virtual Teams: Developing Platforms for Co-Creation (Premier Reference)

IGI Global, 2007

Virtual teams constitute a relatively new knowledge area that has risen from two recent changes: the globalization of industry and markets, and advances in information communication technology tools. Continual technological advancement is set against a backdrop of international mergers, take-overs, and alliances, which invariably leads to more...