ShaoLin InfiniCluster 2.0 User Guide

2007-01-06

Revision History
Revision 1.002005-05-24
Release 1.00
Revision 1.012007-01-06
Release 1.01
Revision 1.022007-01-25
Release 1.02
Revision 1.032007-04-27
Release 1.03

Table of Contents
I. Basic Concept and Terminology
1. Overview of Cluster Technology
1.1. What is a Cluster?
1.1.1. Failure Detection
1.1.1.1. Application Failure Detection
1.1.1.2. Node Failure Detection
1.1.2. Switchover, Failover and Failback
1.1.2.1. Switchover
1.1.2.2. Failover
1.1.2.3. Failback
1.2. How to make Application "Clustered"?
1.2.1. Start, Stop and monitor procedures
1.2.1.1. Defined Start Procedure
1.2.1.2. Defined Stop Procedure
1.2.1.3. Defined Monitor Procedure
1.2.2. Known state
1.2.3. Store required data on shared disks
1.2.4. Adherence to system
2. General hardware configuration
2.1. General hardware configuration
2.2. Configuring Storage
2.3. Configuring SCSI storage
2.4. Configuring single initiator SCSI
2.5. Configuring multi initiator SCSI
2.6. Configure SCSI Host Bus Adapters (HBAs)
2.7. Configuring SCSI IDs
2.8. Configuring Fibre Channel
2.9. Configuring power supply for storage system
2.10. Internal Storage
2.10.1. Advantages
2.10.2. Disadvantages
3. Introduction to ShaoLin InfiniCluster
3.1. Terminology
3.2. Cluster Features
3.3. Understanding ShaoLin InfiniCluster Components
3.3.1. Resources
3.3.1.1. Resource Group
3.3.2. Service Monitoring Agents (SMA)
3.3.2.1. Agent Architecture
3.3.2.2. Agent Types
3.3.3. Cluster Infrastructure and Control
3.3.3.1. The ShaoLin General Parallel Cluster Infrastructure (SGPCI)
3.3.3.2. Load Balanced Communication
3.3.3.3. Reliable Data Communication
3.3.3.4. Kernel Watch Dog and Kernel Kill
3.3.4. Cluster Manager
3.3.4.1. Cluster Configuration Database (CCDB)
3.3.4.2. Cluster Administration Daemon (CAD)
3.3.4.3. Group Communication Service (GCS)
3.3.4.4. System Resource Management Service
3.3.4.5. SIGKILL and Cluster Manager
3.4. Cluster Administration and User Interface
3.4.1. Command Line Interface
3.4.2. Graphical User Interface (GUI)
II. Installation and configurations
4. Installing ShaoLin InfiniCluster
4.1. Preparation
4.2. System Requirement
4.2.1. Minimum System Requirement
4.2.1.1. Software Requirement
4.3. Installation Procedures
4.3.1. Installation ShaoLin InfiniCluster
4.3.2. Hardware Installation
4.3.3. Installation ShaoLin InfiniCluster (SIC)
4.4. Setup Node and Cluster
4.5. Join a Node to an Existing Cluster
4.6. About ShaoLin Administration System
4.6.1. Configuration of ShaoLin Administration System
4.6.2. Preparation
4.6.3. Installation ShaoLin Administration System Client (SASC)
4.6.4. Accessing ShaoLin Administration System Client
4.7. Tips About Installation and setup ShaoLin InfiniCluster
4.7.1. Installation
4.7.2. Setup
5. Registration and Support
5.1. Getting Support
5.2. Getting Update
III. Administration and Operations
6. Graphic User Interface
6.1. ShaoLin Administration System
6.1.1. SAS Client Main Window
6.1.1.1. ShortCut Bar Basics
6.1.2. Login Dialog Box
6.1.3. SAS Host Information
6.2. SAS Client Main Window
6.2.1. ShortCut Bar Basics
6.3. Login Dialog Box
6.4. SAS Host Information
6.5. SIC - SAS Client Administration Module (CAM)
6.5.1. Display Event Log
6.5.2. Logger Configuration
6.6. ICON Representations
6.6.1. Component Display
6.6.2. Operational status Display
6.6.3. Resources Types
6.7. Communication Channels
6.7.1. About Communication Channels
6.7.2. SIC - Communication Channel Configuration
6.7.2.1. SIC - Add a Communication Channel to Cluster
6.8. SIC - Cluster Member
6.8.1. SIC - Node Information
6.8.2. Member Status Information in Detail
6.8.3. Change Member Status
6.9. SIC - Resource Group and Resource
6.9.1. SIC - Resource Group List
6.9.1.1. Create Resource Group
6.9.2. Resource Group
6.9.3. Resource Monitoring
6.9.3.1. About Adding Resource
6.9.3.2. Resource
6.9.3.3. Overview of Different Resource
6.9.4. Enable Resource Group
7. SIC - Command Line Interface Administration
7.1. What is Command Line Interface (CLI)?
7.2. SIC Basic Operation Through CLI
7.3. Reference for SIC CLI Execution and Command
7.3.1. Global Configuration
7.3.2. Cluster Member Commands
7.3.3. Communication Channel Configurations
7.3.4. Resource Group Configurations
7.4. Operation With CLI
7.4.1. Verifying Cluster Environment
7.4.2. Add a Communication Channel to Cluster
7.4.3. Add Resource Group
7.4.4. Manual Takeover
IV. Advanced Clustering
8. ShaoLin InfiniCluster Service Monitoring Agents (SMA)
8.1. What is Service Monitoring Agent?
8.1.1. How SMA Work?
8.1.1.1. SMA Available
8.1.1.2. What are Resource Group and Resource?
8.2. Installation SMA
8.3. Multinic SMA
8.4. Configuration
8.5. Apache SMA
8.5.1. Release Note
8.5.2. Overview
8.5.3. Apache SMA Installation
8.5.3.1. Requirement
8.5.3.2. SMA Installation
8.5.3.3. SMA Configuration
8.5.3.4. Remove Apache SMA
9. Configure Multi-instant Parallel Applications
9.1. What is Parallel Application/Service?
9.2. How ShaoLin InfiniCluster Handles Parallel Services
9.2.1. Parallel Resource Group Operations
9.2.2. Sample Parallel Application
9.2.3. Considerations of Parallel Services
9.2.3.1. Cluster Awareness of Resources
9.2.3.2. Collision of resources
9.2.3.3. Licensing Issues
V. Troubleshooting and Recovery
10. Troubleshooting and Recovery
10.1. Troubleshooting
10.2. Recovery
11. Cluster Logs and Events
11.1. Event Log Code
11.1.1. Event Log Code start from 1000
11.1.2. Event Log Code start from 2000
11.1.3. Event Log Code start from 3000
11.1.4. Event Log Code start from 4000
11.1.5. Event Log Code start from 5000
12. Frequent Ask Questions
12.1. Cluster Setting
12.2. Resource and Resource Group
12.3. Split Brain Handling
13. Cluster Errors
13.1. Fatal Errors
13.2. Runtime Errors
Index
List of Tables
6-1. List of Functions
6-2. List of Functions
6-3. Items of Host Table
6-4. Items of Logger Form
6-5. List of Components
6-6. Operational Status
6-7. List of Resources
6-8. View format of Communication Channels
6-9. Different Types of Communication Channels
6-10. Communication Channels Main Panel
6-11. Change Member Status
6-12. Create Resource Group
6-13. Resource Window
6-14. Resources
8-1. List of Available Resources Type
List of Figures
1-1. Switchover, Failover and Failback
4-1. Use console with command "rpm -ivh ShaoLin-Administration_System-client-1.0-1.i586.rpm
4-2. Execute SASC by using command "/opt/slas/bin/slasc"
4-3. A screen for downloading the SAS client
4-4. To connect to SIC.
4-5. Shows the progress status.
4-6. SASC is connected to the cluster.
6-1. SAS Client Main Window
6-2. Login Dialog Box
6-3. Host information on "Working Area".
6-4. SAS Client Main Window
6-5. Login Dialog Box
6-6. Host information on "Working Area".
6-7. Event Log Display
6-8. Logger Configuration
6-9. Communication Channels Main Panel
6-10. Individual Communication Channel on Main Panel
6-11. Create New Communication Channel" Screen
6-12. Add Member Dialog Box
6-13. Finish Setup New Communication Channel
6-14. Viewing Member list as icon
6-15. Viewing Member list as list
6-16. Detail about member Information
6-17. Create Resource Group
6-18. Detail about Resource Group
6-19. Add Member to the Resource Group
6-20. Change Node Priority
6-21. Adding Resource
6-22. Applications
6-23. Filesystem
6-24. IP Address
6-25. Logical Volume Manager
6-26. Multiple Disk
6-27. Network Interface
6-28. System Health Monitoring
8-1. Create New Resource Groups Window
8-2. Create New Resource Groups Window
9-1. Sample Parallel Application Setup