T1:  Towards Highly Available, Scalable, and Secure HPC Clusters with HA-OSCAR
09/14/2006, 2:00 PM - 5:45 PM

Speaker:
Ibrahim Haddad, Strategic Program Manager, Open Source Development Labs, Inc.

The HA-OSCAR open source project aims to combine high availability and performance for mission-critical applications by enhancing a Beowulf cluster system with high availability mechanisms such as component redundancy, self-healing, and failure detection and recovery, in addition to supporting automatic failover and fail-back. Released in March 2004, version 1.0 provides an installation wizard GUI and a Web-based administration tool to allow intuitive creation and configuration of a multi-head Beowulf cluster. A default set of monitoring services ensures the availability of critical services, hardware components, and cluster resources at the control node. HA-OSCAR also supports new tailored services that can be configured and added via a WebMin-based HA-OSCAR administration tool. This tutorial addresses design and implementation issues related to building HA Linux Beowulf clusters. The discussion will include the architecture of HA-OSCAR, new features of the latest release, HA and security features, and real-world tests of the software's performance and availability.

© 2008 IDG WORLD EXPO CORP. ALL RIGHTS RESERVED