Three of the most important features in compute node design
are reliability, availability, and serviceability (RAS). These RAS
features help to ensure the integrity of the data that is stored in
the compute node, the availability of the compute node when you need
it, and the ease with which you can diagnose and correct problems.
The compute node has the following RAS features:
- 24-hour support center
- Advanced Configuration and Power Interface (ACPI)
- Automatic server restart (ASR)
- Built-in diagnostics using DSA Preboot
- Built in monitoring for temperature, voltage, and hard disk drives
- Customer support center 24 hours per day, 7 days a week1
- Customer upgrade of flash ROM-resident code and diagnostics
- Customer-upgradeable Unified Extensible Firmware Interface (UEFI)
code and diagnostics
- ECC protected DDR3 DIMMs
- ECC protection on the L2 cache
- Error codes and messages
- Integrated management module II (IMM2) that communicates with
the Chassis Management Module to enable remote systems management
- Light path diagnostics
- Memory mirroring mode support
- Microprocessor built-in self-test (BIST) during power-on self-test
(POST)
- Microprocessor Intel QuickPath Interconnect bus and CRC checking
- Microprocessor serial number access
- PCI bus even parity checking
- POST
- Power policy
- Processor presence detection
- Rank-sparing mode support
- ROM-resident diagnostics
- System-error logging
- Vital product data (VPD) on memory
- Wake on LAN capability
- Wake on PCI (PCIe Expansion Node) capability