HPC Infrastructure Operations
Provides 24 hour/day, 7 day/week, and 365 day/year support of NCCS data centers.
The HPC Infrastructure Operations Group of the Systems Section provides 24 hour/day, 7 day/week, and 365 day/year support of NCCS data centers. The group monitors the status of key systems, receives system alarms, triages the alarms, and repairs/initiates repairs to the system. In addition, the group maintains the configuration management of the data center assets, oversees ongoing facility mechanical and hardware related work and ensures that all work is conducted in the data centers is performed in accordance with the safe conduct of research principles. Team members work with center manager, facility personnel, and the system admins on the flow of material in/out of the facility, storage, and maintenance. Finally, the group provides the first line of defense in the recognition of upset conditions and the response to those conditions, and strives to minimizes downtime or to promote rapid return to normal conditions after an adverse event occurs.