Design and Implementation of Broadcast Algorithms for Extreme-Scale Systems Conference Paper September, 2011
A case for Virtual Machine based Fault Injection in a High-Performance Computing Environment... Conference Paper August, 2011
Realization of User Level Fault Tolerant Policy Management through a Holistic Approach for Fault Correlation... Conference Paper June, 2011
User Application Monitoring through Assessment of Abnormal Behaviours Recorded in RAS Logs... Conference Paper May, 2011
Providing Runtime Clock Synchronization With Minimal Node-to-Node Time Deviation on XT4s and XT5s... Conference Paper May, 2011
ConnectX-2 CORE-Direct Enabled Asynchronous Broadcast Collective Communications... Conference Paper May, 2011