Elnozahy's research area is in systems, including high performance computing, power-aware computing, fault tolerance, operating systems, system architecture, and distributed systems. His work on rollback-recovery is now a standard component of graduate courses in fault-tolerant computing, and he has made seminal contributions in checkpoint/restart, and in general on the complex hardware-software interactions in resilience.
Selected Publications
Melhem R; Mosse D; Elnozahy E. "The Interplay of Power Management and Fault Recovery in Real-Time Systems", IEEE Transactions on Computers, vol. 53, no. 2, pp. 217—231, February 2004.
Elnozahy, E.N., Speight, E., Li, J., Rajamony, R., Zhang, L., Arimilli, L.B. "PERCS System Architecture", Encyclopedia of Parallel Computing, Springer Verlag, pp. 1506-1515, 2011.
Elnozahy EN; Plank JS. "Checkpointing for Peta-Scale Systems: A Look into the Future of Practical Rollback-Recovery", IEEE Transactions on Dependable and Secure Computing, vol. 1, no. 2, pp. 97—108, February 2004.
Elnozahy EN; Alvisi L; Wang YM; et al. "A Survey of Rollback-Recovery Protocols in Message Passing Systems", ACM Computing Surveys, vol. 34, no. 3, September 2002.
Elnozahy EN; Zwaenepoel W. "Manetho: Transparent Rollback-Recovery with Low Overhead, Limited Rollback and Fast Output Commit", IEEE Transactions on Computers, Special Issue on Fault-Tolerant Computing, 41(5): 526—531, May 1992.