Publications
A Generic and Efficient Framework for Estimating Lossy Compressibility of Scientific Data
Md Hasanur Rahman, Sheng Di, Guanpeng Li, and Franck Cappello
[MSST’24][Lossy Compression]: The 38th International Conference on Massive Storage Systems and Technology, 2024
[Paper] [Slides] [Code] [Cite]
Investigating The Impact of Transient Hardware Faults on Deep Learning Neural Network Inference
Md Hasanur Rahman, Sabuj Laskar, and Guanpeng Li
[STVR’24][Fault Tolerance]: Software Testing, Verification and Reliability, 2024
[Paper] [Slides] [Code] [Cite]
DRUTO: Upper-Bounding Silent Data Corruption Vulnerability in GPU Applications
Md Hasanur Rahman, Sheng Di, Shengjian Guo, Xiaoyi Lu, Guanpeng Li, and Franck Cappello
[IPDPS’24][Fault Tolerance]: The 38th IEEE International Parallel & Distributed Processing Symposium (IPDPS), 2024
[Paper] [Slides] [Code] [Cite]
A Feature-Driven Fixed-Ratio Lossy Compression Framework for Real-World Scientific Datasets
Md Hasanur Rahman, Sheng Di, Kai Zhao, Robert Underwood, Guanpeng Li, and Franck Cappello
[ICDE’23][Lossy Compression]: The 39th IEEE International Conference on Data Engineering, 2023 (Acceptance rate: 19.1% past years)
[Paper] [Slides] [Code] [Cite]
Characterizing Deep Learning Neural Network Failures between Algorithmic Inaccuracy and Transient Hardware Faults
Sabuj Laskar, Md Hasanur Rahman, Bohan Zhang, and Guanpeng Li
[PRDC’22][Fault Tolerance]: The 27th IEEE Pacific Rim International Symposium on Dependable Computing, 2022
[Paper] [Slides] [Code] [Cite]
TensorFI+: A Scalable Fault Injection Framework for Modern Deep Learning Neural Networks
Sabuj Laskar, Md Hasanur Rahman, and Guanpeng Li
[ISSRE-W’22][Fault Tolerance]: Workshop on Resiliency, Security, Defenses, and Attacks (RSDA) at IEEE International Symposium on Software Reliability Engineering, 2022
[Paper] [Slides] [Code] [Cite]
PEPPA-X: finding program test inputs to bound silent data corruption vulnerability in HPC applications
Md Hasanur Rahman, Aabid Shamji, Shengjian Guo, and Guanpeng Li
[SC’21][Fault Tolerance]: International Conference for High Performance Computing, Networking, Storage and Analysis, 2021 (Acceptance rate: 23.6%)
[Paper] [Slides] [Code] [Cite] Received all three artifacts badges
|