[Defense] Parallel I/O on Compressed Data Files
Friday, April 22, 2022
3:00 pm - 4:30 pm
In
Partial
Fulfillment
of
the
Requirements
for
the
Degree
of
Doctor
of
Philosophy
Siddhesh
Singh
will
defend
his
dissertation
Parallel
I/O
on
Compressed
Data
Files
Abstract
The increase in processing power of modern computing hardware has not been accompanied by a proportional increase in the performance of storage technology leading to an imbalance in cluster and parallel computing architectures where input-output (I/O) operations may bottleneck the overall performance of the system. This makes necessary the use of sophisticated software solutions to overcome limitations on I/O performance. One method is to apply specialized algorithms in parallel I/O to optimize data transfer. Another solution to this problem is to use data compression to effectively reduce the amount of data which is transferred between processing and storage units. An under examined area of research is the intersection of parallel I/O and data compression and how these two techniques can be combined in High Performance Computing (HPC) environments. This thesis presents a general model for incorporating data compression within existing parallel I/O algorithms and evaluates the performance benefits obtained through performing parallel I/O on compressed data files. In particular, the thesis presents an Open MPI-I/O (OMPIO) implementation which incorporates arbitrary compression libraries within the two phase I/O algorithm through a new file format. The results indicate significant performance and space saving benefits through this approach and the parallel compression semantics presented in this thesis provide a theoretical basis for future research in parallel I/O and data compression.
Friday,
April
22,
2022
3PM
-
4:30PM
CT
Online
via
Dr. Jaspal Subhlok (for Dr. Edgar Gabriel), dissertation advisor
Faculty, students and the general public are invited.
