Improving parallelism of scientific and engineering applications on heterogeneous supercomputers
dc.rights.license | CC BY-NC-ND. Users may download and share copies with attribution in accordance with a Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 license. No commercial use or derivatives are permitted without the explicit approval of the author. | |
dc.contributor | Slota, George M. | |
dc.contributor | Cutler, Barbara M. | |
dc.contributor | Sahni, Onkar | |
dc.contributor | Smith, Cameron W. | |
dc.contributor.advisor | Shephard, M. S. (Mark S.) | |
dc.contributor.author | Diamond, Gerrett | |
dc.date.accessioned | 2022-09-14T19:26:12Z | |
dc.date.available | 2022-09-14T19:26:12Z | |
dc.date.issued | 2021-08 | |
dc.identifier.uri | https://hdl.handle.net/20.500.13015/6110 | |
dc.description | August 2021 | |
dc.description | School of Science | |
dc.description.abstract | The rising usage of heterogeneous supercomputers introduces both opportunities for increased parallelism and challenges for efficient usage of the available hardware. Applications running on heterogeneous supercomputers must adopt new methods to achieve performance across two levels of parallelism. Inter-process parallelism defines coordination between processes and intra-process parallelism within each process. This thesis presents research towards improving inter-process and intra-process parallelism for applications that use complex data structures such as distributed unstructured meshes. Inter-process parallelism is defined by the coupled costs of the partition of load between processes and the communications between processes required as a result of the partition. To achieve optimal performance, partitions must divide computational load evenly between processes while minimizing the additional costs of communications. This thesis addresses improving inter-process parallelism using multicriteria partition improvement multicriteria methods on a generalized structure for a broad set of potential applications. The partition improvement methods are applied to different unstructured mesh setups with partitions up to half a million processes. In the case of heterogeneous supercomputers, intra-process parallelism is dictated by the parallel hardware available to each process for performing computations. For most of the current and next generation US systems, Graphic Processing Units (GPUs) are the parallel hardware available on each node. This thesis addresses methods for intra-process parallelism in the scope of particle-in-cell simulations with a novel approach to the storage of the unstructured mesh and the particles for optimized performance on GPUs while utilizing performance-portable methods for performance on future hardware. Scaling studies of these methods are presented up to 4096 nodes of the Summit supercomputer with over a trillion particles simulated. | |
dc.language | ENG | |
dc.language.iso | en_US | |
dc.publisher | Rensselaer Polytechnic Institute, Troy, NY | |
dc.relation.ispartof | Rensselaer Theses and Dissertations Online Collection | |
dc.rights | Attribution-NonCommercial-NoDerivs 3.0 United States | * |
dc.rights.uri | http://creativecommons.org/licenses/by-nc-nd/3.0/us/ | * |
dc.subject | Computer science | |
dc.title | Improving parallelism of scientific and engineering applications on heterogeneous supercomputers | |
dc.type | Electronic thesis | |
dc.type | Thesis | |
dc.date.updated | 2022-09-14T19:26:15Z | |
dc.rights.holder | This electronic version is a licensed copy owned by Rensselaer Polytechnic Institute (RPI), Troy, NY. Copyright of original work retained by author. | |
dc.description.degree | PhD | |
dc.relation.department | Dept. of Computer Science |
Files in this item
This item appears in the following Collection(s)
-
RPI Theses Online (Complete)
Rensselaer theses from 2006; many restricted to current RPI Students, Faculty and Staff -
RPI Theses Open Access
Rensselaer Theses and Dissertations with Creative Commons Licenses
Except where otherwise noted, this item's license is described as CC BY-NC-ND. Users may download and share copies with attribution in accordance with a Creative Commons
Attribution-Noncommercial-No Derivative Works 3.0 license. No commercial use or derivatives
are permitted without the explicit approval of the author.