• Login
    View Item 
    •   DSpace@RPI Home
    • Rensselaer Libraries
    • RPI Theses Online (Complete)
    • View Item
    •   DSpace@RPI Home
    • Rensselaer Libraries
    • RPI Theses Online (Complete)
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Performance analysis and visualization tools to support the codesign of next generation computer systems

    Author
    Ross, Caitlin J.
    View/Open
    179847_Ross_rpi_0185E_11596.pdf (14.63Mb)
    Other Contributors
    Carothers, Christopher D.; Cutler, Barbara M.; Ross, Robert B., 1972-; Shephard, M. S. (Mark S.);
    Date Issued
    2019-08
    Subject
    Computer science
    Degree
    PhD;
    Terms of Use
    This electronic version is a licensed copy owned by Rensselaer Polytechnic Institute, Troy, NY. Copyright of original work retained by author.;
    Metadata
    Show full item record
    URI
    https://hdl.handle.net/20.500.13015/2459
    Abstract
    To better our understanding of optimistic PDES, a dynamic instrumentation layer was introduced into the ROSS simulation framework that allows model developers to collect a variety of metrics across the model and simulation engine software layers. Because the instrumentation has the potential to collect large amounts of data that is infeasible for either storing to disk or transferring over a network from the supercomputer running the simulation to another system for analysis, we also developed the ROSS In Situ Analysis system (RISA) that can perform data reduction while the simulation data resides in memory. We demonstrate the usefulness of our instrumentation and analysis tools by performing visual analyses of high performance computing (HPC) network models built on top of the ROSS framework. With the visual analysis, we are able to find load and communication imbalances in the simulation and determine their causes. In addition, we perform perturbation studies of both the ROSS instrumentation and RISA. This compares instrumented and non-instrumented simulations to ensure that these tools do not significantly affect simulation performance nor introduce new performance bottlenecks.; Discrete event simulation is a cost-effective tool for exploring the design space of next generation computer systems. Optimistic synchronization algorithms for PDES, such as Time Warp, allow for a model's inherent parallelism to be discovered using an out-of-order event detection and recovery scheme. When events are processed out of timestamp order, the simulation is rolled back to a prior state and events are re-executed in the correct order. Although optimistic protocols can be highly scalable, optimizing optimistic simulations to minimize time spent performing rolling backs is not a trivial task due to the number of factors that can affect the rollback behavior of the simulation.; In this work, we demonstrate the efficacy of discrete event simulation in evaluating and improving the performance of parallel and distributed scientific analysis systems, such as the MG-RAST metagenomics analysis service provided by Argonne National Laboratory. We propose hardware and job scheduling changes to their system that can improve scalability under increased user workloads that are anticipated in the future. We use event-driven simulation to evaluate the proposed changes and compare them to the current infrastructure and job scheduling policies. However, the simulation exhibits poor parallel performance, which limits the size of the workloads able to be simulated for MG-RAST. This highlights the need for scalable analysis and visualization tools for use in optimistic PDES that can be used to gain insights to their rollback behavior and performance.; Finally, we also explore the use of three-dimensional animations for understanding both the time series model data as well as optimistic PDES performance of the CODES network models. Typically these simulations are visualized using information visualization techniques such as parallel coordinates and radial diagrams. However, adding spatial data to the compute nodes and routers of the HPC networks enables the visualization of simulation data in a context familiar with simulation users, such as network architects. Replaying time series model data, such as network congestion, over the network visualizations has helped to provide insight to hotspots that occur in HPC networks during simulation, and enable a visual comparison of different networks.;
    Description
    August 2019; School of Science
    Department
    Dept. of Computer Science;
    Publisher
    Rensselaer Polytechnic Institute, Troy, NY
    Relationships
    Rensselaer Theses and Dissertations Online Collection;
    Access
    Restricted to current Rensselaer faculty, staff and students. Access inquiries may be directed to the Rensselaer Libraries.;
    Collections
    • RPI Theses Online (Complete)

    Browse

    All of DSpace@RPICommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    My Account

    Login

    DSpace software copyright © 2002-2022  DuraSpace
    Contact Us | Send Feedback
    DSpace Express is a service operated by 
    Atmire NV