Author
Gu, Deyang
Other Contributors
Carothers, Christopher D.; Shephard, M. S. (Mark S.); Anshelevich, Elliot; Xu, Xie George;
Date Issued
2014-05
Subject
Computer science
Degree
MS;
Terms of Use
This electronic version is a licensed copy owned by Rensselaer Polytechnic Institute, Troy, NY. Copyright of original work retained by author.;
Abstract
A vectorized implementation for Klein-Nishina equation together with a simple test program using assembly language is developed in this thesis. Using this implementation, we are able to observe great speedup on the Intel® Xeon Phi™ coprocessor. While our implementation keeps the algorithm as a time complexity of O(n), we successfully controlled the growth rate of the algorithm, thus gaining a speedup up to 14x. In the comparison with compiler-generated assembly code from C source code, we found out the automatically generated code is not able to vectorize the execution process while we are able to vectorize it using various instructions such as blending instructions and gather/scatter combination instructions. We believe this is the main reason for the speedup we have observed. In addition, the frequent data conversion performed by the compiler also slowed down the calculation.;
Description
May 2014; School of Science
Department
Dept. of Computer Science;
Publisher
Rensselaer Polytechnic Institute, Troy, NY
Relationships
Rensselaer Theses and Dissertations Online Collection;
Access
Restricted to current Rensselaer faculty, staff and students. Access inquiries may be directed to the Rensselaer Libraries.;