dc.contributor
Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.contributor
Barcelona Supercomputing Center
dc.contributor
Universitat Politècnica de Catalunya. CAP - Grup de Computació d'Altes Prestacions
dc.contributor.author
Criado, Joel
dc.contributor.author
Garcia Gasulla, Marta
dc.contributor.author
Labarta Mancho, Jesús José
dc.contributor.author
Chatterjee, Arghya
dc.contributor.author
Hernández, Óscar
dc.contributor.author
Sirvent, Raül
dc.contributor.author
Álvarez, Gonzalo
dc.identifier
Criado, J. [et al.]. Optimization of condensed matter physics application with OpenMP tasking model. A: International Workshop on OpenMP. "OpenMP: Conquering the Full Hardware Spectrum: 15th International Workshop on OpenMP, IWOMP 2019: Auckland, New Zealand: September 11-13, 2019: proceedings". Berlín: Springer, 2019, p. 291-305.
dc.identifier
978-3-030-28595-1
dc.identifier
https://hdl.handle.net/2117/186453
dc.identifier
10.1007/978-3-030-28596-8_20
dc.description.abstract
The Density Matrix Renormalization Group (DMRG++) is a condensed matter physics application used to study superconductivity properties of materials. It’s main computations consist of calculating hamiltonian matrix which requires sparse matrix-vector multiplications. This paper presents task-based parallelization and optimization strategies of the Hamiltonian algorithm. The algorithm is implemented as a mini-application in C++ and parallelized with OpenMP. The optimization leverages tasking features, such as dependencies or priorities included in the OpenMP standard 4.5. The code refactoring targets performance as much as programmability. The optimized version achieves a speedup of 8.0 × with 8 threads and 20.5 × with 40 threads on a Power9 computing node while reducing the memory consumption to 90 MB with respect to the original code, by adding less than ten OpenMP directives.
dc.description.abstract
This work is partially supported by the Spanish Government through Programa Severo Ochoa (SEV2015-0493), by the Spanish Ministry of Science and Technology (project TIN2015-65316-P), by the Generalitat de Catalunya (contract 2017-SGR-1414) and by the BSC-IBM Deep Learning Research Agreement, under JSA “Application porting, analysis and optimization for POWER and POWER AI”. This work was partially supported by the Scientific Discovery through Advanced Computing (SciDAC) program funded by U.S. Department of Energy, Office of Science, Advanced Scientific Computing Research and Basic Energy Sciences, Division of Materials Sciences and Engineering. This research used resources of the Oak Ridge Leadership Computing Facility at the Oak Ridge National Laboratory, which is supported by the Office of Science of the U.S. Department of Energy under Contract No. DE-AC05-00OR22725.
dc.description.abstract
Peer Reviewed
dc.description.abstract
Postprint (author's final draft)
dc.format
application/pdf
dc.relation
https://link.springer.com/chapter/10.1007/978-3-030-28596-8_20
dc.relation
info:eu-repo/grantAgreement/MINECO//TIN2015-65316-P/ES/COMPUTACION DE ALTAS PRESTACIONES VII/
dc.relation
info:eu-repo/grantAgreement/AGAUR/2017 SGR 1414
dc.relation
info:eu-repo/grantAgreement/MINECO//SEV-2015-0493/ES/BARCELONA SUPERCOMPUTING CENTER - CENTRO. NACIONAL DE SUPERCOMPUTACION/
dc.subject
Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors
dc.subject
Application program interfaces (Computer software)
dc.subject
Parallel programming (Computer science)
dc.subject
Condensed matter
dc.subject
Interfícies de programació d'aplicacions (Programari)
dc.subject
Programació en paral·lel (Informàtica)
dc.subject
Matèria condensada
dc.title
Optimization of condensed matter physics application with OpenMP tasking model
dc.type
Conference report