dc.contributor
Universitat Politècnica de Catalunya. Doctorat en Arquitectura de Computadors
dc.contributor
Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors
dc.contributor
Universitat Politècnica de Catalunya. ARCO - Microarquitectura i Compiladors
dc.contributor.author
Joseph, Diya
dc.contributor.author
Aragón Alcaraz, Juan Luis
dc.contributor.author
Parcerisa Bundó, Joan Manuel
dc.contributor.author
González Colás, Antonio María
dc.identifier
Joseph, D. [et al.]. Boustrophedonic frames: Quasi-optimal L2 caching for textures in GPUs. A: International Conference on Parallel Architectures and Compilation Techniques. "2023 32nd International Conference on Parallel Architecture and Compilation Techniques, PACT 2023: Vienna, Austria, 21-25 October 2023: proceedings". Institute of Electrical and Electronics Engineers (IEEE), 2023, p. 124-136. ISBN 979-8-3503-4254-3. DOI 10.1109/PACT58117.2023.00019.
dc.identifier
979-8-3503-4254-3
dc.identifier
https://hdl.handle.net/2117/403438
dc.identifier
10.1109/PACT58117.2023.00019
dc.description.abstract
Literature is plentiful in works exploiting cache locality for GPUs. A majority of them explore replacement or bypassing policies. In this paper, however, we surpass this exploration by fabricating a formal proof for a no-overhead quasi-optimal caching technique for caching textures in graphics workloads. Textures make up a significant part of main memory traffic in mobile GPUs, which contributes to the total GPU energy consumption. Since texture accesses use a shared L2 cache, improving the L2 texture caching efficiency would decrease main memory traffic, thus improving energy efficiency, which is crucial for mobile GPUs. Our proposal reaches quasi-optimality by exploiting the frame-to-frame reuse of textures in graphics. We do this by traversing frames in a boustrophedonic 1 1 Boustrophedon is a style of writing in which alternate lines of writing are reversed in order. This is in contrast to most modern languages, where the order of lines is the same, usually left-to-right. manner w.r.t. the frame-to-frame tile order. We first approximate the texture access trace to a circular trace and then forge a formal proof for our proposal being optimal for such traces. We also complement the proof with empirical data that demonstrates the quasi-optimality of our no-cost proposal.
dc.description.abstract
This work has been supported by the CoCoUnit ERC Advanced Grant of the EU’s Horizon 2020 program (grant No 833057), the Spanish State Research Agency (MCIN/AEI) under grant PID2020-113172RB-I00, the ICREA Academia program and the AGAUR grant 2020-FISDU-00287.
dc.description.abstract
Peer Reviewed
dc.description.abstract
Postprint (author's final draft)
dc.format
application/pdf
dc.publisher
Institute of Electrical and Electronics Engineers (IEEE)
dc.relation
https://ieeexplore.ieee.org/document/10364604
dc.relation
info:eu-repo/grantAgreement/EC/H2020/833057/EU/CoCoUnit: An Energy-Efficient Processing Unit for Cognitive Computing/CoCoUnit
dc.relation
info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/PID2020-113172RB-I00/ES/ARQUITECTURAS DE DOMINIO ESPECIFICO PARA SISTEMAS DE COMPUTACION ENERGETICAMENTE EFICIENTES/
dc.subject
Àrees temàtiques de la UPC::Informàtica::Arquitectura de computadors
dc.subject
Memory management (Computer science)
dc.subject
Graphics processing units
dc.subject
Energy consumption
dc.subject
Gestió de memòria (Informàtica)
dc.subject
Unitats de processament gràfic
dc.subject
Energia -- Consum
dc.title
Boustrophedonic frames: Quasi-optimal L2 caching for textures in GPUs
dc.type
Conference report