Global productiveness propagation: A code optimization technique to speculatively prune useless narrow computations

Bhagat, Indu; Gibert Codina, Enric; Sanchez, Jesus; González Colás, Antonio María

Global productiveness propagation: A code optimization technique to speculatively prune useless narrow computations

dc.contributor

Universitat Politècnica de Catalunya. Departament d'Arquitectura de Computadors

dc.contributor

Universitat Politècnica de Catalunya. ARCO - Microarquitectura i Compiladors

dc.contributor.author

Bhagat, Indu

dc.contributor.author

Gibert Codina, Enric

dc.contributor.author

Sanchez, Jesus

dc.contributor.author

González Colás, Antonio María

dc.date.issued

2011

dc.identifier

Bhagat, I. [et al.]. Global productiveness propagation: A code optimization technique to speculatively prune useless narrow computations. A: ACM SIGPLAN/SIGBED Conference on Languages Compilers, Tools, and Theory for Embedded Systems. "2011 ACM SIGPLAN/SIGBED Conference on Languages Compilers, Tools, and Theory for Embedded Systems". ACM Press, NY, 2011, p. 161-170.

dc.identifier

https://hdl.handle.net/2117/13872

dc.description.abstract

This paper proposes a unique hardware-software collaborative strategy to remove useless work at 16-bit data-width granularity. The underlying motivation is to design a low power execution platform by exploiting ‘narrow’ computations. The proposal uses a strictly narrow bit-wide microarchitecture (16-bit integer datapath), which realizes the goal of a low cost, low hardware complexity, low power execution engine. Software dynamically maps the 64-bit computations by translating them into an equivalent 16-bit instruction stream and optimizing them. In this paper, we propose an optimization technique, called Global Productiveness Propagation (GPP), which is a dynamic, profile-based optimization technique that infers the minimum required dataflow by pruning narrow computations that are mostprobably useless (non-productive). More precisely, GPP speculatively prunes the static backward slices of selected narrow computations: computations that result in the same value (in their respective storage location) as that at the input of the region. This speculative optimization technique is formulated around the concept of ‘narrow’ computations because the same allow a finer granularity to distinguish between useful (productive) and useless (nonproductive) work. GPP has been evaluated on an in-order narrow bit-wide execution core, achieving an average dynamic instruction stream reduction of 6.6%, while improving overall performance by 4.2%.

dc.description.abstract

Peer Reviewed

dc.description.abstract

Postprint (published version)

dc.format

10 p.

dc.format

application/pdf

dc.language

eng

dc.publisher

ACM Press, NY

dc.rights

Restricted access - publisher's policy

dc.subject

Àrees temàtiques de la UPC::Informàtica::Hardware

dc.subject

Narrow bitwide computation

dc.subject

Profile-guided optimization

dc.subject

PGO

dc.subject

Compilers (Computer programs)

dc.subject

Compiladors (Programes d'ordinador)

dc.title

Global productiveness propagation: A code optimization technique to speculatively prune useless narrow computations

dc.type

Conference report

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

E-prints [72954]