HARPA - Harnessing Performance Variability
Ricerca UE FP7
Ruolo DEIB: Coordinatore
Data inizio: 01/09/2013
Durata: 36 mesi
Sommario
Application requirements, power, and technological constraints are driving the architectural convergence of future processors towards heterogeneous many-cores. This development is confronted with variability challenges, mainly the susceptibility to time-dependent variations in silicon devices. Increasing guard-bands to battle variations is not scalable, due to the too large worst-case cost impact for technology nodes around 10 nm. The goal of HARPA is to enable next-generation embedded and high-performance heterogeneous many-cores to cost-effectively confront variations by providing Dependable-Performance: correct functionality and timing guarantees throughout the expected lifetime of a platform under thermal, power, and energy constraints. The HARPA solution employs a cross-layer approach. A middleware implements a control engine that steers software/hardware knobs based on information from strategically dispersed monitors. This engine relies on technology models to identify/exploit various types of platform slack - performance, power/energy, thermal, lifetime, and structural (hardware) - to restore timing guarantees and ensure the expected lifetime amidst time-dependent variations. Dependable-Performance is critical for embedded applications to provide timing correctness; for high-performance applications, it is paramount to ensure load balancing in parallel phases and fast execution of sequential phases. The lifetime requirement has ramifications on the manufacturing process cost and the number of field-returns. The HARPA novelty is in seeking synergies in techniques that have been considered virtually exclusively in the embedded or high-performance domains (worst-case guaranteed partly proactive techniques in embedded, and dynamic best-effort reactive techniques in high-performance). HARPA will demonstrate the benefits of merging concepts from these two domains by evaluating key applications from both segments running on embedded and high-performance platforms.
Risultati del progetto ed eventuali pubblicazioni scientifiche/brevetti
Journals:
- D. Zoni, W. Fornaciari, "Modeling DVFS and power gating actuators for cycle accurate NoC-based simulators", in ACM Journal on Emerging Technologies in Computing Systems, 2015.
- Patrick Bellasi, Giuseppe Massari, and William Fornaciari. 2015. "Effective Runtime Resource Management Using Linux Control Groups with the BarbequeRTRM Framework". ACM Trans. Embed. Comput. Syst. 14, 2, Article 39 (March 2015), 17 pages.
- Dimitrios Rodopoulos, Georgia Psychou, Mohamed M. Sabry, Francky Catthoor, Antonis Papanikolaou, Dimitrios Soudris, Tobias G. Noll, and David Atienza. 2015. "Classification Framework for Analysis and Modeling of Physically Induced Reliability Violations". ACM Comput. Surv. 47, 3, Article 38 (February 2015), 33 pages.
- Rodopoulos, D.; Papanikolaou, A.; Catthoor, F.; Soudris, D., "Demonstrating HW-SW Transient Error Mitigation on the Single-Chip Cloud Computer Data Plane," IEEE Transactions on Very Large Scale Integration (VLSI) Systems, vol.23, no.3, pp.507-519, March 2015
- Rodopoulos, D.; Catthoor, F.; Soudris, D., "Tackling Performance Variability due to RAS Mechanisms with PID-Controlled DVFS," Computer Architecture Letters, vol.PP, no.99, pp.1, December 2014.