Georgia Tech > CoE > ECE > MARS [ MARS | People | Research | Publications | Software | Internal ]

POD: A Highly-Efficient 3D-Integrated
Heterogeneous Many-core Architecture


Faculty

Hsien-Hsin Sean Lee

Graduate Students

Dong Hyuk Woo

External Collaborators

Marsha Eng (Intel Corporation)
Joshua B. Fryman (Intel Corporation)
Allan D. Knies (Intel Research Berkeley)

Sponsor

NSF CPA

Description

To feature hundreds or even thousands of processing cores on a single-die many-core processor, the challenges of energy consumption and performance scalability must be addressed within a given die area budget. Current commercial designs focus on MIMD-style multicores built with rather complex cores. While such designs provide a degree of generality, they may not be the most efficient way to build processors for applications with inherently scalable parallelism.

In this research, we revisit the massively parallel SIMD processing paradigm with modern process technology and present a highly efficient on-chip accelerator architecture called Parallel-On-Die (or POD). Leveraging from emerging 3D integration, POD is designed to address the key challenges of overall chip power, on-chip communication bandwidth, area limitations, and energy consumed by routers by factoring out features necessary for normal MIMD-style many-core platform and focusing on architectures that match scalable workloads. Its deterministic computation and communication model offers much better energy efficiency while also providing a fully debuggable programming environment to application developers. Our single-chip POD is capable of best-in-class scalar performance as well as extremely efficient, scalable parallel performance up to 1.5 TFLOPS of single-precision oating-point arithmetic. Our experimental results show that POD can achieve nearly linear speedup on a large number of SIMD PEs in some application domains, and this speedup is much bigger than the maximum speedup a homogeneous MIMD many-core processor can offer. Using the metric of Performance per Joule, we show that POD also improves the computation-energy efficiency substantially over a homogeneous many-core processor. Furthermore, owing to synchronized computation and communication, POD can efficiently suppress energy consumption on the novel communication method in our proposed interconnection network.

Technical Presentation

An overview slide is available here.



Refereed Journal Articles

ACM TACODong Hyuk Woo, Joshua B. Fryman, Allan D. Knies, and Hsien-Hsin S. Lee. "Chameleon: Virtualizing Idle Acceleration Cores of A Heterogeneous Multi-Core Processor for Caching and Prefetching." In ACM Transactions on Architecture and Code Optimization, Vol. 7, No. 1, pp.3:1-3:35, April, 2010.
[pdf]
IEEE ComputerDong Hyuk Woo and Hsien-Hsin S. Lee. "Extending Amdahl's Law for Energy-Efficient Computing in the Many-Core Era." In IEEE Computer, Vol. 41, No. 12, pp.24-31, December, 2008.
[pdf]
IEEE MICRODong Hyuk Woo, Joshua B. Fryman, Allan D. Knies, Marsha Eng, and Hsien-Hsin S. Lee. "POD: A 3D-Integrated Broad-Purpose Acceleration Layer." In IEEE MICRO special issue on Accelerator Architectures, Vol. 28, No. 4, pp.28-40, July/August, 2008.
[pdf]



Refereed Conference Papers

ASPLOS XVDong Hyuk Woo and Hsien-Hsin S. Lee. "COMPASS: A Programmable Data Prefetcher Using Idle GPU Shaders." In Proceedings of the 15th International Conference on Architectural Support for Programming Languages and Operating Systems, pp.297-309, Pittsburgh, PA, March, 2010.
[pdf] [slides]



Refereed Workshop Papers

HPEC-07 Dong Hyuk Woo, Joshua B. Fryman, Allan D. Knies, Marsha Eng, and Hsien-Hsin S. Lee. "POD: A Parallel-On-Die Architecture." In the 11th Annual Workshop on High Performance Embedded Computing, Lexington, Massachusetts, September, 2007. (One of four finalists for the Best Paper Award.)
[pdf] [slides]



Technical Report

CERCS Technical Report Dong Hyuk Woo, Joshua B. Fryman, Allan D. Knies, and Hsien-Hsin S. Lee. "Chameleon: Virtualizing Idle Acceleration Cores of A Heterogeneous Multi-Core Processor for Caching and Prefetching." Technical Report GIT-CERCS-08-11, Georgia Institute of Technology, December 2008
[pdf]
CERCS Technical Report Dong Hyuk Woo, Joshua B. Fryman, Allan D. Knies, Marsha Eng, and Hsien-Hsin S. Lee. "POD: A Parallel-On-Die Architecture." Technical Report GIT-CERCS-07-09, Georgia Institute of Technology, May 2007
[pdf]
100 Binney Street
Cambridge, MA 02142

>
http://hsienhsinlee.github.io
650-709-9452