2008 |
9 | EE | Shane Ryoo,
Christopher I. Rodrigues,
Sam S. Stone,
Sara S. Baghsorkhi,
Sain-Zee Ueng,
John A. Stratton,
Wen-mei W. Hwu:
Program optimization space pruning for a multithreaded gpu.
CGO 2008: 195-204 |
8 | EE | Isaac Gelado,
John H. Kelm,
Shane Ryoo,
Steven S. Lumetta,
Nacho Navarro,
Wen-mei W. Hwu:
CUBA: an architecture for efficient CPU/co-processor data communication.
ICS 2008: 299-308 |
7 | EE | Shane Ryoo,
Christopher I. Rodrigues,
Sara S. Baghsorkhi,
Sam S. Stone,
David B. Kirk,
Wen-mei W. Hwu:
Optimization principles and application performance evaluation of a multithreaded GPU using CUDA.
PPOPP 2008: 73-82 |
6 | EE | Shane Ryoo,
Christopher I. Rodrigues,
Sam S. Stone,
John A. Stratton,
Sain-Zee Ueng,
Sara S. Baghsorkhi,
Wen-mei W. Hwu:
Program optimization carving for GPU computing.
J. Parallel Distrib. Comput. 68(10): 1389-1401 (2008) |
2007 |
5 | EE | Wen-mei W. Hwu,
Shane Ryoo,
Sain-Zee Ueng,
John H. Kelm,
Isaac Gelado,
Sam S. Stone,
Robert E. Kidd,
Sara S. Baghsorkhi,
Aqeel Mahesri,
Stephanie C. Tsao,
Nacho Navarro,
Steven S. Lumetta,
Matthew I. Frank,
Sanjay J. Patel:
Implicitly Parallel Programming Models for Thousand-Core Microprocessors.
DAC 2007: 754-759 |
4 | EE | Shane Ryoo,
Christopher I. Rodrigues,
Wen-mei W. Hwu:
Iteration Disambiguation for Parallelism Identification in Time-Sliced Applications.
LCPC 2007: 110-124 |
3 | EE | Shane Ryoo,
Sain-Zee Ueng,
Christopher I. Rodrigues,
Robert E. Kidd,
Matthew I. Frank,
Wen-mei W. Hwu:
Automatic Discovery of Coarse-Grained Parallelism in Media Applications.
T. HiPEAC 1: 194-213 (2007) |
2006 |
2 | EE | Ronald D. Barnes,
Shane Ryoo,
Wen-mei W. Hwu:
Tolerating Cache-Miss Latency with Multipass Pipelines.
IEEE Micro 26(1): 40-47 (2006) |
2005 |
1 | EE | Ronald D. Barnes,
Shane Ryoo,
Wen-mei W. Hwu:
"Flea-flicker" Multipass Pipelining: An Alternative to the High-Power Out-of-Order Offense.
MICRO 2005: 319-330 |