2008 | ||
---|---|---|
87 | EE | Carlos Madriles, Carlos García Quiñones, F. Jesús Sánchez, Pedro Marcuello, Antonio González, Dean M. Tullsen, Hong Wang, John Paul Shen: Mitosis: A Speculative Multithreaded Processor Based on Precomputation Slices. IEEE Trans. Parallel Distrib. Syst. 19(7): 914-925 (2008) |
2006 | ||
86 | John Paul Shen, Margaret Martonosi: Proceedings of the 12th International Conference on Architectural Support for Programming Languages and Operating Systems, ASPLOS 2006, San Jose, CA, USA, October 21-25, 2006 ACM 2006 | |
85 | EE | Richard A. Hankins, Gautham N. Chinya, Jamison D. Collins, Perry H. Wang, Ryan Rakvic, Hong Wang, John Paul Shen: Multiple Instruction Stream Processor. ISCA 2006: 114-127 |
84 | EE | Bryan Black, Murali Annavaram, Ned Brekelbaum, John DeVale, Lei Jiang, Gabriel H. Loh, Don McCaule, Pat Morrow, Donald W. Nelson, Daniel Pantuso, Paul Reed, Jeff Rupley, Sadasivan Shankar, John Paul Shen, Clair Webb: Die Stacking (3D) Microarchitecture. MICRO 2006: 469-479 |
2005 | ||
83 | EE | Satish Narayanasamy, Hong Wang, Perry H. Wang, John Paul Shen, Brad Calder: A Dependency Chain Clustered Microarchitecture. IPDPS 2005 |
82 | EE | Murali Annavaram, Ed Grochowski, John Paul Shen: Mitigating Amdahl's Law through EPI Throttling. ISCA 2005: 298-309 |
2004 | ||
81 | EE | Perry H. Wang, Jamison D. Collins, Hong Wang, Dongkeun Kim, Bill Greene, Kai-Ming Chan, Aamir B. Yunus, Terry Sych, Stephen F. Moore, John Paul Shen: Helper threads via virtual multithreading on an experimental itanium® 2 processor-based platform. ASPLOS 2004: 144-155 |
80 | EE | Dongkeun Kim, Shih-Wei Liao, Perry H. Wang, Juan del Cuvillo, Xinmin Tian, Xiang Zou, Hong Wang, Donald Yeung, Milind Girkar, John Paul Shen: Physical Experimentation with Prefetching Helper Threads on Intel's Hyper-Threaded Processors. CGO 2004: 27-38 |
79 | EE | Tor M. Aamodt, Paul Chow, Per Hammarlund, Hong Wang, John Paul Shen: Hardware Support for Prescient Instruction Prefetch. HPCA 2004: 84-95 |
78 | EE | Ed Grochowski, Ronny Ronen, John Paul Shen, Hong Wang: Best of Both Latency and Throughput. ICCD 2004: 236-243 |
77 | EE | Perry H. Wang, Jamison D. Collins, Hong Wang, Dongkeun Kim, Bill Greene, Kai-Ming Chan, Aamir B. Yunus, Terry Sych, Stephen F. Moore, John Paul Shen: Helper Threads via Virtual Multithreading. IEEE Micro 24(6): 74-82 (2004) |
76 | EE | Partha Kundu, Murali Annavaram, Trung A. Diep, John Paul Shen: A case for shared instruction cache on chip multiprocessors running OLTP. SIGARCH Computer Architecture News 32(3): 11-18 (2004) |
2003 | ||
75 | EE | Richard A. Hankins, Trung A. Diep, Murali Annavaram, Brian Hirano, Harald Eri, Hubert Nueckel, John Paul Shen: Scaling and Charact rizing Database Workloads: Bridging the Gap between Research and Practice. MICRO 2003: 151-164 |
74 | EE | Tor M. Aamodt, Pedro Marcuello, Paul Chow, Antonio González, Per Hammarlund, Hong Wang, John Paul Shen: A framework for modeling and optimization of prescient instruction prefetch. SIGMETRICS 2003: 13-24 |
2002 | ||
73 | EE | Ryan Rakvic, Bryan Black, Deepak Limaye, John Paul Shen: Non-Vital Loads. HPCA 2002: 165- |
72 | EE | Perry H. Wang, Hong Wang, Jamison D. Collins, Ed Grochowski, Ralph-Michael Kling, John Paul Shen: Memory Latency-Tolerance Approaches for Itanium Processors: Out-of-Order Execution vs. Speculative Precomputation. HPCA 2002: 187-196 |
71 | EE | Murali Annavaram, Trung A. Diep, John Paul Shen: Branch Behavior of a Commercial OLTP Workload on Intel IA32 Processors. ICCD 2002: 242-248 |
70 | EE | R. David Weldon, Steven S. Chang, Hong Wang, Gerolf Hoflehner, Perry H. Wang, Daniel M. Lavery, John Paul Shen: Quantitative Evaluation of the Register Stack Engine and Optimizations for Future Itanium Processors. Interaction between Compilers and Computer Architectures 2002: 57-67 |
69 | EE | Shih-Wei Liao, Perry H. Wang, Hong Wang, John Paul Shen, Gerolf Hoflehner, Daniel M. Lavery: Post-Pass Binary Adaptation for Software-Based Speculative Precomputation. PLDI 2002: 117-128 |
2001 | ||
68 | EE | Perry H. Wang, Hong Wang, Ralph-Michael Kling, Kalpana Ramakrishnan, John Paul Shen: Register Renaming and Scheduling for Dynamic Execution of Predicated Code. HPCA 2001: 15-26 |
67 | Deepak Limaye, Ryan Rakvic, John Paul Shen: Parallel Cachelets. ICCD 2001: 284-292 | |
66 | John Paul Shen: Clear and Present Tensions in Microprocessor Design. ICCD 2001: 4-4 | |
65 | EE | Jamison D. Collins, Hong Wang, Dean M. Tullsen, Christopher J. Hughes, Yong-Fong Lee, Daniel M. Lavery, John Paul Shen: Speculative precomputation: long-range prefetching of delinquent loads. ISCA 2001: 14-25 |
64 | EE | Jamison D. Collins, Dean M. Tullsen, Hong Wang, John Paul Shen: Dynamic speculative precomputation. MICRO 2001: 306-317 |
2000 | ||
63 | EE | Yuan C. Chou, John Paul Shen: Instruction path coprocessors. ISCA 2000: 270-281 |
62 | EE | Ryan Rakvic, Bryan Black, John Paul Shen: Completion time multiple branch prediction for enhancing trace cache performance. ISCA 2000: 47-58 |
61 | EE | Yuan C. Chou, Pazhani Pillai, Herman Schmit, John Paul Shen: PipeRench implementation of the instruction path coprocessor. MICRO 2000: 147-158 |
60 | EE | Noppanunt Utamaphethai, R. D. (Shawn) Blanton, John Paul Shen: Effectiveness of Microarchitecture Test Program Generation. IEEE Design & Test of Computers 17(4): 38-49 (2000) |
59 | EE | Noppanunt Utamaphethai, R. D. (Shawn) Blanton, John Paul Shen: A Buffer-Oriented Methodology for Microarchitecture Validation. J. Electronic Testing 16(1-2): 49-65 (2000) |
1999 | ||
58 | EE | Jonathan Combs, Candice Bechem Combs, John Paul Shen: Mispredicted Path Cache Effects. Euro-Par 1999: 1322-1231 |
57 | EE | Alexander G. Dean, John Paul Shen: System-Level Issues for Software Thread Integration: Guest Triggering and Host Selection. IEEE Real-Time Systems Symposium 1999: 234 |
56 | EE | Bryan Black, Bohuslav Rychlik, John Paul Shen: The Block-Based Trace Cache. ISCA 1999: 196-207 |
55 | EE | Yuan C. Chou, Jason Fung, John Paul Shen: Reducing branch misprediction penalties via dynamic control independence detection. International Conference on Supercomputing 1999: 109-118 |
54 | EE | Noppanunt Utamaphethai, R. D. (Shawn) Blanton, John Paul Shen: Superscalar Processor Validation at the Microarchitecture Level. VLSI Design 1999: 300-305 |
1998 | ||
53 | EE | Alexander G. Dean, John Paul Shen: Hardware to Software Migration with Real-Time Thread Integration. EUROMICRO 1998: 10243- |
52 | EE | Bohuslav Rychlik, John Faistl, Bryon Krug, John Paul Shen: Efficacy and Performance Impact of Value Prediction. IEEE PACT 1998: 148- |
51 | EE | Alexander G. Dean, John Paul Shen: Techniques for Software Thread Integration in Real-Time Embedded Systems. IEEE Real-Time Systems Symposium 1998: 322-333 |
50 | EE | Bryan Black, Brian Mueller, Stephanie Postal, Ryan Rakvic, Noppanunt Utamaphethai, John Paul Shen: Load Execution Latency Reduction. International Conference on Supercomputing 1998: 29-36 |
49 | Bryan Black, John Paul Shen: Calibration of Microprocessor Performance Models. IEEE Computer 31(5): 59-65 (1998) | |
48 | Mikko H. Lipasti, John Paul Shen: Exploiting Value Locality to Exceed the Dataflow Limit. International Journal of Parallel Programming 26(4): 505-538 (1998) | |
1997 | ||
47 | Mikko H. Lipasti, John Paul Shen: The Performance Potential of Value and Dependence Prediction. Euro-Par 1997: 1043-1052 | |
46 | Yuan C. Chou, Daniel P. Siewiorek, John Paul Shen: A Realistic Study on Multithreaded Superscalar Processor Design. Euro-Par 1997: 1092-1101 | |
45 | EE | Derek B. Noonburg, John Paul Shen: A Framework for Statistical Modeling of Superscalar Processor Performance. HPCA 1997: 298-309 |
44 | Chris J. Newburn, John Paul Shen: Compiler Support for Low-Cost Synchronization Among Threads. PARCO 1997: 485-494 | |
43 | Mikko H. Lipasti, John Paul Shen: Superspeculative Microarchitecture for Beyond AD 2000. IEEE Computer 30(9): 59-66 (1997) | |
1996 | ||
42 | Andrew S. Huang, John Paul Shen: The Intrinsic Bandwidth Requirements of Ordinary Programs. ASPLOS 1996: 105-114 | |
41 | Mikko H. Lipasti, Christopher B. Wilkerson, John Paul Shen: Value Locality and Load Value Prediction. ASPLOS 1996: 138-147 | |
40 | EE | Bryan Black, Andrew S. Huang, Mikko H. Lipasti, John Paul Shen: Can Trace-Driven Simulators Accurately Predict Superscalar Performance? ICCD 1996: 478-485 |
39 | EE | Mikko H. Lipasti, John Paul Shen: Exceeding the Dataflow Limit via Value Prediction. MICRO 1996: 226-237 |
1995 | ||
38 | Trung A. Diep, John Paul Shen: Systematic Validation of Pipeline Interlock for Superscalar Microarchitectures. FTCS 1995: 100-109 | |
37 | EE | Trung A. Diep, Christopher Nelson, John Paul Shen: Performance Evaluation of the PowerPC 620 Microarchitecture. ISCA 1995: 163-175 |
36 | EE | Andrew S. Huang, John Paul Shen: A limit study of local memory requirements using value reuse profiles. MICRO 1995: 71-81 |
35 | Trung A. Diep, John Paul Shen: VMW: A Visualization-Based Microarchitecture Workbench. IEEE Computer 28(12): 57-64 (1995) | |
1994 | ||
34 | Chris J. Newburn, Derek B. Noonburg, John Paul Shen: A PDG-based Tool and its Use in Analyzing Program Control Dependences. IFIP PACT 1994: 157-168 | |
33 | Andrew S. Huang, Gert Slavenburg, John Paul Shen: Speculative Disambiguation: A Compilation Technique for Dynamic Memory Disambiguation. ISCA 1994: 200-210 | |
32 | EE | Derek B. Noonburg, John Paul Shen: Theoretical modeling of superscalar processor performance. MICRO 1994: 52-62 |
31 | Michael A. Schuette, John Paul Shen: Exploiting Instruction-Level Parallelism for Integrated Control-Flow Monitoring. IEEE Trans. Computers 43(2): 129-140 (1994) | |
1993 | ||
30 | Chris J. Newburn, Andrew S. Huang, John Paul Shen: Balancing Fine- and Medium-Grained Parallelism in Scheduling Loops for the XIMD Architecture. Architectures and Compilation Techniques for Fine and Medium Grain Parallelism 1993: 39-52 | |
29 | Trung A. Diep, Mikko H. Lipasti, John Paul Shen: Architecture-Compatible Code Boosting for Performance Enhancement of the IBM RS/6000. ICCD 1993: 86-93 | |
28 | EE | Trung A. Diep, John Paul Shen, Mike Phillip: EXPLORER: a retargetable and visualization-based trace-driven simulator for superscalar processors. MICRO 1993: 225-235 |
1992 | ||
27 | Scott H. Robinson, John Paul Shen: Direct Methods for Synthesis of Self-Monitoring State Machines. FTCS 1992: 306-315 | |
1991 | ||
26 | Andrew Wolfe, John Paul Shen: A Variable Instruction Stream Extension to the VLIW Architecture. ASPLOS 1991: 2-14 | |
25 | Michael A. Schuette, John Paul Shen: Exploiting Instruction-Level Resource Parallelism for Transparent, Integrated Control-Flow Monitoring. FTCS 1991: 318-325 | |
24 | EE | Chriss Stephens, Bryce Cogswell, John Heinlein, Gregory Palmer, John Paul Shen: Instruction Level Profiling and Evaluation of the IBM/6000. ISCA 1991: 180-189 |
23 | EE | Mauricio Breternitz Jr., John Paul Shen: Implementation Optimization Techniques for Architecture Synthesis of Application-Specific Processors. MICRO 1991: 114-123 |
22 | EE | Michael A. Schuette, John Paul Shen: An Instruction-Level Performance Analysis of the Multiflow TRACE 14/300. MICRO 1991: 2-11 |
1990 | ||
21 | EE | Mauricio Breternitz Jr., John Paul Shen: Architecture Synthesis of High-Performance Application-Specific Processors. DAC 1990: 542-548 |
20 | Scott H. Robinson, John Paul Shen: Evaluation and Synthesis of Self-Monitoring State Machines. ICCAD 1990: 276-279 | |
19 | EE | Kent D. Wilken, John Paul Shen: Continuous signature monitoring: low-cost concurrent detection of processor control errors. IEEE Trans. on CAD of Integrated Circuits and Systems 9(6): 629-641 (1990) |
1988 | ||
18 | Andrew Wolfe, Mauricio Breternitz Jr., Chriss Stephens, A. L. Ting, D. B. Kirk, Ronald P. Bianchini Jr., John Paul Shen: The White Dwarf: A High-Performance Application-Specific Processor. ISCA 1988: 212-222 | |
17 | John Paul Shen, F. Joel Ferguson: Extraction and Simulation of Realistic CMOS Faults Using Inductive Fault Analysis. ITC 1988: 475-484 | |
16 | Kent D. Wilken, John Paul Shen: Continuous Signature Monitoring: Efficient Concurrent-Detection of Processor Control Errors. ITC 1988: 914-925 | |
15 | EE | Andrew Wolfe, John Paul Shen: Flexible processors: a promising application-specific processor design approach. MICRO 1988: 30-39 |
14 | EE | Mauricio Breternitz Jr., John Paul Shen: Organization of array data for concurrent memory access. MICRO 1988: 97-99 |
13 | EE | F. Joel Ferguson, John Paul Shen: A CMOS fault extractor for inductive fault analysis. IEEE Trans. on CAD of Integrated Circuits and Systems 7(11): 1181-1194 (1988) |
1987 | ||
12 | Michael A. Schuette, John Paul Shen: Processor Control Flow Monitoring Using Signatured Instruction Streams. IEEE Trans. Computers 36(3): 264-276 (1987) | |
11 | Ronald P. Bianchini Jr., John Paul Shen: Interprocessor Traffic Scheduling Algorithm for Multiple-Processor Networks. IEEE Trans. Computers 36(4): 396-409 (1987) | |
1986 | ||
10 | John Paul Shen: Highlights of CMU Research on CAD, CAM, CAT of VLSI Circuits. FJCC 1986: 878-889 | |
9 | EE | John Paul Shen, John P. Hayes, Luigi Ciminiera, Angelo Serra: Fault-tolerance and performance analysis of beta-networks. Parallel Computing 3(3): 231-249 (1986) |
1985 | ||
8 | Patrick P. Fasang, Michael A. Schuette, John Paul Shen, William A. Gwaltney: Automated Design for Testability of Semicustom Integrated Circuits. ITC 1985: 558-564 | |
1984 | ||
7 | Wojciech Maly, F. Joel Ferguson, John Paul Shen: Systematic Characterization of Physical Defects for Fault Analysis of MOS IC Cells. ITC 1984: 390-399 | |
6 | John Paul Shen, John P. Hayes: Fault-Tolerance of Dynamic-Full-Access Interconnection Networks. IEEE Trans. Computers 33(3): 241-248 (1984) | |
5 | John Paul Shen, F. Joel Ferguson: The Design of Easily Tastabel VLSI Array Multipliers. IEEE Trans. Computers 33(6): 554-560 (1984) | |
1983 | ||
4 | David C. H. Lee, John Paul Shen: Easily-Testable (N, K) Shuffle/Exchange Networks. ICPP 1983: 65-70 | |
3 | Michael A. Schuette, John Paul Shen: On-Line Self-Monitoring Using Signatured Instruction Streams. ITC 1983: 275-282 | |
1982 | ||
2 | John Paul Shen: Fault tolerance analysis of several interconnection networks. ICPP 1982: 101-112 | |
1980 | ||
1 | John Paul Shen, John P. Hayes: Fault Tolerance of a Class of Connecting Networks. ISCA 1980: 61-71 |