2009 | ||
---|---|---|
126 | EE | Albert Sidelnik, I-Jui Sung, Wanmin Wu, María Jesús Garzarán, Wen-mei W. Hwu, Klara Nahrstedt, David A. Padua, Sanjay J. Patel: Optimization of tele-immersion codes. GPGPU 2009: 85-93 |
125 | EE | John E. Stone, Jan Saam, David J. Hardy, Kirby L. Vandivort, Wen-mei W. Hwu, Klaus Schulten: High performance computation and interactive display of molecular orbitals on GPUs and multi-core CPUs. GPGPU 2009: 9-18 |
2008 | ||
124 | EE | Shane Ryoo, Christopher I. Rodrigues, Sam S. Stone, Sara S. Baghsorkhi, Sain-Zee Ueng, John A. Stratton, Wen-mei W. Hwu: Program optimization space pruning for a multithreaded gpu. CGO 2008: 195-204 |
123 | EE | Sam S. Stone, Justin P. Haldar, Stephanie C. Tsao, Wen-mei W. Hwu, Zhi-Pei Liang, Bradley P. Sutton: Accelerating advanced mri reconstructions on gpus. Conf. Computing Frontiers 2008: 261-272 |
122 | EE | Christopher I. Rodrigues, David J. Hardy, John E. Stone, Klaus Schulten, Wen-mei W. Hwu: GPU acceleration of cutoff pair potentials for molecular modeling applications. Conf. Computing Frontiers 2008: 273-282 |
121 | EE | Isaac Gelado, John H. Kelm, Shane Ryoo, Steven S. Lumetta, Nacho Navarro, Wen-mei W. Hwu: CUBA: an architecture for efficient CPU/co-processor data communication. ICS 2008: 299-308 |
120 | EE | Sain-Zee Ueng, Melvin Lathara, Sara S. Baghsorkhi, Wen-mei W. Hwu: CUDA-Lite: Reducing GPU Programming Complexity. LCPC 2008: 1-15 |
119 | EE | John A. Stratton, Sam S. Stone, Wen-mei W. Hwu: MCUDA: An Efficient Implementation of CUDA Kernels for Multi-core CPUs. LCPC 2008: 16-30 |
118 | EE | Shane Ryoo, Christopher I. Rodrigues, Sara S. Baghsorkhi, Sam S. Stone, David B. Kirk, Wen-mei W. Hwu: Optimization principles and application performance evaluation of a multithreaded GPU using CUDA. PPOPP 2008: 73-82 |
117 | EE | Alexandros Papakonstantinou, Deming Chen, Wen-mei W. Hwu: Application Acceleration with the Explicitly Parallel Operations System - the EPOS Processor. SASP 2008: 20-25 |
116 | EE | Sam S. Stone, Justin P. Haldar, Stephanie C. Tsao, Wen-mei W. Hwu, Bradley P. Sutton, Zhi-Pei Liang: Accelerating advanced MRI reconstructions on GPUs. J. Parallel Distrib. Comput. 68(10): 1307-1318 (2008) |
115 | EE | Shane Ryoo, Christopher I. Rodrigues, Sam S. Stone, John A. Stratton, Sain-Zee Ueng, Sara S. Baghsorkhi, Wen-mei W. Hwu: Program optimization carving for GPU computing. J. Parallel Distrib. Comput. 68(10): 1389-1401 (2008) |
2007 | ||
114 | EE | Lauren Sarno, Wen-mei W. Hwu, Craig Lund, Markus Levy, James R. Larus, James Reinders, Gordon Cameron, Chris Lennard, Takashi Yoshimori: Corezilla: Build and Tame the Multicore Beast? DAC 2007: 632-633 |
113 | EE | Wen-mei W. Hwu, Shane Ryoo, Sain-Zee Ueng, John H. Kelm, Isaac Gelado, Sam S. Stone, Robert E. Kidd, Sara S. Baghsorkhi, Aqeel Mahesri, Stephanie C. Tsao, Nacho Navarro, Steven S. Lumetta, Matthew I. Frank, Sanjay J. Patel: Implicitly Parallel Programming Models for Thousand-Core Microprocessors. DAC 2007: 754-759 |
112 | EE | Shane Ryoo, Christopher I. Rodrigues, Wen-mei W. Hwu: Iteration Disambiguation for Parallelism Identification in Time-Sliced Applications. LCPC 2007: 110-124 |
111 | EE | John H. Kelm, Isaac Gelado, Mark J. Murphy, Nacho Navarro, Steven S. Lumetta, Wen-mei W. Hwu: CIGAR: Application Partitioning for a CPU/Coprocessor Architecture. PACT 2007: 317-326 |
110 | EE | Shane Ryoo, Sain-Zee Ueng, Christopher I. Rodrigues, Robert E. Kidd, Matthew I. Frank, Wen-mei W. Hwu: Automatic Discovery of Coarse-Grained Parallelism in Media Applications. T. HiPEAC 1: 194-213 (2007) |
2006 | ||
109 | EE | Ronald D. Barnes, Shane Ryoo, Wen-mei W. Hwu: Tolerating Cache-Miss Latency with Multipass Pipelines. IEEE Micro 26(1): 40-47 (2006) |
108 | EE | Ronald D. Barnes, John W. Sias, Erik M. Nystrom, Sanjay J. Patel, Jose (Nacho) Navarro, Wen-mei W. Hwu: Beating In-Order Stalls with "Flea-Flicker" Two-Pass Pipelining. IEEE Trans. Computers 55(1): 18-33 (2006) |
2005 | ||
107 | Thomas M. Conte, Nacho Navarro, Wen-mei W. Hwu, Mateo Valero, Theo Ungerer: High Performance Embedded Architectures and Compilers, First International Conference, HiPEAC 2005, Barcelona, Spain, November 17-18, 2005, Proceedings Springer 2005 | |
106 | Wen-mei W. Hwu, Sanjay J. Patel: The Future of Computer Architecture Research: An Industrial Perspective. HPCA 2005: 264 | |
105 | EE | Ronald D. Barnes, Shane Ryoo, Wen-mei W. Hwu: "Flea-flicker" Multipass Pipelining: An Alternative to the High-Power Out-of-Order Offense. MICRO 2005: 319-330 |
104 | EE | Wen-mei W. Hwu, Krishna V. Palem: Guest Editors' Introduction. IEEE Trans. Computers 54(10): 1185-1187 (2005) |
2004 | ||
103 | EE | John W. Sias, Sain-Zee Ueng, Geoff A. Kent, Ian M. Steiner, Erik M. Nystrom, Wen-mei W. Hwu: Field-testing IMPACT EPIC research results in Itanium 2. ISCA 2004: 26-39 |
102 | EE | Lakshmi N. Chakrapani, John C. Gyllenhaal, Wen-mei W. Hwu, Scott A. Mahlke, Krishna V. Palem, Rodric M. Rabbah: Trimaran: An Infrastructure for Research in Instruction-Level Parallelism. LCPC 2004: 32-41 |
101 | EE | Erik M. Nystrom, Hong-Seok Kim, Wen-mei W. Hwu: Importance of heap specialization in pointer analysis. PASTE 2004: 43-48 |
100 | EE | Erik M. Nystrom, Hong-Seok Kim, Wen-mei W. Hwu: Bottom-Up and Top-Down Context-Sensitive Summary-Based Pointer Analysis. SAS 2004: 165-180 |
2003 | ||
99 | EE | Ronald D. Barnes, Erik M. Nystrom, John W. Sias, Sanjay J. Patel, Nacho Navarro, Wen-mei W. Hwu: Beating in-order stalls with "flea-flicker" two-pass pipelining. MICRO 2003: 387-398 |
98 | EE | Jeffrey P. Monks, Jean-Pierre Ebert, Wen-mei W. Hwu, Adam Wolisz: Energy saving and capacity improvement potential of power control in multi-hop wireless networks. Computer Networks 41(3): 313-330 (2003) |
2002 | ||
97 | EE | Hillery C. Hunter, Wen-mei W. Hwu: Code coverage and input variability: effects on architecture and compiler research. CASES 2002: 79-87 |
96 | EE | Ronald D. Barnes, Erik M. Nystrom, Matthew C. Merten, Wen-mei W. Hwu: Vacuum packing: extracting hardware-detected program phases for post-link optimization. MICRO 2002: 233-244 |
2001 | ||
95 | EE | Erik M. Nystrom, Ronald D. Barnes, Matthew C. Merten, Wen-mei W. Hwu: Code Reordering and Speculation Support for Dynamic Optimization System. IEEE PACT 2001: 163-174 |
94 | EE | Jeffrey P. Monks, Vaduvur Bharghavan, Wen-mei W. Hwu: A Power Controlled Multiple Access Protocol for Wireless Packet Networks. INFOCOM 2001: 219-228 |
93 | EE | Jeffrey P. Monks, Jean-Pierre Ebert, Adam Wolisz, Wen-mei W. Hwu: A Study of the Energy Saving and Capacity Improvement Potential of Power Control in Multi-Hop Wireless Networks. LCN 2001: 550-559 |
92 | EE | Matthew C. Merten, Wen-mei W. Hwu: Modulo schedule buffers. MICRO 2001: 138-149 |
91 | EE | John W. Sias, Hillery C. Hunter, Wen-mei W. Hwu: Enhancing loop buffering of media and telecommunications applications using low-overhead predication. MICRO 2001: 262-273 |
90 | EE | Matthew C. Merten, Andrew R. Trick, Ronald D. Barnes, Erik M. Nystrom, Christopher N. George, John C. Gyllenhaal, Wen-mei W. Hwu: An Architectural Framework for Runtime Optimization. IEEE Trans. Computers 50(6): 567-589 (2001) |
2000 | ||
89 | EE | Daniel A. Connors, Hillery C. Hunter, Ben-Chung Cheng, Wen-mei W. Hwu: Hardware Support for Dynamic Management of Compiler-Directed Computation Reuse. ASPLOS 2000: 222-233 |
88 | EE | Matthew C. Merten, Andrew R. Trick, Erik M. Nystrom, Ronald D. Barnes, Wen-mei W. Hwu: A hardware mechanism for dynamic extraction and relayout of program hot spots. ISCA 2000: 59-70 |
87 | EE | Jeffrey P. Monks, Vaduvur Bharghavan, Wen-mei W. Hwu: Transmission Power Control for Multiple Access Wireless Packet Networks. LCN 2000: 12-21 |
86 | EE | John W. Sias, Wen-mei W. Hwu, David I. August: Accurate and efficient predicate analysis with binary decision diagrams. MICRO 2000: 112-123 |
85 | EE | Ben-Chung Cheng, Wen-mei W. Hwu: Modular interprocedural pointer analysis using access paths: design, implementation, and evaluation. PLDI 2000: 57-69 |
1999 | ||
84 | EE | Daniel A. Connors, Jean-Michel Puiatti, David I. August, Kevin M. Crozier, Wen-mei W. Hwu: An Architecture Framework for Introducing Predicated Execution into Embedded Microprocessors. Euro-Par 1999: 1301-1311 |
83 | EE | Matthew C. Merten, Andrew R. Trick, Christopher N. George, John C. Gyllenhaal, Wen-mei W. Hwu: A Hardware-Driven Profiling Scheme for Identifying Program Hot Spots to Support Runtime Optimization. ISCA 1999: 136-147 |
82 | EE | David I. August, John W. Sias, Jean-Michel Puiatti, Scott A. Mahlke, Daniel A. Connors, Kevin M. Crozier, Wen-mei W. Hwu: The Program Decision Logic Approach to Predicated Execution. ISCA 1999: 208-219 |
81 | EE | Ben-Chung Cheng, Wen-mei W. Hwu: An Empirical Study of Function Pointers Using SPEC Benchmarks. LCPC 1999: 490-493 |
80 | EE | Daniel A. Connors, Wen-mei W. Hwu: Compiler-Directed Dynamic Computation Reuse: Rationale and Initial Results. MICRO 1999: 158-169 |
79 | EE | Le-Chun Wu, Rajiv Mirani, Harish Patil, Bruce Olsen, Wen-mei W. Hwu: A New Framework for Debugging Globally Optimized Code. PLDI 1999: 181-191 |
78 | EE | Teresa L. Johnson, Daniel A. Connors, Matthew C. Merten, Wen-mei W. Hwu: Run-Time Cache Bypassing. IEEE Trans. Computers 48(12): 1338-1354 (1999) |
77 | Thomas M. Conte, Wen-mei W. Hwu, Mark Smotherman: Editor's Introduction. International Journal of Parallel Programming 27(5): 325-326 (1999) | |
76 | David I. August, Wen-mei W. Hwu, Scott A. Mahlke: The Partial Reverse If-Conversion Framework for Balancing Control Flow and Predication. International Journal of Parallel Programming 27(5): 381-423 (1999) | |
75 | Thomas M. Conte, Wen-mei W. Hwu, Mark Smotherman: Editors' Introduction. International Journal of Parallel Programming 27(6): 425-426 (1999) | |
1998 | ||
74 | EE | Wen-mei W. Hwu, Yale N. Patt: HPSm, a High Performance Restricted Data Flow Architecture Having Minimal Functionality. 25 Years ISCA: Retrospectives and Reprints 1998: 300-308 |
73 | EE | Pohua P. Chang, Scott A. Mahlke, William Y. Chen, Nancy J. Warter, Wen-mei W. Hwu: IMPACT: An Architectural Framework for Multiple-Instruction-Issue Processors. 25 Years ISCA: Retrospectives and Reprints 1998: 408-417 |
72 | EE | Wen-mei W. Hwu, Yale N. Patt: Retrospective: HPSm, a High Performance Restricted Data Flow Architecture Having Minimal Functionality. 25 Years ISCA: Retrospectives and Reprints 1998: 43-44 |
71 | EE | Wen-mei W. Hwu: Retrospective: IMPACT: An Architectural Framework for Multiple-Instruction Issue. 25 Years ISCA: Retrospectives and Reprints 1998: 77-79 |
70 | EE | Brian L. Deitrich, Ben-Chung Cheng, Wen-mei W. Hwu: Improving Static Branch Prediction in a Compiler. IEEE PACT 1998: 214-221 |
69 | EE | David I. August, Daniel A. Connors, Scott A. Mahlke, John W. Sias, Kevin M. Crozier, Ben-Chung Cheng, Patrick R. Eaton, Qudus B. Olaniran, Wen-mei W. Hwu: Integrated Predicated and Speculative Execution in the IMPACT EPIC Architecture. ISCA 1998: 227-237 |
68 | EE | Ben-Chung Cheng, Daniel A. Connors, Wen-mei W. Hwu: Compiler-Directed Early Load-Address Generation. MICRO 1998: 138-147 |
67 | Wen-mei W. Hwu: Introduction to Predicate Execution. IEEE Computer 31: 49-50 (1998) | |
66 | Thomas M. Conte, Mary Ann Hirsch, Wen-mei W. Hwu: Combining Trace Sampling with Single Pass Methods for Efficient Cache Simulation. IEEE Trans. Computers 47(6): 714-720 (1998) | |
65 | Steve Beaty, Wen-mei W. Hwu: Foreword to the Special Issue. International Journal of Parallel Programming 26(4): 345-347 (1998) | |
64 | John C. Gyllenhaal, Wen-mei W. Hwu, B. Ramakrishna Rau: Optimization of Machine Descriptions for Efficient Use. International Journal of Parallel Programming 26(4): 417-447 (1998) | |
1997 | ||
63 | EE | David I. August, Daniel A. Connors, John C. Gyllenhaal, Wen-mei W. Hwu: Architectural Support for Compiler-Synthesized Dynamic Branch Prediction Strategies: Rationale and Initial Results. HPCA 1997: 84-93 |
62 | EE | Teresa L. Johnson, Wen-mei W. Hwu: Run-Time Adaptive Cache Hierarchy Management via Reference Analysis. ISCA 1997: 315-326 |
61 | EE | Teresa L. Johnson, Matthew C. Merten, Wen-mei W. Hwu: Run-Time Spatial Locality Detection and Optimization. MICRO 1997: 57-64 |
60 | EE | David I. August, Wen-mei W. Hwu, Scott A. Mahlke: A Framework for Balancing Control Flow and Predication. MICRO 1997: 92-103 |
59 | Cheng-Hsueh A. Hsieh, Marie T. Conte, Teresa L. Johnson, John C. Gyllenhaal, Wen-mei W. Hwu: Optimizing NET Compilers for Improved Java Performance. IEEE Computer 30(6): 67-75 (1997) | |
1996 | ||
58 | EE | Daniel M. Lavery, Wen-mei W. Hwu: Modulo Scheduling of Loops in Control-intensive Non-numeric Programs. MICRO 1996: 126-137 |
57 | EE | John C. Gyllenhaal, Wen-mei W. Hwu, B. Ramakrishna Rau: Optimization of Machine Descriptions for Efficient Use. MICRO 1996: 349-358 |
56 | EE | Brian L. Deitrich, Wen-mei W. Hwu: Speculative Hedge: Regulating Compile-time Speculation Against Profile Variations. MICRO 1996: 70-79 |
55 | EE | Cheng-Hsueh A. Hsieh, John C. Gyllenhaal, Wen-mei W. Hwu: Java Bytecode to Native Code Translation: The Caffeine Prototype and Preliminary Results. MICRO 1996: 90-99 |
1995 | ||
54 | EE | Roger A. Bringmann, Scott A. Mahlke, Wen-mei W. Hwu: A study of the effects of compiler-controlled speculation on instruction and data caches. HICSS (1) 1995: 211-220 |
53 | EE | Scott A. Mahlke, Richard E. Hank, James E. McCormick, David I. August, Wen-mei W. Hwu: A Comparison of Full and Partial Predicated Execution Support for ILP Processors. ISCA 1995: 138-150 |
52 | EE | Richard E. Hank, Wen-mei W. Hwu, B. Ramakrishna Rau: Region-based compilation: an introduction and motivation. MICRO 1995: 158-168 |
51 | EE | Daniel M. Lavery, Wen-mei W. Hwu: Unrolling-based optimizations for modulo scheduling. MICRO 1995: 327-337 |
50 | Thomas M. Conte, Wen-mei W. Hwu: Advances in Benchmarking Techniques: New Standards and Quantitative Metrics. Advances in Computers 41: 231-253 (1995) | |
49 | Chung-Chi Jim Li, Shyh-Kwei Chen, W. Kent Fuchs, Wen-mei W. Hwu: Compiler-Based Multiple Instruction Retry. IEEE Trans. Computers 44(1): 35-46 (1995) | |
48 | Pohua P. Chang, Daniel M. Lavery, Scott A. Mahlke, William Y. Chen, Wen-mei W. Hwu: The Importance of Prepass Code Scheduling for Superscalar and Superpipelined Processors. IEEE Trans. Computers 44(3): 353-370 (1995) | |
47 | Pohua P. Chang, Nancy J. Warter, Scott A. Mahlke, William Y. Chen, Wen-mei W. Hwu: Three Architecutral Models for Compiler-Controlled Speculative Execution. IEEE Trans. Computers 44(4): 481-494 (1995) | |
46 | Neal J. Alewine, Shyh-Kwei Chen, W. Kent Fuchs, Wen-mei W. Hwu: Compiler-Assisted Multiple Instruction Rollback Recovery Using a Read Buffer. IEEE Trans. Computers 44(9): 1096-1107 (1995) | |
1994 | ||
45 | David M. Gallagher, William Y. Chen, Scott A. Mahlke, John C. Gyllenhaal, Wen-mei W. Hwu: Dynamic Memory Disambiguation Using the Memory Conflict Buffer. ASPLOS 1994: 183-193 | |
44 | Shyh-Kwei Chen, W. Kent Fuchs, Wen-mei W. Hwu: An Analytical Approach to Scheduling Code for Superscalar and VLIW Architectures. ICPP (1) 1994: 285-292 | |
43 | EE | Yoji Yamada, John Gyllenhall, Grant Haab, Wen-mei W. Hwu: Data relocation and prefetching for programs with large data sets. MICRO 1994: 118-127 |
42 | EE | Scott A. Mahlke, Richard E. Hank, Roger A. Bringmann, John C. Gyllenhaal, David M. Gallagher, Wen-mei W. Hwu: Characterizing the impact of predicated execution on branch prediction. MICRO 1994: 217-227 |
41 | Wen-mei W. Hwu, Thomas M. Conte: The Susceptibility of Programs to Context Switching. IEEE Trans. Computers 43(9): 994-1003 (1994) | |
40 | Sadun Anik, Wen-mei W. Hwu: Performance Implications of Synchronization Support for Parallel Fortran Programs. J. Parallel Distrib. Comput. 22(2): 202-215 (1994) | |
39 | Shyh-Kwei Chen, Neal J. Alewine, W. Kent Fuchs, Wen-mei W. Hwu: Incremental Compiler Transformations for Multiple Instruction Retry. Softw., Pract. Exper. 24(12): 1179-1198 (1994) | |
1993 | ||
38 | W. Kent Fuchs, Wen-mei W. Hwu, Neal J. Alewine: Application of Compiler-Assisted Rollback Recovery to Speculative Execution Repair. Hardware and Software Architectures for Fault Tolerance 1993: 45-65 | |
37 | Tokuzo Kiyohara, Scott A. Mahlke, William Y. Chen, Roger A. Bringmann, Richard E. Hank, Sadun Anik, Wen-mei W. Hwu: Register Connection: A New Approach to Adding Registers into Instruction Set Architectures. ISCA 1993: 247-256 | |
36 | EE | Roger A. Bringmann, Scott A. Mahlke, Richard E. Hank, John C. Gyllenhaal, Wen-mei W. Hwu: Speculative execution exception recovery using write-back suppression. MICRO 1993: 214-223 |
35 | EE | Richard E. Hank, Scott A. Mahlke, Roger A. Bringmann, John C. Gyllenhaal, Wen-mei W. Hwu: Superblock formation using static program analysis. MICRO 1993: 247-255 |
34 | Nancy J. Warter, Scott A. Mahlke, Wen-mei W. Hwu, B. Ramakrishna Rau: Reverse If-Conversion. PLDI 1993: 290-299 | |
33 | EE | Scott A. Mahlke, William Y. Chen, Roger A. Bringmann, Richard E. Hank, Wen-mei W. Hwu, B. Ramakrishna Rau, Michael S. Schlansker: Sentinel Scheduling for VLIW and Superscalar Processors. ACM Trans. Comput. Syst. 11(4): 376-408 (1993) |
32 | William Y. Chen, Pohua P. Chang, Thomas M. Conte, Wen-mei W. Hwu: The Effect of Code Expanding Optimizations on Instruction Cache Design. IEEE Trans. Computers 42(9): 1045-1057 (1993) | |
31 | Aloke Gupta, Wen-mei W. Hwu: An execution Profiler for Window-oriented Applications. Softw., Pract. Exper. 23(5): 487-510 (1993) | |
1992 | ||
30 | Scott A. Mahlke, William Y. Chen, Wen-mei W. Hwu, B. Ramakrishna Rau, Michael S. Schlansker: Sentinel Scheduling for VLIW and Superscalar Processors. ASPLOS 1992: 238-247 | |
29 | Neal J. Alewine, Shyh-Kwei Chen, Chung-Chi Jim Li, W. Kent Fuchs, Wen-mei W. Hwu: Branch Recovery with Compiler-Assisted Multiple Instruction Retry. FTCS 1992: 66-73 | |
28 | William Y. Chen, Scott A. Mahlke, Wen-mei W. Hwu: Tolerating First Level Memory Access Latency in High-Performance Systems. ICPP (1) 1992: 36-43 | |
27 | Sadun Anik, Wen-mei W. Hwu: Executing Nested Parallel Loops on Shared-Memory Multiprocessors. ICPP (3) 1992: 241-244 | |
26 | EE | William Y. Chen, Scott A. Mahlke, Wen-mei W. Hwu, Tokuzo Kiyohara, Pohua P. Chang: Tolerating data access latency with register preloading. ICS 1992: 104-113 |
25 | William Y. Chen, Roger A. Bringmann, Scott A. Mahlke, Sadun Anik, Tokuzo Kiyohara, Nancy J. Warter, Daniel M. Lavery, Wen-mei W. Hwu, Richard E. Hank, John C. Gyllenhaal: Using Profile Information to Assist Advaced Compiler Optimization and Scheduling. LCPC 1992: 31-48 | |
24 | Scott A. Mahlke, William Y. Chen, John C. Gyllenhaal, Wen-mei W. Hwu: Compiler Code Transformations for Superscalar-Based High Performance Systems. SC 1992: 808-817 | |
23 | Aloke Gupta, Wen-mei W. Hwu: Xprof: Profiling the Execution of X Window Programs. SIGMETRICS 1992: 253-254 | |
22 | Wen-mei W. Hwu, Pohua P. Chang: Efficient Instruction Sequencing with Inline Target Insertion. IEEE Trans. Computers 41(12): 1537-1551 (1992) | |
21 | Pohua P. Chang, Scott A. Mahlke, William Y. Chen, Wen-mei W. Hwu: Profile-guided Automatic Inline Expansion for C Programs. Softw., Pract. Exper. 22(5): 349-369 (1992) | |
1991 | ||
20 | Scott A. Mahlke, Nancy J. Warter, William Y. Chen, Pohua P. Chang, Wen-mei W. Hwu: The Effect of Compiler Optimizations on Available Parallelism in Scalar Programs. ICPP (2) 1991: 142-145 | |
19 | EE | Pohua P. Chang, Scott A. Mahlke, William Y. Chen, Nancy J. Warter, Wen-mei W. Hwu: IMPACT: An Architectural Framework for Multiple-Instruction-Issue Processors. ISCA 1991: 266-275 |
18 | EE | Pohua P. Chang, William Y. Chen, Scott A. Mahlke, Wen-mei W. Hwu: Comparing Static and Dynamic Code Scheduling for Multiple-Instruction-Issue Processors. MICRO 1991: 25-33 |
17 | EE | William Y. Chen, Scott A. Mahlke, Pohua P. Chang, Wen-mei W. Hwu: Data Access Microarchitectures for Superscalar Processors with Compiler-Assisted Data Prefetching. MICRO 1991: 69-73 |
16 | Thomas M. Conte, Wen-mei W. Hwu: Benchmark Characterization. IEEE Computer 24(1): 48-56 (1991) | |
15 | Pohua P. Chang, Scott A. Mahlke, Wen-mei W. Hwu: Using Profile Information to Assist Classic Code Optimizations. Softw., Pract. Exper. 21(12): 1301-1321 (1991) | |
1989 | ||
14 | EE | Pohua P. Chang, Wen-mei W. Hwu: Control flow optimization for supercomputer scalar processing. ICS 1989: 145-153 |
13 | EE | Wen-mei W. Hwu, Thomas M. Conte, Pohua P. Chang: Comparing Software and Hardware Schemes For Reducing the Cost of Branches. ISCA 1989: 224-233 |
12 | EE | Wen-mei W. Hwu, Pohua P. Chang: Achieving High Instruction Cache Performance with an Optimizing Compiler. ISCA 1989: 242-251 |
11 | EE | P.-H. Chang, Wen-mei W. Hwu: Forward semantic: a compiler-assisted instruction fetch method for heavily pipelined processors. MICRO 1989: 188-198 |
10 | Wen-mei W. Hwu, Pohua P. Chang: Inline Function Expansion for Compiling C Programs. PLDI 1989: 246-257 | |
9 | Wen-mei W. Hwu, Thomas M. Conte: A Simulation Study of Simultaneous Vector Prefetch Performance in Multiprocessor Memory Subsystems (Extended Abstract). SIGMETRICS 1989: 227 | |
1988 | ||
8 | Wen-mei W. Hwu, Pohua P. Chang: Exploiting Parallel Microprocessor Microarchitectures With a Compiler Code Generator. ISCA 1988: 45-53 | |
7 | EE | Pohua P. Chang, Wen-mei W. Hwu: Trace selection for compiling large C application programs to microcode. MICRO 1988: 21-29 |
1987 | ||
6 | Wen-mei W. Hwu, Yale N. Patt: Checkpoint Repair for Out-of-order Execution Machines. ISCA 1987: 18-26 | |
5 | EE | Wen-mei W. Hwu, Yale N. Patt: Exploiting horizontal and vertical concurrency via the HPSm microprocessor. MICRO 1987: 154-161 |
4 | EE | James E. Wilson, Stephen W. Melvin, Michael Shebanow, Wen-mei W. Hwu, Yale N. Patt: On tuning the microarchitecture of an HPS implementation of the VAX. MICRO 1987: 162-167 |
3 | Wen-mei W. Hwu, Yale N. Patt: Checkpoint Repair for High-Performance Out-of-Order Execution Machines. IEEE Trans. Computers 36(12): 1496-1514 (1987) | |
1986 | ||
2 | Yale N. Patt, Wen-mei W. Hwu, Stephen W. Melvin, Michael Shebanow, Chein Chen, Jiajuin Wei: Experiments with HPS, a Restricted Data Flow Microarchitecture for High Performance Computers. COMPCON 1986: 254-258 | |
1 | Wen-mei W. Hwu, Yale N. Patt: HPSm, a High Performance Restricted Data Flow Architecture Having Minimal Functionality. ISCA 1986: 297-306 |