2003 |
92 | EE | Mainak Chaudhuri,
Mark Heinrich,
Chris Holt,
Jaswinder Pal Singh,
Edward Rothberg,
John L. Hennessy:
Latency, Occupancy, and Bandwidth in DSM Multiprocessors: A Performance Evaluation.
IEEE Trans. Computers 52(7): 862-880 (2003) |
2000 |
91 | EE | David Ofelt,
John L. Hennessy:
Efficient performance prediction for modern microprocessors.
SIGMETRICS 2000: 229-239 |
1999 |
90 | | John L. Hennessy:
The Future of Systems Research.
IEEE Computer 32(8): 27-33 (1999) |
89 | EE | Mark Heinrich,
Vijayaraghavan Soundararajan,
John L. Hennessy,
Anoop Gupta:
A Quantitative Analysis of the Performance and Scalability of Distributed Shared Memory.
IEEE Trans. Computers 48(2): 205-217 (1999) |
1998 |
88 | EE | Anant Agarwal,
Richard Simoni,
John L. Hennessy,
Mark Horowitz:
An Evaluation of Directory Schemes for Cache Coherence.
25 Years ISCA: Retrospectives and Reprints 1998: 353-362 |
87 | EE | Kourosh Gharachorloo,
Daniel Lenoski,
James Laudon,
Phillip B. Gibbons,
Anoop Gupta,
John L. Hennessy:
Memory Consistency and Event Ordering in Scalable Shared-Memory Multiprocessors.
25 Years ISCA: Retrospectives and Reprints 1998: 376-387 |
86 | EE | Daniel Lenoski,
James Laudon,
Truman Joe,
David Nakahira,
Luis Stevens,
Anoop Gupta,
John L. Hennessy:
The DASH Prototype: Implementation and Performance.
25 Years ISCA: Retrospectives and Reprints 1998: 418-429 |
85 | EE | Jeffrey Kuskin,
David Ofelt,
Mark Heinrich,
John Heinlein,
Richard Simoni,
Kourosh Gharachorloo,
John Chapin,
David Nakahira,
Joel Baxter,
Mark Horowitz,
Anoop Gupta,
Mendel Rosenblum,
John L. Hennessy:
The Stanford FLASH Multiprocessor.
25 Years ISCA: Retrospectives and Reprints 1998: 485-496 |
84 | EE | John L. Hennessy:
Retrospective: Evaluation of Directory Dchemes for Cache Coherence.
25 Years ISCA: Retrospectives and Reprints 1998: 61-62 |
83 | EE | Vijayaraghavan Soundararajan,
Mark Heinrich,
Ben Verghese,
Kourosh Gharachorloo,
Anoop Gupta,
John L. Hennessy:
Flexible Use of Memory for Replication/Migration in Cache-Coherent DSM Multiprocessors.
ISCA 1998: 342-355 |
1997 |
82 | | David A. Patterson,
John L. Hennessy:
Computer Organization & Design: The Hardware/Software Interface, Second Edition
Morgan Kaufmann 1997 |
81 | EE | Radhika Thekkath,
Amit Pal Singh,
Jaswinder Pal Singh,
Susan John,
John L. Hennessy:
An Evaluation of a Commercial CC-NUMA Architecture - The CONVEX Exemplar SPP1200.
IPPS 1997: 8-17 |
80 | | Ken Kennedy,
Charles F. Bender,
John W. D. Connolly,
John L. Hennessy,
Mary K. Vernon,
Larry Smarr:
A Nationwide Parallel Computing Environment.
Commun. ACM 40(11): 62-72 (1997) |
1996 |
79 | | John L. Hennessy,
David A. Patterson:
Computer Architecture: A Quantitative Approach, 2nd Edition
Morgan Kaufmann 1996 |
78 | | Andrew Erlichson,
Neal Nuckolls,
Greg Chesson,
John L. Hennessy:
SoftFLASH: Analyzing the Performance of Clustered Distributed Virtual Shared Memory.
ASPLOS 1996: 210-220 |
77 | EE | Chris Holt,
Jaswinder Pal Singh,
John L. Hennessy:
Application and Architectural Bottlenecks in Large Scale Distributed Shared Memory Machines.
ISCA 1996: 134-145 |
1995 |
76 | | John L. Hennessy:
Position Paper.
PPSC 1995: 868-869 |
75 | EE | Jaswinder Pal Singh,
John L. Hennessy,
Anoop Gupta:
Implications of Hierarchical N-Body Methods for Multiprocessor Architectures
ACM Trans. Comput. Syst. 13(2): 141-202 (1995) |
74 | | Jaswinder Pal Singh,
Chris Holt,
Takashi Totsuka,
Anoop Gupta,
John L. Hennessy:
Load Balancing and Data locality in Adaptive Hierarchical N-Body Methods: Barnes-Hut, Fast Multipole, and Rasiosity.
J. Parallel Distrib. Comput. 27(2): 118-141 (1995) |
1994 |
73 | | David A. Patterson,
John L. Hennessy:
Computer Organization & Design: The Hardware/Software Interface
Morgan Kaufmann 1994 |
72 | | Steven Cameron Woo,
Jaswinder Pal Singh,
John L. Hennessy:
The Performance Advantages of Integrating Block Data Trabsfer in Cache-Coherent Multiprocessors.
ASPLOS 1994: 219-229 |
71 | | Mark Heinrich,
Jeffrey Kuskin,
David Ofelt,
John Heinlein,
Joel Baxter,
Jaswinder Pal Singh,
Richard Simoni,
Kourosh Gharachorloo,
David Nakahira,
Mark Horowitz,
Anoop Gupta,
Mendel Rosenblum,
John L. Hennessy:
The Performance Impact of Flexibility in the Stanford FLASH Multiprocessor.
ASPLOS 1994: 274-285 |
70 | | Jeffrey Kuskin,
David Ofelt,
Mark Heinrich,
John Heinlein,
Richard Simoni,
Kourosh Gharachorloo,
John Chapin,
David Nakahira,
Joel Baxter,
Mark Horowitz,
Anoop Gupta,
Mendel Rosenblum,
John L. Hennessy:
The Stanford FLASH Multiprocessor.
ISCA 1994: 302-313 |
69 | | Truman Joe,
John L. Hennessy:
Evaluating the Memory Overhead Required for COMA Architectures.
ISCA 1994: 82-93 |
68 | | Rohit Chandra,
Anoop Gupta,
John L. Hennessy:
COOL: An Object-Based Language for Parallel Programming.
IEEE Computer 27(8): 13-26 (1994) |
67 | | Josep Torrellas,
Monica S. Lam,
John L. Hennessy:
False Sharing ans Spatial Locality in Multiprocessor Caches.
IEEE Trans. Computers 43(6): 651-663 (1994) |
66 | | Robert P. Wilson,
Robert S. French,
Christopher S. Wilson,
Saman P. Amarasinghe,
Jennifer-Ann M. Anderson,
Steven W. K. Tjiang,
Shih-Wei Liao,
Chau-Wen Tseng,
Mary W. Hall,
Monica S. Lam,
John L. Hennessy:
SUIF: An Infrastructure for Research on Parallelizing and Optimizing Compilers.
SIGPLAN Notices 29(12): 31-37 (1994) |
1993 |
65 | | Rohit Chandra,
Anoop Gupta,
John L. Hennessy:
Data Locality and Load Balancing in COOL.
PPOPP 1993: 249-259 |
64 | EE | Jaswinder Pal Singh,
Truman Joe,
Anoop Gupta,
John L. Hennessy:
An empirical comparison of the Kendall Square Research KSR-1 and Stanford DASH multiprocessors.
SC 1993: 214-225 |
63 | EE | Jaswinder Pal Singh,
Chris Holt,
John L. Hennessy,
Anoop Gupta:
A parallel adaptive fast multipole method.
SC 1993: 54-65 |
62 | | Stephen R. Goldschmidt,
John L. Hennessy:
The Accuracy of Trace-Driven Simulations of Multiprocessors.
SIGMETRICS 1993: 146-157 |
61 | | Jaswinder Pal Singh,
John L. Hennessy,
Anoop Gupta:
Scaling Parallel Programs for Multiprocessors: Methodology and Examples.
IEEE Computer 26(7): 42-50 (1993) |
60 | EE | Aaron J. Goldberg,
John L. Hennessy:
Mtool: An Integrated System for Performance Debugging Shared Memory Multiprocessor Applications.
IEEE Trans. Parallel Distrib. Syst. 4(1): 28-40 (1993) |
59 | EE | Daniel Lenoski,
James Laudon,
Truman Joe,
David Nakahira,
Luis Stevens,
Anoop Gupta,
John L. Hennessy:
The DASH Prototype: Logic Overhead and Performance.
IEEE Trans. Parallel Distrib. Syst. 4(1): 41-61 (1993) |
58 | | Peter Schnorf,
Mahadevan Ganapathi,
John L. Hennessy:
Compile-time Copy Elimination.
Softw., Pract. Exper. 23(11): 1175-1200 (1993) |
1992 |
57 | | Josep Torrellas,
Anoop Gupta,
John L. Hennessy:
Characterizing the Caching and Synchronization Performance of a Multiprocessor Operating System.
ASPLOS 1992: 162-174 |
56 | | Kourosh Gharachorloo,
Anoop Gupta,
John L. Hennessy:
Hiding Memory Latency using Dynamic Scheduling in Shared-Memory Multiprocessors.
ISCA 1992: 22-33 |
55 | | Daniel Lenoski,
James Laudon,
Truman Joe,
David Nakahira,
Luis Stevens,
Anoop Gupta,
John L. Hennessy:
The DASH Prototype: Implementation and Performance.
ISCA 1992: 92-103 |
54 | | Steven W. K. Tjiang,
John L. Hennessy:
Sharlit - A Tool for Building Optimizers.
PLDI 1992: 82-93 |
53 | | Daniel Lenoski,
James Laudon,
Kourosh Gharachorloo,
Wolf-Dietrich Weber,
Anoop Gupta,
John L. Hennessy,
Mark Horowitz,
Monica S. Lam:
The Stanford Dash Multiprocessor.
IEEE Computer 25(3): 63-79 (1992) |
52 | | Jaswinder Pal Singh,
John L. Hennessy:
Finding and Exploiting Parallelism in an Ocean Simulation Program: Experience, Results, and Implications.
J. Parallel Distrib. Comput. 15(1): 27-48 (1992) |
51 | | Kourosh Gharachorloo,
Sarita V. Adve,
Anoop Gupta,
John L. Hennessy,
Mark D. Hill:
Programming for Different Memory Consistency Models.
J. Parallel Distrib. Comput. 15(4): 399-407 (1992) |
1991 |
50 | | Kourosh Gharachorloo,
Anoop Gupta,
John L. Hennessy:
Performance Evaluation of Memory Consistency Models for Shared Memory Multiprocessors.
ASPLOS 1991: 245-257 |
49 | | Kourosh Gharachorloo,
Anoop Gupta,
John L. Hennessy:
Two Techniques to Enhance the Performance of Memory Consistency Models.
ICPP (1) 1991: 355-364 |
48 | | Aaron J. Goldberg,
John L. Hennessy:
MTOOL: A Method for Isolating Memory Bottlenecks in Shared Memory Multiprocessor Programs.
ICPP (2) 1991: 251-257 |
47 | | Helen Davis,
Stephen R. Goldschmidt,
John L. Hennessy:
Multiprocessor Simulation and Tracing Using Tango.
ICPP (2) 1991: 99-107 |
46 | EE | Anoop Gupta,
John L. Hennessy,
Kourosh Gharachorloo,
Todd C. Mowry,
Wolf-Dietrich Weber:
Comparative Evaluation of Latency Reducing and Tolerating Techniques.
ISCA 1991: 254-263 |
45 | | Steven W. K. Tjiang,
Michael E. Wolf,
Monica S. Lam,
K. Pieper,
John L. Hennessy:
Integrating Scalar Optimization and Parallelization.
LCPC 1991: 137-151 |
44 | | Dror E. Maydan,
John L. Hennessy,
Monica S. Lam:
Efficient and Exact Data Dependence Analysis.
PLDI 1991: 1-14 |
43 | EE | Aaron J. Goldberg,
John L. Hennessy:
Performance debugging shared memory multiprocessor programs with MTOOL.
SC 1991: 481-490 |
42 | | Aaron J. Goldberg,
John L. Hennessy:
MTOOL: A Method for Detecting Memory Bottlenecks.
SIGMETRICS 1991: 210-211 |
41 | | John L. Hennessy,
Norman P. Jouppi:
Computer Technology and Architecture: An Evolving Interaction.
IEEE Computer 24(9): 18-29 (1991) |
1990 |
40 | | David A. Patterson,
John L. Hennessy:
Computer Architecture: A Quantitative Approach.
Morgan Kaufmann 1990 |
39 | | Josep Torrellas,
John L. Hennessy:
Estimating the Performance Advantages of Relaxing Consistency in a Shared Memory Multiprocessor.
ICPP (1) 1990: 26-34 |
38 | | Josep Torrellas,
Monica S. Lam,
John L. Hennessy:
Share Data Placement Optimizations to Reduce Multiprocessor Cache Miss Rates.
ICPP (2) 1990: 266-270 |
37 | | Daniel Lenoski,
James Laudon,
Kourosh Gharachorloo,
Anoop Gupta,
John L. Hennessy:
The Directory-Based Cache Coherence Protocol for the DASH Multiprocessor.
ISCA 1990: 148-159 |
36 | | Kourosh Gharachorloo,
Daniel Lenoski,
James Laudon,
Phillip B. Gibbons,
Anoop Gupta,
John L. Hennessy:
Memory Consistency and Event Ordering in Scalable Shared-Memory Multiprocessors.
ISCA 1990: 15-26 |
35 | EE | Josep Torrellas,
John L. Hennessy,
Thierry Weil:
Analysis of Critical Architectural and Program Parameters in a Hierarchical Shared Memory Multiprocessor.
SIGMETRICS 1990: 163-172 |
34 | EE | Fred C. Chow,
John L. Hennessy:
The Priority-Based Coloring Approach to Register Allocation.
ACM Trans. Program. Lang. Syst. 12(4): 501-536 (1990) |
33 | | Yigal Brandman,
Alon Orlitsky,
John L. Hennessy:
A Spectral Lower Bound Techniqye for the Size of Decision Trees and Two Level AND/OR Circuits.
IEEE Trans. Computers 39(2): 282-287 (1990) |
1989 |
32 | EE | Steven A. Przybylski,
Mark Horowitz,
John L. Hennessy:
Characteristics of Performance-Optimal Multi-Level Cache Hierarchies.
ISCA 1989: 114-121 |
31 | | K. Gopinath,
John L. Hennessy:
Copy Elimination in Functional Languages.
POPL 1989: 303-314 |
30 | EE | Anant Agarwal,
Mark Horowitz,
John L. Hennessy:
An Analytical Cache Model.
ACM Trans. Comput. Syst. 7(2): 184-215 (1989) |
29 | EE | Peter Steenkiste,
John L. Hennessy:
A Simple Interprocedural Register Allocation Algorithm and Its Effectiveness for Lisp.
ACM Trans. Program. Lang. Syst. 11(1): 1-32 (1989) |
1988 |
28 | | Anant Agarwal,
Richard Simoni,
John L. Hennessy,
Mark Horowitz:
An Evaluation of Directory Schemes for Cache Coherence.
ISCA 1988: 280-289 |
27 | | Steven A. Przybylski,
Mark Horowitz,
John L. Hennessy:
Performance Tradeoffs in Cache Design.
ISCA 1988: 290-298 |
26 | EE | Kourosh Gharachorloo,
Vivek Sarkar,
John L. Hennessy:
A Simple and Efficient Implmentation Approach for Single Assignment Languages.
LISP and Functional Programming 1988: 259-268 |
25 | | Helen Davis,
John L. Hennessy:
Characterizing the Synchronization Behavior of Parallel Programs.
PPOPP/PPEALS 1988: 198-211 |
24 | EE | Thomas R. Gross,
John L. Hennessy,
Steven A. Przybylski,
Christopher Rowen:
Measurement and Evaluation of the MIPS Architecture and Processor.
ACM Trans. Comput. Syst. 6(3): 229-257 (1988) |
23 | EE | Anant Agarwal,
John L. Hennessy,
Mark Horowitz:
Cache Performance of Operating System and Multiprogramming Workloads.
ACM Trans. Comput. Syst. 6(4): 393-431 (1988) |
22 | | Peter Steenkiste,
John L. Hennessy:
Lisp on a Reduced-Instruction-Set Processor: Characterization and Optimization.
IEEE Computer 21(7): 34-45 (1988) |
1987 |
21 | | Peter Steenkiste,
John L. Hennessy:
Tags and Type Checking in Lisp: Hardware and Software Approaches.
ASPLOS 1987: 50-59 |
1986 |
20 | | Scott McFarling,
John L. Hennessy:
Reducing the Cost of Branches.
ISCA 1986: 396-403 |
19 | | Peter Steenkiste,
John L. Hennessy:
LISP on a Reduced-Instruction-Set-Processor.
LISP and Functional Programming 1986: 192-201 |
18 | | Vivek Sarkar,
John L. Hennessy:
Partitioning Parallel Programs for Macro-Dataflow.
LISP and Functional Programming 1986: 202-211 |
17 | EE | Vivek Sarkar,
John L. Hennessy:
Compile-time partitioning and scheduling of parallel programs.
SIGPLAN Symposium on Compiler Construction 1986: 17-26 |
1985 |
16 | EE | Christopher Rowen,
John L. Hennessy:
SWAMI: a flexible logic implementation system.
DAC 1985: 169-175 |
1984 |
15 | EE | Fred C. Chow,
John L. Hennessy:
Register allocation by priority-based coloring (with retrospective)
Best of PLDI 1984: 91-103 |
14 | EE | Fred C. Chow,
John L. Hennessy:
Register allocation by priority-based coloring.
SIGPLAN Symposium on Compiler Construction 1984: 222-232 |
13 | | John L. Hennessy:
VLSI Processor Architecture.
IEEE Trans. Computers 33(12): 1221-1246 (1984) |
1983 |
12 | EE | John L. Hennessy,
Thomas R. Gross:
Postpass Code Optimization of Pipeline Constraints.
ACM Trans. Program. Lang. Syst. 5(3): 422-448 (1983) |
1982 |
11 | | John L. Hennessy,
Norman P. Jouppi,
Forest Baskett,
Thomas R. Gross,
John Gill:
Hardware/Software Tradeoffs for Increased Performance.
ASPLOS 1982: 2-11 |
10 | | John L. Hennessy,
Norman P. Jouppi,
John Gill,
Forest Baskett,
Alex Strong,
Thomas R. Gross,
Christopher Rowen,
Judson Leonard:
The MIPS Machine.
COMPCON 1982: 2-7 |
9 | | John L. Hennessy,
Thomas R. Gross:
Code Generation and Reorganization in the Presence of Pipeline Constraints.
POPL 1982: 120-127 |
8 | | Mahadevan Ganapathi,
Charles N. Fischer,
John L. Hennessy:
Retargetable Compiler Code Generation.
ACM Comput. Surv. 14(4): 573-592 (1982) |
7 | EE | John L. Hennessy:
Symbolic Debugging of Optimized Code.
ACM Trans. Program. Lang. Syst. 4(3): 323-344 (1982) |
6 | | John L. Hennessy,
Hilding Elmquist:
The Design and Implementation of Parametric Types in Pascal.
Softw., Pract. Exper. 12(2): 169-184 (1982) |
5 | | John L. Hennessy,
Noah Mendelsohn:
Compilation of the Pascal Case Statement.
Softw., Pract. Exper. 12(9): 879-882 (1982) |
1981 |
4 | | John L. Hennessy:
Program Optimization and Exception Handling.
POPL 1981: 200-206 |
3 | | Richard W. Carr,
John L. Hennessy:
WSClock - A Simple and Effective Algorithm for Virtual Memory Management.
SOSP 1981: 87-95 |
2 | | John L. Hennessy,
Richard B. Kieburtz:
The Formal Definition of a Real-Time Language.
Acta Inf. 16: 309-345 (1981) |
1980 |
1 | | Michael J. Flynn,
John L. Hennessy:
Parallelism and Representation Problems in Distributed Systems.
IEEE Trans. Computers 29(12): 1060-1086 (1980) |