2009 |
65 | EE | Nathan R. Tallent,
John M. Mellor-Crummey:
Effective performance measurement and analysis of multithreaded applications.
PPOPP 2009: 229-240 |
2008 |
64 | EE | Gabriel Marin,
John M. Mellor-Crummey:
Pinpointing and Exploiting Opportunities for Enhancing Data Reuse.
ISPASS 2008: 115-126 |
63 | EE | John M. Mellor-Crummey:
Where will all the threads come from?
PPOPP 2008: 259-260 |
2007 |
62 | | Katherine A. Yelick,
John M. Mellor-Crummey:
Proceedings of the 12th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, PPOPP 2007, San Jose, California, USA, March 14-17, 2007
ACM 2007 |
61 | EE | Cristian Coarfa,
John M. Mellor-Crummey,
Nathan Froyd,
Yuri Dotsenko:
Scalability analysis of SPMD codes using expectations.
ICS 2007: 13-22 |
60 | EE | Gabriel Marin,
John M. Mellor-Crummey:
Application Insight Through Performance Modeling.
IPCCC 2007: 65-74 |
2006 |
59 | EE | Yuri Dotsenko,
Cristian Coarfa,
Luay Nakhleh,
John M. Mellor-Crummey,
Usman Roshan:
PRec-I-DCM3: a parallel framework for fast and accurate large-scale phylogeny reconstruction.
IJBRA 2(4): 407-419 (2006) |
58 | EE | Cristian Coarfa,
Yuri Dotsenko,
John M. Mellor-Crummey:
Experiences with Sweep3D implementations in Co-array Fortran.
The Journal of Supercomputing 36(2): 101-121 (2006) |
57 | EE | Apan Qasem,
Ken Kennedy,
John M. Mellor-Crummey:
Automatic tuning of whole applications using direct search and a performance-based transformation system.
The Journal of Supercomputing 36(2): 183-196 (2006) |
2005 |
56 | | Guohua Jin,
John M. Mellor-Crummey:
Space-filling Curve Generation: A Table-based Approach.
AMCS 2005: 40-46 |
55 | EE | Luay Nakhleh,
Guohua Jin,
Fengmei Zhao,
John M. Mellor-Crummey:
Reconstructing Phylogenetic Networks Using Maximum Parsimony.
CSB 2005: 93-102 |
54 | EE | Cristian Coarfa,
Yuri Dotsenko,
John M. Mellor-Crummey,
Luay Nakhleh,
Usman Roshan:
PRec-I-DCM3: A Parallel Framework for Fast and Accurate Large Scale Phylogeny Reconstruction.
ICPADS (2) 2005: 346-350 |
53 | EE | Nathan Froyd,
John M. Mellor-Crummey,
Robert J. Fowler:
Low-overhead call path profiling of unmodified, optimized code.
ICS 2005: 81-90 |
52 | EE | Daniel G. Chavarría-Miranda,
Guohua Jin,
John M. Mellor-Crummey:
COTS Clusters vs. the Earth Simulator: An Application Study Using IMPACT-3D.
IPDPS 2005 |
51 | EE | Michelle Mills Strout,
John M. Mellor-Crummey,
Paul D. Hovland:
Representation-independent program analysis.
PASTE 2005: 67-74 |
50 | EE | Daniel G. Chavarría-Miranda,
John M. Mellor-Crummey:
Effective communication coalescing for data-parallel applications.
PPOPP 2005: 14-25 |
49 | EE | Cristian Coarfa,
Yuri Dotsenko,
John M. Mellor-Crummey,
François Cantonnet,
Tarek A. El-Ghazawi,
Ashrujit Mohanti,
Yiyi Yao,
Daniel G. Chavarría-Miranda:
An evaluation of global address space languages: co-array fortran and unified parallel C.
PPOPP 2005: 36-47 |
48 | EE | Guohua Jin,
John M. Mellor-Crummey:
SFCGen: A framework for efficient generation of multi-dimensional space-filling curves by recursion.
ACM Trans. Math. Softw. 31(1): 120-148 (2005) |
47 | EE | Francine Berman,
Henri Casanova,
Andrew A. Chien,
Keith D. Cooper,
Holly Dail,
Anshuman Dasgupta,
W. Deng,
Jack Dongarra,
L. Johnsson,
Ken Kennedy,
Charles Koelbel,
B. Liu,
Xin Liu,
Anirban Mandal,
Gabriel Marin,
Mark Mazina,
John M. Mellor-Crummey,
Celso L. Mendes,
Alex Olugbile,
M. Patel,
Daniel A. Reed,
Zhiao Shi,
Otto Sievert,
Huaxia Xia,
Asim YarKhan:
New Grid Scheduling and Rescheduling Methods in the GrADS Project.
International Journal of Parallel Programming 33(2-3): 209-229 (2005) |
2004 |
46 | EE | Anirban Mandal,
Anshuman Dasgupta,
Ken Kennedy,
Mark Mazina,
Charles Koelbel,
Gabriel Marin,
Keith D. Cooper,
John M. Mellor-Crummey,
B. Liu,
L. Johnsson:
Scheduling workflow applications in GrADS.
CCGRID 2004: 790-797 |
45 | EE | Yuri Dotsenko,
Cristian Coarfa,
John M. Mellor-Crummey:
A Multi-Platform Co-Array Fortran Compiler.
IEEE PACT 2004: 29-40 |
44 | EE | Keith D. Cooper,
Anshuman Dasgupta,
Ken Kennedy,
Charles Koelbel,
Anirban Mandal,
Gabriel Marin,
Mark Mazina,
John M. Mellor-Crummey,
Francine Berman,
Henri Casanova,
Andrew A. Chien,
Holly Dail,
Xin Liu,
Alex Olugbile,
Otto Sievert,
Huaxia Xia,
L. Johnsson,
B. Liu,
M. Patel,
Daniel A. Reed,
W. Deng,
Celso L. Mendes,
Zhiao Shi,
Asim YarKhan,
Jack Dongarra:
New Grid Scheduling and Rescheduling Methods in the GrADS Project.
IPDPS Next Generation Software Program - NSFNGS - PI Workshop 2004 |
43 | EE | Yuri Dotsenko,
Cristian Coarfa,
John M. Mellor-Crummey,
Daniel G. Chavarría-Miranda:
Experiences with Co-array Fortran on Hardware Shared Memory Platforms.
LCPC 2004: 332-347 |
42 | EE | Gabriel Marin,
John M. Mellor-Crummey:
Cross-architecture performance predictions for scientific applications using parameterized models.
SIGMETRICS 2004: 2-13 |
2003 |
41 | EE | Cristian Coarfa,
Yuri Dotsenko,
Jason Eckhardt,
John M. Mellor-Crummey:
Co-array Fortran Performance and Potential: An NPB Experimental Study.
LCPC 2003: 177-193 |
40 | EE | Daniel G. Chavarría-Miranda,
John M. Mellor-Crummey:
An Evaluation of Data-Parallel Compiler Support for Line-Sweep Applications.
J. Instruction-Level Parallelism 5: (2003) |
39 | EE | Alain Darte,
John M. Mellor-Crummey,
Robert J. Fowler,
Daniel G. Chavarría-Miranda:
Generalized multipartitioning of multi-dimensional arrays for parallelizing line-sweep computations.
J. Parallel Distrib. Comput. 63(9): 887-911 (2003) |
2002 |
38 | EE | Guohua Jin,
John M. Mellor-Crummey:
Experiences tuning SMG98: a semicoarsening multigrid benchmark based on the hypre library.
ICS 2002: 305-314 |
37 | EE | Daniel G. Chavarría-Miranda,
John M. Mellor-Crummey:
An Evaluation of Data-Parallel Compiler Support for Line-Sweep Applications.
IEEE PACT 2002: 7-17 |
36 | EE | Daniel G. Chavarría-Miranda,
Alain Darte,
Robert J. Fowler,
John M. Mellor-Crummey:
Generalized Multipartitioning for Multi-Dimensional Arrays.
IPDPS 2002 |
35 | EE | Ken Kennedy,
Mark Mazina,
John M. Mellor-Crummey,
Keith D. Cooper,
Linda Torczon,
Francine Berman,
Andrew A. Chien,
Holly Dail,
Otto Sievert,
Dave Angulo,
Ian T. Foster,
Ruth A. Aydt,
Daniel A. Reed,
Dennis Gannon,
S. Lennart Johnsson,
Carl Kesselman,
Jack Dongarra,
Sathish S. Vadhiyar,
Richard Wolski:
Toward a Framework for Preparing and Executing Adaptive Grid Programs.
IPDPS 2002 |
34 | | John M. Mellor-Crummey,
Vikram S. Adve,
Bradley Broom,
Daniel G. Chavarría-Miranda,
Robert J. Fowler,
Guohua Jin,
Ken Kennedy,
Qing Yi:
Advanced optimization strategies in the Rice dHPF compiler.
Concurrency and Computation: Practice and Experience 14(8-9): 741-767 (2002) |
33 | EE | John M. Mellor-Crummey,
Robert J. Fowler,
Gabriel Marin,
Nathan R. Tallent:
HPCVIEW: A Tool for Top-down Analysis of Node Performance.
The Journal of Supercomputing 23(1): 81-104 (2002) |
2001 |
32 | EE | Vikram S. Adve,
John M. Mellor-Crummey:
Advanced Code Generation for High Performance Fortran.
Compiler Optimizations for Scalable Parallel Systems Languages 2001: 553-596 |
31 | EE | Daniel G. Chavarría-Miranda,
John M. Mellor-Crummey,
Trushar Sarang:
Data-Parallel Compiler Support for Multipartitioning.
Euro-Par 2001: 241-253 |
30 | EE | John M. Mellor-Crummey,
Robert J. Fowler,
David B. Whalley:
Tools for application-oriented performance tuning.
ICS 2001: 154-165 |
29 | EE | Guohua Jin,
John M. Mellor-Crummey,
Robert J. Fowler:
Increasing temporal locality with skewing and recursive blocking.
SC 2001: 43 |
28 | EE | John M. Mellor-Crummey,
Robert J. Fowler,
David B. Whalley:
On providing useful information for analyzing and tuning applications.
SIGMETRICS/Performance 2001: 332-333 |
27 | | John M. Mellor-Crummey,
David B. Whalley,
Ken Kennedy:
Improving Memory Hierarchy Performance for Irregular Applications Using Data and Computation Reorderings.
International Journal of Parallel Programming 29(3): 217-247 (2001) |
26 | EE | Ken Kennedy,
Bradley Broom,
Keith D. Cooper,
Jack Dongarra,
Robert J. Fowler,
Dennis Gannon,
S. Lennart Johnsson,
John M. Mellor-Crummey,
Linda Torczon:
Telescoping Languages: A Strategy for Automatic Generation of Scientific Problem-Solving Systems from Annotated Libraries.
J. Parallel Distrib. Comput. 61(12): 1803-1826 (2001) |
2000 |
25 | EE | Kai Zhang,
John M. Mellor-Crummey,
Robert J. Fowler:
Compilation and Runtime-Optimizations for Software Distributed Shared Memory.
LCR 2000: 182-191 |
24 | EE | Daniel G. Chavarría-Miranda,
John M. Mellor-Crummey:
Toward Compiler Support for Scalable Parallelism Using Multipartitioning.
LCR 2000: 272-284 |
1999 |
23 | EE | John M. Mellor-Crummey,
David B. Whalley,
Ken Kennedy:
Improving memory hierarchy performance for irregular applications.
International Conference on Supercomputing 1999: 425-433 |
22 | EE | Collin McCurdy,
John M. Mellor-Crummey:
An Evaluation of Computing Paradigms for N-Body Simulations on Distributed Memory Architectures.
PPOPP 1999: 25-36 |
1998 |
21 | EE | Bo Lu,
John M. Mellor-Crummey:
Compiler-Optimization of Implicit Reductions for Distributed Memory Multiprocessors.
IPPS/SPDP 1998: 42-51 |
20 | | Vikram S. Adve,
John M. Mellor-Crummey:
Using Integer Sets for Data-Parallel Program Analysis and Optimization.
PLDI 1998: 186-198 |
19 | | John M. Mellor-Crummey,
Vikram S. Adve:
Simplifying Control Flow in Compiler-Generated Parallel Code.
International Journal of Parallel Programming 26(5): 613-638 (1998) |
1997 |
18 | | John M. Mellor-Crummey,
Vikram S. Adve:
Simplifying Control Flow in Compiler-Generated Parallel Code.
LCPC 1997: 235-239 |
1995 |
17 | | Ken Kennedy,
John M. Mellor-Crummey,
Gerald Roth:
Optimizing Fortran 90 Shift Operations on Distributed-Memory Multicomputers.
LCPC 1995: 161-175 |
16 | EE | Vikram S. Adve,
John M. Mellor-Crummey,
Mark Anderson,
Ken Kennedy,
Jhy-Chun Wang,
Daniel A. Reed:
An Integrated Compilation and Performance Analysis Environment for Data Parallel Programs.
SC 1995 |
1994 |
15 | EE | Seema Hiranandani,
Ken Kennedy,
John M. Mellor-Crummey,
Ajay Sethi:
Compilation techniques for block-cyclic distributions.
International Conference on Supercomputing 1994: 392-403 |
1993 |
14 | | Mary W. Hall,
John M. Mellor-Crummey,
Alan Carle,
René G. Rodríguez:
FIAT: A Framework for Interprocedural Analysis and Transfomation.
LCPC 1993: 522-545 |
13 | | John M. Mellor-Crummey:
Compile-Time Support for Efficient Data Race Detection in Shared-Memory Parallel Programs.
Workshop on Parallel and Distributed Debugging 1993: 129-139 |
1992 |
12 | EE | Ervan Darnell,
John M. Mellor-Crummey,
Ken Kennedy:
Automatic software cache coherence through vectorization.
ICS 1992: 129-138 |
1991 |
11 | | John M. Mellor-Crummey,
Michael L. Scott:
Synchronization without Contention.
ASPLOS 1991: 269-278 |
10 | | John M. Mellor-Crummey,
Michael L. Scott:
Scalable Reader-Writer Synchronization for Shared-Memory Multiprocessors.
PPOPP 1991: 106-113 |
9 | EE | John M. Mellor-Crummey:
On-the-fly detection of data races for programs with nested fork-join parallelism.
SC 1991: 24-33 |
8 | EE | John M. Mellor-Crummey,
Michael L. Scott:
Algorithms for Scalable Synchronization on Shared-Memory Multiprocessors.
ACM Trans. Comput. Syst. 9(1): 21-65 (1991) |
1990 |
7 | EE | Robert Hood,
Ken Kennedy,
John M. Mellor-Crummey:
Parallel program debugging with on-the-fly anomaly detection.
SC 1990: 74-81 |
6 | | Thomas J. LeBlanc,
John M. Mellor-Crummey,
Robert J. Fowler:
Analyzing Parallel Program Executions Using Multiple Views.
J. Parallel Distrib. Comput. 9(2): 203-217 (1990) |
1989 |
5 | | John M. Mellor-Crummey,
Thomas J. LeBlanc:
A Software Instruction Counter.
ASPLOS 1989: 78-86 |
4 | | Thomas J. LeBlanc,
John M. Mellor-Crummey,
Neal M. Gafter,
Lawrence A. Crowl,
Peter C. Dibble:
The Elmwood Multiprocessor Operating System.
Softw., Pract. Exper. 19(11): 1029-1055 (1989) |
1988 |
3 | | John M. Mellor-Crummey:
Experience with the BBN Butterfly.
COMPCON 1988: 101-105 |
2 | | Robert J. Fowler,
Thomas J. LeBlanc,
John M. Mellor-Crummey:
An Integrated Approach to Parallel Program Debugging and Performance Analysis of Large-Scal Multiprocessors.
Workshop on Parallel and Distributed Debugging 1988: 163-173 |
1987 |
1 | | Thomas J. LeBlanc,
John M. Mellor-Crummey:
Debugging Parallel Programs with Instant Replay.
IEEE Trans. Computers 36(4): 471-482 (1987) |