2007 |
26 | EE | Xiaoyang Gao,
Sriram Krishnamoorthy,
Swarup Kumar Sahoo,
Chi-Chung Lam,
Gerald Baumgartner,
J. Ramanujam,
P. Sadayappan:
Efficient search-space pruning for integrated fusion and tiling transformations.
Concurrency and Computation: Practice and Experience 19(18): 2425-2443 (2007) |
2006 |
25 | EE | Sandhya Krishnan,
Sriram Krishnamoorthy,
Gerald Baumgartner,
Chi-Chung Lam,
J. Ramanujam,
P. Sadayappan,
Venkatesh Choppella:
Efficient synthesis of out-of-core algorithms using a nonlinear optimization solver.
J. Parallel Distrib. Comput. 66(5): 659-673 (2006) |
24 | EE | Sriram Krishnamoorthy,
Gerald Baumgartner,
Chi-Chung Lam,
Jarek Nieplocha,
P. Sadayappan:
Layout transformation support for the disk resident arrays framework.
The Journal of Supercomputing 36(2): 153-170 (2006) |
2005 |
23 | EE | Albert Hartono,
Alexander Sibiryakov,
Marcel Nooijen,
Gerald Baumgartner,
David E. Bernholdt,
So Hirata,
Chi-Chung Lam,
Russell M. Pitzer,
J. Ramanujam,
P. Sadayappan:
Automated Operation Minimization of Tensor Contraction Expressions in Electronic Structure Calculations.
International Conference on Computational Science (1) 2005: 155-164 |
22 | EE | Xiaoyang Gao,
Sriram Krishnamoorthy,
Swarup Kumar Sahoo,
Chi-Chung Lam,
Gerald Baumgartner,
J. Ramanujam,
P. Sadayappan:
Efficient Search-Space Pruning for Integrated Fusion and Tiling Transformations.
LCPC 2005: 215-229 |
21 | EE | Xiaoyang Gao,
Swarup Kumar Sahoo,
Chi-Chung Lam,
J. Ramanujam,
Qingda Lu,
Gerald Baumgartner,
P. Sadayappan:
Performance modeling and optimization of parallel out-of-core tensor contractions.
PPOPP 2005: 266-276 |
2004 |
20 | EE | Sriram Krishnamoorthy,
Gerald Baumgartner,
Chi-Chung Lam,
Jarek Nieplocha,
P. Sadayappan:
Efficient Layout Transformation for Disk-Based Multidimensional Arrays.
HiPC 2004: 386-398 |
19 | EE | Sandhya Krishnan,
Sriram Krishnamoorthy,
Gerald Baumgartner,
Chi-Chung Lam,
J. Ramanujam,
P. Sadayappan,
Venkatesh Choppella:
Efficient Synthesis of Out-of-Core Algorithms Using a Nonlinear Optimization Solver.
IPDPS 2004 |
18 | EE | Sriram Krishnamoorthy,
Gerald Baumgartner,
Daniel Cociorva,
Chi-Chung Lam,
P. Sadayappan:
Efficient parallel out-of-core matrix transposition.
IJHPCN 2(2/3/4): 110-119 (2004) |
2003 |
17 | EE | Sriram Krishnamoorthy,
Gerald Baumgartner,
Daniel Cociorva,
Chi-Chung Lam,
P. Sadayappan:
Efficient Parallel Out-of-Core Matrix Transposition.
CLUSTER 2003: 300-307 |
16 | EE | Sandhya Krishnan,
Sriram Krishnamoorthy,
Gerald Baumgartner,
Daniel Cociorva,
Chi-Chung Lam,
P. Sadayappan,
J. Ramanujam,
David E. Bernholdt,
Venkatesh Choppella:
Data Locality Optimization for Synthesis of Efficient Out-of-Core Algorithms.
HiPC 2003: 406-417 |
15 | EE | Daniel Cociorva,
Xiaoyang Gao,
Sandhya Krishnan,
Gerald Baumgartner,
Chi-Chung Lam,
P. Sadayappan,
J. Ramanujam:
Global Communication Optimization for Tensor Contraction Expressions under Memory Constraints.
IPDPS 2003: 37 |
14 | EE | Alina Bibireata,
Sandhya Krishnan,
Gerald Baumgartner,
Daniel Cociorva,
Chi-Chung Lam,
P. Sadayappan,
J. Ramanujam,
David E. Bernholdt,
Venkatesh Choppella:
Memory-Constrained Data Locality Optimization for Tensor Contractions.
LCPC 2003: 93-108 |
2002 |
13 | EE | Gerald Baumgartner,
David E. Bernholdt,
Daniel Cociorva,
Chi-Chung Lam,
J. Ramanujam,
Robert J. Harrison,
Marcel Nooijen,
P. Sadayappan:
A Performance Optimization Framework for Compilation of Tensor Contraction Expressions into Parallel Programs.
IPDPS 2002 |
12 | EE | Daniel Cociorva,
Gerald Baumgartner,
Chi-Chung Lam,
P. Sadayappan,
J. Ramanujam:
Memory-Constrained Communication Minimization for a Class of Array Computations.
LCPC 2002: 1-15 |
11 | EE | Daniel Cociorva,
Gerald Baumgartner,
Chi-Chung Lam,
P. Sadayappan,
J. Ramanujam,
Marcel Nooijen,
David E. Bernholdt,
Robert J. Harrison:
Space-Time Trade-Off Optimization for a Class of Electronic Structure Calculations.
PLDI 2002: 177-186 |
10 | EE | Gerald Baumgartner,
David E. Bernholdt,
Daniel Cociorva,
Robert J. Harrison,
So Hirata,
Chi-Chung Lam,
Marcel Nooijen,
Russell M. Pitzer,
J. Ramanujam,
P. Sadayappan:
A high-level approach to synthesis of high-performance codes for quantum chemistry.
SC 2002: 1-10 |
2001 |
9 | EE | Daniel Cociorva,
J. W. Wilkins,
Chi-Chung Lam,
Gerald Baumgartner,
J. Ramanujam,
P. Sadayappan:
Loop optimization for a class of memory-constrained computations.
ICS 2001: 103-113 |
1999 |
8 | | Chi-Chung Lam,
Daniel Cociorva,
Gerald Baumgartner,
P. Sadayappan:
Memory-Optimal Evaluation of Expression Trees Involving Large Objects.
HiPC 1999: 103-110 |
7 | EE | Chi-Chung Lam,
Daniel Cociorva,
Gerald Baumgartner,
P. Sadayappan:
Optimization of Memory Usage Requirement for a Class of Loops Implementing Multi-dimensional Integrals.
LCPC 1999: 350-364 |
6 | | Chi-Chung Lam,
P. Sadayappan,
Daniel Cociorva,
Mebarek Alouani,
John Wilkins:
Performance Optimization of a Class of Loops Involving Sums of Products of Sparse Arrays.
PPSC 1999 |
1997 |
5 | | Chi-Chung Lam,
P. Sadayappan,
Rephael Wenger:
Optimization of a Class of Multi-Dimensional Integrals on Parallel Machines.
PPSC 1997 |
4 | | Chi-Chung Lam,
Chua-Huang Huang,
P. Sadayappan:
Optimal Algorithms for All-to-All Personalized Communication on Rings and Two Dimensional Tori.
J. Parallel Distrib. Comput. 43(1): 3-13 (1997) |
3 | | Chi-Chung Lam,
P. Sadayappan,
Rephael Wenger:
On Optimizing a Class of Multi-Dimensional Loops with Reductions for Parallel Execution.
Parallel Processing Letters 7(2): 157-168 (1997) |
1996 |
2 | EE | Chi-Chung Lam:
An efficient distributed channel allocation algorithm based on dynamic channel boundaries.
ICNP 1996: 236-243 |
1 | | Chi-Chung Lam,
P. Sadayappan,
Rephael Wenger:
Optimal Reordering and Mapping of a Class of Nested-Loops for Parallel Execution.
LCPC 1996: 315-329 |