2007 | ||
---|---|---|
26 | EE | Xiaoyang Gao, Sriram Krishnamoorthy, Swarup Kumar Sahoo, Chi-Chung Lam, Gerald Baumgartner, J. Ramanujam, P. Sadayappan: Efficient search-space pruning for integrated fusion and tiling transformations. Concurrency and Computation: Practice and Experience 19(18): 2425-2443 (2007) |
2006 | ||
25 | EE | Sandhya Krishnan, Sriram Krishnamoorthy, Gerald Baumgartner, Chi-Chung Lam, J. Ramanujam, P. Sadayappan, Venkatesh Choppella: Efficient synthesis of out-of-core algorithms using a nonlinear optimization solver. J. Parallel Distrib. Comput. 66(5): 659-673 (2006) |
24 | EE | Sriram Krishnamoorthy, Gerald Baumgartner, Chi-Chung Lam, Jarek Nieplocha, P. Sadayappan: Layout transformation support for the disk resident arrays framework. The Journal of Supercomputing 36(2): 153-170 (2006) |
2005 | ||
23 | EE | Albert Hartono, Alexander Sibiryakov, Marcel Nooijen, Gerald Baumgartner, David E. Bernholdt, So Hirata, Chi-Chung Lam, Russell M. Pitzer, J. Ramanujam, P. Sadayappan: Automated Operation Minimization of Tensor Contraction Expressions in Electronic Structure Calculations. International Conference on Computational Science (1) 2005: 155-164 |
22 | EE | Xiaoyang Gao, Sriram Krishnamoorthy, Swarup Kumar Sahoo, Chi-Chung Lam, Gerald Baumgartner, J. Ramanujam, P. Sadayappan: Efficient Search-Space Pruning for Integrated Fusion and Tiling Transformations. LCPC 2005: 215-229 |
21 | EE | Xiaoyang Gao, Swarup Kumar Sahoo, Chi-Chung Lam, J. Ramanujam, Qingda Lu, Gerald Baumgartner, P. Sadayappan: Performance modeling and optimization of parallel out-of-core tensor contractions. PPOPP 2005: 266-276 |
2004 | ||
20 | EE | Sriram Krishnamoorthy, Gerald Baumgartner, Chi-Chung Lam, Jarek Nieplocha, P. Sadayappan: Efficient Layout Transformation for Disk-Based Multidimensional Arrays. HiPC 2004: 386-398 |
19 | EE | Sandhya Krishnan, Sriram Krishnamoorthy, Gerald Baumgartner, Chi-Chung Lam, J. Ramanujam, P. Sadayappan, Venkatesh Choppella: Efficient Synthesis of Out-of-Core Algorithms Using a Nonlinear Optimization Solver. IPDPS 2004 |
18 | EE | Sriram Krishnamoorthy, Gerald Baumgartner, Daniel Cociorva, Chi-Chung Lam, P. Sadayappan: Efficient parallel out-of-core matrix transposition. IJHPCN 2(2/3/4): 110-119 (2004) |
2003 | ||
17 | EE | Sriram Krishnamoorthy, Gerald Baumgartner, Daniel Cociorva, Chi-Chung Lam, P. Sadayappan: Efficient Parallel Out-of-Core Matrix Transposition. CLUSTER 2003: 300-307 |
16 | EE | Sandhya Krishnan, Sriram Krishnamoorthy, Gerald Baumgartner, Daniel Cociorva, Chi-Chung Lam, P. Sadayappan, J. Ramanujam, David E. Bernholdt, Venkatesh Choppella: Data Locality Optimization for Synthesis of Efficient Out-of-Core Algorithms. HiPC 2003: 406-417 |
15 | EE | Daniel Cociorva, Xiaoyang Gao, Sandhya Krishnan, Gerald Baumgartner, Chi-Chung Lam, P. Sadayappan, J. Ramanujam: Global Communication Optimization for Tensor Contraction Expressions under Memory Constraints. IPDPS 2003: 37 |
14 | EE | Alina Bibireata, Sandhya Krishnan, Gerald Baumgartner, Daniel Cociorva, Chi-Chung Lam, P. Sadayappan, J. Ramanujam, David E. Bernholdt, Venkatesh Choppella: Memory-Constrained Data Locality Optimization for Tensor Contractions. LCPC 2003: 93-108 |
2002 | ||
13 | EE | Gerald Baumgartner, David E. Bernholdt, Daniel Cociorva, Chi-Chung Lam, J. Ramanujam, Robert J. Harrison, Marcel Nooijen, P. Sadayappan: A Performance Optimization Framework for Compilation of Tensor Contraction Expressions into Parallel Programs. IPDPS 2002 |
12 | EE | Daniel Cociorva, Gerald Baumgartner, Chi-Chung Lam, P. Sadayappan, J. Ramanujam: Memory-Constrained Communication Minimization for a Class of Array Computations. LCPC 2002: 1-15 |
11 | EE | Daniel Cociorva, Gerald Baumgartner, Chi-Chung Lam, P. Sadayappan, J. Ramanujam, Marcel Nooijen, David E. Bernholdt, Robert J. Harrison: Space-Time Trade-Off Optimization for a Class of Electronic Structure Calculations. PLDI 2002: 177-186 |
10 | EE | Gerald Baumgartner, David E. Bernholdt, Daniel Cociorva, Robert J. Harrison, So Hirata, Chi-Chung Lam, Marcel Nooijen, Russell M. Pitzer, J. Ramanujam, P. Sadayappan: A high-level approach to synthesis of high-performance codes for quantum chemistry. SC 2002: 1-10 |
2001 | ||
9 | EE | Daniel Cociorva, J. W. Wilkins, Chi-Chung Lam, Gerald Baumgartner, J. Ramanujam, P. Sadayappan: Loop optimization for a class of memory-constrained computations. ICS 2001: 103-113 |
1999 | ||
8 | Chi-Chung Lam, Daniel Cociorva, Gerald Baumgartner, P. Sadayappan: Memory-Optimal Evaluation of Expression Trees Involving Large Objects. HiPC 1999: 103-110 | |
7 | EE | Chi-Chung Lam, Daniel Cociorva, Gerald Baumgartner, P. Sadayappan: Optimization of Memory Usage Requirement for a Class of Loops Implementing Multi-dimensional Integrals. LCPC 1999: 350-364 |
6 | Chi-Chung Lam, P. Sadayappan, Daniel Cociorva, Mebarek Alouani, John Wilkins: Performance Optimization of a Class of Loops Involving Sums of Products of Sparse Arrays. PPSC 1999 | |
1997 | ||
5 | Chi-Chung Lam, P. Sadayappan, Rephael Wenger: Optimization of a Class of Multi-Dimensional Integrals on Parallel Machines. PPSC 1997 | |
4 | Chi-Chung Lam, Chua-Huang Huang, P. Sadayappan: Optimal Algorithms for All-to-All Personalized Communication on Rings and Two Dimensional Tori. J. Parallel Distrib. Comput. 43(1): 3-13 (1997) | |
3 | Chi-Chung Lam, P. Sadayappan, Rephael Wenger: On Optimizing a Class of Multi-Dimensional Loops with Reductions for Parallel Execution. Parallel Processing Letters 7(2): 157-168 (1997) | |
1996 | ||
2 | EE | Chi-Chung Lam: An efficient distributed channel allocation algorithm based on dynamic channel boundaries. ICNP 1996: 236-243 |
1 | Chi-Chung Lam, P. Sadayappan, Rephael Wenger: Optimal Reordering and Mapping of a Class of Nested-Loops for Parallel Execution. LCPC 1996: 315-329 |