2008 |
6 | EE | Jiang Lin,
Qingda Lu,
Xiaoning Ding,
Zhao Zhang,
Xiaodong Zhang,
P. Sadayappan:
Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems.
HPCA 2008: 367-378 |
2006 |
5 | EE | Albert Hartono,
Qingda Lu,
Xiaoyang Gao,
Sriram Krishnamoorthy,
Marcel Nooijen,
Gerald Baumgartner,
David E. Bernholdt,
Venkatesh Choppella,
Russell M. Pitzer,
J. Ramanujam,
Atanas Rountev,
P. Sadayappan:
Identifying Cost-Effective Common Subexpressions to Reduce Operation Count in Tensor Contraction Evaluations.
International Conference on Computational Science (1) 2006: 267-275 |
4 | EE | Qingda Lu,
Sriram Krishnamoorthy,
P. Sadayappan:
Combining analytical and empirical approaches in tuning matrix transposition.
PACT 2006: 233-242 |
2005 |
3 | EE | Xiaoyang Gao,
Swarup Kumar Sahoo,
Chi-Chung Lam,
J. Ramanujam,
Qingda Lu,
Gerald Baumgartner,
P. Sadayappan:
Performance modeling and optimization of parallel out-of-core tensor contractions.
PPOPP 2005: 266-276 |
2004 |
2 | EE | Qingda Lu,
Jiesheng Wu,
Dhabaleswar K. Panda,
P. Sadayappan:
Applying MPI Derived Datatypes to the NAS Benchmarks: A Case Study.
ICPP Workshops 2004: 538-545 |
1 | EE | Qingda Lu,
Xiaoyang Gao,
Sriram Krishnamoorthy,
Gerald Baumgartner,
J. Ramanujam,
P. Sadayappan:
Empirical Performance-Model Driven Data Layout Optimization.
LCPC 2004: 72-86 |