2009 |
178 | EE | Muthu Manikandan Baskaran,
Nagavijayalakshmi Vydyanathan,
Uday Bondhugula,
J. Ramanujam,
Atanas Rountev,
P. Sadayappan:
Compiler-assisted dynamic scheduling for effective parallelization of loop nests on multicore processors.
PPOPP 2009: 219-228 |
2008 |
177 | EE | P. Sadayappan,
Manish Parashar,
Ramamurthy Badrinath,
Viktor K. Prasanna:
High Performance Computing - HiPC 2008, 15th International Conference, Bangalore, India, December 17-20, 2008. Proceedings
Springer 2008 |
176 | EE | Uday Bondhugula,
Muthu Manikandan Baskaran,
Sriram Krishnamoorthy,
J. Ramanujam,
Atanas Rountev,
P. Sadayappan:
Automatic Transformations for Communication-Minimized Parallelization and Locality Optimization in the Polyhedral Model.
CC 2008: 132-146 |
175 | EE | N. Desai,
Pavan Balaji,
P. Sadayappan,
Mohammad Islam:
Are nonblocking networks really needed for high-end-computing workloads?
CLUSTER 2008: 152-159 |
174 | EE | Nawab Ali,
Ananth Devulapalli,
Dennis Dalessandro,
Pete Wyckoff,
P. Sadayappan:
An OSD-based approach to managing directory operations in parallel file systems.
CLUSTER 2008: 175-184 |
173 | EE | Jiang Lin,
Qingda Lu,
Xiaoning Ding,
Zhao Zhang,
Xiaodong Zhang,
P. Sadayappan:
Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems.
HPCA 2008: 367-378 |
172 | EE | Gaurav Khanna,
Ümit V. Çatalyürek,
Tahsin M. Kurç,
P. Sadayappan,
Joel H. Saltz,
Rajkumar Kettimuthu,
Ian T. Foster:
Multi-hop path splitting and multi-pathing optimizations for data transfers over shared wide-area networks using gridFTP.
HPDC 2008: 225-226 |
171 | EE | Jarek Nieplocha,
Sriram Krishnamoorthy,
Marat Valiev,
Manojkumar Krishnan,
Bruce Palmer,
P. Sadayappan:
Integrated Data and Task Management for Scientific Applications.
ICCS (1) 2008: 20-31 |
170 | EE | Nagavijayalakshmi Vydyanathan,
Ümit V. Çatalyürek,
Tahsin M. Kurç,
P. Sadayappan,
Joel H. Saltz:
A Duplication Based Algorithm for Optimizing Latency Under Throughput Constraints for Streaming Workflows.
ICPP 2008: 254-261 |
169 | EE | James Dinan,
Sriram Krishnamoorthy,
D. Brian Larkins,
Jarek Nieplocha,
P. Sadayappan:
Scioto: A Framework for Global-View Task Parallelism.
ICPP 2008: 586-593 |
168 | EE | Muthu Manikandan Baskaran,
Uday Bondhugula,
Sriram Krishnamoorthy,
J. Ramanujam,
Atanas Rountev,
P. Sadayappan:
A compiler framework for optimization of affine loop nests for gpgpus.
ICS 2008: 225-234 |
167 | EE | Gaurav Khanna,
Ümit V. Çatalyürek,
Tahsin M. Kurç,
Rajkumar Kettimuthu,
P. Sadayappan,
Joel H. Saltz:
A dynamic scheduling approach for coordinated wide-area data transfers using GridFTP.
IPDPS 2008: 1-12 |
166 | EE | Uday Bondhugula,
Muthu Manikandan Baskaran,
Albert Hartono,
Sriram Krishnamoorthy,
J. Ramanujam,
Atanas Rountev,
P. Sadayappan:
Towards effective automatic parallelization for multicore systems.
IPDPS 2008: 1-5 |
165 | EE | Uday Bondhugula,
Albert Hartono,
J. Ramanujam,
P. Sadayappan:
A practical automatic polyhedral parallelizer and locality optimizer.
PLDI 2008: 101-113 |
164 | EE | Muthu Manikandan Baskaran,
Uday Bondhugula,
Sriram Krishnamoorthy,
J. Ramanujam,
Atanas Rountev,
P. Sadayappan:
Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories.
PPOPP 2008: 1-10 |
163 | EE | Gaurav Khanna,
Ümit V. Çatalyürek,
Tahsin M. Kurç,
Rajkumar Kettimuthu,
P. Sadayappan,
Ian T. Foster,
Joel H. Saltz:
Using overlays for efficient data transfer over shared wide-area networks.
SC 2008: 47 |
162 | EE | D. Brian Larkins,
James Dinan,
Sriram Krishnamoorthy,
Srinivasan Parthasarathy,
Atanas Rountev,
P. Sadayappan:
Global trees: a framework for linked data structures on distributed memory parallel systems.
SC 2008: 57 |
161 | EE | Aniruddha G. Shet,
P. Sadayappan,
David E. Bernholdt,
Jarek Nieplocha,
Vinod Tipparaju:
A framework for characterizing overlap of communication and computation in parallel applications.
Cluster Computing 11(1): 75-90 (2008) |
160 | EE | James Dinan,
Stephen Olivier,
Gerald Sabin,
Jan Prins,
P. Sadayappan,
Chau-Wen Tseng:
A message passing benchmark for unbalanced applications.
Simulation Modelling Practice and Theory 16(9): 1177-1189 (2008) |
2007 |
159 | EE | Sriram Krishnamoorthy,
Juan Piernas,
Vinod Tipparaju,
Jarek Nieplocha,
P. Sadayappan:
Non-collective parallel I/O for global address space programming models.
CLUSTER 2007: 41-49 |
158 | EE | Nagavijayalakshmi Vydyanathan,
Ümit V. Çatalyürek,
Tahsin M. Kurç,
P. Sadayappan,
Joel H. Saltz:
Toward Optimizing Latency Under Throughput Constraints for Application Workflows on Clusters.
Euro-Par 2007: 173-183 |
157 | EE | Gaurav Khanna,
Ümit V. Çatalyürek,
Tahsin M. Kurç,
P. Sadayappan,
Joel H. Saltz:
Scheduling File Transfers for Data-Intensive Jobs on Heterogeneous Clusters.
Euro-Par 2007: 214-223 |
156 | EE | Mohammad Islam,
Pavan Balaji,
Gerald Sabin,
P. Sadayappan:
Analyzing and Minimizing the Impact of Opportunity Cost in QoS-aware Job Scheduling.
ICPP 2007: 42 |
155 | EE | Sriram Krishnamoorthy,
Ümit V. Çatalyürek,
Jarek Nieplocha,
Atanas Rountev,
P. Sadayappan:
A global address space framework for locality aware scheduling of block-sparse computations.
IPDPS 2007: 1-8 |
154 | EE | James Dinan,
Stephen Olivier,
Gerald Sabin,
Jan Prins,
P. Sadayappan,
Chau-Wen Tseng:
Dynamic Load Balancing of Unbalanced Computations Using Message Passing.
IPDPS 2007: 1-8 |
153 | EE | Sriram Krishnamoorthy,
Muthu Manikandan Baskaran,
Uday Bondhugula,
J. Ramanujam,
Atanas Rountev,
P. Sadayappan:
Effective automatic parallelization of stencil computations.
PLDI 2007: 235-244 |
152 | EE | Uday Bondhugula,
J. Ramanujam,
P. Sadayappan:
Automatic mapping of nested loops to FPGAS.
PPOPP 2007: 101-111 |
151 | EE | Ananth Devulapalli,
Dennis Dalessandro,
Pete Wyckoff,
Nawab Ali,
P. Sadayappan:
Integrating parallel file systems with object-based storage devices.
SC 2007: 27 |
150 | EE | Xiaoyang Gao,
Sriram Krishnamoorthy,
Swarup Kumar Sahoo,
Chi-Chung Lam,
Gerald Baumgartner,
J. Ramanujam,
P. Sadayappan:
Efficient search-space pruning for integrated fusion and tiling transformations.
Concurrency and Computation: Practice and Experience 19(18): 2425-2443 (2007) |
2006 |
149 | | Eduard Ayguadé,
Gerald Baumgartner,
J. Ramanujam,
P. Sadayappan:
Languages and Compilers for Parallel Computing, 18th International Workshop, LCPC 2005, Hawthorne, NY, USA, October 20-22, 2005, Revised Selected Papers
Springer 2006 |
148 | EE | Aniruddha G. Shet,
P. Sadayappan,
David E. Bernholdt,
Jarek Nieplocha,
Vinod Tipparaju:
A Performance Instrumentation Framework to Characterize Computation-Communication Overlap in Message-Passing Systems.
CLUSTER 2006 |
147 | EE | Nagavijayalakshmi Vydyanathan,
Sriram Krishnamoorthy,
Gerald Sabin,
Ümit V. Çatalyürek,
Tahsin M. Kurç,
P. Sadayappan,
Joel H. Saltz:
Locality Conscious Processor Allocation and Scheduling for Mixed Parallel Applications.
CLUSTER 2006 |
146 | EE | Uday Bondhugula,
Ananth Devulapalli,
James Dinan,
Joseph Fernando,
Pete Wyckoff,
Eric Stahlberg,
P. Sadayappan:
Hardware/Software Integration for FPGA-based All-Pairs Shortest-Paths.
FCCM 2006: 152-164 |
145 | EE | Gaurav Khanna,
Nagavijayalakshmi Vydyanathan,
Ümit V. Çatalyürek,
Tahsin M. Kurç,
Sriram Krishnamoorthy,
P. Sadayappan,
Joel H. Saltz:
Task Scheduling and File Replication for Data-Intensive Jobs with Batch-shared I/O.
HPDC 2006: 241-252 |
144 | EE | Nagavijayalakshmi Vydyanathan,
Sriram Krishnamoorthy,
Gerald Sabin,
Ümit V. Çatalyürek,
Tahsin M. Kurç,
P. Sadayappan,
Joel H. Saltz:
An Integrated Approach for Processor Allocation and Scheduling of Mixed-Parallel Applications.
ICPP 2006: 443-450 |
143 | EE | Sriram Krishnamoorthy,
Ümit V. Çatalyürek,
Jarek Nieplocha,
P. Sadayappan:
An approach to locality-conscious load balancing and transparent memory hierarchy management with a global-address-space parallel programming model.
IPDPS 2006 |
142 | EE | Sriram Krishnamoorthy,
Ümit V. Çatalyürek,
Jarek Nieplocha,
Atanas Rountev,
P. Sadayappan:
An extensible global address space framework with decoupled task and data abstractions.
IPDPS 2006 |
141 | EE | A. Allam,
J. Ramanujam,
Gerald Baumgartner,
P. Sadayappan:
Memory minimization for tensor contractions using integer linear programming.
IPDPS 2006 |
140 | EE | Uday Bondhugula,
Ananth Devulapalli,
Joseph Fernando,
Pete Wyckoff,
P. Sadayappan:
Parallel FPGA-based all-pairs shortest-paths in a directed graph.
IPDPS 2006 |
139 | EE | Albert Hartono,
Qingda Lu,
Xiaoyang Gao,
Sriram Krishnamoorthy,
Marcel Nooijen,
Gerald Baumgartner,
David E. Bernholdt,
Venkatesh Choppella,
Russell M. Pitzer,
J. Ramanujam,
Atanas Rountev,
P. Sadayappan:
Identifying Cost-Effective Common Subexpressions to Reduce Operation Count in Tensor Contraction Evaluations.
International Conference on Computational Science (1) 2006: 267-275 |
138 | EE | Gaurav Khanna,
Ümit V. Çatalyürek,
Tahsin M. Kurç,
P. Sadayappan,
Joel H. Saltz:
A Data Locality Aware Online Scheduling Approach for I/O-Intensive Jobs with File Sharing.
JSSPP 2006: 141-160 |
137 | EE | Gerald Sabin,
Matthew Lang,
P. Sadayappan:
Moldable Parallel Job Scheduling Using Job Efficiency: An Iterative Approach.
JSSPP 2006: 94-114 |
136 | EE | Stephen Olivier,
Jun Huan,
Jinze Liu,
Jan Prins,
James Dinan,
P. Sadayappan,
Chau-Wen Tseng:
UTS: An Unbalanced Tree Search Benchmark.
LCPC 2006: 235-250 |
135 | EE | Qingda Lu,
Sriram Krishnamoorthy,
P. Sadayappan:
Combining analytical and empirical approaches in tuning matrix transposition.
PACT 2006: 233-242 |
134 | EE | Jarek Nieplocha,
Bruce Palmer,
Manojkumar Krishnan,
P. Sadayappan:
M12 - Overview of the global arrays parallel software development toolkit.
SC 2006: 226 |
133 | EE | Sriram Krishnamoorthy,
Ümit V. Çatalyürek,
Jarek Nieplocha,
Atanas Rountev,
P. Sadayappan:
Data management and query - Hypergraph partitioning for automatic memory hierarchy management.
SC 2006: 98 |
132 | EE | Sandhya Krishnan,
Sriram Krishnamoorthy,
Gerald Baumgartner,
Chi-Chung Lam,
J. Ramanujam,
P. Sadayappan,
Venkatesh Choppella:
Efficient synthesis of out-of-core algorithms using a nonlinear optimization solver.
J. Parallel Distrib. Comput. 66(5): 659-673 (2006) |
131 | EE | Christian Engelmann,
Stephen L. Scott,
David E. Bernholdt,
Narasimha Raju Gottumukkala,
Chokchai Leangsuksun,
Jyothish Varma,
Chao Wang,
Frank Mueller,
Aniruddha G. Shet,
P. Sadayappan:
MOLAR: adaptive runtime support for high-end computing operating and runtime systems.
Operating Systems Review 40(2): 63-72 (2006) |
130 | EE | Sriram Krishnamoorthy,
Gerald Baumgartner,
Chi-Chung Lam,
Jarek Nieplocha,
P. Sadayappan:
Layout transformation support for the disk resident arrays framework.
The Journal of Supercomputing 36(2): 153-170 (2006) |
2005 |
129 | EE | Gaurav Khanna,
Nagavijayalakshmi Vydyanathan,
Tahsin M. Kurç,
Ümit V. Çatalyürek,
Pete Wyckoff,
Joel H. Saltz,
P. Sadayappan:
A hypergraph partitioning based approach for scheduling of tasks with batch-shared I/O.
CCGRID 2005: 792-799 |
128 | EE | Sriram Krishnamoorthy,
Jarek Nieplocha,
P. Sadayappan:
Data and Computation Abstractions for Dynamic and Irregular Computations.
HiPC 2005: 258-269 |
127 | EE | Vipin Chaudhary,
P. Sadayappan:
Message from the Chairs.
ICPP Workshops 2005: 282 |
126 | EE | Swarup Kumar Sahoo,
Rajkiran Panuganti,
Sriram Krishnamoorthy,
P. Sadayappan:
Cache Miss Characterization and Data Locality Optimization for Imperfectly Nested Loops on Shared Memory Multiprocessors.
IPDPS 2005 |
125 | EE | Albert Hartono,
Alexander Sibiryakov,
Marcel Nooijen,
Gerald Baumgartner,
David E. Bernholdt,
So Hirata,
Chi-Chung Lam,
Russell M. Pitzer,
J. Ramanujam,
P. Sadayappan:
Automated Operation Minimization of Tensor Contraction Expressions in Electronic Structure Calculations.
International Conference on Computational Science (1) 2005: 155-164 |
124 | EE | Gerald Sabin,
P. Sadayappan:
Unfairness Metrics for Space-Sharing Parallel Job Schedulers.
JSSPP 2005: 238-256 |
123 | EE | Xiaoyang Gao,
Sriram Krishnamoorthy,
Swarup Kumar Sahoo,
Chi-Chung Lam,
Gerald Baumgartner,
J. Ramanujam,
P. Sadayappan:
Efficient Search-Space Pruning for Integrated Fusion and Tiling Transformations.
LCPC 2005: 215-229 |
122 | EE | Xiaoyang Gao,
Swarup Kumar Sahoo,
Chi-Chung Lam,
J. Ramanujam,
Qingda Lu,
Gerald Baumgartner,
P. Sadayappan:
Performance modeling and optimization of parallel out-of-core tensor contractions.
PPOPP 2005: 266-276 |
121 | EE | Swarup Kumar Sahoo,
Sriram Krishnamoorthy,
Rajkiran Panuganti,
P. Sadayappan:
Integrated Loop Optimizations for Data Locality Enhancement of Tensor Contraction Expressions.
SC 2005: 13 |
120 | EE | Rajkumar Kettimuthu,
Vijay Subramani,
Srividya Srinivasan,
Thiagaraja Gopalsamy,
Dhabaleswar K. Panda,
P. Sadayappan:
Selective preemption strategies for parallel job scheduling.
IJHPCN 3(2/3): 122-152 (2005) |
2004 |
119 | EE | Mohammad Islam,
Pavan Balaji,
P. Sadayappan,
Dhabaleswar K. Panda:
Towards provision of quality of service guarantees in job scheduling.
CLUSTER 2004: 245-254 |
118 | EE | Gerald Sabin,
V. Sahasrabudhe,
P. Sadayappan:
On fairness in distributed job scheduling across multiple sites.
CLUSTER 2004: 35-44 |
117 | EE | Nagavijayalakshmi Vydyanathan,
Gaurav Khanna,
Tahsin M. Kurç,
Ümit V. Çatalyürek,
Pete Wyckoff,
Joel H. Saltz,
P. Sadayappan:
Use of PVFS for Efficient Execution of Jobs with Pipeline-Shared I/O.
GRID 2004: 235-242 |
116 | EE | Sriram Krishnamoorthy,
Gerald Baumgartner,
Chi-Chung Lam,
Jarek Nieplocha,
P. Sadayappan:
Efficient Layout Transformation for Disk-Based Multidimensional Arrays.
HiPC 2004: 386-398 |
115 | EE | Gerald Sabin,
Garima Kochhar,
P. Sadayappan:
Job Fairness in Non-Preemptive Job Scheduling.
ICPP 2004: 186-194 |
114 | EE | Vipin Chaudhary,
P. Sadayappan:
Message from the Chairs: International Workshop on Compile and Run Time Techniques for Parallel Computing.
ICPP Workshops 2004: 497 |
113 | EE | Qingda Lu,
Jiesheng Wu,
Dhabaleswar K. Panda,
P. Sadayappan:
Applying MPI Derived Datatypes to the NAS Benchmarks: A Case Study.
ICPP Workshops 2004: 538-545 |
112 | EE | Sandhya Krishnan,
Sriram Krishnamoorthy,
Gerald Baumgartner,
Chi-Chung Lam,
J. Ramanujam,
P. Sadayappan,
Venkatesh Choppella:
Efficient Synthesis of Out-of-Core Algorithms Using a Nonlinear Optimization Solver.
IPDPS 2004 |
111 | EE | Qingda Lu,
Xiaoyang Gao,
Sriram Krishnamoorthy,
Gerald Baumgartner,
J. Ramanujam,
P. Sadayappan:
Empirical Performance-Model Driven Data Layout Optimization.
LCPC 2004: 72-86 |
110 | EE | Sriram Krishnamoorthy,
Gerald Baumgartner,
Daniel Cociorva,
Chi-Chung Lam,
P. Sadayappan:
Efficient parallel out-of-core matrix transposition.
IJHPCN 2(2/3/4): 110-119 (2004) |
109 | EE | Sudha Srinivasan,
Savitha Krishnamoorthy,
P. Sadayappan:
Robust scheduling of moldable parallel jobs.
IJHPCN 2(2/3/4): 120-132 (2004) |
2003 |
108 | EE | Sriram Krishnamoorthy,
Gerald Baumgartner,
Daniel Cociorva,
Chi-Chung Lam,
P. Sadayappan:
Efficient Parallel Out-of-Core Matrix Transposition.
CLUSTER 2003: 300-307 |
107 | EE | Sudha Srinivasan,
Sriram Krishnamoorthy,
P. Sadayappan:
A Robust Scheduling Strategy for Moldable Scheduling of Parallel Jobs.
CLUSTER 2003: 92-99 |
106 | EE | Sandhya Krishnan,
Sriram Krishnamoorthy,
Gerald Baumgartner,
Daniel Cociorva,
Chi-Chung Lam,
P. Sadayappan,
J. Ramanujam,
David E. Bernholdt,
Venkatesh Choppella:
Data Locality Optimization for Synthesis of Efficient Out-of-Core Algorithms.
HiPC 2003: 406-417 |
105 | EE | Daniel Cociorva,
Xiaoyang Gao,
Sandhya Krishnan,
Gerald Baumgartner,
Chi-Chung Lam,
P. Sadayappan,
J. Ramanujam:
Global Communication Optimization for Tensor Contraction Expressions under Memory Constraints.
IPDPS 2003: 37 |
104 | EE | Mohammad Islam,
Pavan Balaji,
P. Sadayappan,
Dhabaleswar K. Panda:
QoPS: A QoS Based Scheme for Parallel Job Scheduling.
JSSPP 2003: 252-268 |
103 | EE | Gerald Sabin,
Rajkumar Kettimuthu,
Arun Rajan,
P. Sadayappan:
Scheduling of Parallel Jobs in a Heterogeneous Multi-site Environement.
JSSPP 2003: 87-104 |
102 | EE | Konstantin Berlin,
Jun Huan,
Mary Jacob,
Garima Kochhar,
Jan Prins,
William Pugh,
P. Sadayappan,
Jaime Spacco,
Chau-Wen Tseng:
Evaluating the Impact of Programming Language Features on the Performance of Parallel Applications on Cluster Architectures.
LCPC 2003: 194-208 |
101 | EE | Alina Bibireata,
Sandhya Krishnan,
Gerald Baumgartner,
Daniel Cociorva,
Chi-Chung Lam,
P. Sadayappan,
J. Ramanujam,
David E. Bernholdt,
Venkatesh Choppella:
Memory-Constrained Data Locality Optimization for Tensor Contractions.
LCPC 2003: 93-108 |
2002 |
100 | EE | Vijay Subramani,
Rajkumar Kettimuthu,
Srividya Srinivasan,
Jeanette Johnston,
P. Sadayappan:
Selective Buddy Allocation for Scheduling Parallel Jobs on Clusters.
CLUSTER 2002: 107- |
99 | EE | Vijay Subramani,
Rajkumar Kettimuthu,
Srividya Srinivasan,
P. Sadayappan:
Distributed Job Scheduling on Computational Grids Using Multiple Simultaneous Requests.
HPDC 2002: 359- |
98 | EE | Srividya Srinivasan,
Vijay Subramani,
Rajkumar Kettimuthu,
Praveen Holenarsipur,
P. Sadayappan:
Effective Selection of Partition Sizes for Moldable Scheduling of Parallel Jobs.
HiPC 2002: 174-183 |
97 | EE | Thiagaraja Gopalsamy,
Mukesh Singhal,
Dhabaleswar K. Panda,
P. Sadayappan:
A Reliable Multicast Algorithm for Mobile Ad Hoc Networks.
ICDCS 2002: 563-570 |
96 | EE | P. Sadayappan:
Message from the Chair.
ICPP Workshops 2002: 495-498 |
95 | EE | Srividya Srinivasan,
Rajkumar Kettimuthu,
Vijay Subramani,
P. Sadayappan:
Characterization of Backfilling Strategies for Parallel Job Scheduling.
ICPP Workshops 2002: 514-522 |
94 | EE | Vipin Chaudhary,
P. Sadayappan:
Message from the Co-Chairs.
ICPP Workshops 2002: 547-550 |
93 | EE | Gerald Baumgartner,
David E. Bernholdt,
Daniel Cociorva,
Chi-Chung Lam,
J. Ramanujam,
Robert J. Harrison,
Marcel Nooijen,
P. Sadayappan:
A Performance Optimization Framework for Compilation of Tensor Contraction Expressions into Parallel Programs.
IPDPS 2002 |
92 | EE | Srividya Srinivasan,
Rajkumar Kettimuthu,
Vijay Subramani,
P. Sadayappan:
Selective Reservation Strategies for Backfill Job Scheduling.
JSSPP 2002: 55-71 |
91 | EE | Daniel Cociorva,
Gerald Baumgartner,
Chi-Chung Lam,
P. Sadayappan,
J. Ramanujam:
Memory-Constrained Communication Minimization for a Class of Array Computations.
LCPC 2002: 1-15 |
90 | EE | Daniel Cociorva,
Gerald Baumgartner,
Chi-Chung Lam,
P. Sadayappan,
J. Ramanujam,
Marcel Nooijen,
David E. Bernholdt,
Robert J. Harrison:
Space-Time Trade-Off Optimization for a Class of Electronic Structure Calculations.
PLDI 2002: 177-186 |
89 | EE | Gerald Baumgartner,
David E. Bernholdt,
Daniel Cociorva,
Robert J. Harrison,
So Hirata,
Chi-Chung Lam,
Marcel Nooijen,
Russell M. Pitzer,
J. Ramanujam,
P. Sadayappan:
A high-level approach to synthesis of high-performance codes for quantum chemistry.
SC 2002: 1-10 |
2001 |
88 | EE | Daniel Cociorva,
J. W. Wilkins,
Gerald Baumgartner,
P. Sadayappan,
J. Ramanujam,
Marcel Nooijen,
David E. Bernholdt,
Robert J. Harrison:
Towards Automatic Synthesis of High-Performance Codes for Electronic Structure Calculations: Data Locality Optimization.
HiPC 2001: 237-248 |
87 | EE | Mohammad Banikazemi,
Jiuxing Liu,
Dhabaleswar K. Panda,
P. Sadayappan:
Implementing TreadMarksover VIA on Myrinet and Gigabit Ethernet: Challenges, Design Experience, and Performance Evaluation.
ICPP 2001: 167-174 |
86 | EE | Abhishek Gulati,
Dhabaleswar K. Panda,
P. Sadayappan,
Pete Wyckoff:
NIC-Based Rate Control for Proportional Bandwidth Allocation in Myrinet Clusters.
ICPP 2001: 305-312 |
85 | EE | Daniel Cociorva,
J. W. Wilkins,
Chi-Chung Lam,
Gerald Baumgartner,
J. Ramanujam,
P. Sadayappan:
Loop optimization for a class of memory-constrained computations.
ICS 2001: 103-113 |
84 | | Darius Buntinas,
Dhabaleswar K. Panda,
P. Sadayappan:
Performance Benefits of NIC-Based Barrier on Myrinet/GM.
IPDPS 2001: 166 |
83 | | Mohammad Banikazemi,
Jiuxing Liu,
S. Kutlug,
P. Sadayappan,
H. Shah,
Dhabaleswar K. Panda:
VIBe: A Micro-benchmark Suite for Evaluating Virtual Interface Architecture (VIA) Implementations.
IPDPS 2001: 24 |
82 | | Darius Buntinas,
Dhabaleswar K. Panda,
P. Sadayappan:
Fast NIC-Based Barrier over Myrinet/GM.
IPDPS 2001: 52 |
81 | | Amit Singhal,
Mohammad Banikazemi,
P. Sadayappan,
Dhabaleswar K. Panda:
Efficient Multicast Algorithms for Heterogeneous Switch-based Irregular Networks of Workstations.
IPDPS 2001: 71 |
2000 |
80 | | Vijay Moorthy,
Dhabaleswar K. Panda,
P. Sadayappan:
Fast Collective Communication Algorithms for Reflective Memory Network Clusters.
CANPC 2000: 100-114 |
79 | | Darius Buntinas,
Dhabaleswar K. Panda,
José Duato,
P. Sadayappan:
Broadcast/Multicast over Myrinet Using NIC-Assisted Multidestination Messages.
CANPC 2000: 115-129 |
78 | EE | Praveen Holenarsipur,
Vladimir Yarmolenko,
José Duato,
Dhabaleswar K. Panda,
P. Sadayappan:
Characterization and enhancement of Static Mapping Heuristics for Heterogeneous Systems.
HiPC 2000: 37-48 |
77 | EE | P. Sadayappan:
Message from the Chair.
ICPP Workshops 2000: 391- |
76 | EE | Vladimir Yarmolenko,
José Duato,
Dhabaleswar K. Panda,
P. Sadayappan:
Characterization and Enhancement of Dynamic Mapping Heuristics for Heterogeneous Systems.
ICPP Workshops 2000: 437- |
75 | EE | Arindam Paul,
Wu-chi Feng,
Dhabaleswar K. Panda,
P. Sadayappan:
Balancing Web Server Load for Adaptable Video Distribution.
ICPP Workshops 2000: 469- |
1999 |
74 | | Matthew G. Jacunski,
Vijay Moorthy,
Peter P. Ware,
Manoj Pillai,
Dhabaleswar K. Panda,
P. Sadayappan:
Low Latency Message-Passing for Reflective Memory Networks.
CANPC 1999: 211-224 |
73 | EE | Mohammad Banikazemi,
Jayanthi Sampathkumar,
Sandeep Prabhu,
Dhabaleswar K. Panda,
P. Sadayappan:
Communication Modeling of Heterogeneous Networks of Workstations for Performance Characterization of Collective Operations.
Heterogeneous Computing Workshop 1999: 125- |
72 | | Chi-Chung Lam,
Daniel Cociorva,
Gerald Baumgartner,
P. Sadayappan:
Memory-Optimal Evaluation of Expression Trees Involving Large Objects.
HiPC 1999: 103-110 |
71 | EE | N. S. Sundar,
S. Jayanthi,
P. Sadayappan,
Miguel Visbal:
An Incremental Methodology for Parallelizing Legacy Stencil Codes on Message-Passing Computers.
ICPP 1999: 302-310 |
70 | EE | Vijay Moorthy,
Matthew G. Jacunski,
Manoj Pillai,
Peter P. Ware,
Dhabaleswar K. Panda,
Thomas W. Page Jr.,
P. Sadayappan,
V. Nagarajan,
Johns Daniel:
Low-Latency Message Passing on Workstation Clusters using SCRAMNet.
IPPS/SPDP 1999: 148-152 |
69 | EE | Matthew G. Jacunski,
P. Sadayappan,
Dhabaleswar K. Panda:
All-to-All Broadcast on Switch-Based Clusters of Workstations.
IPPS/SPDP 1999: 325-329 |
68 | EE | Chi-Chung Lam,
Daniel Cociorva,
Gerald Baumgartner,
P. Sadayappan:
Optimization of Memory Usage Requirement for a Class of Loops Implementing Multi-dimensional Integrals.
LCPC 1999: 350-364 |
67 | | Chi-Chung Lam,
P. Sadayappan,
Daniel Cociorva,
Mebarek Alouani,
John Wilkins:
Performance Optimization of a Class of Loops Involving Sums of Products of Sparse Arrays.
PPSC 1999 |
1998 |
66 | | Zhiyuan Li,
Pen-Chung Yew,
Siddhartha Chatterjee,
Chua-Huang Huang,
P. Sadayappan,
David C. Sehr:
Languages and Compilers for Parallel Computing, 10th International Workshop, LCPC'97, Minneapolis, Minnesota, USA, August 7-9, 1997, Proceedings
Springer 1998 |
65 | | Sandeep K. S. Gupta,
Chua-Huang Huang,
P. Sadayappan,
Rodney W. Johnson:
A technique for overlapping computation and communication for block recursive algorithms.
Concurrency - Practice and Experience 10(2): 73-90 (1998) |
64 | | P. Sadayappan,
Fikret Erçal,
J. Ramanujam:
Partitioning Graphs on Message-Passing Machines by Pairwise Mincut.
Inf. Sci. 111(1-4): 223-237 (1998) |
1997 |
63 | | Chi-Chung Lam,
P. Sadayappan,
Rephael Wenger:
Optimization of a Class of Multi-Dimensional Integrals on Parallel Machines.
PPSC 1997 |
62 | | Chi-Chung Lam,
Chua-Huang Huang,
P. Sadayappan:
Optimal Algorithms for All-to-All Personalized Communication on Rings and Two Dimensional Tori.
J. Parallel Distrib. Comput. 43(1): 3-13 (1997) |
61 | | Chi-Chung Lam,
P. Sadayappan,
Rephael Wenger:
On Optimizing a Class of Multi-Dimensional Loops with Reductions for Parallel Execution.
Parallel Processing Letters 7(2): 157-168 (1997) |
1996 |
60 | | Chua-Huang Huang,
P. Sadayappan,
Utpal Banerjee,
David Gelernter,
Alexandru Nicolau,
David A. Padua:
Languages and Compilers for Parallel Computing, 8th International Workshop, LCPC'95, Columbus, Ohio, USA, August 10-12, 1995, Proceedings
Springer 1996 |
59 | EE | N. S. Sundar,
D. N. Jayasimha,
Dhabaleswar K. Panda,
P. Sadayappan:
Hybrid Algorithms for Complete Exchange in 2D Meshes.
International Conference on Supercomputing 1996: 181-188 |
58 | | Chi-Chung Lam,
P. Sadayappan,
Rephael Wenger:
Optimal Reordering and Mapping of a Class of Nested-Loops for Parallel Execution.
LCPC 1996: 315-329 |
57 | EE | S. D. Kaushik,
Sanjay Sharma,
Chua-Huang Huang,
John R. Johnson,
Rodney W. Johnson,
P. Sadayappan:
An Algebraic Theory for Modeling Directt Interconnection Networks.
J. Inf. Sci. Eng. 12(1): 25-49 (1996) |
56 | | Sandeep K. S. Gupta,
S. D. Kaushik,
Chua-Huang Huang,
P. Sadayappan:
Compiling Array Expressions for Efficient Execution on Distributed-Memory Machines.
J. Parallel Distrib. Comput. 32(2): 155-172 (1996) |
55 | | Sandeep K. S. Gupta,
Chua-Huang Huang,
P. Sadayappan,
Rodney W. Johnson:
A Framework for Generating Distributed-Memory Parallel Programs for Block Recursive Algorithms.
J. Parallel Distrib. Comput. 34(2): 137-153 (1996) |
54 | | S. D. Kaushik,
Chua-Huang Huang,
P. Sadayappan:
Efficient Index Set Generation for Compiling HPF Array Statements on Distributed-Memory Machines.
J. Parallel Distrib. Comput. 38(2): 237-247 (1996) |
53 | | Himanshu Gupta,
P. Sadayappan:
Communication-Efficient Matrix Multiplication on Hypercubes.
Parallel Computing 22(1): 75-99 (1996) |
1995 |
52 | EE | S. D. Kaushik,
Chua-Huang Huang,
J. Ramanujam,
P. Sadayappan:
Multi-phase array redistribution: modeling and evaluation.
IPPS 1995: 441-445 |
51 | | S. D. Kaushik,
Chua-Huang Huang,
P. Sadayappan:
Compiling Array Statements for Efficient Execution on Distributed-Memory Machines: Two-Level Mappings.
LCPC 1995: 209-223 |
50 | | Bharat Kumar,
Kalluri Eswar,
P. Sadayappan,
Chua-Huang Huang:
A Clustering Algorithm for Parallel Sparse Cholesky Factorization.
Parallel Processing Letters 5: 685-696 (1995) |
49 | EE | Bharat Kumar,
Chua-Huang Huang,
P. Sadayappan,
Rodney W. Johnson:
A Tensor Product Formulation of Strassen's Matrix Multiplication Algorithm with Memory Reduction.
Scientific Programming 4(4): 275-289 (1995) |
1994 |
48 | | Sandeep K. S. Gupta,
Chua-Huang Huang,
Rodney W. Johnson,
P. Sadayappan:
Communication-Efficient Implementation of Block Recursive Algorithms on Distributed-Memory Machines.
ICPADS 1994: 113-119 |
47 | | Alpesh Amin,
P. Sadayappan,
Murali Gudavalli:
A Clustered Reduced Communication Element by Element Preconditioned Conjugate Gradient Algorithm for Finite Element Computations.
IPPS 1994: 509-516 |
46 | EE | S. D. Kaushik,
Chua-Huang Huang,
Rodney W. Johnson,
P. Sadayappan:
An approach to communication-efficient data redistribution.
International Conference on Supercomputing 1994: 364-373 |
45 | EE | Bharat Kumar,
P. Sadayappan,
Chua-Huang Huang:
On sparse matrix reordering for parallel factorization.
International Conference on Supercomputing 1994: 431-438 |
44 | | S. D. Kaushik,
Chua-Huang Huang,
P. Sadayappan:
Incremental Generation of Index Sets for Array Statement Execution on Distributed-Memory Machines.
LCPC 1994: 251-265 |
43 | EE | D. L. Dai,
Sandeep K. S. Gupta,
S. D. Kaushik,
J. H. Lu,
R. V. Singh,
Chua-Huang Huang,
P. Sadayappan,
Rodney W. Johnson:
EXTENT: a portable programming environment for designing and implementing high-performance block recursive algorithms.
SC 1994: 49-58 |
42 | EE | Himanshu Gupta,
P. Sadayappan:
Communication Efficient Matrix Multiplication on Hypercubes.
SPAA 1994: 320-329 |
41 | | Sandeep K. S. Gupta,
Chua-Huang Huang,
P. Sadayappan,
Rodney W. Johnson:
Implementing Fast Fourier Transforms on Distributed-Memory Multiprocessors Using Data Redistributions.
Parallel Processing Letters 4: 477-488 (1994) |
1993 |
40 | EE | Debabrata Ghosh,
S. K. Nandy,
P. Sadayappan,
K. Parthasarathy:
Architectural Synthesis of Performance-Driven Multipliers with Accumulator Interleaving.
DAC 1993: 303-307 |
39 | | Kalluri Eswar,
P. Sadayappan,
Chua-Huang Huang:
Compile-Time Characterization of Recurrent Patterns in Irregular Computations.
ICPP 1993: 148-155 |
38 | | Kalluri Eswar,
P. Sadayappan,
Chua-Huang Huang,
V. Visvanathan:
Supernodal Sparse Cholesky Facotrization on Distributed-Memory Multiprocessors.
ICPP 1993: 18-22 |
37 | | Sandeep K. S. Gupta,
S. D. Kaushik,
S. Mufti,
Sanjay Sharma,
Chua-Huang Huang,
P. Sadayappan:
On Compiling Array Expressions for Efficient Execution on Distributed-Memory Machines.
ICPP 1993: 301-305 |
36 | | S. K. Nandy,
Ranjani Narayan,
V. Visvanathan,
P. Sadayappan,
Prashant S. Chauhan:
A Parallel Progressive Refinement Image Rendering Algorithm on a Scalable Multithreaded VLSI Processor Array.
ICPP 1993: 94-97 |
35 | | Bharat Kumar,
Chua-Huang Huang,
Rodney W. Johnson,
P. Sadayappan:
A Tensor Product Formulation of Strassen's Matrix Multiplication Algorithm with Memory Reduction.
IPPS 1993: 582-588 |
34 | | S. D. Kaushik,
Chua-Huang Huang,
Rodney W. Johnson,
P. Sadayappan:
A Methodology for Generating Efficient Disk-Based Algorithms from Tensor Product Formulas.
LCPC 1993: 358-373 |
33 | EE | S. D. Kaushik,
Chua-Huang Huang,
John R. Johnson,
Rodney W. Johnson,
P. Sadayappan:
Efficient transposition algorithms for large matrices.
SC 1993: 656-665 |
32 | | Chua-Huang Huang,
P. Sadayappan:
Communication-Free Hyperplane Partitioning of Nested Loops.
J. Parallel Distrib. Comput. 19(2): 90-102 (1993) |
1992 |
31 | | Sandeep K. S. Gupta,
Chua-Huang Huang,
P. Sadayappan,
Rodney W. Johnson:
On the Synthesis of Parallel Programs from Tensor Product Formulas for Block Recursive Algorithms.
LCPC 1992: 264-280 |
30 | | S. D. Kaushik,
Sanjay Sharma,
Chua-Huang Huang,
Jeremy R. Johnson,
Rodney W. Johnson,
P. Sadayappan:
An Algebraic Theory for Modeling Direct Interconnection Networks.
SC 1992: 488-497 |
29 | | Sanjay Sharma,
Chua-Huang Huang,
P. Sadayappan:
On Data Dependence Analysis for Compiling Programs on Distributed-Memory Machines (Extended Abstract).
SIGPLAN Workshop 1992: 13-16 |
28 | | Sandeep K. S. Gupta,
S. D. Kaushik,
Chua-Huang Huang,
John R. Johnson,
Rodney W. Johnson,
P. Sadayappan:
On the Automatic Generation of Data Distributions.
SIGPLAN Workshop 1992: 82 |
27 | | Sandeep K. S. Gupta,
S. D. Kaushik,
Chua-Huang Huang,
John R. Johnson,
Rodney W. Johnson,
P. Sadayappan:
A Methodology for Generating Data Distributions to Optimize Communication.
SPDP 1992: 436-441 |
26 | | Sailesh K. Rao,
P. Sadayappan,
Frank K. Hwang,
Peter W. Shor:
The Rectilinear Steiner Arborescence Problem.
Algorithmica 7(2&3): 277-288 (1992) |
25 | | J. Ramanujam,
P. Sadayappan:
Tiling Multidimensional Itertion Spaces for Multicomputers.
J. Parallel Distrib. Comput. 16(2): 108-120 (1992) |
1991 |
24 | | Kalluri Eswar,
P. Sadayappan,
V. Visvanathan:
Multifrontal Factorization of Sparse Matrices on Shared-Memory Multiprocessors.
ICPP (3) 1991: 159-166 |
23 | | Scott Whitman,
P. Sadayappan:
Computer Graphics Rendering on a Shared Memory Multiprocessor.
ICPP (3) 1991: 197-200 |
22 | | Chua-Huang Huang,
P. Sadayappan:
Communication-Free Hyperplane Partitioning of Nested Loops.
LCPC 1991: 186-200 |
21 | | V. Prasad Krothapalli,
P. Sadayappan:
Removal of Redundant Dependences in DOACROSS Lops with Constant Dependences.
PPOPP 1991: 51-60 |
20 | EE | J. Ramanujam,
P. Sadayappan:
Tiling multidimensional iteration spaces for nonshared memory machines.
SC 1991: 111-120 |
19 | EE | V. Prasad Krothapalli,
P. Sadayappan:
Removal of Redundant Dependences in DOACROSS Loops with Constant Dependences.
IEEE Trans. Parallel Distrib. Syst. 2(3): 281-289 (1991) |
18 | EE | J. Ramanujam,
P. Sadayappan:
Compile-Time Techniques for Data Distribution in Distributed Memory Machines.
IEEE Trans. Parallel Distrib. Syst. 2(4): 472-482 (1991) |
1990 |
17 | EE | V. Prasad Krothapalli,
P. Sadayappan:
Exploiting Parallelism Through Run-Time Analysis on a Vector Processor (Abstract).
ACM Conference on Computer Science 1990: 434 |
16 | | J. Ramanujam,
P. Sadayappan:
Tiling of Iteration Spaces for Multicomputers.
ICPP (2) 1990: 179-186 |
15 | | V. Prasad Krothapalli,
P. Sadayappan:
Dynamic Scheduling of DOACROSS Loops for Multiprocessors.
PARBASE / Architectures 1990: 141-160 |
14 | | Fikret Erçal,
J. Ramanujam,
P. Sadayappan:
Task Allocation onto a Hypercube by Recursive Mincut Bipartitioning.
J. Parallel Distrib. Comput. 10(1): 35-44 (1990) |
13 | | P. Sadayappan,
Fikret Erçal,
J. Ramanujam:
Cluster partitioning approaches to mapping parallel programs onto a hypercube.
Parallel Computing 13(1): 1-16 (1990) |
1989 |
12 | EE | P. Sadayappan,
V. Visvanathan:
Efficient Sparse Matrix Factorization for Circuit Simulation on Vector Supercomputers.
DAC 1989: 13-18 |
11 | | Amr Zaky,
P. Sadayappan:
Optimal Static Scheduling of Sequential Loops on Multiprocessors.
ICPP (3) 1989: 130-137 |
10 | EE | Fikret Erçal,
P. Sadayappan:
One-to-one mapping of process graphs onto a hypercube.
ICS 1989: 91-98 |
9 | EE | P. Sadayappan,
V. Visvanathan:
Efficient sparse matrix factorization for circuit simulation on vector supercomputers.
IEEE Trans. on CAD of Integrated Circuits and Systems 8(12): 1276-1285 (1989) |
1988 |
8 | EE | P. Sadayappan,
V. Visvanathan:
Parallelization and performance evaluation of circuit simulation on a shared-memory multiprocessor.
ICS 1988: 254-265 |
7 | EE | V. Prasad Krothapalli,
P. Sadayappan:
An approach to synchronization for parallel computing.
ICS 1988: 573-581 |
6 | | Cevdet Aykanat,
Füsun Özgüner,
Fikret Erçal,
P. Sadayappan:
Iterative Algorithms for Solution of Large Sparse Systems of Linear Equations on Hypercubes.
IEEE Trans. Computers 37(12): 1554-1568 (1988) |
5 | | P. Sadayappan,
V. Visvanathan:
Circuit Simulation on Shared-Memory Multiprocessors.
IEEE Trans. Computers 37(12): 1634-1642 (1988) |
1987 |
4 | | P. Sadayappan,
Fikret Erçal,
Steven Martin:
Mapping Finite Element Graphs onto Processor Meshes.
ICPP 1987: 192-195 |
3 | | P. Sadayappan,
Fikret Erçal:
Cluster-Partitioning Approaches to Mapping Parallel Programs onto a Hypercube.
ICS 1987: 475-497 |
2 | | P. Sadayappan,
Fikret Erçal:
Nearest-Neighbor Mapping of Finite Element Graphs onto Processor Meshes.
IEEE Trans. Computers 36(12): 1408-1424 (1987) |
1985 |
1 | EE | V. Ashok,
Roger L. Costello,
P. Sadayappan:
Modeling switch-level simulation using data flow.
DAC 1985: 637-644 |