2009 | ||
---|---|---|
178 | EE | Muthu Manikandan Baskaran, Nagavijayalakshmi Vydyanathan, Uday Bondhugula, J. Ramanujam, Atanas Rountev, P. Sadayappan: Compiler-assisted dynamic scheduling for effective parallelization of loop nests on multicore processors. PPOPP 2009: 219-228 |
2008 | ||
177 | EE | P. Sadayappan, Manish Parashar, Ramamurthy Badrinath, Viktor K. Prasanna: High Performance Computing - HiPC 2008, 15th International Conference, Bangalore, India, December 17-20, 2008. Proceedings Springer 2008 |
176 | EE | Uday Bondhugula, Muthu Manikandan Baskaran, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan: Automatic Transformations for Communication-Minimized Parallelization and Locality Optimization in the Polyhedral Model. CC 2008: 132-146 |
175 | EE | N. Desai, Pavan Balaji, P. Sadayappan, Mohammad Islam: Are nonblocking networks really needed for high-end-computing workloads? CLUSTER 2008: 152-159 |
174 | EE | Nawab Ali, Ananth Devulapalli, Dennis Dalessandro, Pete Wyckoff, P. Sadayappan: An OSD-based approach to managing directory operations in parallel file systems. CLUSTER 2008: 175-184 |
173 | EE | Jiang Lin, Qingda Lu, Xiaoning Ding, Zhao Zhang, Xiaodong Zhang, P. Sadayappan: Gaining insights into multicore cache partitioning: Bridging the gap between simulation and real systems. HPCA 2008: 367-378 |
172 | EE | Gaurav Khanna, Ümit V. Çatalyürek, Tahsin M. Kurç, P. Sadayappan, Joel H. Saltz, Rajkumar Kettimuthu, Ian T. Foster: Multi-hop path splitting and multi-pathing optimizations for data transfers over shared wide-area networks using gridFTP. HPDC 2008: 225-226 |
171 | EE | Jarek Nieplocha, Sriram Krishnamoorthy, Marat Valiev, Manojkumar Krishnan, Bruce Palmer, P. Sadayappan: Integrated Data and Task Management for Scientific Applications. ICCS (1) 2008: 20-31 |
170 | EE | Nagavijayalakshmi Vydyanathan, Ümit V. Çatalyürek, Tahsin M. Kurç, P. Sadayappan, Joel H. Saltz: A Duplication Based Algorithm for Optimizing Latency Under Throughput Constraints for Streaming Workflows. ICPP 2008: 254-261 |
169 | EE | James Dinan, Sriram Krishnamoorthy, D. Brian Larkins, Jarek Nieplocha, P. Sadayappan: Scioto: A Framework for Global-View Task Parallelism. ICPP 2008: 586-593 |
168 | EE | Muthu Manikandan Baskaran, Uday Bondhugula, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan: A compiler framework for optimization of affine loop nests for gpgpus. ICS 2008: 225-234 |
167 | EE | Gaurav Khanna, Ümit V. Çatalyürek, Tahsin M. Kurç, Rajkumar Kettimuthu, P. Sadayappan, Joel H. Saltz: A dynamic scheduling approach for coordinated wide-area data transfers using GridFTP. IPDPS 2008: 1-12 |
166 | EE | Uday Bondhugula, Muthu Manikandan Baskaran, Albert Hartono, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan: Towards effective automatic parallelization for multicore systems. IPDPS 2008: 1-5 |
165 | EE | Uday Bondhugula, Albert Hartono, J. Ramanujam, P. Sadayappan: A practical automatic polyhedral parallelizer and locality optimizer. PLDI 2008: 101-113 |
164 | EE | Muthu Manikandan Baskaran, Uday Bondhugula, Sriram Krishnamoorthy, J. Ramanujam, Atanas Rountev, P. Sadayappan: Automatic data movement and computation mapping for multi-level parallel architectures with explicitly managed memories. PPOPP 2008: 1-10 |
163 | EE | Gaurav Khanna, Ümit V. Çatalyürek, Tahsin M. Kurç, Rajkumar Kettimuthu, P. Sadayappan, Ian T. Foster, Joel H. Saltz: Using overlays for efficient data transfer over shared wide-area networks. SC 2008: 47 |
162 | EE | D. Brian Larkins, James Dinan, Sriram Krishnamoorthy, Srinivasan Parthasarathy, Atanas Rountev, P. Sadayappan: Global trees: a framework for linked data structures on distributed memory parallel systems. SC 2008: 57 |
161 | EE | Aniruddha G. Shet, P. Sadayappan, David E. Bernholdt, Jarek Nieplocha, Vinod Tipparaju: A framework for characterizing overlap of communication and computation in parallel applications. Cluster Computing 11(1): 75-90 (2008) |
160 | EE | James Dinan, Stephen Olivier, Gerald Sabin, Jan Prins, P. Sadayappan, Chau-Wen Tseng: A message passing benchmark for unbalanced applications. Simulation Modelling Practice and Theory 16(9): 1177-1189 (2008) |
2007 | ||
159 | EE | Sriram Krishnamoorthy, Juan Piernas, Vinod Tipparaju, Jarek Nieplocha, P. Sadayappan: Non-collective parallel I/O for global address space programming models. CLUSTER 2007: 41-49 |
158 | EE | Nagavijayalakshmi Vydyanathan, Ümit V. Çatalyürek, Tahsin M. Kurç, P. Sadayappan, Joel H. Saltz: Toward Optimizing Latency Under Throughput Constraints for Application Workflows on Clusters. Euro-Par 2007: 173-183 |
157 | EE | Gaurav Khanna, Ümit V. Çatalyürek, Tahsin M. Kurç, P. Sadayappan, Joel H. Saltz: Scheduling File Transfers for Data-Intensive Jobs on Heterogeneous Clusters. Euro-Par 2007: 214-223 |
156 | EE | Mohammad Islam, Pavan Balaji, Gerald Sabin, P. Sadayappan: Analyzing and Minimizing the Impact of Opportunity Cost in QoS-aware Job Scheduling. ICPP 2007: 42 |
155 | EE | Sriram Krishnamoorthy, Ümit V. Çatalyürek, Jarek Nieplocha, Atanas Rountev, P. Sadayappan: A global address space framework for locality aware scheduling of block-sparse computations. IPDPS 2007: 1-8 |
154 | EE | James Dinan, Stephen Olivier, Gerald Sabin, Jan Prins, P. Sadayappan, Chau-Wen Tseng: Dynamic Load Balancing of Unbalanced Computations Using Message Passing. IPDPS 2007: 1-8 |
153 | EE | Sriram Krishnamoorthy, Muthu Manikandan Baskaran, Uday Bondhugula, J. Ramanujam, Atanas Rountev, P. Sadayappan: Effective automatic parallelization of stencil computations. PLDI 2007: 235-244 |
152 | EE | Uday Bondhugula, J. Ramanujam, P. Sadayappan: Automatic mapping of nested loops to FPGAS. PPOPP 2007: 101-111 |
151 | EE | Ananth Devulapalli, Dennis Dalessandro, Pete Wyckoff, Nawab Ali, P. Sadayappan: Integrating parallel file systems with object-based storage devices. SC 2007: 27 |
150 | EE | Xiaoyang Gao, Sriram Krishnamoorthy, Swarup Kumar Sahoo, Chi-Chung Lam, Gerald Baumgartner, J. Ramanujam, P. Sadayappan: Efficient search-space pruning for integrated fusion and tiling transformations. Concurrency and Computation: Practice and Experience 19(18): 2425-2443 (2007) |
2006 | ||
149 | Eduard Ayguadé, Gerald Baumgartner, J. Ramanujam, P. Sadayappan: Languages and Compilers for Parallel Computing, 18th International Workshop, LCPC 2005, Hawthorne, NY, USA, October 20-22, 2005, Revised Selected Papers Springer 2006 | |
148 | EE | Aniruddha G. Shet, P. Sadayappan, David E. Bernholdt, Jarek Nieplocha, Vinod Tipparaju: A Performance Instrumentation Framework to Characterize Computation-Communication Overlap in Message-Passing Systems. CLUSTER 2006 |
147 | EE | Nagavijayalakshmi Vydyanathan, Sriram Krishnamoorthy, Gerald Sabin, Ümit V. Çatalyürek, Tahsin M. Kurç, P. Sadayappan, Joel H. Saltz: Locality Conscious Processor Allocation and Scheduling for Mixed Parallel Applications. CLUSTER 2006 |
146 | EE | Uday Bondhugula, Ananth Devulapalli, James Dinan, Joseph Fernando, Pete Wyckoff, Eric Stahlberg, P. Sadayappan: Hardware/Software Integration for FPGA-based All-Pairs Shortest-Paths. FCCM 2006: 152-164 |
145 | EE | Gaurav Khanna, Nagavijayalakshmi Vydyanathan, Ümit V. Çatalyürek, Tahsin M. Kurç, Sriram Krishnamoorthy, P. Sadayappan, Joel H. Saltz: Task Scheduling and File Replication for Data-Intensive Jobs with Batch-shared I/O. HPDC 2006: 241-252 |
144 | EE | Nagavijayalakshmi Vydyanathan, Sriram Krishnamoorthy, Gerald Sabin, Ümit V. Çatalyürek, Tahsin M. Kurç, P. Sadayappan, Joel H. Saltz: An Integrated Approach for Processor Allocation and Scheduling of Mixed-Parallel Applications. ICPP 2006: 443-450 |
143 | EE | Sriram Krishnamoorthy, Ümit V. Çatalyürek, Jarek Nieplocha, P. Sadayappan: An approach to locality-conscious load balancing and transparent memory hierarchy management with a global-address-space parallel programming model. IPDPS 2006 |
142 | EE | Sriram Krishnamoorthy, Ümit V. Çatalyürek, Jarek Nieplocha, Atanas Rountev, P. Sadayappan: An extensible global address space framework with decoupled task and data abstractions. IPDPS 2006 |
141 | EE | A. Allam, J. Ramanujam, Gerald Baumgartner, P. Sadayappan: Memory minimization for tensor contractions using integer linear programming. IPDPS 2006 |
140 | EE | Uday Bondhugula, Ananth Devulapalli, Joseph Fernando, Pete Wyckoff, P. Sadayappan: Parallel FPGA-based all-pairs shortest-paths in a directed graph. IPDPS 2006 |
139 | EE | Albert Hartono, Qingda Lu, Xiaoyang Gao, Sriram Krishnamoorthy, Marcel Nooijen, Gerald Baumgartner, David E. Bernholdt, Venkatesh Choppella, Russell M. Pitzer, J. Ramanujam, Atanas Rountev, P. Sadayappan: Identifying Cost-Effective Common Subexpressions to Reduce Operation Count in Tensor Contraction Evaluations. International Conference on Computational Science (1) 2006: 267-275 |
138 | EE | Gaurav Khanna, Ümit V. Çatalyürek, Tahsin M. Kurç, P. Sadayappan, Joel H. Saltz: A Data Locality Aware Online Scheduling Approach for I/O-Intensive Jobs with File Sharing. JSSPP 2006: 141-160 |
137 | EE | Gerald Sabin, Matthew Lang, P. Sadayappan: Moldable Parallel Job Scheduling Using Job Efficiency: An Iterative Approach. JSSPP 2006: 94-114 |
136 | EE | Stephen Olivier, Jun Huan, Jinze Liu, Jan Prins, James Dinan, P. Sadayappan, Chau-Wen Tseng: UTS: An Unbalanced Tree Search Benchmark. LCPC 2006: 235-250 |
135 | EE | Qingda Lu, Sriram Krishnamoorthy, P. Sadayappan: Combining analytical and empirical approaches in tuning matrix transposition. PACT 2006: 233-242 |
134 | EE | Jarek Nieplocha, Bruce Palmer, Manojkumar Krishnan, P. Sadayappan: M12 - Overview of the global arrays parallel software development toolkit. SC 2006: 226 |
133 | EE | Sriram Krishnamoorthy, Ümit V. Çatalyürek, Jarek Nieplocha, Atanas Rountev, P. Sadayappan: Data management and query - Hypergraph partitioning for automatic memory hierarchy management. SC 2006: 98 |
132 | EE | Sandhya Krishnan, Sriram Krishnamoorthy, Gerald Baumgartner, Chi-Chung Lam, J. Ramanujam, P. Sadayappan, Venkatesh Choppella: Efficient synthesis of out-of-core algorithms using a nonlinear optimization solver. J. Parallel Distrib. Comput. 66(5): 659-673 (2006) |
131 | EE | Christian Engelmann, Stephen L. Scott, David E. Bernholdt, Narasimha Raju Gottumukkala, Chokchai Leangsuksun, Jyothish Varma, Chao Wang, Frank Mueller, Aniruddha G. Shet, P. Sadayappan: MOLAR: adaptive runtime support for high-end computing operating and runtime systems. Operating Systems Review 40(2): 63-72 (2006) |
130 | EE | Sriram Krishnamoorthy, Gerald Baumgartner, Chi-Chung Lam, Jarek Nieplocha, P. Sadayappan: Layout transformation support for the disk resident arrays framework. The Journal of Supercomputing 36(2): 153-170 (2006) |
2005 | ||
129 | EE | Gaurav Khanna, Nagavijayalakshmi Vydyanathan, Tahsin M. Kurç, Ümit V. Çatalyürek, Pete Wyckoff, Joel H. Saltz, P. Sadayappan: A hypergraph partitioning based approach for scheduling of tasks with batch-shared I/O. CCGRID 2005: 792-799 |
128 | EE | Sriram Krishnamoorthy, Jarek Nieplocha, P. Sadayappan: Data and Computation Abstractions for Dynamic and Irregular Computations. HiPC 2005: 258-269 |
127 | EE | Vipin Chaudhary, P. Sadayappan: Message from the Chairs. ICPP Workshops 2005: 282 |
126 | EE | Swarup Kumar Sahoo, Rajkiran Panuganti, Sriram Krishnamoorthy, P. Sadayappan: Cache Miss Characterization and Data Locality Optimization for Imperfectly Nested Loops on Shared Memory Multiprocessors. IPDPS 2005 |
125 | EE | Albert Hartono, Alexander Sibiryakov, Marcel Nooijen, Gerald Baumgartner, David E. Bernholdt, So Hirata, Chi-Chung Lam, Russell M. Pitzer, J. Ramanujam, P. Sadayappan: Automated Operation Minimization of Tensor Contraction Expressions in Electronic Structure Calculations. International Conference on Computational Science (1) 2005: 155-164 |
124 | EE | Gerald Sabin, P. Sadayappan: Unfairness Metrics for Space-Sharing Parallel Job Schedulers. JSSPP 2005: 238-256 |
123 | EE | Xiaoyang Gao, Sriram Krishnamoorthy, Swarup Kumar Sahoo, Chi-Chung Lam, Gerald Baumgartner, J. Ramanujam, P. Sadayappan: Efficient Search-Space Pruning for Integrated Fusion and Tiling Transformations. LCPC 2005: 215-229 |
122 | EE | Xiaoyang Gao, Swarup Kumar Sahoo, Chi-Chung Lam, J. Ramanujam, Qingda Lu, Gerald Baumgartner, P. Sadayappan: Performance modeling and optimization of parallel out-of-core tensor contractions. PPOPP 2005: 266-276 |
121 | EE | Swarup Kumar Sahoo, Sriram Krishnamoorthy, Rajkiran Panuganti, P. Sadayappan: Integrated Loop Optimizations for Data Locality Enhancement of Tensor Contraction Expressions. SC 2005: 13 |
120 | EE | Rajkumar Kettimuthu, Vijay Subramani, Srividya Srinivasan, Thiagaraja Gopalsamy, Dhabaleswar K. Panda, P. Sadayappan: Selective preemption strategies for parallel job scheduling. IJHPCN 3(2/3): 122-152 (2005) |
2004 | ||
119 | EE | Mohammad Islam, Pavan Balaji, P. Sadayappan, Dhabaleswar K. Panda: Towards provision of quality of service guarantees in job scheduling. CLUSTER 2004: 245-254 |
118 | EE | Gerald Sabin, V. Sahasrabudhe, P. Sadayappan: On fairness in distributed job scheduling across multiple sites. CLUSTER 2004: 35-44 |
117 | EE | Nagavijayalakshmi Vydyanathan, Gaurav Khanna, Tahsin M. Kurç, Ümit V. Çatalyürek, Pete Wyckoff, Joel H. Saltz, P. Sadayappan: Use of PVFS for Efficient Execution of Jobs with Pipeline-Shared I/O. GRID 2004: 235-242 |
116 | EE | Sriram Krishnamoorthy, Gerald Baumgartner, Chi-Chung Lam, Jarek Nieplocha, P. Sadayappan: Efficient Layout Transformation for Disk-Based Multidimensional Arrays. HiPC 2004: 386-398 |
115 | EE | Gerald Sabin, Garima Kochhar, P. Sadayappan: Job Fairness in Non-Preemptive Job Scheduling. ICPP 2004: 186-194 |
114 | EE | Vipin Chaudhary, P. Sadayappan: Message from the Chairs: International Workshop on Compile and Run Time Techniques for Parallel Computing. ICPP Workshops 2004: 497 |
113 | EE | Qingda Lu, Jiesheng Wu, Dhabaleswar K. Panda, P. Sadayappan: Applying MPI Derived Datatypes to the NAS Benchmarks: A Case Study. ICPP Workshops 2004: 538-545 |
112 | EE | Sandhya Krishnan, Sriram Krishnamoorthy, Gerald Baumgartner, Chi-Chung Lam, J. Ramanujam, P. Sadayappan, Venkatesh Choppella: Efficient Synthesis of Out-of-Core Algorithms Using a Nonlinear Optimization Solver. IPDPS 2004 |
111 | EE | Qingda Lu, Xiaoyang Gao, Sriram Krishnamoorthy, Gerald Baumgartner, J. Ramanujam, P. Sadayappan: Empirical Performance-Model Driven Data Layout Optimization. LCPC 2004: 72-86 |
110 | EE | Sriram Krishnamoorthy, Gerald Baumgartner, Daniel Cociorva, Chi-Chung Lam, P. Sadayappan: Efficient parallel out-of-core matrix transposition. IJHPCN 2(2/3/4): 110-119 (2004) |
109 | EE | Sudha Srinivasan, Savitha Krishnamoorthy, P. Sadayappan: Robust scheduling of moldable parallel jobs. IJHPCN 2(2/3/4): 120-132 (2004) |
2003 | ||
108 | EE | Sriram Krishnamoorthy, Gerald Baumgartner, Daniel Cociorva, Chi-Chung Lam, P. Sadayappan: Efficient Parallel Out-of-Core Matrix Transposition. CLUSTER 2003: 300-307 |
107 | EE | Sudha Srinivasan, Sriram Krishnamoorthy, P. Sadayappan: A Robust Scheduling Strategy for Moldable Scheduling of Parallel Jobs. CLUSTER 2003: 92-99 |
106 | EE | Sandhya Krishnan, Sriram Krishnamoorthy, Gerald Baumgartner, Daniel Cociorva, Chi-Chung Lam, P. Sadayappan, J. Ramanujam, David E. Bernholdt, Venkatesh Choppella: Data Locality Optimization for Synthesis of Efficient Out-of-Core Algorithms. HiPC 2003: 406-417 |
105 | EE | Daniel Cociorva, Xiaoyang Gao, Sandhya Krishnan, Gerald Baumgartner, Chi-Chung Lam, P. Sadayappan, J. Ramanujam: Global Communication Optimization for Tensor Contraction Expressions under Memory Constraints. IPDPS 2003: 37 |
104 | EE | Mohammad Islam, Pavan Balaji, P. Sadayappan, Dhabaleswar K. Panda: QoPS: A QoS Based Scheme for Parallel Job Scheduling. JSSPP 2003: 252-268 |
103 | EE | Gerald Sabin, Rajkumar Kettimuthu, Arun Rajan, P. Sadayappan: Scheduling of Parallel Jobs in a Heterogeneous Multi-site Environement. JSSPP 2003: 87-104 |
102 | EE | Konstantin Berlin, Jun Huan, Mary Jacob, Garima Kochhar, Jan Prins, William Pugh, P. Sadayappan, Jaime Spacco, Chau-Wen Tseng: Evaluating the Impact of Programming Language Features on the Performance of Parallel Applications on Cluster Architectures. LCPC 2003: 194-208 |
101 | EE | Alina Bibireata, Sandhya Krishnan, Gerald Baumgartner, Daniel Cociorva, Chi-Chung Lam, P. Sadayappan, J. Ramanujam, David E. Bernholdt, Venkatesh Choppella: Memory-Constrained Data Locality Optimization for Tensor Contractions. LCPC 2003: 93-108 |
2002 | ||
100 | EE | Vijay Subramani, Rajkumar Kettimuthu, Srividya Srinivasan, Jeanette Johnston, P. Sadayappan: Selective Buddy Allocation for Scheduling Parallel Jobs on Clusters. CLUSTER 2002: 107- |
99 | EE | Vijay Subramani, Rajkumar Kettimuthu, Srividya Srinivasan, P. Sadayappan: Distributed Job Scheduling on Computational Grids Using Multiple Simultaneous Requests. HPDC 2002: 359- |
98 | EE | Srividya Srinivasan, Vijay Subramani, Rajkumar Kettimuthu, Praveen Holenarsipur, P. Sadayappan: Effective Selection of Partition Sizes for Moldable Scheduling of Parallel Jobs. HiPC 2002: 174-183 |
97 | EE | Thiagaraja Gopalsamy, Mukesh Singhal, Dhabaleswar K. Panda, P. Sadayappan: A Reliable Multicast Algorithm for Mobile Ad Hoc Networks. ICDCS 2002: 563-570 |
96 | EE | P. Sadayappan: Message from the Chair. ICPP Workshops 2002: 495-498 |
95 | EE | Srividya Srinivasan, Rajkumar Kettimuthu, Vijay Subramani, P. Sadayappan: Characterization of Backfilling Strategies for Parallel Job Scheduling. ICPP Workshops 2002: 514-522 |
94 | EE | Vipin Chaudhary, P. Sadayappan: Message from the Co-Chairs. ICPP Workshops 2002: 547-550 |
93 | EE | Gerald Baumgartner, David E. Bernholdt, Daniel Cociorva, Chi-Chung Lam, J. Ramanujam, Robert J. Harrison, Marcel Nooijen, P. Sadayappan: A Performance Optimization Framework for Compilation of Tensor Contraction Expressions into Parallel Programs. IPDPS 2002 |
92 | EE | Srividya Srinivasan, Rajkumar Kettimuthu, Vijay Subramani, P. Sadayappan: Selective Reservation Strategies for Backfill Job Scheduling. JSSPP 2002: 55-71 |
91 | EE | Daniel Cociorva, Gerald Baumgartner, Chi-Chung Lam, P. Sadayappan, J. Ramanujam: Memory-Constrained Communication Minimization for a Class of Array Computations. LCPC 2002: 1-15 |
90 | EE | Daniel Cociorva, Gerald Baumgartner, Chi-Chung Lam, P. Sadayappan, J. Ramanujam, Marcel Nooijen, David E. Bernholdt, Robert J. Harrison: Space-Time Trade-Off Optimization for a Class of Electronic Structure Calculations. PLDI 2002: 177-186 |
89 | EE | Gerald Baumgartner, David E. Bernholdt, Daniel Cociorva, Robert J. Harrison, So Hirata, Chi-Chung Lam, Marcel Nooijen, Russell M. Pitzer, J. Ramanujam, P. Sadayappan: A high-level approach to synthesis of high-performance codes for quantum chemistry. SC 2002: 1-10 |
2001 | ||
88 | EE | Daniel Cociorva, J. W. Wilkins, Gerald Baumgartner, P. Sadayappan, J. Ramanujam, Marcel Nooijen, David E. Bernholdt, Robert J. Harrison: Towards Automatic Synthesis of High-Performance Codes for Electronic Structure Calculations: Data Locality Optimization. HiPC 2001: 237-248 |
87 | EE | Mohammad Banikazemi, Jiuxing Liu, Dhabaleswar K. Panda, P. Sadayappan: Implementing TreadMarksover VIA on Myrinet and Gigabit Ethernet: Challenges, Design Experience, and Performance Evaluation. ICPP 2001: 167-174 |
86 | EE | Abhishek Gulati, Dhabaleswar K. Panda, P. Sadayappan, Pete Wyckoff: NIC-Based Rate Control for Proportional Bandwidth Allocation in Myrinet Clusters. ICPP 2001: 305-312 |
85 | EE | Daniel Cociorva, J. W. Wilkins, Chi-Chung Lam, Gerald Baumgartner, J. Ramanujam, P. Sadayappan: Loop optimization for a class of memory-constrained computations. ICS 2001: 103-113 |
84 | Darius Buntinas, Dhabaleswar K. Panda, P. Sadayappan: Performance Benefits of NIC-Based Barrier on Myrinet/GM. IPDPS 2001: 166 | |
83 | Mohammad Banikazemi, Jiuxing Liu, S. Kutlug, P. Sadayappan, H. Shah, Dhabaleswar K. Panda: VIBe: A Micro-benchmark Suite for Evaluating Virtual Interface Architecture (VIA) Implementations. IPDPS 2001: 24 | |
82 | Darius Buntinas, Dhabaleswar K. Panda, P. Sadayappan: Fast NIC-Based Barrier over Myrinet/GM. IPDPS 2001: 52 | |
81 | Amit Singhal, Mohammad Banikazemi, P. Sadayappan, Dhabaleswar K. Panda: Efficient Multicast Algorithms for Heterogeneous Switch-based Irregular Networks of Workstations. IPDPS 2001: 71 | |
2000 | ||
80 | Vijay Moorthy, Dhabaleswar K. Panda, P. Sadayappan: Fast Collective Communication Algorithms for Reflective Memory Network Clusters. CANPC 2000: 100-114 | |
79 | Darius Buntinas, Dhabaleswar K. Panda, José Duato, P. Sadayappan: Broadcast/Multicast over Myrinet Using NIC-Assisted Multidestination Messages. CANPC 2000: 115-129 | |
78 | EE | Praveen Holenarsipur, Vladimir Yarmolenko, José Duato, Dhabaleswar K. Panda, P. Sadayappan: Characterization and enhancement of Static Mapping Heuristics for Heterogeneous Systems. HiPC 2000: 37-48 |
77 | EE | P. Sadayappan: Message from the Chair. ICPP Workshops 2000: 391- |
76 | EE | Vladimir Yarmolenko, José Duato, Dhabaleswar K. Panda, P. Sadayappan: Characterization and Enhancement of Dynamic Mapping Heuristics for Heterogeneous Systems. ICPP Workshops 2000: 437- |
75 | EE | Arindam Paul, Wu-chi Feng, Dhabaleswar K. Panda, P. Sadayappan: Balancing Web Server Load for Adaptable Video Distribution. ICPP Workshops 2000: 469- |
1999 | ||
74 | Matthew G. Jacunski, Vijay Moorthy, Peter P. Ware, Manoj Pillai, Dhabaleswar K. Panda, P. Sadayappan: Low Latency Message-Passing for Reflective Memory Networks. CANPC 1999: 211-224 | |
73 | EE | Mohammad Banikazemi, Jayanthi Sampathkumar, Sandeep Prabhu, Dhabaleswar K. Panda, P. Sadayappan: Communication Modeling of Heterogeneous Networks of Workstations for Performance Characterization of Collective Operations. Heterogeneous Computing Workshop 1999: 125- |
72 | Chi-Chung Lam, Daniel Cociorva, Gerald Baumgartner, P. Sadayappan: Memory-Optimal Evaluation of Expression Trees Involving Large Objects. HiPC 1999: 103-110 | |
71 | EE | N. S. Sundar, S. Jayanthi, P. Sadayappan, Miguel Visbal: An Incremental Methodology for Parallelizing Legacy Stencil Codes on Message-Passing Computers. ICPP 1999: 302-310 |
70 | EE | Vijay Moorthy, Matthew G. Jacunski, Manoj Pillai, Peter P. Ware, Dhabaleswar K. Panda, Thomas W. Page Jr., P. Sadayappan, V. Nagarajan, Johns Daniel: Low-Latency Message Passing on Workstation Clusters using SCRAMNet. IPPS/SPDP 1999: 148-152 |
69 | EE | Matthew G. Jacunski, P. Sadayappan, Dhabaleswar K. Panda: All-to-All Broadcast on Switch-Based Clusters of Workstations. IPPS/SPDP 1999: 325-329 |
68 | EE | Chi-Chung Lam, Daniel Cociorva, Gerald Baumgartner, P. Sadayappan: Optimization of Memory Usage Requirement for a Class of Loops Implementing Multi-dimensional Integrals. LCPC 1999: 350-364 |
67 | Chi-Chung Lam, P. Sadayappan, Daniel Cociorva, Mebarek Alouani, John Wilkins: Performance Optimization of a Class of Loops Involving Sums of Products of Sparse Arrays. PPSC 1999 | |
1998 | ||
66 | Zhiyuan Li, Pen-Chung Yew, Siddhartha Chatterjee, Chua-Huang Huang, P. Sadayappan, David C. Sehr: Languages and Compilers for Parallel Computing, 10th International Workshop, LCPC'97, Minneapolis, Minnesota, USA, August 7-9, 1997, Proceedings Springer 1998 | |
65 | Sandeep K. S. Gupta, Chua-Huang Huang, P. Sadayappan, Rodney W. Johnson: A technique for overlapping computation and communication for block recursive algorithms. Concurrency - Practice and Experience 10(2): 73-90 (1998) | |
64 | P. Sadayappan, Fikret Erçal, J. Ramanujam: Partitioning Graphs on Message-Passing Machines by Pairwise Mincut. Inf. Sci. 111(1-4): 223-237 (1998) | |
1997 | ||
63 | Chi-Chung Lam, P. Sadayappan, Rephael Wenger: Optimization of a Class of Multi-Dimensional Integrals on Parallel Machines. PPSC 1997 | |
62 | Chi-Chung Lam, Chua-Huang Huang, P. Sadayappan: Optimal Algorithms for All-to-All Personalized Communication on Rings and Two Dimensional Tori. J. Parallel Distrib. Comput. 43(1): 3-13 (1997) | |
61 | Chi-Chung Lam, P. Sadayappan, Rephael Wenger: On Optimizing a Class of Multi-Dimensional Loops with Reductions for Parallel Execution. Parallel Processing Letters 7(2): 157-168 (1997) | |
1996 | ||
60 | Chua-Huang Huang, P. Sadayappan, Utpal Banerjee, David Gelernter, Alexandru Nicolau, David A. Padua: Languages and Compilers for Parallel Computing, 8th International Workshop, LCPC'95, Columbus, Ohio, USA, August 10-12, 1995, Proceedings Springer 1996 | |
59 | EE | N. S. Sundar, D. N. Jayasimha, Dhabaleswar K. Panda, P. Sadayappan: Hybrid Algorithms for Complete Exchange in 2D Meshes. International Conference on Supercomputing 1996: 181-188 |
58 | Chi-Chung Lam, P. Sadayappan, Rephael Wenger: Optimal Reordering and Mapping of a Class of Nested-Loops for Parallel Execution. LCPC 1996: 315-329 | |
57 | EE | S. D. Kaushik, Sanjay Sharma, Chua-Huang Huang, John R. Johnson, Rodney W. Johnson, P. Sadayappan: An Algebraic Theory for Modeling Directt Interconnection Networks. J. Inf. Sci. Eng. 12(1): 25-49 (1996) |
56 | Sandeep K. S. Gupta, S. D. Kaushik, Chua-Huang Huang, P. Sadayappan: Compiling Array Expressions for Efficient Execution on Distributed-Memory Machines. J. Parallel Distrib. Comput. 32(2): 155-172 (1996) | |
55 | Sandeep K. S. Gupta, Chua-Huang Huang, P. Sadayappan, Rodney W. Johnson: A Framework for Generating Distributed-Memory Parallel Programs for Block Recursive Algorithms. J. Parallel Distrib. Comput. 34(2): 137-153 (1996) | |
54 | S. D. Kaushik, Chua-Huang Huang, P. Sadayappan: Efficient Index Set Generation for Compiling HPF Array Statements on Distributed-Memory Machines. J. Parallel Distrib. Comput. 38(2): 237-247 (1996) | |
53 | Himanshu Gupta, P. Sadayappan: Communication-Efficient Matrix Multiplication on Hypercubes. Parallel Computing 22(1): 75-99 (1996) | |
1995 | ||
52 | EE | S. D. Kaushik, Chua-Huang Huang, J. Ramanujam, P. Sadayappan: Multi-phase array redistribution: modeling and evaluation. IPPS 1995: 441-445 |
51 | S. D. Kaushik, Chua-Huang Huang, P. Sadayappan: Compiling Array Statements for Efficient Execution on Distributed-Memory Machines: Two-Level Mappings. LCPC 1995: 209-223 | |
50 | Bharat Kumar, Kalluri Eswar, P. Sadayappan, Chua-Huang Huang: A Clustering Algorithm for Parallel Sparse Cholesky Factorization. Parallel Processing Letters 5: 685-696 (1995) | |
49 | EE | Bharat Kumar, Chua-Huang Huang, P. Sadayappan, Rodney W. Johnson: A Tensor Product Formulation of Strassen's Matrix Multiplication Algorithm with Memory Reduction. Scientific Programming 4(4): 275-289 (1995) |
1994 | ||
48 | Sandeep K. S. Gupta, Chua-Huang Huang, Rodney W. Johnson, P. Sadayappan: Communication-Efficient Implementation of Block Recursive Algorithms on Distributed-Memory Machines. ICPADS 1994: 113-119 | |
47 | Alpesh Amin, P. Sadayappan, Murali Gudavalli: A Clustered Reduced Communication Element by Element Preconditioned Conjugate Gradient Algorithm for Finite Element Computations. IPPS 1994: 509-516 | |
46 | EE | S. D. Kaushik, Chua-Huang Huang, Rodney W. Johnson, P. Sadayappan: An approach to communication-efficient data redistribution. International Conference on Supercomputing 1994: 364-373 |
45 | EE | Bharat Kumar, P. Sadayappan, Chua-Huang Huang: On sparse matrix reordering for parallel factorization. International Conference on Supercomputing 1994: 431-438 |
44 | S. D. Kaushik, Chua-Huang Huang, P. Sadayappan: Incremental Generation of Index Sets for Array Statement Execution on Distributed-Memory Machines. LCPC 1994: 251-265 | |
43 | EE | D. L. Dai, Sandeep K. S. Gupta, S. D. Kaushik, J. H. Lu, R. V. Singh, Chua-Huang Huang, P. Sadayappan, Rodney W. Johnson: EXTENT: a portable programming environment for designing and implementing high-performance block recursive algorithms. SC 1994: 49-58 |
42 | EE | Himanshu Gupta, P. Sadayappan: Communication Efficient Matrix Multiplication on Hypercubes. SPAA 1994: 320-329 |
41 | Sandeep K. S. Gupta, Chua-Huang Huang, P. Sadayappan, Rodney W. Johnson: Implementing Fast Fourier Transforms on Distributed-Memory Multiprocessors Using Data Redistributions. Parallel Processing Letters 4: 477-488 (1994) | |
1993 | ||
40 | EE | Debabrata Ghosh, S. K. Nandy, P. Sadayappan, K. Parthasarathy: Architectural Synthesis of Performance-Driven Multipliers with Accumulator Interleaving. DAC 1993: 303-307 |
39 | Kalluri Eswar, P. Sadayappan, Chua-Huang Huang: Compile-Time Characterization of Recurrent Patterns in Irregular Computations. ICPP 1993: 148-155 | |
38 | Kalluri Eswar, P. Sadayappan, Chua-Huang Huang, V. Visvanathan: Supernodal Sparse Cholesky Facotrization on Distributed-Memory Multiprocessors. ICPP 1993: 18-22 | |
37 | Sandeep K. S. Gupta, S. D. Kaushik, S. Mufti, Sanjay Sharma, Chua-Huang Huang, P. Sadayappan: On Compiling Array Expressions for Efficient Execution on Distributed-Memory Machines. ICPP 1993: 301-305 | |
36 | S. K. Nandy, Ranjani Narayan, V. Visvanathan, P. Sadayappan, Prashant S. Chauhan: A Parallel Progressive Refinement Image Rendering Algorithm on a Scalable Multithreaded VLSI Processor Array. ICPP 1993: 94-97 | |
35 | Bharat Kumar, Chua-Huang Huang, Rodney W. Johnson, P. Sadayappan: A Tensor Product Formulation of Strassen's Matrix Multiplication Algorithm with Memory Reduction. IPPS 1993: 582-588 | |
34 | S. D. Kaushik, Chua-Huang Huang, Rodney W. Johnson, P. Sadayappan: A Methodology for Generating Efficient Disk-Based Algorithms from Tensor Product Formulas. LCPC 1993: 358-373 | |
33 | EE | S. D. Kaushik, Chua-Huang Huang, John R. Johnson, Rodney W. Johnson, P. Sadayappan: Efficient transposition algorithms for large matrices. SC 1993: 656-665 |
32 | Chua-Huang Huang, P. Sadayappan: Communication-Free Hyperplane Partitioning of Nested Loops. J. Parallel Distrib. Comput. 19(2): 90-102 (1993) | |
1992 | ||
31 | Sandeep K. S. Gupta, Chua-Huang Huang, P. Sadayappan, Rodney W. Johnson: On the Synthesis of Parallel Programs from Tensor Product Formulas for Block Recursive Algorithms. LCPC 1992: 264-280 | |
30 | S. D. Kaushik, Sanjay Sharma, Chua-Huang Huang, Jeremy R. Johnson, Rodney W. Johnson, P. Sadayappan: An Algebraic Theory for Modeling Direct Interconnection Networks. SC 1992: 488-497 | |
29 | Sanjay Sharma, Chua-Huang Huang, P. Sadayappan: On Data Dependence Analysis for Compiling Programs on Distributed-Memory Machines (Extended Abstract). SIGPLAN Workshop 1992: 13-16 | |
28 | Sandeep K. S. Gupta, S. D. Kaushik, Chua-Huang Huang, John R. Johnson, Rodney W. Johnson, P. Sadayappan: On the Automatic Generation of Data Distributions. SIGPLAN Workshop 1992: 82 | |
27 | Sandeep K. S. Gupta, S. D. Kaushik, Chua-Huang Huang, John R. Johnson, Rodney W. Johnson, P. Sadayappan: A Methodology for Generating Data Distributions to Optimize Communication. SPDP 1992: 436-441 | |
26 | Sailesh K. Rao, P. Sadayappan, Frank K. Hwang, Peter W. Shor: The Rectilinear Steiner Arborescence Problem. Algorithmica 7(2&3): 277-288 (1992) | |
25 | J. Ramanujam, P. Sadayappan: Tiling Multidimensional Itertion Spaces for Multicomputers. J. Parallel Distrib. Comput. 16(2): 108-120 (1992) | |
1991 | ||
24 | Kalluri Eswar, P. Sadayappan, V. Visvanathan: Multifrontal Factorization of Sparse Matrices on Shared-Memory Multiprocessors. ICPP (3) 1991: 159-166 | |
23 | Scott Whitman, P. Sadayappan: Computer Graphics Rendering on a Shared Memory Multiprocessor. ICPP (3) 1991: 197-200 | |
22 | Chua-Huang Huang, P. Sadayappan: Communication-Free Hyperplane Partitioning of Nested Loops. LCPC 1991: 186-200 | |
21 | V. Prasad Krothapalli, P. Sadayappan: Removal of Redundant Dependences in DOACROSS Lops with Constant Dependences. PPOPP 1991: 51-60 | |
20 | EE | J. Ramanujam, P. Sadayappan: Tiling multidimensional iteration spaces for nonshared memory machines. SC 1991: 111-120 |
19 | EE | V. Prasad Krothapalli, P. Sadayappan: Removal of Redundant Dependences in DOACROSS Loops with Constant Dependences. IEEE Trans. Parallel Distrib. Syst. 2(3): 281-289 (1991) |
18 | EE | J. Ramanujam, P. Sadayappan: Compile-Time Techniques for Data Distribution in Distributed Memory Machines. IEEE Trans. Parallel Distrib. Syst. 2(4): 472-482 (1991) |
1990 | ||
17 | EE | V. Prasad Krothapalli, P. Sadayappan: Exploiting Parallelism Through Run-Time Analysis on a Vector Processor (Abstract). ACM Conference on Computer Science 1990: 434 |
16 | J. Ramanujam, P. Sadayappan: Tiling of Iteration Spaces for Multicomputers. ICPP (2) 1990: 179-186 | |
15 | V. Prasad Krothapalli, P. Sadayappan: Dynamic Scheduling of DOACROSS Loops for Multiprocessors. PARBASE / Architectures 1990: 141-160 | |
14 | Fikret Erçal, J. Ramanujam, P. Sadayappan: Task Allocation onto a Hypercube by Recursive Mincut Bipartitioning. J. Parallel Distrib. Comput. 10(1): 35-44 (1990) | |
13 | P. Sadayappan, Fikret Erçal, J. Ramanujam: Cluster partitioning approaches to mapping parallel programs onto a hypercube. Parallel Computing 13(1): 1-16 (1990) | |
1989 | ||
12 | EE | P. Sadayappan, V. Visvanathan: Efficient Sparse Matrix Factorization for Circuit Simulation on Vector Supercomputers. DAC 1989: 13-18 |
11 | Amr Zaky, P. Sadayappan: Optimal Static Scheduling of Sequential Loops on Multiprocessors. ICPP (3) 1989: 130-137 | |
10 | EE | Fikret Erçal, P. Sadayappan: One-to-one mapping of process graphs onto a hypercube. ICS 1989: 91-98 |
9 | EE | P. Sadayappan, V. Visvanathan: Efficient sparse matrix factorization for circuit simulation on vector supercomputers. IEEE Trans. on CAD of Integrated Circuits and Systems 8(12): 1276-1285 (1989) |
1988 | ||
8 | EE | P. Sadayappan, V. Visvanathan: Parallelization and performance evaluation of circuit simulation on a shared-memory multiprocessor. ICS 1988: 254-265 |
7 | EE | V. Prasad Krothapalli, P. Sadayappan: An approach to synchronization for parallel computing. ICS 1988: 573-581 |
6 | Cevdet Aykanat, Füsun Özgüner, Fikret Erçal, P. Sadayappan: Iterative Algorithms for Solution of Large Sparse Systems of Linear Equations on Hypercubes. IEEE Trans. Computers 37(12): 1554-1568 (1988) | |
5 | P. Sadayappan, V. Visvanathan: Circuit Simulation on Shared-Memory Multiprocessors. IEEE Trans. Computers 37(12): 1634-1642 (1988) | |
1987 | ||
4 | P. Sadayappan, Fikret Erçal, Steven Martin: Mapping Finite Element Graphs onto Processor Meshes. ICPP 1987: 192-195 | |
3 | P. Sadayappan, Fikret Erçal: Cluster-Partitioning Approaches to Mapping Parallel Programs onto a Hypercube. ICS 1987: 475-497 | |
2 | P. Sadayappan, Fikret Erçal: Nearest-Neighbor Mapping of Finite Element Graphs onto Processor Meshes. IEEE Trans. Computers 36(12): 1408-1424 (1987) | |
1985 | ||
1 | EE | V. Ashok, Roger L. Costello, P. Sadayappan: Modeling switch-level simulation using data flow. DAC 1985: 637-644 |