| 2009 |
| 63 | EE | Gregorio Quintana-Ortí,
Francisco D. Igual,
Enrique S. Quintana-Ortí,
Robert A. van de Geijn:
Solving dense linear systems on platforms with multiple hardware accelerators.
PPOPP 2009: 121-130 |
| 2008 |
| 62 | EE | Gregorio Quintana-Ortí,
Enrique S. Quintana-Ortí,
Ernie Chan,
Robert A. van de Geijn,
Field G. Van Zee:
Design of scalable dense linear algebra libraries for multithreaded architectures: the LU factorization.
IPDPS 2008: 1-8 |
| 61 | EE | Gregorio Quintana-Ortí,
Enrique S. Quintana-Ortí,
Ernie Chan,
Robert A. van de Geijn,
Field G. Van Zee:
Scheduling of QR Factorization Algorithms on SMP and Multi-Core Architectures.
PDP 2008: 301-310 |
| 60 | EE | Ernie Chan,
Field G. Van Zee,
Paolo Bientinesi,
Enrique S. Quintana-Ortí,
Gregorio Quintana-Ortí,
Robert A. van de Geijn:
SuperMatrix: a multithreaded runtime scheduling system for algorithms-by-blocks.
PPOPP 2008: 123-132 |
| 59 | EE | Jeffrey R. Diamond,
Behnam Robatmili,
Stephen W. Keckler,
Robert A. van de Geijn,
Kazushige Goto,
Doug Burger:
High performance dense linear algebra on a spatially distributed processor.
PPOPP 2008: 63-72 |
| 58 | EE | Gregorio Quintana-Ortí,
Enrique S. Quintana-Ortí,
Alfredo Remón,
Robert A. van de Geijn:
An Algorithm-by-Blocks for SuperMatrix Band Cholesky Factorization.
VECPAR 2008: 228-239 |
| 57 | EE | Field G. Van Zee,
Paolo Bientinesi,
Tze Meng Low,
Robert A. van de Geijn:
Scalable parallelization of FLAME code via the workqueuing model.
ACM Trans. Math. Softw. 34(2): (2008) |
| 56 | EE | Kazushige Goto,
Robert A. van de Geijn:
Anatomy of high-performance matrix multiplication.
ACM Trans. Math. Softw. 34(3): (2008) |
| 55 | EE | Paolo Bientinesi,
Brian C. Gunter,
Robert A. van de Geijn:
Families of algorithms related to the inversion of a Symmetric Positive Definite matrix.
ACM Trans. Math. Softw. 35(1): (2008) |
| 54 | EE | Kazushige Goto,
Robert A. van de Geijn:
High-performance implementation of the level-3 BLAS.
ACM Trans. Math. Softw. 35(1): (2008) |
| 53 | EE | Enrique S. Quintana-Ortí,
Robert A. van de Geijn:
Updating an LU Factorization with Pivoting.
ACM Trans. Math. Softw. 35(2): (2008) |
| 2007 |
| 52 | EE | Ernie Chan,
Field G. Van Zee,
Enrique S. Quintana-Ortí,
Gregorio Quintana-Ortí,
Robert A. van de Geijn:
Satisfying your dependencies with SuperMatrix.
CLUSTER 2007: 91-99 |
| 51 | EE | Robert A. van de Geijn:
The science of programming dense linear algebra libraries.
CLUSTER 2007 |
| 50 | EE | Bryan Marker,
Field G. Van Zee,
Kazushige Goto,
Gregorio Quintana-Ortí,
Robert A. van de Geijn:
Toward Scalable Matrix Multiply on Multithreaded Architectures.
Euro-Par 2007: 748-757 |
| 49 | EE | Ernie Chan,
Enrique S. Quintana-Ortí,
Gregorio Quintana-Ortí,
Robert A. van de Geijn:
Supermatrix out-of-order scheduling of matrix operations for SMP and multi-core architectures.
SPAA 2007: 116-125 |
| 48 | EE | Ernie Chan,
Marcel Heimlich,
Avi Purkayastha,
Robert A. van de Geijn:
Collective communication: theory, practice, and experience.
Concurrency and Computation: Practice and Experience 19(13): 1749-1783 (2007) |
| 2006 |
| 47 | EE | Ernie Chan,
Robert A. van de Geijn,
William Gropp,
Rajeev Thakur:
Collective communication on architectures that support simultaneous communication over multiple links.
PPOPP 2006: 2-11 |
| 46 | EE | Thierry Joffrain,
Tze Meng Low,
Enrique S. Quintana-Ortí,
Robert A. van de Geijn,
Field G. Van Zee:
Accumulating Householder transformations, revisited.
ACM Trans. Math. Softw. 32(2): 169-179 (2006) |
| 45 | EE | Gregorio Quintana-Ortí,
Robert A. van de Geijn:
Improving the performance of reduction to Hessenberg form.
ACM Trans. Math. Softw. 32(2): 180-194 (2006) |
| 2005 |
| 44 | EE | Tze Meng Low,
Robert A. van de Geijn,
Field G. Van Zee:
Extracting SMP parallelism for dense linear algebra algorithms from high-level specifications.
PPOPP 2005: 153-163 |
| 43 | EE | Paolo Bientinesi,
John A. Gunnels,
Margaret E. Myers,
Enrique S. Quintana-Ortí,
Robert A. van de Geijn:
The science of deriving dense linear algebra algorithms.
ACM Trans. Math. Softw. 31(1): 1-26 (2005) |
| 42 | EE | Paolo Bientinesi,
Enrique S. Quintana-Ortí,
Robert A. van de Geijn:
Representing linear algebra algorithms in code: the FLAME application program interfaces.
ACM Trans. Math. Softw. 31(1): 27-59 (2005) |
| 41 | EE | Brian C. Gunter,
Robert A. van de Geijn:
Parallel out-of-core computation and updating of the QR factorization.
ACM Trans. Math. Softw. 31(1): 60-78 (2005) |
| 40 | EE | Paolo Bientinesi,
Inderjit S. Dhillon,
Robert A. van de Geijn:
A Parallel Eigensolver for Dense Symmetric Matrices Based on Multiple Relatively Robust Representations.
SIAM J. Scientific Computing 27(1): 43-66 (2005) |
| 2004 |
| 39 | EE | E. W. Chan,
M. F. Heimlich,
Avi Purkayastha,
Robert A. van de Geijn:
On optimizing collective communication.
CLUSTER 2004: 145-155 |
| 38 | EE | E. W. Chan,
M. F. Heimlich,
Avi Purkayastha,
Robert A. van de Geijn:
Attaining higher performance in collective communication.
CLUSTER 2004: 484 |
| 37 | EE | John A. Gunnels,
Fred G. Gustavson,
Greg Henry,
Robert A. van de Geijn:
A Family of High-Performance Matrix Multiplication Algorithms.
PARA 2004: 256-265 |
| 36 | EE | Paolo Bientinesi,
John A. Gunnels,
Fred G. Gustavson,
Greg Henry,
Margaret E. Myers,
Enrique S. Quintana-Ortí,
Robert A. van de Geijn:
Rapid Development of High-Performance Linear Algebra Libraries.
PARA 2004: 376-384 |
| 35 | EE | Paolo Bientinesi,
Sergey Kolos,
Robert A. van de Geijn:
Automatic Derivation of Linear Algebra Algorithms with Application to Control Theory.
PARA 2004: 385-394 |
| 34 | EE | Thierry Joffrain,
Enrique S. Quintana-Ortí,
Robert A. van de Geijn:
Rapid Development of High-Performance Out-of-Core Solvers.
PARA 2004: 413-422 |
| 2003 |
| 33 | EE | Enrique S. Quintana-Ortí,
Robert A. van de Geijn:
Formal derivation of algorithms: The triangular sylvester equation.
ACM Trans. Math. Softw. 29(2): 218-243 (2003) |
| 2002 |
| 32 | EE | Thuan D. Cao,
John F. Hall,
Robert A. van de Geijn:
Parallel Cholesky Factorization of a Block Tridiagonal Matrix.
ICPP Workshops 2002: 327-335 |
| 2001 |
| 31 | EE | John A. Gunnels,
Robert A. van de Geijn,
Daniel S. Katz,
Enrique S. Quintana-Ortí:
Fault-Tolerant High-Performance Matrix Multiplication: Theory and Practice.
DSN 2001: 47-56 |
| 30 | | Brian C. Gunter,
Wesley C. Reiley,
Robert A. van de Geijn:
Parallel Out-of-Core Cholesky and QR Factorization with POOCLAPACK.
IPDPS 2001: 179 |
| 29 | EE | John A. Gunnels,
Greg Henry,
Robert A. van de Geijn:
A Family of High-Performance Matrix Multiplication Algorithms.
International Conference on Computational Science (1) 2001: 51-60 |
| 28 | EE | John A. Gunnels,
Fred G. Gustavson,
Greg Henry,
Robert A. van de Geijn:
FLAME: Formal Linear Algebra Methods Environment.
ACM Trans. Math. Softw. 27(4): 422-455 (2001) |
| 27 | EE | Enrique S. Quintana-Ortí,
Robert A. van de Geijn:
Specialized Parallel Algorithms for Solving Lyapunov and Stein Equations.
J. Parallel Distrib. Comput. 61(10): 1489-1504 (2001) |
| 2000 |
| 26 | | John A. Gunnels,
Robert A. van de Geijn:
Formal Methods for High-Performance Linear Algebra Libraries.
The Architecture of Scientific Software 2000: 193-210 |
| 1999 |
| 25 | | James Overfelt,
Yuhong Fu,
Gregory J. Rodin,
Robert A. van de Geijn:
Application Driven Fast Summation Methods.
PPSC 1999 |
| 24 | | Enrique S. Quintana-Ortí,
Robert A. van de Geijn:
Fast Parallel Kernels for Selected Problems in Control Theory.
PPSC 1999 |
| 1998 |
| 23 | EE | Gregory S. Baker,
John A. Gunnels,
Greg Morrow,
Béatrice Riviére,
Robert A. van de Geijn:
PLAPACK: High Performance through High-Level Abstraction.
ICPP 1998: 414- |
| 22 | EE | John A. Gunnels,
Calvin Lin,
Greg Morrow,
Robert A. van de Geijn:
A Flexible Class of Parallel Matrix Multiplication Algorithms.
IPPS/SPDP 1998: 110-116 |
| 1997 |
| 21 | | Phillip Alpatov,
Gregory S. Baker,
Carter Edwards,
John A. Gunnels,
Greg Morrow,
James Overfelt,
Robert A. van de Geijn,
Yuan-Jye J. Wu:
PLAPACK: Parallel Linear Algebra Package.
PPSC 1997 |
| 20 | | Robert A. van de Geijn,
Jerrell Watts:
SUMMA: scalable universal matrix multiplication algorithm.
Concurrency - Practice and Experience 9(4): 255-274 (1997) |
| 19 | | Domingo Giménez,
Vicente Hernández,
Robert A. van de Geijn,
Antonio M. Vidal:
A block Jacobi method on a mesh of processors.
Concurrency - Practice and Experience 9(5): 391-411 (1997) |
| 18 | | Almadena Yu. Chtchelkanova,
John A. Gunnels,
Greg Morrow,
James Overfelt,
Robert A. van de Geijn:
Parallel implementation of BLAS: general techniques for Level 3 BLAS.
Concurrency - Practice and Experience 9(9): 837-857 (1997) |
| 1996 |
| 17 | EE | Domingo Giménez,
Robert A. van de Geijn,
Vicente Hernández,
Antonio M. Vidal:
Exploiting the Symmetry on the Jacobi Method on a Mesh of Processors.
PDP 1996: 377-384 |
| 16 | | Michael Barnett,
David G. Payne,
Robert A. van de Geijn,
Jerrell Watts:
Broadcasting on Meshes with Wormhole Routing.
J. Parallel Distrib. Comput. 35(2): 111-122 (1996) |
| 1995 |
| 15 | | Kenneth Klimkowski,
Robert A. van de Geijn:
Anatomy of a Parallel Out-of-Core Dense Linear Solver.
ICPP (3) 1995: 29-33 |
| 14 | | Michael Barnett,
Richard J. Littlefield,
David G. Payne,
Robert A. van de Geijn:
Global Combine Algorithms for 2-D Meshes with Wormhole Routing.
J. Parallel Distrib. Comput. 24(2): 191-201 (1995) |
| 13 | | Jerrell Watts,
Robert A. van de Geijn:
A Pipelined Broadcast for Multidimensional Meshes.
Parallel Processing Letters 5: 281-292 (1995) |
| 1994 |
| 12 | EE | Michael Barnett,
Lance Shuler,
Satya Gupta,
David G. Payne,
Robert A. van de Geijn,
Jerrell Watts:
Building a high-performance collective communication library.
SC 1994: 107-116 |
| 11 | | Edward J. Barragy,
Graham F. Carey,
Robert A. van de Geijn:
Performance and Scalability of Finite Element Analysis for Distributed Parallel Computation.
J. Parallel Distrib. Comput. 21(2): 202-212 (1994) |
| 10 | | Robert A. van de Geijn:
On Global Combine Operations.
J. Parallel Distrib. Comput. 22(2): 324-328 (1994) |
| 9 | | Jack Dongarra,
Robert A. van de Geijn,
David W. Walker:
Scalability Issues Affecting the Design of a Dense Linear Algebra Library.
J. Parallel Distrib. Comput. 22(3): 523-537 (1994) |
| 1993 |
| 8 | | Michael Barnett,
Richard J. Littlefield,
David G. Payne,
Robert A. van de Geijn:
Global Combine on Mesh Architectures with Wormhole Routing.
IPPS 1993: 156-162 |
| 7 | | James Demmel,
Jack Dongarra,
Robert A. van de Geijn,
David W. Walker:
LAPACK for Distributed Memory Architectures: The Next Generation.
PPSC 1993: 323-329 |
| 6 | | Jack Dongarra,
Robert A. van de Geijn,
R. Clinton Whaley:
Two Dimensional Basic Linear Algebra Communication Subprograms.
PPSC 1993: 347-352 |
| 5 | | Michael Barnett,
Richard J. Littlefield,
David G. Payne,
Robert A. van de Geijn:
Efficient Communication Primitives on Mesh Architectures with Hardware Routing.
PPSC 1993: 943-948 |
| 4 | EE | John G. Lewis,
Robert A. van de Geijn:
Distributed memory matrix-vector multiplication and conjugate gradient algorithms.
SC 1993: 484-492 |
| 1992 |
| 3 | | Jack Dongarra,
Robert A. van de Geijn:
Reduction to condensed form for the eigenvalue problem on distributed memory architectures.
Parallel Computing 18(9): 973-982 (1992) |
| 1991 |
| 2 | | Ed Anderson,
Annamaria Benzoni,
Jack Dongarra,
Steve Moulton,
Susan Ostrouchov,
Bernard Tourancheau,
Robert A. van de Geijn:
LAPACK for Distributed Memory Architectures: Progress Report.
PPSC 1991: 625-630 |
| 1990 |
| 1 | EE | Duncan G. Hudson III,
Robert A. van de Geijn:
An asymptotically 100% efficient parallel implementation of the nonsymmetric QR algorithm.
SPDP 1990: 243-249 |