2009 |
16 | EE | Alfredo Buttari,
Julien Langou,
Jakub Kurzak,
Jack Dongarra:
A class of parallel tiled linear algebra algorithms for multicore architectures.
Parallel Computing 35(1): 38-53 (2009) |
15 | EE | Jakub Kurzak,
Wesley Alvaro,
Jack Dongarra:
Optimizing matrix multiplication for a short-vector SIMD architecture - CELL processor.
Parallel Computing 35(3): 138-150 (2009) |
2008 |
14 | EE | Wesley Alvaro,
Jakub Kurzak,
Jack Dongarra:
Fast and Small Short Vector SIMD Matrix Multiplication Kernels for the Synergistic Processing Element of the CELL Processor.
ICCS (1) 2008: 935-944 |
13 | EE | Alfredo Buttari,
Jack Dongarra,
Jakub Kurzak,
Piotr Luszczek,
Stanimire Tomov:
Using Mixed Precision for Sparse Matrix Computations to Enhance the Performance while Achieving 64-bit Accuracy.
ACM Trans. Math. Softw. 34(4): (2008) |
12 | EE | Marc Baboulin,
Alfredo Buttari,
Jack Dongarra,
Jakub Kurzak,
Julie Langou,
Julien Langou,
Piotr Luszczek,
Stanimire Tomov:
Accelerating Scientific Computations with Mixed Precision Algorithms
CoRR abs/0808.2794: (2008) |
11 | EE | Alfredo Buttari,
Julien Langou,
Jakub Kurzak,
Jack Dongarra:
Parallel tiled QR factorization for multicore architectures.
Concurrency and Computation: Practice and Experience 20(13): 1573-1590 (2008) |
10 | EE | Jakub Kurzak,
Alfredo Buttari,
Jack Dongarra:
Solving Systems of Linear Equations on the CELL Processor Using Cholesky Factorization.
IEEE Trans. Parallel Distrib. Syst. 19(9): 1175-1186 (2008) |
2007 |
9 | EE | Alfredo Buttari,
Julien Langou,
Jakub Kurzak,
Jack Dongarra:
Parallel Tiled QR Factorization for Multicore Architectures.
PPAM 2007: 639-648 |
8 | EE | Alfredo Buttari,
Julien Langou,
Jakub Kurzak,
Jack Dongarra:
A Class of Parallel Tiled Linear Algebra Algorithms for Multicore Architectures
CoRR abs/0709.1272: (2007) |
7 | EE | Jakub Kurzak,
Jack Dongarra:
Implementation of mixed precision in solving systems of linear equations on the Cell processor.
Concurrency and Computation: Practice and Experience 19(10): 1371-1385 (2007) |
2006 |
6 | EE | Alfredo Buttari,
Jack Dongarra,
Jakub Kurzak,
Julien Langou,
Piotr Luszczek,
Stanimire Tomov:
The Impact of Multicore on Math Software.
PARA 2006: 1-10 |
5 | EE | James Demmel,
Jack Dongarra,
Beresford N. Parlett,
William Kahan,
Ming Gu,
David Bindel,
Yozo Hida,
Xiaoye S. Li,
Osni Marques,
E. Jason Riedy,
Christof Vömel,
Julien Langou,
Piotr Luszczek,
Jakub Kurzak,
Alfredo Buttari,
Julie Langou,
Stanimire Tomov:
Prospectus for the Next LAPACK and ScaLAPACK Libraries.
PARA 2006: 11-23 |
4 | EE | Jakub Kurzak,
Jack Dongarra:
Implementing Linear Algebra Routines on Multi-core Processors with Pipelining and a Look Ahead.
PARA 2006: 147-156 |
3 | EE | Julie Langou,
Julien Langou,
Piotr Luszczek,
Jakub Kurzak,
Alfredo Buttari,
Jack Dongarra:
Tools and techniques for performance - Exploiting the performance of 32 bit floating point arithmetic in obtaining 64 bit accuracy (revisiting iterative refinement for linear systems).
SC 2006: 113 |
2 | EE | Alfredo Buttari,
Jakub Kurzak,
Jack Dongarra:
Poster reception - Targeting multi-core architectures for linear algebra applications.
SC 2006: 162 |
2005 |
1 | EE | Jakub Kurzak,
B. Montgomery Pettitt:
Massively parallel implementation of a fast multipole method for distributed memory machines.
J. Parallel Distrib. Comput. 65(7): 870-881 (2005) |