2008 |
53 | EE | Keith D. Underwood:
From Silicon to Science: The Long Road to Production Reconfigurable Supercomputing.
ARC 2008: 2 |
52 | EE | Keith D. Underwood,
Michael Levenhagen,
K. Scott Hemmert,
Ron Brightwell:
High message rate, NIC-based atomics: Design and performance considerations.
CLUSTER 2008: 133-141 |
51 | EE | Michael J. Beauchamp,
Scott Hauck,
Keith D. Underwood,
K. Scott Hemmert:
Architectural Modifications to Enhance the Floating-Point Performance of FPGAs.
IEEE Trans. VLSI Syst. 16(2): 177-187 (2008) |
2007 |
50 | EE | K. Scott Hemmert,
Keith D. Underwood,
Arun Rodrigues:
An architecture to perform NIC based MPI matching.
CLUSTER 2007: 211-221 |
49 | EE | Kyle Rupnow,
Keith D. Underwood,
Katherine Compton:
Scientific Application Acceleration with Reconfigurable Functional Units.
FCCM 2007: 261-274 |
48 | EE | Keith D. Underwood,
Michael Levenhagen,
Arun Rodrigues:
Simulating Red Storm: Challenges and Successes in Building a System Simulation.
IPDPS 2007: 1-10 |
47 | EE | Keith D. Underwood,
Megan Vance,
Jonathan W. Berry,
Bruce Hendrickson:
Analyzing the Scalability of Graph Algorithms on Eldorado.
IPDPS 2007: 1-8 |
46 | EE | Keith D. Underwood,
Michael Levenhagen,
Ron Brightwell:
Evaluating NIC hardware requirements to achieve high message rate PGAS support on multi-core processors.
SC 2007: 36 |
45 | EE | K. Scott Hemmert,
Keith D. Underwood:
Floating-Point Divider Design for FPGAs.
IEEE Trans. VLSI Syst. 15(1): 115-118 (2007) |
2006 |
44 | EE | Steven J. Plimpton,
Ron Brightwell,
Courtenay Vaughan,
Keith D. Underwood,
Mike Davis:
A Simple Synchronous Distributed-Memory Algorithm for the HPCC RandomAccess Benchmark.
CLUSTER 2006 |
43 | EE | Arun Rodrigues,
Kyle Wheeler,
Peter M. Kogge,
Keith D. Underwood:
Fine-Grained Message Pipelining for Improved MPI Performance.
CLUSTER 2006 |
42 | EE | K. Scott Hemmert,
Keith D. Underwood:
Open Source High Performance Floating-Point Modules.
FCCM 2006: 349-350 |
41 | EE | Michael J. Beauchamp,
Scott Hauck,
Keith D. Underwood,
K. Scott Hemmert:
Embedded floating-point units in FPGAs.
FPGA 2006: 12-20 |
40 | EE | Michael J. Beauchamp,
Scott Hauck,
Keith D. Underwood,
K. Scott Hemmert:
Architectural Modifications to Improve Floating-Point Unit Efficiency in FPGAs.
FPL 2006: 1-6 |
39 | EE | Kyle Rupnow,
Arun Rodrigues,
Keith D. Underwood,
Katherine Compton:
Scientific applications vs. SPEC-FP: a comparison of program behavior.
ICS 2006: 66-74 |
38 | EE | Ron Brightwell,
Douglas Doerfler,
Keith D. Underwood:
A preliminary analysis of the InfiniPath and XD1 network interfaces.
IPDPS 2006 |
37 | EE | Keith D. Underwood:
Challenges and Issues in Benchmarking MPI.
PVM/MPI 2006: 339-346 |
36 | EE | Keith D. Underwood,
K. Scott Hemmert,
Craig Ulmer:
Tools and techniques for performance - Architectures and APIs: assessing requirements for delivering FPGA performance to applications.
SC 2006: 111 |
35 | EE | Arun Rodrigues,
Richard C. Murphy,
Peter M. Kogge,
Keith D. Underwood:
Poster reception - The structural simulation toolkit: exploring novel architectures.
SC 2006: 157 |
34 | EE | Tarek A. El-Ghazawi,
Dave Bennett,
Daniel S. Poznanovic,
Allan Cantle,
Keith D. Underwood,
Rob Pennington,
Duncan A. Buell,
Alan George,
Volodymyr V. Kindratenko:
Reconfigurable supercomputing - Is high-performance reconfigurable computing the next supercomputing paradigm?
SC 2006: 71 |
33 | EE | Ron Brightwell,
Kevin T. Pedretti,
Keith D. Underwood,
Trammell Hudson:
SeaStar Interconnect: Balanced Bandwidth for Scalable Performance.
IEEE Micro 26(3): 41-57 (2006) |
32 | EE | Ron Brightwell,
Sue Goudy,
Arun Rodrigues,
Keith D. Underwood:
Implications of application usage characteristics for collective communication offload.
IJHPCN 4(3/4): 104-116 (2006) |
2005 |
31 | EE | Keith D. Underwood,
Arun Rodrigues,
K. S. Hemmeit:
Accelerating List Management for MPI.
CLUSTER 2005: 1-10 |
30 | EE | Ron Brightwell,
Trammell Hudson,
Kevin T. Pedretti,
Rolf Riesen,
Keith D. Underwood:
Implementation and Performance of Portals 3.3 on the Cray XT3.
CLUSTER 2005: 1-10 |
29 | EE | K. Scott Hemmert,
Keith D. Underwood:
An Analysis of the Double-Precision Floating-Point FFT on FPGAs.
FCCM 2005: 171-180 |
28 | EE | Michael Haselman,
Michael J. Beauchamp,
Aaron Wood,
Scott Hauck,
Keith D. Underwood,
K. Scott Hemmert:
A Comparison of Floating Point and Logarithmic Number Systems for FPGAs.
FCCM 2005: 181-190 |
27 | EE | Ron Brightwell,
Kevin T. Pedretti,
Keith D. Underwood:
Initial Performance Evaluation of the Cray SeaStar Interconnect.
Hot Interconnects 2005: 51-57 |
26 | EE | Ron Brightwell,
Sue Goudy,
Keith D. Underwood:
A Preliminary Analysis of the MPI Queue Characteristics of Several Applications.
ICPP 2005: 175-183 |
25 | EE | William Lawry,
Keith D. Underwood:
Considering the Relative Importance of Network Performance and Network Features.
ICPP 2005: 329-337 |
24 | EE | Richard C. Murphy,
Arun Rodrigues,
Peter M. Kogge,
Keith D. Underwood:
The implications of working set analysis on supercomputing memory hierarchy design.
ICS 2005: 332-340 |
23 | EE | Keith D. Underwood,
K. Scott Hemmert,
Arun Rodrigues,
Richard C. Murphy,
Ron Brightwell:
A Hardware Acceleration Unit for MPI Queue Processing.
IPDPS 2005 |
22 | EE | Arun Rodrigues,
Richard C. Murphy,
Ron Brightwell,
Keith D. Underwood:
Enhancing NIC Performance for MPI using Processing-in-Memory.
IPDPS 2005 |
21 | EE | Krishna Muriki,
Keith D. Underwood,
Ron Sass:
RC-BLAST: Towards a Portable, Cost-Effective Open Source Hardware Implementation.
IPDPS 2005 |
2004 |
20 | EE | Ron Brightwell,
Douglas Doerfler,
Keith D. Underwood:
A comparison of 4X InfiniBand and Quadrics Elan-4 technologies.
CLUSTER 2004: 193-204 |
19 | EE | Keith D. Underwood,
K. Scott Hemmert:
Closing the Gap: CPU and FPGA Trends in Sustainable Floating-Point BLAS Performance.
FCCM 2004: 219-228 |
18 | EE | Keith D. Underwood:
FPGAs vs. CPUs: trends in peak floating-point performance.
FPGA 2004: 171-180 |
17 | EE | Keith D. Underwood,
Ron Brightwell:
The Impact of MPI Queue Usage on Message Latency.
ICPP 2004: 152-160 |
16 | EE | Arun Rodrigues,
Richard C. Murphy,
Peter M. Kogge,
Keith D. Underwood:
Characterizing a new class of threads in scientific applications for high end supercomputers.
ICS 2004: 164-174 |
15 | EE | Ron Brightwell,
Keith D. Underwood:
An analysis of the impact of MPI overlap and independent progress.
ICS 2004: 298-305 |
14 | EE | Ron Brightwell,
Keith D. Underwood:
An Analysis of NIC Resource Usage for Offloading MPI.
IPDPS 2004 |
13 | EE | Ron Brightwell,
Keith D. Underwood,
Rolf Riesen:
An Initial Analysis of the Impact of Overlap and Independent Progress for MPI.
PVM/MPI 2004: 370-377 |
12 | EE | Keith D. Underwood,
Walter B. Ligon III,
Ron R. Sass:
An Analysis of the Cost Effectiveness of an Adaptable Computing Cluster.
Cluster Computing 7(4): 357-371 (2004) |
2003 |
11 | EE | Ron Brightwell,
Rolf Riesen,
Keith D. Underwood,
Trammell Hudson,
Patrick G. Bridges,
Arthur B. Maccabe:
A Performance Comparison of Linux and a Lightweight Kernel.
CLUSTER 2003: 251-258 |
10 | EE | Arun Rodrigues,
Richard C. Murphy,
Peter M. Kogge,
Jay B. Brockman,
Ron Brightwell,
Keith D. Underwood:
Implications of a PIM Architectural Model for MPI.
CLUSTER 2003: 259- |
9 | EE | Ranjesh G. Jaganathan,
Keith D. Underwood,
Ron R. Sass:
A Configurable Network Protocol for Cluster Based Communications using Modular Hardware Primitives on an Intelligent NIC.
FCCM 2003: 286-287 |
8 | EE | Ron Brightwell,
Keith D. Underwood:
Evaluation of an Eager Protocol Optimization for MPI.
PVM/MPI 2003: 327-334 |
7 | EE | Ranjesh G. Jaganathan,
Keith D. Underwood,
Ron Sass:
A Configurable Network Protocol for Cluster Based Communications using Modular Hardware Primitives on an Intelligent NIC.
SC 2003: 22 |
6 | | Keith D. Underwood,
Walter B. Ligon III,
Ron Sass:
Analysis of a prototype intelligent network interface.
Concurrency and Computation: Practice and Experience 15(7-8): 751-777 (2003) |
2002 |
5 | EE | Peter Bellows,
Jaroslav Flidr,
Tom Lehman,
Brian Schott,
Keith D. Underwood:
GRIP: A Reconfigurable Architecture for Host-Based Gigabit-Rate Packet Processing.
FCCM 2002: 121-130 |
2001 |
4 | EE | Keith D. Underwood,
Ron Sass,
Walter B. Ligon III:
A Reconfigurable Extension to the Network Interface of Beowulf Clusters.
CLUSTER 2001: 212- |
3 | EE | Keith D. Underwood,
Ron R. Sass,
Walter B. Ligon III:
Cost effectiveness of an adaptable computing cluster.
SC 2001: 54 |
1998 |
2 | EE | Walter B. Ligon III,
Scott McMillan,
Greg Monn,
Kevin Schoonover,
Fred Stivers,
Keith D. Underwood:
A Re-evaluation of the Practicality of Floating-Point Operations on FPGAs.
FCCM 1998: 206-215 |
1 | EE | Walter B. Ligon III,
Greg Monn,
S. P. McMillan,
Kevin Schoonover,
Fred Stivers,
Keith D. Underwood:
Implementation of IEEE Single-Precision Floating-Point Operations on FPGAs (Abstract).
FPGA 1998: 258 |