2009 |
29 | EE | Stephen L. Scott,
Christian Engelmann,
Geoffroy Vallée,
Thomas Naughton,
Anand Tikotekar,
George Ostrouchov,
Chokchai Leangsuksun,
Nichamon Naksinehaboon,
Raja Nassar,
Mihaela Paun,
Frank Mueller,
Chao Wang,
Arun Babu Nagarajan,
Jyothish Varma:
A tunable holistic resiliency approach for high-performance computing systems.
PPOPP 2009: 305-306 |
2008 |
28 | EE | Christian Engelmann,
Stephen L. Scott,
Chokchai Leangsuksun,
Xubin He:
Symmetric Active/Active Replication for Dependent Services.
ARES 2008: 260-267 |
27 | EE | Geoffroy Vallée,
Kulathep Charoenpornwattana,
Christian Engelmann,
Anand Tikotekar,
Chokchai Leangsuksun,
Thomas Naughton,
Stephen L. Scott:
A Framework for Proactive Fault Tolerance.
ARES 2008: 659-664 |
26 | EE | Christian Engelmann,
Stephen L. Scott,
Chokchai Leangsuksun,
Xubin He:
Symmetric Active/Active High Availability for High-Performance Computing System Services: Accomplishments and Limitations.
CCGRID 2008: 813-818 |
25 | EE | Anand Tikotekar,
Geoffroy Vallée,
Thomas Naughton,
Hong Ong,
Christian Engelmann,
Stephen L. Scott:
An Analysis of HPC Benchmarks in Virtual Machine Environments.
Euro-Par Workshops 2008: 63-71 |
24 | EE | Björn Könning,
Christian Engelmann,
Stephen L. Scott,
Al Geist:
Virtualized Environments for the Harness High Performance Computing Workbench.
PDP 2008: 133-140 |
23 | EE | Geoffroy Vallée,
Thomas Naughton,
Christian Engelmann,
Hong Ong,
Stephen L. Scott:
System-Level Virtualization for High Performance Computing.
PDP 2008: 636-643 |
22 | EE | Chao Wang,
Frank Mueller,
Christian Engelmann,
Stephen L. Scott:
Proactive process-level live migration in HPC environments.
SC 2008: 43 |
2007 |
21 | EE | Christian Engelmann,
Stephen L. Scott,
Chokchai Leangsuksun,
Xubin He:
On Programming Models for Service-Level High Availability.
ARES 2007: 999-1008 |
20 | EE | Christian Engelmann,
Stephen L. Scott,
Chokchai Leangsuksun,
Xubin He:
Transparent Symmetric Active/Active Replication for Service-Level High Availability.
CCGRID 2007: 755-760 |
19 | EE | Li Ou,
Xubin He,
Christian Engelmann,
Stephen L. Scott:
A Fast Delivery Protocol for Total Order Broadcasting.
ICCCN 2007: 730-734 |
18 | EE | Arun Babu Nagarajan,
Frank Mueller,
Christian Engelmann,
Stephen L. Scott:
Proactive fault tolerance for HPC with Xen virtualization.
ICS 2007: 23-32 |
17 | EE | Chao Wang,
Frank Mueller,
Christian Engelmann,
Stephen L. Scott:
A Job Pause Service under LAM/MPI+BLCR for Transparent Fault Tolerance.
IPDPS 2007: 1-10 |
16 | EE | Christian Engelmann,
Hong Ong,
Stephen L. Scott:
Middleware in Modern High Performance Computing System Architectures.
International Conference on Computational Science (2) 2007: 784-791 |
15 | EE | Emanuele Di Saverio,
Marco Cesati,
Christian Di Biagio,
Guido Pennella,
Christian Engelmann:
Distributed Real-Time Computing with Harness.
PVM/MPI 2007: 281-288 |
14 | EE | Xubin (Ben) He,
Li Ou,
Martha J. Kosa,
Stephen L. Scott,
Christian Engelmann:
A unified multiple-level cache for high performance storage systems.
IJHPCN 5(1/2): 97-109 (2007) |
2006 |
13 | EE | Christian Engelmann,
Stephen L. Scott,
Chokchai Leangsuksun,
Xubin (Ben) He:
Active/Active Replication for Highly Available HPC System Services.
ARES 2006: 639-645 |
12 | EE | Kai Uhlemann,
Christian Engelmann,
Stephen L. Scott:
JOSHUA: Symmetric Active/Active Replication for Highly Available HPC Job and Resource Management.
CLUSTER 2006 |
11 | EE | Ronald Baumann,
Christian Engelmann,
Al Geist:
A Parallel Plug-In Programming Paradigm.
HPCC 2006: 823-832 |
10 | EE | Jyothish Varma,
Chao Wang,
Frank Mueller,
Christian Engelmann,
Stephen L. Scott:
Scalable, fault tolerant membership for MPI tasks on HPC systems.
ICS 2006: 219-228 |
9 | EE | Christian Engelmann,
Al Geist:
RMIX: A Dynamic, Heterogeneous, Reconfigurable Communication Framework.
International Conference on Computational Science (2) 2006: 573-580 |
8 | EE | Christian Engelmann,
Stephen L. Scott,
Chokchai Leangsuksun,
Xubin (Ben) He:
Symmetric Active/Active High Availability for High-Performance Computing System Services.
JCP 1(8): 43-54 (2006) |
7 | EE | Christian Engelmann,
Stephen L. Scott,
David E. Bernholdt,
Narasimha Raju Gottumukkala,
Chokchai Leangsuksun,
Jyothish Varma,
Chao Wang,
Frank Mueller,
Aniruddha G. Shet,
P. Sadayappan:
MOLAR: adaptive runtime support for high-end computing operating and runtime systems.
Operating Systems Review 40(2): 63-72 (2006) |
2005 |
6 | EE | Kshitij Limaye,
Box Leangsuksun,
Zeno Greenwood,
Stephen L. Scott,
Christian Engelmann,
Richard Libby,
Kasidit Chanchio:
Job-Site Level Fault Tolerance for Cluster and Grid environments.
CLUSTER 2005: 1-9 |
5 | EE | Christian Engelmann,
Al Geist:
A Lightweight Kernel for the Harness Metacomputing Framework.
IPDPS 2005 |
4 | EE | Christian Engelmann,
Al Geist:
Super-Scalable Algorithms for Computing on 100, 000 Processors.
International Conference on Computational Science (1) 2005: 313-321 |
3 | | Hertong Song,
Chokchai Leangsuksun,
Raja Nassar,
Yudan Liu,
Christian Engelmann,
Stephen L. Scott:
UML-based Beowulf Cluster Availability Modeling.
Software Engineering Research and Practice 2005: 161-167 |
2003 |
2 | EE | Christian Engelmann,
Al Geist:
A Diskless Checkpointing Algorithm for Super-scale Architectures Applied to the Fast Fourier Transform.
CLADE 2003: 47 |
2002 |
1 | EE | Christian Engelmann,
Stephen L. Scott,
G. A. Geist II:
Distributed Peer-to-Peer Control in Harness.
International Conference on Computational Science (2) 2002: 720-728 |