dblp.uni-trier.dewww.uni-trier.de

Per Stenström

List of publications from the DBLP Bibliography Server - FAQ
Coauthor Index - Ask others: ACM DL/Guide - CiteSeer - CSB - Google - MSN - Yahoo

2009
108EEPer Stenström: Transactions on High-Performance Embedded Architectures and Compilers II Springer 2009
107EEMartin Thuresson, Magnus Själander, Per Stenström: A Flexible Code Compression Scheme Using Partitioned Look-Up Tables. HiPEAC 2009: 95-109
106EEM. M. Waliullah, Per Stenström: Schemes for avoiding starvation in transactional memory systems. Concurrency and Computation: Practice and Experience 21(7): 859-873 (2009)
105EEPer Stenström, David B. Whalley: Introduction. T. HiPEAC 2: 3 (2009)
2008
104 Per Stenström, Michel Dubois, Manolis Katevenis, Rajiv Gupta, Theo Ungerer: High Performance Embedded Architectures and Compilers, Third International Conference, HiPEAC 2008, Göteborg, Sweden, January 27-29, 2008, Proceedings Springer 2008
103EEMartin Thuresson, Per Stenström: Accommodation of the Bandwidth of Large Cache Blocks Using Cache/Memory Link Compression. ICPP 2008: 478-486
102EEM. M. Waliullah, Per Stenström: Intermediate checkpointing with conflicting access prediction in transactional memory systems. IPDPS 2008: 1-11
101EEReinhard Wilhelm, Jakob Engblom, Andreas Ermedahl, Niklas Holsti, Stephan Thesing, David B. Whalley, Guillem Bernat, Christian Ferdinand, Reinhold Heckmann, Tulika Mitra, Frank Mueller, Isabelle Puaut, Peter P. Puschner, Jan Staschulat, Per Stenström: The worst-case execution-time problem - overview of methods and survey of tools. ACM Trans. Embedded Comput. Syst. 7(3): (2008)
100EEMartin Thuresson, Lawrence Spracklen, Per Stenström: Memory-Link Compression Schemes: A Value Locality Perspective. IEEE Trans. Computers 57(7): 916-927 (2008)
99EEFredrik Warg, Per Stenström: Dual-thread Speculation: A Simple Approach to Uncover Thread-level Parallelism on a Simultaneous Multithreaded Processor. International Journal of Parallel Programming 36(2): 166-183 (2008)
98EEMd. Mafijul Islam, Magnus Själander, Per Stenström: Early detection and bypassing of trivial operations to improve energy efficiency of processors. Microprocessors and Microsystems - Embedded Hardware Design 32(4): 183-196 (2008)
2007
97 Utpal Banerjee, José Moreira, Michel Dubois, Per Stenström: Proceedings of the 4th Conference on Computing Frontiers, 2007, Ischia, Italy, May 7-9, 2007 ACM 2007
96 Per Stenström, Michael F. P. O'Boyle, François Bodin, Marcelo Cintra, Sally A. McKee: Transactions on High-Performance Embedded Architectures and Compilers I Springer 2007
95 Koen De Bosschere, David R. Kaeli, Per Stenström, David B. Whalley, Theo Ungerer: High Performance Embedded Architectures and Compilers, Second International Conference, HiPEAC 2007, Ghent, Belgium, January 28-30, 2007, Proceedings Springer 2007
94EEMarco Galluzzi, Enrique Vallejo, Adrián Cristal, Fernando Vallejo, Ramón Beivide, Per Stenström, James E. Smith, Mateo Valero: Implicit Transactional Memory in Kilo-Instruction Multiprocessors. Asia-Pacific Computer Systems Architecture Conference 2007: 339-353
93EEShekhar Borkar, Norman P. Jouppi, Per Stenström: Microprocessors in the era of terascale integration. DATE 2007: 237-242
92EEM. M. Waliullah, Per Stenström: Starvation-Free Transactional Memory-System Protocols. Euro-Par 2007: 280-291
91EEHaakon Dybdahl, Per Stenström: An Adaptive Shared/Private NUCA Cache Partitioning Scheme for Chip Multiprocessors. HPCA 2007: 2-12
90EEMd. Mafijul Islam, Alexander Busck, Mikael Engbom, Simji Lee, Michel Dubois, Per Stenström: Loop-level Speculative Parallelism in Embedded Applications. ICPP 2007: 3
89EEMartin Thuresson, Magnus Själander, Magnus Björk, Lars J. Svensson, Per Larsson-Edefors, Per Stenström: FlexCore: Utilizing Exposed Datapath Control for Efficient Computing. ICSAMOS 2007: 18-25
88EEPer Stenström: IPDPS Panel: Is the Multi-Core Roadmap going to Live Up to its Promises? IPDPS 2007: 14
87EEMd. Mafijul Islam, Per Stenström: Energy and Performance Trade-offs between Instruction Reuse and Trivial Computations for Embedded Applications. SIES 2007: 86-93
86EEJianwei Chen, Michel Dubois, Per Stenström: SimWattch: Integrating Complete-System and User-Level Performance and Power Simulators. IEEE Micro 27(4): 34-48 (2007)
85EEJochen Hollmann, Anders Ardö, Per Stenström: Effectiveness of caching in a distributed digital library system. Journal of Systems Architecture 53(7): 403-416 (2007)
84EEPer Stenström: Introduction to Part 1. T. HiPEAC 1: 33 (2007)
83EEKoen De Bosschere, Wayne Luk, Xavier Martorell, Nacho Navarro, Michael F. P. O'Boyle, Dionisios N. Pnevmatikatos, Alex Ramírez, Pascal Sainrat, André Seznec, Per Stenström, Olivier Temam: High-Performance Embedded Architecture and Compilation Roadmap. T. HiPEAC 1: 5-29 (2007)
2006
82EEHaakon Dybdahl, Per Stenström: Enhancing Last-Level Cache Performance by Block Bypassing and Early Miss Determination. Asia-Pacific Computer Systems Architecture Conference 2006: 52-66
81EEJaeheon Jeong, Per Stenström, Michel Dubois: Simple penalty-sensitive replacement policies for caches. Conf. Computing Frontiers 2006: 341-352
80EEPer Stenström: Chip-multiprocessing and beyond. HPCA 2006: 109
79EEHaakon Dybdahl, Per Stenström, Lasse Natvig: A Cache-Partitioning Aware Replacement Policy for Chip Multiprocessors. HiPC 2006: 22-34
78EEMd. Mafijul Islam, Per Stenström: Reduction of Energy Consumption in Processors by Early Detection and Bypassing of Trivial Operations. ICSAMOS 2006: 28-34
77EEMartin Thuresson, Per Stenström: Scalable Value-Cache Based Compression Schemes for Multiprocessors. SBAC-PAD 2006: 117-124
76EEFredrik Warg, Per Stenström: Dual-Thread Speculation: Two Threads in the Machine are Worth Eight in the Bush. SBAC-PAD 2006: 91-98
75EEBurkhard Monien, Guang Gao, Horst Simon, Paul G. Spirakis, Per Stenström: Introduction. J. Parallel Distrib. Comput. 66(5): 615-616 (2006)
2005
74EEFredrik Warg, Per Stenström: Reducing misspeculation overhead for module-level speculative execution. Conf. Computing Frontiers 2005: 289-298
73EEMartin Thuresson, Per Stenström: Evaluation of extended dictionary-based static code compression schemes. Conf. Computing Frontiers 2005: 77-86
72EEPer Stenström: The Chip-Multiprocessing Paradigm Shift: Opportunities and Challenges. HiPEAC 2005: 5
71EEMagnus Ekman, Per Stenström: A Cost-Effective Main Memory Organization for Future Servers. IPDPS 2005
70EEMagnus Ekman, Per Stenström: A Robust Main-Memory Compression Scheme. ISCA 2005: 74-85
69EEMagnus Ekman, Per Stenström: Enhancing Multiprocessor Architecture Simulation Speed Using Matched-Pair Comparison. ISPASS 2005: 89-99
68EEFrank Mueller, Per Stenström: Introduction to the special issue. ACM Trans. Embedded Comput. Syst. 4(1): 1-2 (2005)
2004
67EEMartin Kämpe, Per Stenström, Michel Dubois: Self-correcting LRU replacement policies. Conf. Computing Frontiers 2004: 181-191
66EEMagnus Ekman, Per Stenström: A case for multi-level main memory. WMPI 2004: 1-8
65EEHåkan Grahn, Per Stenström: A comparative evaluation of hardware-only and software-only directory protocols in shared-memory multiprocessors. Journal of Systems Architecture 50(9): 537-561 (2004)
64EEJonas Jalminger, Per Stenström: A cache block reuse prediction scheme. Microprocessors and Microsystems 28(7): 373-385 (2004)
2003
63 Jochen Hollmann, Anders Ardö, Per Stenström: An Evaluation of Document Prefetching in a Distributed Digital Library. ECDL 2003: 276-287
62EEPer Stenström: One Chip, One Server: How Do We Exploit Its Power? HiPC 2003: 405
61EEJonas Jalminger, Per Stenström: A Novel Approach to Cache Block Reuse Predictions. ICPP 2003: 294-
60EEMagnus Ekman, Per Stenström: Performance and Power Impact of Issue-width in Chip-Multiprocessor Cores. ICPP 2003: 359-368
59EEJim Nilsson, Anders Landin, Per Stenström: The Coherence Predictor Cache: A Resource-Efficient and Accurate Coherence Prediction Infrastructure. IPDPS 2003: 10
58EEPeter Rundberg, Per Stenström: Speculative Lock Reordering: Optimistic Out-of-Order Execution of Critical Sections. IPDPS 2003: 11
57EEFredrik Warg, Per Stenström: Improving Speculative Thread-Level Parallelism Through Module Run-Length Prediction. IPDPS 2003: 12
56EEJianwei Chen, Michel Dubois, Per Stenström: Integrating complete-system and user-level performance/power simulators: the SimWattch approach. ISPASS 2003: 1-10
2002
55EEMartin Kämpe, Per Stenström, Michel Dubois: The FAB Predictor: Using Fourier Analysis to Predict the Outcome of Conditional Branches. HPCA 2002: 223-232
54EEJochen Hollmann, Anders Ardö, Per Stenström: Empirical Observations Regarding Predictability in User Access-Behavior in a Distributed Digital Library System. IPDPS 2002
53EEMagnus Ekman, Per Stenström, Fredrik Dahlgren: TLB and snoop energy-reduction using virtual caches in low-power chip-multiprocessors. ISLPED 2002: 243-246
52EEJonas Jalminger, Per Stenström: Improvement of energy-efficiency in off-chip caches by selective prefetching. Microprocessors and Microsystems 26(3): 107-121 (2002)
2001
51EEUlf Assarsson, Per Stenström: A Case Study of Load Distribution in Parallel View Frustum Culling and Collision Detection. Euro-Par 2001: 663-673
50EEFredrik Warg, Per Stenström: Limits on Speculative Module-Level Parallelism in Imperative and Object-Oriented Programs on CMP Platforms. IEEE PACT 2001: 221-230
49EEPeter Rundberg, Per Stenström: An All-Software Thread-Level Data Dependence Speculation System for Multiprocessors. J. Instruction-Level Parallelism 3: (2001)
2000
48EESilvia M. Müller, Per Stenström, Mateo Valero, Stamatis Vassiliadis: Parallel Computer Architecture. Euro-Par 2000: 537-538
47EEMagnus Karlsson, Fredrik Dahlgren, Per Stenström: A Prefetching Technique for Irregular Accesses to Linked Data Structures. HPCA 2000: 206-217
46EEAshley Saulsbury, Fredrik Dahlgren, Per Stenström: Recency-based TLB preloading. ISCA 2000: 117-127
45EEMagnus Karlsson, Per Stenström: An analytical model of the working-set sizes in decision-support systems. SIGMETRICS 2000: 275-285
44 Per Stenström, Erik Hagersten, David J. Lilja, Margaret Martonosi, Madan Venugopal: Shared-memory multiprocessing: Current state and future directions. Advances in Computers 53: 2-55 (2000)
43 Håkan Grahn, Per Stenström: Comparative Evaluation of Latency-Tolerating and -Reducing Techniques for Hardware-Only and Software-Only Directory Protocols. J. Parallel Distrib. Comput. 60(7): 807-834 (2000)
1999
42EEThomas Lundqvist, Per Stenström: Timing Anomalies in Dynamically Scheduled Microprocessors. IEEE Real-Time Systems Symposium 1999: 12-21
41EEThomas Lundqvist, Per Stenström: A Method to Improve the Estimated Worst-Case Performance of Data Caching. RTCSA 1999: 255-262
40 Jonas Skeppstedt, Fredrik Dahlgren, Per Stenström: Evaluation of Compiler-Controlled Updating to Reduce Coherence-Miss Penalties in Shared-Memory Multiprocessors. J. Parallel Distrib. Comput. 56(2): 122-143 (1999)
39 Thomas Lundqvist, Per Stenström: An Integrated Path and Timing Analysis Method based on Cycle-Level Symbolic Execution. Real-Time Systems 17(2-3): 183-207 (1999)
1998
38EEThomas Lundqvist, Per Stenström: Integrating Path and Timing Analysis Using Instruction-Level Simulation Techniques. LCTES 1998: 1-15
37 Fredrik Dahlgren, Michel Dubois, Per Stenström: Performance Evaluation and Cost Analysis of Cache Protocol Extensions for Shared-Memory Multiprocessors. IEEE Trans. Computers 47(10): 1041-1055 (1998)
1997
36 Per Stenström, Jonas Skeppstedt: A Performance Tuning Approach for Shared-Memory Multiprocessors. Euro-Par 1997: 72-83
35EEHåkan Grahn, Per Stenström: Relative Performance of Hardware and Software-Only Directory Protocols Under Latency Tolerating and Reducing Techniques. IPPS 1997: 500-
34 Fredrik Dahlgren, Per Stenström, Mårten Björkman: Reducing the Read-Miss Penalty for Flat COMA Protocols. Comput. J. 40(4): 208-219 (1997)
33 Per Stenström, Erik Hagersten, David J. Lilja, Margaret Martonosi, Madan Venugopal: Trends in Shared Memory Multiprocessing. IEEE Computer 30(12): 44-50 (1997)
32 Per Stenström, Mats Brorsson, Fredrik Dahlgren, Håkan Grahn, Michel Dubois: Boosting the Performance of Shared Memory Multiprocessors. IEEE Computer 30(7): 63-70 (1997)
31 Magnus Karlsson, Per Stenström: Effectivness of Dynamic Prefetching in Multiple-Writer Distributed Virtual Shared-Memory Systems. J. Parallel Distrib. Comput. 43(2): 79-93 (1997)
1996
30EEMagnus Karlsson, Per Stenström: Performance Evaluation of a Cluster-Based Multiprocessor Built from ATM Switches and Bus-Based Multiprocessor Servers. HPCA 1996: 4-13
29EEJonas Skeppstedt, Per Stenström: Using Dataflow Analysis Techniques to Reduce Ownership Overhead in Cache Coherence Protocols. ACM Trans. Program. Lang. Syst. 18(6): 659-682 (1996)
28 Per Stenström, Fredrik Dahlgren: Applications for Shared Memory Multiprocessors (Guest Editors' Introduction). IEEE Computer 29(12): 29-31 (1996)
27EEFredrik Dahlgren, Per Stenström: Evaluation of Hardware-Based Stride and Sequential Prefetching in Shared-Memory Multiprocessors. IEEE Trans. Parallel Distrib. Syst. 7(4): 385-398 (1996)
26 Håkan Grahn, Per Stenström: Evaluation of a Competitive-Update Cache Coherence Protocol with Migratory Data Detection. J. Parallel Distrib. Comput. 39(2): 168-180 (1996)
25 Mats Brorsson, Per Stenström: Characterising and Modelling Shared Memory Accesses in Multiprocessor Programs. Parallel Computing 22(6): 869-893 (1996)
1995
24EEMårten Björkman, Fredrik Dahlgren, Per Stenström: Using hints to reduce the read miss penalty for flat COMA protocols. HICSS (1) 1995: 242-251
23 Fredrik Dahlgren, Per Stenström: Effectiveness of Hardware-Based Stride and Sequential Prefetching in Shared-Memory Multiprocessors. HPCA 1995: 68-77
22EEHåkan Grahn, Per Stenström: Efficient Strategies for Software-Only Protocols in Shared-Memory Multiprocessors. ISCA 1995: 38-47
21EEFredrik Dahlgren, Michel Dubois, Per Stenström: Sequential Hardware Prefetching in Shared-Memory Multiprocessors. IEEE Trans. Parallel Distrib. Syst. 6(7): 733-746 (1995)
20 Fredrik Dahlgren, Per Stenström: Using Write Caches to Improve Performance of Cache Coherence Protocols in Shared-Memory Multiprocessors. J. Parallel Distrib. Comput. 26(2): 193-210 (1995)
19 Michel Dubois, Jonas Skeppstedt, Per Stenström: Essential Misses and Data Traffic in Coherence Protocols. J. Parallel Distrib. Comput. 29(2): 108-125 (1995)
1994
18 Jonas Skeppstedt, Per Stenström: Simple Compiler Algorithms to Reduce Ownership Operhead in Cache Coherence Protocols. ASPLOS 1994: 286-296
17 Per Stenström: Introduction. HICSS (1) 1994: 520-521
16 Fong Pong, Per Stenström, Michel Dubois: An Integrated Methodology for the Verification of Directory-Based Cache Protocols. ICPP (1) 1994: 158-165
15 Fredrik Dahlgren, Per Stenström: Reducing the Write Traffic for a Hybrid Cache Protocol. ICPP (1) 1994: 166-173
14 Fredrik Dahlgren, Michel Dubois, Per Stenström: Combined Performance Gains of Simple Cache Protocol Extensions. ISCA 1994: 187-197
13 Håkan Nilsson, Per Stenström: An Adaptive Update-Based Cache Coherence Protocol for Reduction of Miss Rate and Traffic. PARLE 1994: 363-374
1993
12 Fredrik Dahlgren, Michel Dubois, Per Stenström: Fixed and Adaptive Sequential Prefetching in Shared Memory Multiprocessors. ICPP 1993: 56-63
11 Per Stenström, Mats Brorsson, Lars Sandberg: An Adaptive Cache Coherence Protocol Optimized for Migratory Sharing. ISCA 1993: 109-118
10 Michel Dubois, Jonas Skeppstedt, Livio Ricciulli, Krishnan Ramamurthy, Per Stenström: The Detection and Elimination of Useless Misses in Multiprocessors. ISCA 1993: 88-97
1992
9 Per Stenström: A Latency-Hiding Scheme for Multiprocessors with Buffered Multistage Networks. IPPS 1992: 39-42
8 Per Stenström, Truman Joe, Anoop Gupta: Comparative Performance Evaluation of Cache-Coherent NUMA and COMA Architectures. ISCA 1992: 80-91
7 Håkan Nilsson, Per Stenström: The Scalable Tree Protocol - A Cache Coherence Approach for Large-Scale Multiprocessors. SPDP 1992: 498-506
1991
6 Per Stenström, Fredrik Dahlgren, Lars Lundberg: A Lockup-Free Multiprocessor Cache Design. ICPP (1) 1991: 246-250
5EEFredrik Dahlgren, Per Stenström: On Reconfigurable On-Chip Data Caches. MICRO 1991: 189-198
1990
4 Per Stenström: A Survey of Cache Coherence Schemes for Multiprocessors. IEEE Computer 23(6): 12-24 (1990)
1989
3EEPer Stenström: A Cache Consistency Protocol for Multiprocessors with Multistage Networks. ISCA 1989: 407-415
1988
2 Per Stenström: Reducing Contention in Sharde-Memory Multiprocessors. IEEE Computer 21(11): 26-37 (1988)
1987
1 Per Stenström, Lars Philipson: A Layered Emulator for Design Evaluation of MIMD Multiprocessors with Shared Memory. PARLE (1) 1987: 329-344

Coauthor Index

1Anders Ardö [54] [63] [85]
2Ulf Assarsson [51]
3Utpal Banerjee [97]
4Ramón Beivide [94]
5Guillem Bernat [101]
6Magnus Björk [89]
7Mårten Björkman [24] [34]
8François Bodin [96]
9Shekhar Y. Borkar (Shekhar Borkar) [93]
10Koen De Bosschere (Koenraad De Bosschere) [83] [95]
11Mats Brorsson [11] [25] [32]
12Alexander Busck [90]
13Jianwei Chen [56] [86]
14Marcelo Cintra [96]
15Adrián Cristal [94]
16Fredrik Dahlgren [5] [6] [12] [14] [15] [20] [21] [23] [24] [27] [28] [32] [34] [37] [40] [46] [47] [53]
17Michel Dubois [10] [12] [14] [16] [19] [21] [32] [37] [55] [56] [67] [81] [86] [90] [97] [104]
18Haakon Dybdahl [79] [82] [91]
19Magnus Ekman [53] [60] [66] [69] [70] [71]
20Jakob Engblom [101]
21Mikael Engbom [90]
22Andreas Ermedahl [101]
23Christian Ferdinand [101]
24Marco Galluzzi [94]
25Guang Gao [75]
26Håkan Grahn [22] [26] [32] [35] [43] [65]
27Anoop Gupta [8]
28Rajiv Gupta [104]
29Erik Hagersten [33] [44]
30Reinhold Heckmann [101]
31Jochen Hollmann [54] [63] [85]
32Niklas Holsti [101]
33Md. Mafijul Islam [78] [87] [90] [98]
34Jonas Jalminger [52] [61] [64]
35Jaeheon Jeong [81]
36Truman Joe [8]
37Norman P. Jouppi [93]
38David R. Kaeli [95]
39Martin Kämpe [55] [67]
40Magnus Karlsson [30] [31] [45] [47]
41Manolis Katevenis [104]
42Anders Landin [59]
43Per Larsson-Edefors [89]
44Simji Lee [90]
45David J. Lilja [33] [44]
46Wayne Luk [83]
47Lars Lundberg [6]
48Thomas Lundqvist [38] [39] [41] [42]
49Margaret Martonosi [33] [44]
50Xavier Martorell [83]
51Sally A. McKee [96]
52Tulika Mitra [101]
53Burkhard Monien [75]
54José Moreira [97]
55Frank Mueller [68] [101]
56Silvia M. Müller [48]
57Lasse Natvig [79]
58Nacho Navarro [83]
59Håkan Nilsson [7] [13]
60Jim Nilsson [59]
61Michael F. P. O'Boyle [83] [96]
62Lars Philipson [1]
63Dionisios N. Pnevmatikatos [83]
64Fong Pong [16]
65Isabelle Puaut [101]
66Peter P. Puschner [101]
67Krishnan Ramamurthy [10]
68Alex Ramírez [83]
69Livio Ricciulli [10]
70Peter Rundberg [49] [58]
71Pascal Sainrat [83]
72Lars Sandberg [11]
73Ashley Saulsbury [46]
74André Seznec [83]
75Horst Simon [75]
76Magnus Själander [89] [98] [107]
77Jonas Skeppstedt [10] [18] [19] [29] [36] [40]
78James E. Smith [94]
79Paul G. Spirakis [75]
80Lawrence Spracklen [100]
81Jan Staschulat [101]
82Lars J. Svensson [89]
83Olivier Temam [83]
84Stephan Thesing [101]
85Martin Thuresson [73] [77] [89] [100] [103] [107]
86Theo Ungerer [95] [104]
87Mateo Valero [48] [94]
88Enrique Vallejo [94]
89Fernando Vallejo [94]
90Stamatis Vassiliadis [48]
91Madan Venugopal [33] [44]
92M. M. Waliullah [92] [102] [106]
93Fredrik Warg [50] [57] [74] [76] [99]
94David B. Whalley [95] [101] [105]
95Reinhard Wilhelm [101]

Colors in the list of coauthors

Copyright © Sun May 17 03:24:02 2009 by Michael Ley (ley@uni-trier.de)