Publications
Publications related to Scientific Computing, STAPL (Standard Template Adaptive Parallel Library), Parallel Computing, and Compiler Optimization

  In a few cases a paper appears more than once in the list. Please note that the copyrights of the papers with full citations have been signed over to the publishers.


Bounded Asynchrony and Nested Parallelism for Scalable Graph Processing, Adam Fidel, Nancy M. Amato, Lawrence Rauchwerger, In Proc. Supercomputing (SC), Doctoral Showcase Poster, Nov 2017.
Proceedings(pdf, abstract)

Fast Approximate Distance Queries in Unweighted Graphs using Bounded Asynchrony, Adam Fidel, Francisco Coral, Colton Riedel, Nancy M. Amato, Lawrence Rauchwerger, In Wkshp. on Lang. and Comp. for Par. Comp. (LCPC), Sep 2016.
Proceedings(pdf, abstract)

MPMD Framework for Offloading Load Balance Computation, Olga Pearce, Todd Gamblin, Bronis de Supinski, Nancy Amato, In Proc. Int. Par. and Dist. Proc. Symp. (IPDPS), Chicago, IL, USA, May 2016.
Proceedings(pdf, abstract)

An Algorithmic Approach to Communication Reduction in Parallel Graph Algorithms (Conference Best Paper Finalist), Harshvardhan, Adam Fidel, Nancy M. Amato, Lawrence Rauchwerger, In Proc. IEEE Int.Conf. on Parallel Architectures and Compilation Techniques (PACT), San Francisco, CA, Oct 2015.
Proceedings(pdf, abstract)

Finding Schedule Sensitive Branches, Jeff Huang, Lawrence Rauchwerger, In Proc. of the 2015 10th Joint Meeting on Foundations of Software Engineering , pp. 439-449, New York, Sep 2015.
Proceedings(pdf, abstract)

Asynchronous Nested Parallelism for Dynamic Applications in Distributed Memory, Ioannis Papadopoulos, Nathan Thomas, Adam Fidel, Dielli Hoxha, Nancy M. Amato, Lawrence Rauchwerger, In Wkshp. on Lang. and Comp. for Par. Comp. (LCPC), pp. 106-121, Chapel Hill, NC, Sep 2015.
Proceedings(pdf, abstract)

Composing Algorithmic Skeletons to Express High-Performance Scientific Applications (Conference Best Paper), Mani Zandifar, Mustafa Abdujabbar, Alireza Majidi, David Keyes, Nancy M. Amato, Lawrence Rauchwerger, In Proc. ACM Int. Conf. Supercomputing (ICS), pp. 415--424, Newport Beach, CA, USA, Jun 2015.
Proceedings(pdf, abstract)

STAPL-RTS: An Application Driven Runtime System, Ioannis Papadopoulos, Nathan Thomas, Adam Fidel, Nancy M. Amato, Lawrence Rauchwerger, In Proc. ACM Int. Conf. Supercomputing (ICS), pp. 425-434 , Newport Beach, CA, USA, Jun 2015.
Proceedings(pdf, ppt, abstract)

A Hybrid Approach To Processing Big Data Graphs on Memory-Restricted Systems, Harshvardhan, Brandon West, Adam Fidel, Nancy M. Amato, Lawrence Rauchwerger, In Proc. Int. Par. and Dist. Proc. Symp. (IPDPS), pp. 799-808, Hyderabad, India, May 2015.
Proceedings(pdf, abstract)

Scalable conditional induction variables (CIV) analysis , Cosmin Oancea, Lawrence Rauchwerger, In Int. Symposium on Code Generation and Optimization, pp. 213-224, San Francisco, CA, Feb 2015.
Proceedings(pdf, abstract)

A Hierarchical Approach to Reducing Communication in Parallel Graph Algorithms, Harshvardhan, Nancy M. Amato, Lawrence Rauchwerger, In Proc. ACM SIGPLAN Symp. Prin. Prac. Par. Prog. (PPOPP), pp. 285-286 (Poster), San Francisco, CA, USA, Jan 2015.
Proceedings(pdf, abstract)

Efficient, Reachability-based, Parallel Algorithms for Finding Strongly Connected Components, Daniel Tomkins, Timmie Smith, Nancy M. Amato, Lawrence Rauchwerger, Technical Report, TR15-002, Parasol Laboratory, Department of Computer Science, Texas A&M University, College Station, TX 77843-3112, Jan 2015.
Technical Report(pdf, abstract)

Faster Parallel Traversal of Scale Free Graphs at Extreme Scale with Vertex Delegates, Roger Pearce, Maya Gokhale, Nancy M. Amato, In Proc. Supercomputing (SC), New Orleans, LA, Nov 2014.
Proceedings(pdf, abstract)

The STAPL Skeleton Framework, Mani Zandifar, Nathan Thomas, Nancy M. Amato, Lawrence Rauchwerger, In Wkshp. on Lang. and Comp. for Par. Comp. (LCPC), pp. 176--190, Hillsboro, OR, USA, Sep 2014.
Proceedings(pdf, abstract)

KLA: A New Algorithmic Paradigm for Parallel Graph Computations (Conference Best Paper), Harshvardhan, Adam Fidel, Nancy M. Amato, Lawrence Rauchwerger, In Proc. IEEE Int.Conf. on Parallel Architectures and Compilation Techniques (PACT), pp. 27-38, Edmonton, AB, Canada, Aug 2014.
Proceedings(pdf, abstract)

Processing Big Data Graphs on Memory-Restricted Systems, Harshvardhan, Nancy M. Amato, Lawrence Rauchwerger, In Proc. IEEE Int.Conf. on Parallel Architectures and Compilation Techniques (PACT), pp. 517-518 (SRC Poster), Edmonton, AB, Canada, Aug 2014.
Proceedings(pdf, abstract)

From Petascale to the Pocket: Adaptively Scaling Parallel Programs for Mobile SoCs, Adam Fidel, Nancy M. Amato, Lawrence Rauchwerger, In Proc. IEEE Int.Conf. on Parallel Architectures and Compilation Techniques (PACT), SRC Poster, Aug 2014.
Proceedings(pdf)

Load Balancing N-Body Simulations with Highly Non-Uniform Density, Olga Pearce, Todd Gamblin, Bronis de Supinski, Nancy M. Amato, Tom Arsenlis, In Proc. ACM Int. Conf. Supercomputing (ICS), pp. 113-122 , Munchen, Germany, Jun 2014.
Proceedings(pdf, abstract)

Validation of Full-Domain Massively Parallel Transport Sweep Algorithms, W Hawkins, Marvin Adams, Michael Adams, Timmie Smith, Nancy Amato, Lawrence Rauchwerger, Teresa Bailey, Peter Brown, Adam Kunen, In Trans. Amer. Nucl. Soc., pp. 699-792, 2014.
Proceedings(pdf, abstract)

Using Load Balancing to Scalably Parallelize Sampling-Based Motion Planning Algorithms, Adam Fidel, Sam Ade Jacobs, Shishir Sharma, Nancy M. Amato, Lawrence Rauchwerger, In Proc. Int. Par. and Dist. Proc. Symp. (IPDPS), Phoenix, Arizona, USA, May 2014.
Proceedings(pdf, abstract)

Scaling Techniques for Massive Scale-Free Graphs in Distributed (External) Memory, Roger Pearce, Maya Gokhale, Nancy M. Amato, In Proc. Int. Par. and Dist. Proc. Symp. (IPDPS), May 2013.
Proceedings(pdf, abstract)

Provably Optimal Parallel Transport Sweeps on Regular Grids, W Hawkins, Timmie Smith, Michael Adams, Lawrence Rauchwerger, Nancy Amato, Marvin Adams, Teresa Bailey, Robert Falgout, In Proc. Int. Conf. on Math. Meth. and Supercomp. for Nuc. App., Idaho, May 2013.
Proceedings(pdf, abstract)

Load Balancing Techniques for Scalable Parallelization of Sampling-Based Motion Planning Algorithms, Adam Fidel, Sam Ade Jacobs, Shishir Sharma, Lawrence Rauchwerger, Nancy M. Amato, Technical Report, TR13-002 , Parasol Laboratory, Department of Computer Science, Texas A&M University, Mar 2013.
Technical Report(pdf, abstract)

Efficient Massively Parallel Transport Sweeps, W Hawkins, Timmie Smith, Michael Adams, Lawrence Rauchwerger, Nancy Amato, Marvin Adams, Trans. Amer. Nucl. Soc., 107(1):477-481, Nov 2012.
Journal(pdf)

The STAPL Parallel Graph Library, Harshvardhan, Adam Fidel, Nancy M. Amato, Lawrence Rauchwerger, In Wkshp. on Lang. and Comp. for Par. Comp. (LCPC), Tokyo, Japan, Sep 2012.
Proceedings(pdf, abstract)

Quantifying the Effectiveness of Load Balance Algorithms, Olga Pearce, Todd Gamblin, Bronis de Supinski, Martin Schulz, Nancy M. Amato, In Proc. ACM Int. Conf. Supercomputing (ICS), Jun 2012.
Proceedings(pdf, abstract)

Logical inference techniques for loop parallelization, Lawrence Rauchwerger, Cosmin Oancea, In Prog. Lang. Design and Implementation (PLDI), pp. 509-520, New York, NY, USA, Jun 2012.
Proceedings(pdf, abstract)

A Hybrid Approach to Proving Memory Reference Monotonicity, Cosmin Oancea, Lawrence Rauchwerger, In Wkshp. on Lang. and Comp. for Par. Comp. (LCPC), Fort Collins, Colorado, U.S.A., Sep 2011.
Proceedings(pdf, abstract)

Radiative Effects in Radiative Shocks in Shock Tubes, R Drake, Forrest Doss, Ryan McClarren, Marvin Adams, Nancy Amato, Derek Bingham, C Chou, Carlos DiStefano, Krzysztof Fidkowski, Bruce Fryxell, Tamas Gombosi, Michael Grosskopf, James Holloway, Bart van der Holst, Channing Huntington, Smadar Karni, Christine Krauland, Carolyn Kuranz, Edward Larsen, Bram van Leer, Bani Mallick, D Marion, William Martin, Jim Morel, Eric Myra, Vijay Nair, Kenneth Powell, Lawrence Rauchwerger, Philip Roe, Erica Rutter, Igor Sokolov, Quentin Stout, Ben Torralva, Gabor Toth, Katsuyo Thornton, Anthony Visco, , 7(3):130-140, 2011.
Journal(pdf, abstract)

Speculative Parallelization of Loops, Encyclopedia of Parallel Computing, Lawrence Rauchwerger, , 1901--1912, 2011.
Journal(pdf)

The STAPL Parallel Container Framework, Gabriel Tanase, Antal Buss, Adam Fidel, Harshvardhan, Ioannis Papadopoulos, Olga Pearce, Timmie Smith, Nathan Thomas, Xiabing Xu, Nedhal Mourad, Jeremy Vu, Mauro Bianco, Nancy M. Amato, Lawrence Rauchwerger, In Proc. ACM SIGPLAN Symp. Prin. Prac. Par. Prog. (PPOPP), Feb 2011.
Proceedings(pdf, abstract)

Multithreaded Asynchronous Graph Traversal for In-Memory and Semi-External Memory, Roger Pearce, Maya Gokhale, Nancy M. Amato, In Proc. Supercomputing (SC), New Orleans, LA, Nov 2010.
Proceedings(pdf, abstract)

The STAPL pView, Antal Buss, Adam Fidel, Harshvardhan, Timmie Smith, Gabriel Tanase, Nathan Thomas, Xiabing Xu, Mauro Bianco, Nancy M. Amato, Lawrence Rauchwerger, In Wkshp. on Lang. and Comp. for Par. Comp. (LCPC), Oct 2010. Also, Technical Report, TR10-001, Parasol Laboratory, Department of Computer Science, Texas A&M University, Jul 2010.
Proceedings(pdf, abstract) Technical Report(pdf, abstract)

STAPL: Standard Template Adaptive Parallel Library, Antal Buss, Harshvardhan, Ioannis Papadopoulos, Olga Tkachyshyn, Timmie Smith, Gabriel Tanase, Nathan Thomas, Xiabing Xu, Mauro Bianco, Nancy M. Amato, Lawrence Rauchwerger, In Haifa Experimental Systems Conference, Haifa, Israel, May 2010.
Proceedings(pdf, abstract)

The STAPL pList, Gabriel Tanase, Xiabing Xu, Antal Buss, Harshvardhan, Ioannis Papadopoulos, Olga Tkachyshyn, Timmie Smith, Nathan Thomas, Mauro Bianco, Nancy M. Amato, Lawrence Rauchwerger, In Wkshp. on Lang. and Comp. for Par. Comp. (LCPC), Newark, Delaware, Oct 2009.
Proceedings(pdf, abstract)

Two Memory Allocators that Use Hints to Improve Locality, Alin Jula, Lawrence Rauchwerger, In International Symposium on Memory Management, Trinity College Dublin, Dublin, Ireland, Jun 2009.
Proceedings(pdf, abstract)

Implementation of Sensitivity Analysis for Automatic Parallelization., Silvius Rus, Maikel Pennings, Lawrence Rauchwerger, In Wkshp. on Lang. and Comp. for Par. Comp. (LCPC), pp. 316-330, Edmonton, Canada, Aug 2008.
Proceedings(pdf, abstract)

Design for Interoperability in STAPL : pMatrices and Linear Algebra Algorithms, Antal Buss, Timmie Smith, Gabriel Tanase, Nathan Thomas, Mauro Bianco, Nancy M. Amato, Lawrence Rauchwerger, In Wkshp. on Lang. and Comp. for Par. Comp. (LCPC), Jul 2008.
Proceedings(pdf, abstract)

Design and Use of htalib – A Library for Hierarchically Tiled Arrays, Ganesh Bikshandy, Jia Guo, Christoph von Praun, Gabriel Tanase, Basilio Fraguela, Maria Jesus Garzaran, David Padua, Lawrence Rauchwerger, Lecture Notes in Computer Science, 4382/2007:17-32, 2008.
Journal(pdf, abstract)

Design for Interoperability in STAPL: pMatrices and Linear Algebra Algorithms, Antal Buss, Timmie Smith, Gabriel Tanase, Nathan Thomas, Mauro Bianco, Nancy M. Amato, Lawrence Rauchwerger, Lecture Notes in Computer Science, 5335/2008:304-315, 2008.
Journal(pdf, abstract)

Associative Parallel Containers In STAPL, Gabriel Tanase, Chidambareswaran (Chids) Raman, Mauro Bianco, Nancy M. Amato, Lawrence Rauchwerger, Lecture Notes in Computer Science, 5234/2008:156-171, 2008. Also, In Wkshp. on Lang. and Comp. for Par. Comp. (LCPC), Oct 2007.
Journal(pdf, abstract) Proceedings(pdf, abstract)

Balancing Allocation Speed, Locality and Fragmentation in a Locality Improving Allocator, Alin Jula, Lawrence Rauchwerger, Technical Report, TR08-002, Department of Computer Science and Engineering, Texas A&M University, Feb 2008.
Technical Report(pdf, abstract)

The STAPL pArray, Gabriel Tanase, Mauro Bianco, Nancy M. Amato, Lawrence Rauchwerger, In Proc. of Workshop MEDEA, pp. 81-88, Brasov, Romania, Sep 2007.
Proceedings(pdf, abstract)

Sensitivity Analysis for Automatic Parallelization on Multi-Cores, Silvius Rus, Maikel Pennings, Lawrence Rauchwerger, In Proc. ACM Int. Conf. Supercomputing (ICS), pp. 11, Jun 2007.
Proceedings(pdf, abstract)

How to Focus on Memory Allocation Strategies, Alin Jula, Lawrence Rauchwerger, Technical Report, TR07-003, Department of Computer Science, Texas A&M University, College Station, TX 77843-3112, Jun 2007.
Technical Report(pdf, abstract)

Sensitivity Analysis for Migrating Programs to Multi-Cores, Silvius Rus, Marinus Pennings, Lawrence Rauchwerger, Technical Report, TR06-015, Parasol Laboratory, Department of Computer Science, Texas A&M University, College Station, TX, Dec 2006.
Technical Report(ps, pdf, abstract)

Custom Memory Allocation for Free: Improving Data Locality with Container-Centric Memory Allocation, Alin Jula, Lawrence Rauchwerger, In Wkshp. on Lang. and Comp. for Par. Comp. (LCPC), New Orleans, Louisiana, Nov 2006.
Proceedings(pdf, abstract)

ARMI: A High Level Communication Library for STAPL, Nathan Thomas, Steven Saunders, Tim Smith, Gabriel Tanase, Lawrence Rauchwerger, Parallel Processing Letters, 16(2):261-280, Jun 2006.
Journal(ps, pdf, abstract)

An Adaptive Algorithm Selection Framework for Reduction Parallelization, Hao Yu, Lawrence Rauchwerger, Transactions on Parallel and Distributed Systems, 17(19):1084 - 1096 , 2006.
Journal(pdf, abstract)

Region Array SSA, Silvius Rus, Guobin He, Lawrence Rauchwerger, Technical Report, TR06-007, Parasol Laboratory, Department of Computer Science, Texas A&M University, College Station, TX, May 2006.
Technical Report(ps, pdf, abstract)

SmartApps: Middleware for Adaptive Applications on Reconfigurable Platforms, Lawrence Rauchwerger, Nancy Amato, Operating System Reviews, 40(2):73-82, Apr 2006.
Journal(pdf, abstract)

Defero - a Semantic Driven Memory Allocator, Alin Jula, Lawrence Rauchwerger, Technical Report, TR06-001, Department of Computer Science and Engineering, Texas A&M University, Parasol Lab, Department of Computer Science, Texas A&M University, College Station, TX 77843-3112, Jan 2006.
Technical Report(abstract)

Parallel Protein Folding with STAPL, Shawna Thomas, Gabriel Tanase, Lucia K. Dale, Jose M. Moreira, Lawrence Rauchwerger, Nancy M. Amato, Concurrency and Computation: Practice and Experience, 17(14):1643-1656, Dec 2005.
Journal(ps, pdf, abstract)

Hybrid Dependence Analysis for Automatic Parallelization, Silvius Rus, Lawrence Rauchwerger, Technical Report, TR05-013, Parasol Laboratory, Department of Computer Science, Texas A&M University, Nov 2005.
Technical Report(ps, pdf, ppt, abstract)

Tradeoffs In Buffering Speculative Memory State for Thread-Level Speculation in Multiprocessors, Maria Jesus Garzaran, Milos Prvulovic, Jose Llaberia, Victor Vinals, Lawrence Rauchwerger, Josep Torrellas, Transactions on Architecture and COde Optimization (TACO), 2(3):247-279, Sep 2005.
Journal(pdf, abstract)

An Experimental Evaluation of the HP V-Class and SGI Origin 2000 Multiprocessors using Microbenchmarks and Scientific Applications, Ravi Iyer, Jack Perdue, Nancy M. Amato, Lawrence Rauchwerger, Laxmi Bhuyan, International Journal of Parallel Programming, 33(4):307-350, 2005.
Journal(ps, pdf, abstract)

Finding strongly connected components in distributed graphs, Bruce Hendrickson, Steve Plimpton, William McLendon, Lawrence Rauchwerger, Distributed Computing, 65(8):901-910, Aug 2005.
Journal(pdf, abstract)

A Framework for Adaptive Algorithm Selection in STAPL, Nathan Thomas, Gabriel Tanase, Olga Tkachyshyn, Jack Perdue, Nancy M. Amato, Lawrence Rauchwerger, In Proc. ACM SIGPLAN Symp. Prin. Prac. Par. Prog. (PPOPP), pp. 277-288, Chicago, Illinois, Jun 2005.
Proceedings(ps, pdf, abstract)

Parallel Reduction: An Application of Adaptive Algorithm Selection, Hao Yu, Francis Dang, Lawrence Rauchwerger, In Wkshp. on Lang. and Comp. for Par. Comp. (LCPC), pp. 188-202, Springer-Verlag, 2005.
Proceedings(pdf, abstract)

Scalable Array SSA and Array Data Flow Analysis, Silvius Rus, Guobin He, Lawrence Rauchwerger, In Wkshp. on Lang. and Comp. for Par. Comp. (LCPC), pp. 397-412, Hawthorne, NY, 2005.
Proceedings(pdf, abstract)

An Adaptive Algorithm Selection Framework, Hao Yu, Dongmin Zhang, Lawrence Rauchwerger, In Proc. IEEE Int.Conf. on Parallel Architectures and Compilation Techniques (PACT), Antibes Juan-les-Pins, France, Sep 2004.
Proceedings(ps, pdf, abstract)

The Value Evolution Graph and its Use in Memory Reference Analysis, Silvius Rus, Dongmin Zhang, Lawrence Rauchwerger, In Proc. IEEE Int.Conf. on Parallel Architectures and Compilation Techniques (PACT), pp. 243-254, Antibes Juan-les-Pins, France, Sep 2004.
Proceedings(ps, pdf, abstract)

Automatic Parallelization Using the Value Evolution Graph, Silvius Rus, Dongmin Zhang, Lawrence Rauchwerger, In Wkshp. on Lang. and Comp. for Par. Comp. (LCPC), West Lafayette, Indiana, Sep 2004.
Proceedings(ps, pdf, abstract)

An Adaptive Algorithm Selection Framework, Hao Yu, Dongmin Zhang, Francis Dang, Lawrence Rauchwerger, Technical Report, TR04-002, Parasol Laboratory, Department of Computer Science, Texas A&M University, Mar 2004.
Technical Report(ps, pdf, abstract)

Software Logging under Speculative Parallelization, Maria Jesus Garzaran, Milos Prvulovic, Victor Vinals, Jose Llaberia, Lawrence Rauchwerger, Josep Torrellas, High Performance Memory Systems, 181-193, Nov 2003. Also, In Proc. of Workshop on High Performance Memory Systems, Goteborg, Sweden, Jun 2001.
Proceedings(ps, pdf, abstract)

Using Software Logging to Support Multi-Version Buffering in Thread-Level Speculation, Maria Jesus Garzaran, Milos Prvulovic, Victor Vinals, Jose Llaberia, Lawrence Rauchwerger, Josep Torrellas, In Proc. IEEE Int.Conf. on Parallel Architectures and Compilation Techniques (PACT), pp. 170-181, New Orleans, LA, Sep 2003.
Proceedings(ps, pdf, abstract)

Hybrid Analysis: Static & Dynamic Memory Reference Analysis, Silvius Rus, Lawrence Rauchwerger, Jay Hoeflinger, International Journal of Parallel Programming, 31(4):251-283, Aug 2003. Also, In Proc. ACM Int. Conf. Supercomputing (ICS), pp. 274-284, New York City, Jun 2002. Also, Technical Report, TR02-002, Parasol Laboratory, Department of Computer Science, Texas A&M University, Jan 2002.
Journal(pdf, ppt, abstract) Proceedings(ps, pdf, ppt, abstract) Technical Report(ps, pdf, ppt, abstract)

ARMI: An Adaptive, Platform Independent Communication Library, Steven Saunders, Lawrence Rauchwerger, In Proc. ACM SIGPLAN Symp. Prin. Prac. Par. Prog. (PPOPP), pp. 12, San Diego, CA, Jun 2003.
Proceedings(ps, pdf, abstract)

STAPL: An Adaptive, Generic Parallel C++ Library, Ping An, Alin Jula, Silvius Rus, Steven Saunders, Tim Smith, Gabriel Tanase, Nathan Thomas, Nancy Amato, Lawrence Rauchwerger, Lecture Notes in Computer Science, 2624/2003:195-210, 2003. Also, In Wkshp. on Lang. and Comp. for Par. Comp. (LCPC), pp. 193-208, Cumberland Falls, Kentucky, Aug 2001.
Journal(pdf, abstract) Proceedings(ps, pdf, abstract)

Tradeoffs in Buffering Memory State for Thread-Level Speculation in Multiprocessors, Maria Jesus Garzaran, Milos Prvulovic, Jose Llaberia, Victor Vinals, Lawrence Rauchwerger, Josep Torrellas, In Proc. High Performance Computer Architecture (HPCA), pp. 191-202, Anaheim, CA, Feb 2003.
Proceedings(ps, pdf, abstract)

A parallel communication infrastructure for STAPL, Steven Saunders, Lawrence Rauchwerger, In Wkshp. on Perf. Opt. for High-Level Lang. and Lib. (POHLL), New York, NY, Jun 2002.
Proceedings(ps, pdf, ppt, abstract)

The R-LRPD Test: Speculative Parallelization of Partially Parallel Loops, Francis Dang, Hao Yu, Lawrence Rauchwerger, In Proc. Int. Par. and Dist. Proc. Symp. (IPDPS), Fort Lauderdale, FL, Apr 2002. Also, Technical Report, TR02-001, Department of Computer Science and Engineering, Texas A&M University, College Station, TX, Jan 2002.
Proceedings(ps, pdf, abstract) Technical Report(ps, pdf, abstract)

SmartApps, An Application Centric Approach to High Performance Computing: Compiler-Assisted Software and Hardware Support for Reduction Operations, Francis Dang, Maria Jesus Garzaran, Milos Prvulovic, Ye Zhang, Alin Jula, Hao Yu, Nancy Amato, Lawrence Rauchwerger, Josep Torrellas, In Proc. Int. Par. and Dist. Proc. Symp. (IPDPS), pp. 172-181, Fort Lauderdale, Florida, Apr 2002.
Proceedings(ps, pdf, abstract)

Adaptive Parallel Sorting in the STAPL library, Steven Saunders, Nathan Thomas, Nancy Amato, Lawrence Rauchwerger, Technical Report, TR01-005, Department of Computer Science and Engineering, Texas A&M University, Nov 2001.
Technical Report(abstract)

Architectural Support for Parallel Reductions in Scalable Shared-Memory Multiprocessors, Maria Jesus Garzaran, Milos Prvulovic, Ye Zhang, Alin Jula, Hao Yu, Lawrence Rauchwerger, Josep Torrellas, In Proc. IEEE Int.Conf. on Parallel Architectures and Compilation Techniques (PACT), Barcelona, Spain, Sep 2001.
Proceedings(ps, pdf, abstract)

Finding Strongly Connected Components in Parallel in Particle Transport Sweeps, William McLendon, Bruce Hendrickson, Steve Plimpton, Lawrence Rauchwerger, In Proc. ACM Symp. Par. Alg. Arch. (SPAA), pp. 328-329, Crete, Greece, Jul 2001.
Proceedings(ps, pdf, abstract)

STAPL: A Standard Template Adaptive Parallel C++ Library, Ping An, Alin Jula, Silvius Rus, Steven Saunders, Tim Smith, Gabriel Tanase, Nathan Thomas, Nancy Amato, Lawrence Rauchwerger, In Int. Wkshp on Adv. Compiler Technology for High Perf. and Embedded Processors, pp. 10, Bucharest, Romania, Jul 2001.
Proceedings(ps, pdf, abstract)

Removing Architectural Bottlenecks to the Scalability of Speculative Parallelization, Milos Prvulovic, Maria Jesus Garzaran, Lawrence Rauchwerger, Josep Torrellas, In Proc. IEEE Int. Symp. Comput. Arch. (ISCA), pp. 204-215, Goteborg, Sweden, Jun 2001.
Proceedings(pdf, abstract)

Run-time Assisted Interprocedural Analysis of Memory Access Patterns, Silvius Rus, Lawrence Rauchwerger, J Hoeflinger, Technical Report, TR01-001, Department of Computer Science and Engineering, Texas A&M University, College Station, TX, 2001.
Technical Report(ps, pdf, abstract)

Identifying Strongly Connected Components in Parallel, William McLendon, Bruce Hendrickson, Steve Plimpton, Lawrence Rauchwerger, In Proc. SIAM Conference on Parallel Processing for Scientific Computing, Portsmouth, VA, Mar 2001.
Proceedings(ps, pdf, abstract)

SmartApps: An Application Centric Approach to High Performance Computing, Lawrence Rauchwerger, Nancy Amato, Josep Torrellas, In Wkshp. on Lang. and Comp. for Par. Comp. (LCPC), Yorktown Heights, New York, USA, Aug 2000.
Proceedings(ps, pdf, abstract)

Predicting Performance on SMPs. A Case Study: The SGI Power Challenge, Nancy M. Amato, Jack Perdue, Andrea Pietracaprina, Geppino Pucci, Mark Mathis, In Proc. Int. Par. and Dist. Proc. Symp. (IPDPS), pp. 729-737, Cancun, Mexico, May 2000. Also, Technical Report, TR99-020, Department of Computer Science and Engineering, Texas A&M University, Oct 1999.
Proceedings(ps, pdf, abstract) Technical Report(ps, pdf, abstract)

A General Performance Model for Parallel Sweeps on Orthogonal Grids for Particle Transport Calculations, Mark M. Mathis, Nancy M. Amato, Marvin Adams, In Proc. ACM Int. Conf. Supercomputing (ICS), pp. 255-263, Santa Fe, NM, May 2000. Also, Technical Report, TR00-004, Parasol Laboratory, Department of Computer Science, Texas A&M University, Dec 1999.
Proceedings(ps, pdf, abstract) Technical Report(ps, pdf, abstract)

Task Scheduling and Parallel Mesh-Sweeps in Transport Computations, Nancy M. Amato, Ping An, Technical Report, TR00-009, Department of Computer Science and Engineering, Texas A&M University, Jan 2000.
Technical Report(ps, pdf)

Speculative Parallelization of Partially Parallel Loops, Francis Dang, Lawrence Rauchwerger, In Wkshp. on Lang. Comp. and Run-time Sys. for Scal. Comp. (LCR)., Rochester, New York, USA, May 2000.
Proceedings(ps, pdf, abstract)

Adaptive Reduction Parallelization Techniques, Hao Yu, Lawrence Rauchwerger, In Proc. ACM Int. Conf. Supercomputing (ICS), pp. 66-77, Santa Fe, New Mexico, USA, May 2000.
Proceedings(ps, pdf, abstract)

Techniques for Reducing the Overhead of Run-time Parallelization, Hao Yu, Lawrence Rauchwerger, In Int. Conf. Compiler Construction (CC), pp. 232, Berlin, Germany, Mar 2000.
Proceedings(ps, pdf, abstract)

An Experimental Evaluation of the HP V-Class and SGI Origin 2000 Multiprocessors using Microbenchmarks and Scientific Applications, Ravi Iyer, Nancy Amato, Lawrence Rauchwerger, Laxmi Bhuyan, In Annual Conference of Hewlett Packard's High Performance Computing User Group, San Jose, CA, Mar 2000.
Proceedings(abstract)

A Unified Approach to Speculative Parallelization of Loops in DSM Multiprocessors, Ye Zhang, Lawrence Rauchwerger, Josep Torrellas, Technical Report, 1546, Center for Supercomputing Research & Development, University of Illinois, Oct 1999.
Technical Report(ps, pdf, abstract)

Run-time Parallelization Optimization Techniques, Hao Yu, Lawrence Rauchwerger, In Wkshp. on Lang. and Comp. for Par. Comp. (LCPC), San Diego, CA, Aug 1999.
Proceedings(ps, pdf, abstract)

Comparing the Memory System Performance of the HP V-Class and SGI Origin 2000 Multiprocessors using Microbenchmarks and Scientific Applications, Ravi Iyer, Nancy M. Amato, Lawrence Rauchwerger, Laxmi Bhuyan, In Proc. ACM Int. Conf. Supercomputing (ICS), pp. 9, Rhodes, Greece, Jun 1999.
Proceedings(ps, pdf, abstract)

Comparing the Memory System Performance of the HP V-Class and SGI Origin 2000 Multiprocessors using Microbenchmarks and Scientific Applications, Ravi Iyer, Nancy M. Amato, Lawrence Rauchwerger, Laxmi Bhuyan, In Proc. ACM Int. Conf. Supercomputing (ICS), pp. 339-347, Rhodes, Greece, Jun 1999.
Proceedings(ps, pdf, abstract)

A Cost Model for Communication on a Symmetric MultiProcessor, Nancy M. Amato, Andrea Pietracaprina, Geppino Pucci, Lucia K. Dale, Jack Perdue, Technical Report, TR98-004, Department of Computer Science and Engineering, Texas A&M University, Presented at SPAA Revue, 1998., Jan 1998.
Technical Report(ps, pdf, abstract)

Hindsight Helps: Deterministic Task Scheduling with Backtracking, Yueh-O Wang, Nancy M. Amato, D. K. Friesen, In Workshop on Challenges for Parallel Processing, Aug 1997.
Proceedings(ps, pdf)

A Comparison of Parallel Sorting Algorithms on Different Architectures, Nancy M. Amato, Ravi Iyer, Sharad Sundaresan, Yan Wu, Technical Report, TR98-029, Department of Computer Science and Engineering, Texas A&M University, Jan 1996.
Technical Report(ps, pdf, abstract)

Improved Processor Bounds for Parallel Algorithms for Weighted Digraphs, Nancy M. Amato, Information Processing Letters, 45:147-152, 1993.
Journal(pdf, abstract)

The Aggregate LRPD Test, Hao Yu, Lawrence Rauchwerger, Technical Report, TR99-025, Department of Computer Science and Engineering, Texas A&M University, College Station, TX, Jun 1999.
Technical Report(ps, pdf, abstract)

Hardware for Speculative Reduction Parallelization and Optimization in DSM Multiprocessors, Ye Zhang, Lawrence Rauchwerger, Josep Torrellas, In Workshop for Parallel Computing for Irregular Applications, Orlando, FL, 1999. Also, Technical Report, 1557, Center for Supercomputing Research & Development, University of Illinois, Jan 1999.
Technical Report(ps, pdf, abstract)

Hardware for Speculative Parallelization of Partially-Parallel Loops in DSM Multiprocessors, Ye Zhang, Lawrence Rauchwerger, Josep Torrellas, In Proc. High Performance Computer Architecture (HPCA), pp. 135-141, Orlando, FL, 1999.
Proceedings(ps, pdf)

Implementation Issues of Loop-level Speculative Run-time Parallelization, Devang Patel, Lawrence Rauchwerger, In Int. Conf. Compiler Construction (CC), Amsterdam, The Netherlands, Mar 1999.
Proceedings(ps, pdf, abstract)

Hardware for Speculative Parallelization in High-End Multiprocessors, Ye Zhang, Lawrence Rauchwerger, Josep Torrellas, In The Third PetaFlop Workshop, Annapolis, MD, Feb 1999.
Proceedings(ps, abstract)

Speculative Parallel Execution of Loops with Cross-Iteration Dependences in DSM Multiprocessors, Ye Zhang, Lawrence Rauchwerger, Josep Torrellas, Technical Report, 1536, University of Illinois, Jan 1999.
Technical Report(ps, pdf, abstract)

Principles of Speculative Run-time Parallelization, Devang Patel, Lawrence Rauchwerger, In Wkshp. on Lang. and Comp. for Par. Comp. (LCPC), Chapel Hill, NC, Aug 1998.
Proceedings(ps, pdf, abstract)

Run-Time Parallelization: It's Time Has Come, Lawrence Rauchwerger, Parallel Computing, 24(3-4):527-556, 1998.
Journal(ps, pdf, abstract)

Standard Templates Adaptive Parallel Library (STAPL), Lawrence Rauchwerger, Francisco Arzu, K Ouchi, In Wkshp. on Lang. Comp. and Run-time Sys. for Scal. Comp. (LCR)., pp. 402-410, LNCS 1511. Pittsburgh, PA, May 1998.
Proceedings(ps, pdf, abstract)

Hardware for Speculative Parallelization in Large- and Small- Multiprocessors, Ye Zhang, Venkata Krishnan, Lawrence Rauchwerger, Josep Torrellas, In Workshop on Scalable Shared Memory Multiprocessors, Barcelona, Spain, Apr 1998.
Proceedings(abstract)

Hardware for Speculative Run-Time Parallelization in Distributed Shared-Memory Multiprocessors, Ye Zhang, Lawrence Rauchwerger, Josep Torrellas, In Proc. High Performance Computer Architecture (HPCA), pp. 162-174, Las Vegas, NV, Feb 1998. Also, Technical Report, 1523, University of Illinois, Jul 1997.
Proceedings(ps, pdf, abstract) Technical Report(ps, pdf, abstract)

Hardware for Speculative Run-Time Parallelization in Distributed Hardware for Speculative Run-Time Parallelization in Distributed Shared-Memory Multiprocessors, Ye Zhang, Lawrence Rauchwerger, Josep Torrellas, In Proc. Int. Conf. on the Sim. and Syn. of Living Sys. (Alife), pp. 12, Las Vegas, NE, Feb 1998.
Proceedings(ps, pdf, abstract)

Parallel Programming with Polaris, W Blume, R Doallo, R Eigenmann, J Grout, J Hoeflinger, T Lawrence, J Lee, David Padua, Y Paek, B Pottenger, Lawrence Rauchwerger, IEEE Computer, 29(12):78-82, Dec 1996.
Journal(abstract)

Restructuring Programs for High-Speed Computers with Polaris, William Blume, Rudolf Eigenmann, Keith Faigin, J Grout, J Lee, T Lawrence, Jay Hoeflinger, David Padua, Y Paek, Paul Petersen, B Pottenger, Lawrence Rauchwerger, P Tu, S Weatherford, In Workshop on Challenges for Parallel Processing, pp. 149-162, Aug 1996.
Proceedings(pdf, abstract)

Advanced Program Restructuring for High-Performance Computers with Polaris, W Blume, R Doallo, R Eigenmann, J Grout, J Hoeflinger, T Lawrence, J Lee, David Padua, Y Paek, B Pottenger, Lawrence Rauchwerger, P Tu, Technical Report, 1473, Univ. of Illinois at Urbana-Champaign, Center for Supercomputing Res. & Dev., Jan 1996.
Technical Report(ps, pdf, abstract)

Run-Time Methods for Parallelizing Partially Parallel Loops, Lawrence Rauchwerger, Nancy M. Amato, David A. Padua, In Proc. ACM Int. Conf. Supercomputing (ICS), pp. 137-146, Barcelona, Spain, Aug 1995.
Proceedings(ps, pdf, abstract)

Run-Time Parallelization: A Framework for Parallel Computation, Lawrence Rauchwerger, Ph.D. Thesis, Center for Supercomputing Research & Development, University of Illinois, Urbana-Champaign, IL, Aug 1995.
Ph.D. Thesis(ps, pdf, abstract)

A Scalable Method for Run-Time Loop Parallelization, Lawrence Rauchwerger, Nancy M. Amato, David A. Padua, In Proc. ACM Int. Conf. Supercomputing (ICS), pp. 137-146, Barcelona, Spain, Jul 1995. Also, International Journal of Parallel Programming, 23(6):537-576, 1995.
Proceedings(ps, pdf, abstract)

The LRPD Test: Speculative Run-Time Parallelization of Loops with Privatization and Reduction Parallelization, Lawrence Rauchwerger, David Padua, In Prog. Lang. Design and Implementation (PLDI), pp. 218-232, La Jolla, CA, Jun 1995.
Proceedings(ps, pdf, abstract)

Effective Automatic Parallelization with Polaris, William Blume, R Eigenmann, K Faigin, J Grout, J Hoeflinger, David Padua, P Petersen, B Pottenger, Lawrence Rauchwerger, P Tu, S Weatherford, International Journal of Parallel Programming, 28, May 1995.
Journal(ps, pdf, abstract)

Parallelizing While Loops for Multiprocessor Systems, Lawrence Rauchwerger, David Padua, In Proc. IEEE Int. Par. Proc. Symp. (IPPS), pp. 347-356, Santa Barbara, CA, Apr 1995. Also, Technical Report, 1349, Center for Supercomputing Research & Development, University of Illinois, Urbana-Champaign, IL, Oct 1994.
Proceedings(ps, pdf, abstract) Technical Report(ps, pdf, abstract)

Run-Time Methods for Parallelizing DO Loops, Lawrence Rauchwerger, David A. Padua, In Proc. Int. Wkshp. on Massive Parallelism (IWMP), pp. 1-15, Capri, Italy, Oct 1994.
Proceedings(ps, pdf, abstract)

The Privatizing DOALL Test: A Run-Time Technique for DOALL Loop Identification and Array Privatization, Lawrence Rauchwerger, David Padua, Technical Report, 1383, Center for Supercomputing Research & Development, University of Illinois, Oct 1994. Also, In Proc. ACM Int. Conf. Supercomputing (ICS), pp. 33-43, Manchester, England, Jul 1994.
Technical Report(ps, pdf, abstract) Proceedings(ps, pdf, abstract)

Automatic Detection of Parallelism: A Grand Challenge for High-Performance Computing, William Blume, Rudolf Eigenmann, Jay Hoeflinger, David Padua, Paul Petersen, Lawrence Rauchwerger, Peng Tu, IEEE Parallel and Distributed Technology, 2(3):37-47, Sep 1994.
Journal(ps, pdf, abstract)

Polaris: The Next Generation in Parallelizing Compilers, Bill Blume, Rudolf Eigenmann, Keith Faigin, John Grout, Jay Hoeflinger, David Padua, Paul Petersen, Bill Pottenger, Lawrence Rauchwerger, Peng Tu, Stephen Weatherford, In Wkshp. on Lang. and Comp. for Par. Comp. (LCPC), pp. 10.1-10.18, Ithaca, New York, Aug 1994.
Proceedings(ps, pdf, abstract)

Polaris: Improving the Effectiveness of Parallelizing Compilers, William Blume, Rudolf Eigenmann, Keith Faigin, John Grout, Jay Hoeflinger, David Padua, Paul Petersen, William Pottenger, Lawrence Rauchwerger, Peng Tu, Stephen Weatherford, In Wkshp. on Lang. and Comp. for Par. Comp. (LCPC), Ithaca, New York, Aug 1994.
Proceedings(ps, pdf, abstract)

Speculative Run-Time Parallelization of Loops, Lawrence Rauchwerger, David Padua, Technical Report, 1339, Center for Supercomputing Research & Development, University of Illinois, Mar 1994.
Technical Report(ps, pdf, abstract)

Measuring limits of parallelism and characterizing its vulnerability to resource constraints, Lawrence Rauchwerger, P. Dubey, R. Nair, In Proc. Int. Sym. on Microarchitecture (MICRO 26), pp. 105-117, Austin, Texas, Dec 1993.
Proceedings(ps, pdf, abstract)

Perfect BenchmarksTM: Instrumented Version, Priyamvada Sinvhal-Sharma, Lawrence Rauchwerger, John Larson, Technical Report, 1152, Center for Supercomputing Research & Development, University of Illinois, Urbana-Champaign, Sep 1991.
Technical Report(ps, pdf, abstract)

pi-PERFECT: The Portably Instrumented Perfect Benchmarks, Lawrence Rauchwerger, Technical Report, 1150, Center for Supercomputing Research & Development, University of Illinois, Univ. of Illinois at Urbana-Champaign, Center for Supercomputing Res. & Dev., Sep 1991.
Technical Report(ps, pdf, abstract)

A multiple floating point coprocessor architecture, Lawrence Rauchwerger, P. Michael Farmwald, In Proc. Sym. Microprogramming and Microarchitecture (MICRO 23), pp. 216-222, Orlando, Florida, Nov 1990.
Proceedings(ps, pdf, abstract)