Hybrid Analysis of Memory Reference Patterns
Lawrence Rauchwerger,
Hao Yu,
Dongmin Zhang
Description
One of the most important operations in scientific applications is the reduction operation, and the parallelization of reductions is crucial to the overall performance of parallel codes. In this work, we adapt reduction parallelization to the actual reference pattern found from executing the reduction loop. The following picture describes our algorithm.
Reduction Algorithm
In order to recognize the access pattern, we have defined the following parameters:
Reduction Parameters
In addition, we have developed three new methods for reduction parallelization:
Reduction Decision Tree
Experimental Results
The experimental results show that our adaptive modeling scheme works rather well most of the time. Although it failed to select the best reduction parallelization scheme in some cases, experimental results showed that the absolute performance of our choice was quite close to the best scheme in these cases.
An Adaptive Algorithm Selection Framework, Hao Yu, Dongmin Zhang, Lawrence Rauchwerger, In Proc. IEEE Int.Conf. on Parallel Architectures and Compilation Techniques (PACT), Antibes Juan-les-Pins, France, Sep 2004.
Proceedings(ps, pdf, abstract)
An Adaptive Algorithm Selection Framework, Hao Yu, Dongmin Zhang, Francis Dang, Lawrence Rauchwerger, Technical Report, TR04-002, Parasol Laboratory, Department of Computer Science, Texas A&M University, Mar 2004.
Technical Report(ps, pdf, abstract)
Architectural Support for Parallel Reductions in Scalable Shared-Memory Multiprocessors, Maria Jesus Garzaran, Milos Prvulovic, Ye Zhang, Alin Jula, Hao Yu, Lawrence Rauchwerger, Josep Torrellas, In Proc. IEEE Int.Conf. on Parallel Architectures and Compilation Techniques (PACT), Barcelona, Spain, Sep 2001.
Proceedings(ps, pdf, abstract)
Adaptive Reduction Parallelization Techniques, Hao Yu, Lawrence Rauchwerger, In Proc. ACM Int. Conf.
Supercomputing (ICS), pp. 66-77, Santa Fe, New Mexico, USA, May 2000.
Proceedings(ps, pdf, abstract)
Run-time Parallelization Optimization Techniques, Hao Yu, Lawrence Rauchwerger, In Wkshp. on Lang. and Comp.
for Par. Comp. (LCPC), San Diego, CA, Aug 1999.
Proceedings(ps, pdf, abstract)