On-the-fly elimination of dynamic irregularities for GPU computing
- 5 March 2011
- proceedings article
- Published by Association for Computing Machinery (ACM)
- Vol. 39 (1) , 369-380
- https://doi.org/10.1145/1950365.1950408
Abstract
No abstract availableThis publication has 18 references indexed in Scilit:
- A GPGPU compiler for memory optimization and parallelism managementPublished by Association for Computing Machinery (ACM) ,2010
- Increasing memory miss tolerance for SIMD coresPublished by Association for Computing Machinery (ACM) ,2009
- OpenMP to GPGPUPublished by Association for Computing Machinery (ACM) ,2009
- A compiler framework for optimization of affine loop nests for gpgpusPublished by Association for Computing Machinery (ACM) ,2008
- Scalable Parallel Programming with CUDAQueue, 2008
- Optimization principles and application performance evaluation of a multithreaded GPU using CUDAPublished by Association for Computing Machinery (ACM) ,2008
- CUDA-Lite: Reducing GPU Programming ComplexityPublished by Springer Nature ,2008
- Lattice Boltzmann based PDE solver on the GPUThe Visual Computer, 2007
- Improving effective bandwidth through compiler enhancement of global cache reuseJournal of Parallel and Distributed Computing, 2003
- The Elements of Statistical LearningPublished by Springer Nature ,2001