Empirical Installation of Linear Algebra Shared-Memory Subroutines for Auto-Tuning

Cámara, J.; Cuenca, J.; Giménez, D.; García, LP.; Vidal Maciá, AM. (2014). Empirical Installation of Linear Algebra Shared-Memory Subroutines for Auto-Tuning. International Journal of Parallel Programming. 42(3):408-434. https://doi.org/10.1007/s10766-013-0249-6

Resumen

The introduction of auto-tuning techniques in linear algebra shared-memory routines is analyzed. Information obtained in the installation of the routines is used at running time to take some decisions to reduce the total execution time. The study is carried out with routines at different levels (matrix multiplication, LU and Cholesky factorizations and linear systems symmetric or general routines) and with calls to routines in the LAPACK and PLASMA libraries with multithread implementations. Medium NUMA and large cc-NUMA systems are used in the experiments. This variety of routines, libraries and systems allows us to obtain general conclusions about the methodology to use for linear algebra shared-memory routines auto-tuning. Satisfactory execution times are obtained with the proposed methodology.

Descripción

The final publication is available at Springer via http://dx.doi.org/10.1007/s10766-013-0249-6

Palabras clave

Linear algebra libraries, Linear algebra routines, Empirical installation, Shared-memory, Auto-tuning

Fuente

International Journal of Parallel Programming issn: 0885-7458

DOI

10.1007/s10766-013-0249-6

Versión del editor

http://dx.doi.org/10.1007/s10766-013-0249-6

Colecciones

Artículos, conferencias, monografías

Página completa del ítem

Empirical Installation of Linear Algebra Shared-Memory Subroutines for Auto-Tuning

Archivos

Fecha

Autores

Directores

Editores

Otras autorías

Unidades organizativas

Compartir

Handle

Cita bibliográfica

Titulación