Cámara, Jesús; Cuenca, Javier; Giménez, Domingo; García, Luis Pedro; Vidal Maciá, Antonio Manuel(Springer Verlag (Germany), 2014-06)
The introduction of auto-tuning techniques in linear algebra shared-memory routines is analyzed. Information obtained in the installation of the routines is used at running time to take some decisions to reduce the total ...
[EN] We provide a practical demonstration that it is possible to systematically generate a variety of high-performance micro-kernels for the general matrix multiplication (gemm) via generic templates which can be easily ...