- -

Automatic routine tuning to represent landform attributes on multicore and multi-GPU systems

RiuNet: Institutional repository of the Polithecnic University of Valencia

Share/Send to

Cited by

Statistics

Automatic routine tuning to represent landform attributes on multicore and multi-GPU systems

Show full item record

Do Carmo Boratto, M.; Alonso-Jordá, P.; Giménez Cánovas, D.; Barreto, M. (2014). Automatic routine tuning to represent landform attributes on multicore and multi-GPU systems. Journal of Supercomputing. 70(2):733-745. https://doi.org/10.1007/s11227-014-1191-0

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10251/59312

Files in this item

Item Metadata

Title: Automatic routine tuning to represent landform attributes on multicore and multi-GPU systems
Author: DO CARMO BORATTO, MURILO Alonso-Jordá, Pedro Giménez Cánovas, Domingo Barreto, Marcos
UPV Unit: Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació
Issued date:
Abstract:
Auto-tuning techniques have been used in the design of routines in recent years. The goal is to develop routines which automatically adapt to the conditions of the computational system in such a way that efficient executions ...[+]
Subjects: Auto-tuning , Landform representation , Parallel computing , Performance estimation , Multicore , Multi-GPU
Copyrigths: Cerrado
Source:
Journal of Supercomputing. (issn: 0920-8542 )
DOI: 10.1007/s11227-014-1191-0
Publisher:
Springer Verlag (Germany)
Publisher version: http://dx.doi.org/10.1007/s11227-014-1191-0
Project ID:
info:eu-repo/grantAgreement/MINECO//TEC2012-38142-C04-01/ES/PROCESADO DISTRIBUIDO Y COLABORATIVO DE SEÑALES SONORAS: CONTROL ACTIVO/
Description: The final publication is available at Springer via http://dx.doi.org/10.1007/s11227-014-1191-0
Thanks:
This work has been partially supported by European Union ERDF and Spanish Government through TEC2012-38142-C04 project.
Type: Artículo

References

Alberti P, Alonso P, Vidal A, Cuenca J, Giménez D (2004) Designing polylibraries to speed up parallel computations. Int J High Perform Comput Appl 1(1/2/3):75–84

Frigo M, Johnson S (1998) FFTW: an adaptive software architecture for the FFT. Proc IEEE Int Conf Acoust Speech Signal Process 3:1381–1384

Garland M (2010) Parallel computing with CUDA. In: IPDPS, pp 10–26 [+]
Alberti P, Alonso P, Vidal A, Cuenca J, Giménez D (2004) Designing polylibraries to speed up parallel computations. Int J High Perform Comput Appl 1(1/2/3):75–84

Frigo M, Johnson S (1998) FFTW: an adaptive software architecture for the FFT. Proc IEEE Int Conf Acoust Speech Signal Process 3:1381–1384

Garland M (2010) Parallel computing with CUDA. In: IPDPS, pp 10–26

Jerez S, Montávez JP, Giménez D (2009) Optimizing the execution of a parallel meteorology simulation code. In: IPDPS. IEEE Computer Society, Los Alamitos, CA, USA

Nogueira L, Abrantes RP, Leal B (2008) A methodology of distributed processing using a mathematical model for landform attributes representation. In: Proceeding of the IADIS International Conference on applied computing, pp 17–21

Nogueira L, Abrantes RP, Leal B, Goulart C (2008) A model of landform attributes representation for application in distributed systems. In: Proceeding of the IADIS International Conference on applied computing

Rawlings JO, Pantula SG, Dickey DA (1998) Applied regression analysis: a research tool. Springer, London

Rufino I, Galvao C, Rego J, Albuquerque J (2009) Water resources and urban planning: the case of a coastal area in Brazil. J Urban Environ Eng 3:32–42

Song F, Tomov S, Dongarra J (2011) Efficient support for matrix computations on heterogeneous multicore and multi-GPU architectures. Tech Rep 250, LAPACK working note

Whaley C, Petitet A, Dongarra JJ (2000) Automated empirical optimization of software and the ATLAS project. Parallel Comput 27:21–31

[-]

recommendations

 

This item appears in the following Collection(s)

Show full item record