Castelló, Adrián; Mayo Gual, Rafael; Seo, Sangmin; Balaji, Pavan; Quintana Ortí, Enrique Salvador; Peña, Antonio J.(Institute of Electrical and Electronics Engineers, 2020-09-01)
[EN] With the appearance of multi-/many core machines, applications and runtime systems have evolved in order to exploit the new on-node concurrency brought by new software paradigms. POSIX threads (Pthreads) was widely-adopted ...
Castelló-Gimeno, Adrián; Peña Monferrer, Antonio José; Mayo Gual, Rafael; Planas,Judit; Quintana Ortí, Enrique Salvador; Balaji, Pavan(Springer-Verlag, 2018-11)
[EN] Directive-based programming models, such as OpenMP, OpenACC, and OmpSs, enable users to accelerate applications by using coprocessors with little effort. These devices offer significant computing power, but their use ...
Alaejos López, Guillermo(Universitat Politècnica de València, 2022-04-27)
[ES] El auge en la aplicación de redes neuronales profundas (RNPs) en una gran variedad de campos científicos ha propiciado su uso no solo en servidores de cómputo sino también en dispositivos de bajo consumo.
Los cálculos ...
Reaño González, Carlos; Silla Jiménez, Federico; Castello Gimeno, Adrián; Peña Monferrer, Antonio José; Mayo Gual, Rafael; Quintana Ortí, Enrique Salvador; Duato Marín, José Francisco(Wiley, 2015-09-25)
Graphics processing units (GPUs) are being increasingly embraced by the high-performance computing
community as an effective way to reduce execution time by accelerating parts of their applications. remote
CUDA (rCUDA) ...
[EN] We investigate a parallelization strategy for dense matrix factorization (DMF) algorithms, using OpenMP, that departs from the legacy (or conventional) solution, which simply extracts concurrency from a multi-threaded ...
Iserte Agut, Sergio; Castello Gimeno, Adrián; Mayo Gual, Rafael; Quintana Ortí, Enrique Salvador; Silla Jiménez, Federico; Duato Marín, José Francisco; Reaño González, Carlos; Prades Gasulla, Javier(IEEE, 2014-10-22)
SLURM is a resource manager that can be leveraged to share a collection of heterogeneous resources among the jobs in execution in a cluster. However, SLURM is not designed to handle resources such as graphics processing ...