Alonso-Jordá, Pedro; Dolz Zaragozá, Manuel Francisco; Igual, Francisco D.; Mayo, Rafael; Quintana Ortí, Enrique Salvador(Springer Verlag (Germany), 2012-11)
[EN] This paper analyzes the impact on power con- sumption of two DVFS-control strategies when applied to the execution of dense linear algebra operations on multi- core processors. The strategies considered here, prototyped ...
[EN] Near Threshold Voltage (NTV) computing has been recently proposed as a technique to save energy, at the cost of incurring higher error rates including, among others, Silent Data Corruption (SDC). In this paper, we ...
This paper addresses the efficient exploitation of task-level parallelism, present in many dense linear alge- bra operations, from the point of view of both computa- tional performance and energy consumption. The strategies ...
Alonso-Jordá, Pedro; Dolz Zaragozá, Manuel Francisco; Igual, Francisco D.; Mayo, Rafael; Quintana Ortí, Enrique Salvador(Wiley, 2014-10)
The road towards Exascale Computing requires a holistic effort to address three different challenges simultaneously: high performance, energy efficiency, and programmability. The use of runtime task schedulers to orchestrate ...
Alonso Jordá, Pedro; Dolz Zaragozá, Manuel Francisco; Mayo, Rafael; Quintana Ortí, Enrique Salvador(Wiley, 2014-12)
[EN] In this paper, we propose a model for the energy consumption of the concurrent execution of three key dense matrix factorizations, with task parallelism leveraged via the Symmetric Multi-Processing Superscalar (SMPSs) ...
[EN] In this paper we introduce a model for the total energy consumption of the Cholesky factorization on a multicore processor. Our model assumes a task- parallel execution of the factorization process, with con- currency ...
Catalán, Sandra; Herrero, José R.; Igual Peña, Francisco Daniel; Rodríguez-Sánchez, Rafael; Quintana Ortí, Enrique Salvador; Adeniyi-Jones, Chris(Elsevier, 2018-03)
[EN] Dense linear algebra libraries, such as BLAS and LAPACK, provide a relevant collection of numerical tools for many scientific and engineering applications. While there exist high performance implementations of the ...
Dolz Zaragozá, Manuel Francisco(Universitat Politècnica de València, 2011-09-06)
Desde años, el principal objetivo de la computación de altas prestaciones ha sido la optimización de algoritmos
aplicados a la resolución de problemas complejos que, constantemente, aparecen en un amplio abanico ...
Catalán, Sandra; Igual, Francisco D.; Herrero, José R.; Rodríguez-Sánchez, Rafael; Quintana-Ortí, Enrique S.(Elsevier, 2023-05)
[EN] We propose a methodology to address the programmability issues derived from the emergence of newgeneration shared-memory NUMA architectures. For this purpose, we employ dense matrix factorizations and matrix inversion ...
[EN] We analyze the benefits of look-ahead in the parallel execution of the LU factorization with partial pivoting (LUpp) in two distinct "asymmetric" multicore scenarios. The first one corresponds to an actual hardware-asymmetric ...
[EN] We investigate how to leverage the heterogeneous resources of an Asymmetric Multicore Processor (AMP) in order to deliver high performance in the reduction to condensed forms for the solution of dense eigenvalue and ...