Diouri, Mohammed El Mehdi; Dolz Zaragozá, Manuel Francisco; Glück, Olivier; Lefèvre, Laurent; Alonso-Jordá, Pedro; Catalán, Sandra; Mayo, Rafael; Quintana Ortí, Enrique Salvador(Elsevier, 2014-06)
Large-scale distributed systems (e.g., datacenters, HPC systems, clouds, large-scale networks, etc.) con- sume and will consume enormous amounts of energy. Therefore, accurately monitoring the power dissipation and energy ...
Alonso-Jordá, Pedro; Dolz Zaragozá, Manuel Francisco; Igual, Francisco D.; Mayo, Rafael; Quintana Ortí, Enrique Salvador(Springer Verlag (Germany), 2012-11)
[EN] This paper analyzes the impact on power con- sumption of two DVFS-control strategies when applied to the execution of dense linear algebra operations on multi- core processors. The strategies considered here, prototyped ...
This paper addresses the efficient exploitation of task-level parallelism, present in many dense linear alge- bra operations, from the point of view of both computa- tional performance and energy consumption. The strategies ...
Alonso-Jordá, Pedro; Dolz Zaragozá, Manuel Francisco; Igual, Francisco D.; Mayo, Rafael; Quintana Ortí, Enrique Salvador(Wiley, 2014-10)
The road towards Exascale Computing requires a holistic effort to address three different challenges simultaneously: high performance, energy efficiency, and programmability. The use of runtime task schedulers to orchestrate ...
Alonso Jordá, Pedro; Dolz Zaragozá, Manuel Francisco; Mayo, Rafael; Quintana Ortí, Enrique Salvador(Wiley, 2014-12)
[EN] In this paper, we propose a model for the energy consumption of the concurrent execution of three key dense matrix factorizations, with task parallelism leveraged via the Symmetric Multi-Processing Superscalar (SMPSs) ...
[EN] In this paper we introduce a model for the total energy consumption of the Cholesky factorization on a multicore processor. Our model assumes a task- parallel execution of the factorization process, with con- currency ...
[EN] We present accurate piece-wise models for the time and energy costs of high performance implementations of both the matrix multiplication (gemm) and the triangular system solve with multiple right-hand sides (trsm) ...