Mukherjee, Shubhendu; Silla Jiménez, Federico; Bannon, Peter; Emer, Joel; Lang, Steve; Webb, David(Association for Computing Machinery (ACM), 2002)
Interconnection networks usually consist of a fabric of interconnected routers, which receive packets arriving at their input ports and forward them to appropriate output ports. Unfortunately, network packets moving through ...
Peña Monferrer, Antonio José; Reaño González, Carlos; Silla Jiménez, Federico; Mayo Gual, Rafael; Quintana-Orti, Enrique S.; Duato Marín, José Francisco(Elsevier, 2014-12)
In this paper we detail the key features, architectural design, and implementation of rCUDA,
an advanced framework to enable remote and transparent GPGPU acceleration in HPC
clusters. rCUDA allows decoupling GPUs from ...
Kujat ., Knut(Universitat Politècnica de València, 2013-02-21)
[ES] Los lenguajes PGAS han demostrado ser una forma intuitiva de programación paralela. Con GASNet, una capa de red independiente del lenguaje, preocupaciones por la compatibilidad ya no son ningún problema, ya que GASNet ...
Roca Pérez, Antoni; Flich Cardo, José; Silla Jiménez, Federico; Duato Marín, José Francisco(Elsevier, 2011-11)
[EN] As technology advances, the number of cores in Chip MultiProcessor systems and MultiProcessor Systems-on-Chips keeps increasing. The network must provide sustained throughput and ultra-low latencies. In this paper we ...
Montaner Mas, Héctor; Silla Jiménez, Federico; Fröning, Holger; Duato Marín, José Francisco(Springer Verlag (Germany), 2012-06)
Improvements in parallel computing hardware usually involve increments in the number of available resources for a given application such as the number of computing cores and the amount of memory. In the case of shared-memory ...
[EN] GPUs are commonly used to accelerate the execution of applications in domains such as deep learning. Deep learning applications are applied to an increasing variety of scenarios, with edge computing being one of them. ...
Reaño González, Carlos; Silla Jiménez, Federico(IEEE, 2015-09-08)
Using GPUs reduces execution time of many applications
but increases acquisition cost and power consumption.
Furthermore, GPUs usually attain a relatively low utilization.
In this context, remote GPU virtualization ...
Varghese, Blesson; Reaño González, Carlos; Silla Jiménez, Federico(Institute of Electrical and Electronics Engineers (IEEE), 2018)
[EN] Hardware accelerators are available on the cloud for enhanced analytics. Next-generation clouds aim to bring enhanced analytics using accelerators closer to user devices at the edge of the network for improving quality ...
Montaner Mas, Héctor(Universitat Politècnica de València, 2011-10-26)
Este estudio se centra en la granularidad de las comunicaciones en aplicaciones MPI. Nuestra hipótesis consiste en que la granularidad óptima para una red exterior al chip no es la óptima para una red interior al chip. ...
Hernández, Daniel; Cano, Juan-Carlos; Silla, Federico; Tavares De Araujo Cesariny Calafate, Carlos Miguel; Cecilia-Canales, José María(Institute of Electrical and Electronics Engineers, 2022-05-15)
[EN] Climate change is one of the greatest challenges for modern societies. Its consequences, often associated with extreme events, have dramatic results worldwide. New synergies between different disciplines, including ...
Reaño González, Carlos; Prades, Javier; Silla Jiménez, Federico(Elsevier, 2019-10)
[EN] The computing power of supercomputers and data centers has noticeably grown during the last decades at the cost of an ever increasing energy demand. The need for energy (and power) of these facilities has finally ...
Rodríguez Alepuz, Sergio(Universitat Politècnica de València, 2018-09-11)
[ES] Los sistemas de aprendizaje automático —y, en concreto, los modelos de aprendizaje profundo
(del inglés, deep learning) o redes neuronales— se han popularizado recientemente
debido a que han supuesto un salto ...
Baiget Orts, Carlos Jose(Universitat Politècnica de València, 2013-10-04)
La presencia de Unidades de Proceso Gráfico (Graphics Processing Units, GPUs) en las instalaciones de Computación de Alto Rendimiento (High Performance Computing, HPC) es una opción cada vez más extendida por la mejora del ...
Bautista Perales, Ismael(Universitat Politècnica de València, 2016-10-17)
[ES] En este TFM se va a diseñar e implementar un programa de software que
permita realizar de una forma automática tests de validación de software. El
objetivo principal es realizar tests masivos del entorno de ...
Reaño González, Carlos; Silla Jiménez, Federico; Peña Monferrer, Antonio José; Shainer, Gilad; Schultz, Scot; Castello Gimeno, Adrián; Quintana Ortí, Enrique Salvador; Duato Marín, José Francisco(IEEE, 2014-09-22)
[EN] A clear trend has emerged involving the acceleration of scientific applications by using GPUs. However, the capabilities of these devices are still generally underutilized. Remote GPU virtualization techniques can ...
Hernández Luz, Carles; Roca Pérez, Antoni; Flich Cardo, José; Silla Jiménez, Federico; Duato Marín, José Francisco(Elsevier, 2011-05)
[EN] Current integration scales make possible to design chip multiprocessors with a large amount of cores interconnected by a NoC. Unfortunately, they also bring process variation, posing a new burden to processor ...
Rodrigo Mocholí, Samuel; Flich Cardo, José; Roca Pérez, Antoni; Medardoni, Simone; Bertozzi, Davide; Camacho Villanueva, Jesús; Silla Jiménez, Federico; Duato Marín, José Francisco(Institute of Electrical and Electronics Engineers (IEEE), 2011-04)
[EN] The high-performance computing domain is enriching with the inclusion of networks-on-chip (NoCs) as a key component of many-core (CMPs or MPSoCs) architectures. NoCs face the communication scalability challenge while ...
Osuna Fontan, Alejandro Carlos(Universitat Politècnica de València, 2016-09-07)
[ES] En este trabajo hemos realizado la implementación completa un sistema de Cloud
Computing con la tecnología OpenStack. Hemos realizado un pequeño estudio de las
diferentes opciones libres de sistemas cloud. Se ha ...