Buscar en RiuNet

Listar

Todo RiuNet

Mi cuenta

Acceder

Ayuda RiuNet

Admin. UPV

Listar por autor "Silla Jiménez, Federico"

Mostrando ítems 1-20 de 74

Página siguiente

A comparative study of arbitration algorithms for the Alpha 21364 pipelined router

Mukherjee, Shubhendu; Silla Jiménez, Federico; Bannon, Peter; Emer, Joel; Lang, Steve; Webb, David (Association for Computing Machinery (ACM), 2002)

Interconnection networks usually consist of a fabric of interconnected routers, which receive packets arriving at their input ports and forward them to appropriate output ports. Unfortunately, network packets moving through ...
A complete and efficient CUDA-sharing solution for HPC clusters

Peña Monferrer, Antonio José; Reaño González, Carlos; Silla Jiménez, Federico; Mayo Gual, Rafael; Quintana-Orti, Enrique S.; Duato Marín, José Francisco (Elsevier, 2014-12)

In this paper we detail the key features, architectural design, and implementation of rCUDA, an advanced framework to enable remote and transparent GPGPU acceleration in HPC clusters. rCUDA allows decoupling GPUs from ...
A GASNet Conduit for the New EXTOLL Interconnection Network Architecture

Kujat ., Knut (Universitat Politècnica de València, 2013-02-21)

[ES] Los lenguajes PGAS han demostrado ser una forma intuitiva de programación paralela. Con GASNet, una capa de red independiente del lenguaje, preocupaciones por la compatibilidad ya no son ningún problema, ya que GASNet ...
A low-latency modular switch for CMP systems

Roca Pérez, Antoni; Flich Cardo, José; Silla Jiménez, Federico; Duato Marín, José Francisco (Elsevier, 2011-11)

[EN] As technology advances, the number of cores in Chip MultiProcessor systems and MultiProcessor Systems-on-Chips keeps increasing. The network must provide sustained throughput and ultra-low latencies. In this paper we ...
A new degree of freedom for memory allocation in clusters

Montaner Mas, Héctor; Silla Jiménez, Federico; Fröning, Holger; Duato Marín, José Francisco (Springer Verlag (Germany), 2012-06)

Improvements in parallel computing hardware usually involve increments in the number of available resources for a given application such as the number of computing cores and the amount of memory. In the case of shared-memory ...
A Parallel Compression Pipeline for Improving GPU Virtualization Data Transfers

Peñaranda-Cebrián, Cristian; Reaño, Carlos; Silla, Federico (MDPI AG, 2024-07)

[EN] GPUs are commonly used to accelerate the execution of applications in domains such as deep learning. Deep learning applications are applied to an increasing variety of scenarios, with edge computing being one of them. ...
A performance comparison of CUDA remote GPU virtualization frameworks

Reaño González, Carlos; Silla Jiménez, Federico (IEEE, 2015-09-08)

Using GPUs reduces execution time of many applications but increases acquisition cost and power consumption. Furthermore, GPUs usually attain a relatively low utilization. In this context, remote GPU virtualization ...
Accelerator Virtualization in Fog Computing: Moving from the Cloud to the Edge

Varghese, Blesson; Reaño González, Carlos; Silla Jiménez, Federico (Institute of Electrical and Electronics Engineers (IEEE), 2018)

[EN] Hardware accelerators are available on the cloud for enhanced analytics. Next-generation clouds aim to bring enhanced analytics using accelerators closer to user devices at the edge of the network for improving quality ...
Adaptación a PCI-Express de un diseño de memoria compartida distribuida basado en FPGAS

Mislata Valero, Santiago (Universitat Politècnica de València, 2011-10-11)
Addressing Manufacturing Challenges in NoC-based ULSI Designs

Hernández Luz, Carles (Universitat Politècnica de València, 2012-07-19)
Adecuación de la granularidad de las comunicaciones en aplicaciones MPI a las características de la red de interconexión

Montaner Mas, Héctor (Universitat Politècnica de València, 2011-10-26)

Este estudio se centra en la granularidad de las comunicaciones en aplicaciones MPI. Nuestra hipótesis consiste en que la granularidad óptima para una red exterior al chip no es la óptima para una red interior al chip. ...
AI-enabled autonomous drones for fast climate change crisis assessment

Hernández, Daniel; Cano, Juan-Carlos; Silla, Federico; Tavares De Araujo Cesariny Calafate, Carlos Miguel; Cecilia-Canales, José María (Institute of Electrical and Electronics Engineers, 2022-05-15)

[EN] Climate change is one of the greatest challenges for modern societies. Its consequences, often associated with extreme events, have dramatic results worldwide. New synergies between different disciplines, including ...
Analyzing the performance/power tradeoff of the rCUDA middleware for future exascale systems

Reaño González, Carlos; Prades, Javier; Silla Jiménez, Federico (Elsevier, 2019-10)

[EN] The computing power of supercomputers and data centers has noticeably grown during the last decades at the cost of an ever increasing energy demand. The need for energy (and power) of these facilities has finally ...
Análisis de las prestaciones del entorno de deep learning TensorFlow

Rodríguez Alepuz, Sergio (Universitat Politècnica de València, 2018-09-11)

[ES] Los sistemas de aprendizaje automático —y, en concreto, los modelos de aprendizaje profundo (del inglés, deep learning) o redes neuronales— se han popularizado recientemente debido a que han supuesto un salto ...
Análisis del impacto de rCUDA en las prestaciones de mCUDA-MEME y HOOMD-Blue

Baiget Orts, Carlos Jose (Universitat Politècnica de València, 2013-10-04)

La presencia de Unidades de Proceso Gráfico (Graphics Processing Units, GPUs) en las instalaciones de Computación de Alto Rendimiento (High Performance Computing, HPC) es una opción cada vez más extendida por la mejora del ...
Automatización de la validación y exactitud de un entorno de virtualización remota de GPUs

Bautista Perales, Ismael (Universitat Politècnica de València, 2016-10-17)

[ES] En este TFM se va a diseñar e implementar un programa de software que permita realizar de una forma automática tests de validación de software. El objetivo principal es realizar tests masivos del entorno de ...
Boosting the performance of remote GPU virtualization using InfiniBand Connect-IB and PCIe 3.0

Reaño González, Carlos; Silla Jiménez, Federico; Peña Monferrer, Antonio José; Shainer, Gilad; Schultz, Scot; Castello Gimeno, Adrián; Quintana Ortí, Enrique Salvador; Duato Marín, José Francisco (IEEE, 2014-09-22)

[EN] A clear trend has emerged involving the acceleration of scientific applications by using GPUs. However, the capabilities of these devices are still generally underutilized. Remote GPU virtualization techniques can ...
Characterizing the impact of process variation on 45 nm NoC-based CMPs

Hernández Luz, Carles; Roca Pérez, Antoni; Flich Cardo, José; Silla Jiménez, Federico; Duato Marín, José Francisco (Elsevier, 2011-05)

[EN] Current integration scales make possible to design chip multiprocessors with a large amount of cores interconnected by a NoC. Unfortunately, they also bring process variation, posing a new burden to processor ...
Cost-efficient on-chip routing implementations for CMP and MPSoC systems

Rodrigo Mocholí, Samuel; Flich Cardo, José; Roca Pérez, Antoni; Medardoni, Simone; Bertozzi, Davide; Camacho Villanueva, Jesús; Silla Jiménez, Federico; Duato Marín, José Francisco (Institute of Electrical and Electronics Engineers (IEEE), 2011-04)

[EN] The high-performance computing domain is enriching with the inclusion of networks-on-chip (NoCs) as a key component of many-core (CMPs or MPSoCs) architectures. NoCs face the communication scalability challenge while ...
Creación de sistema cloud con OpenStack

Osuna Fontan, Alejandro Carlos (Universitat Politècnica de València, 2016-09-07)

[ES] En este trabajo hemos realizado la implementación completa un sistema de Cloud Computing con la tecnología OpenStack. Hemos realizado un pequeño estudio de las diferentes opciones libres de sistemas cloud. Se ha ...