In the context of computed tomography (CT), iterative image reconstruction techniques are gaining attention because high-quality images are becoming computationally feasible. They involve the solution of large systems of ...
[EN] Tuning and optimising the operations executed in deep learning frameworks is a fundamental task in accelerating the processing of deep neural networks (DNNs). However, this optimisation usually requires extensive ...
[EN] We contribute to the optimization of the sparse matrix-vector product by introducing a variant of the coordinate sparse matrix format that balances the workload distribution and compresses both the indexing arrays and ...
[EN] Event warnings are critical in the context of ITS, being dependent on reliable and low-delay delivery ofmessages to nearby vehicles. One of the main challenges to address in this context is intersection management. ...
Background: Short sequence mapping methods for Next Generation Sequencing consist on a combination of
seeding techniques followed by local alignment based on dynamic programming approaches. Most seeding
algorithms are ...
[EN] We present FloatX (Float eXtended), a C++ framework to investigate the effect of leveraging customized floating-point formats in numerical applications. FloatX formats are based on binary IEEE 754 with smaller significand ...
HERRERA-TAPIA, JORGE; Manzoni, Pietro; Hernández Orallo, Enrique; Tomás Domínguez, Andrés Enrique; Tavares de Araujo Cesariny Calafate, Carlos Miguel; Cano Escribá, Juan Carlos(MDPI, 2016)
[EN] Regular citizens equipped with smart devices are being increasingly used as sensors by Smart Cities applications. Using contacts among users, data in the form of messages is obtained and shared. Contact-based messaging ...
Castelló, Adrián; SERGIO BARRACHINA; DOLZ ZARAGOZÁ, MANUEL FRANCISCO; Enrique S. Quintana-Ortí; San Juan-Sebastian, Pablo; Tomás Domínguez, Andrés Enrique(Elsevier, 2022-04)
[EN] We evolve PyDTNN, a framework for distributed parallel training of Deep Neural Networks (DNNs), into an efficient inference tool for convolutional neural networks. Our optimization process on multicore ARM processors ...
HERRERA TAPIA, JORGE; Hernández Orallo, Enrique; Tomás Domínguez, Andrés Enrique; Manzoni, Pietro; Tavares de Araujo Cesariny Calafate, Carlos Miguel; Cano Escribá, Juan Carlos(Springer, 2016-07)
The performance of mobile opportunistic networks strongly depends on contact duration. If the contact lasts less than the required transmission times, some messages will not get delivered, and the whole diffusion scheme ...
Error correction is typically the first step of de Novo genome assembly from NGS data. This step has an important impact on the quality and speed of the assembly process. However, the majority of available stand-alone error ...
[EN] Due to the NGS data deluge, sequence mapping has become an intensive task that, depending on the experiment, may demand high amounts of computing power or memory capacity.
On the one hand, GPGPU architectures have ...
The QR decomposition with column pivoting (QRP) of a matrix is widely used for rank revealing. The performance of LAPACK implementation (DGEQP3) of the Householder QRP algorithm is limited by Level 2 BLAS operations required ...
[EN] In this work, we assess the performance and energy efciency of high-performance
codes for the convolution operator, based on the direct, explicit/implicit lowering and Winograd algorithms used for deep learning (DL) ...
[EN] Cloud4Science is a research activity funded by Microsoft that develops a unique
online platform providing cloud services, datasets, tools, documentations, tutorial and best
practices to meet the needs of researchers ...
[EN] We present a novel method for the QR factorization of large tall-and-skinny matrices that introduces an approximation technique for computing the Householder vectors. This approach is very competitive on a hybrid ...
General Purpose Graphic Processing Units (GPGPUs) constitute an inexpensive resource for computing-intensive
applications that could exploit an intrinsic fine-grain parallelism. This paper presents the design and ...