Mostrar el registro sencillo del ítem
dc.contributor.author | Reaño González, Carlos | es_ES |
dc.contributor.author | Silla Jiménez, Federico | es_ES |
dc.date.accessioned | 2017-06-19T07:15:28Z | |
dc.date.available | 2017-06-19T07:15:28Z | |
dc.date.issued | 2016-12 | |
dc.identifier.issn | 1573-0484 | |
dc.identifier.uri | http://hdl.handle.net/10251/83128 | |
dc.description | The final publication is available at Springer via http://dx.doi.org/ 10.1007/s11227-016-1754-3 | es_ES |
dc.description.abstract | In the past few years, a tendency towards using InfiniBand networks to interconnect high performance computing clusters can be observed. Thus, most of the supercomputers appearing in the TOP500 list either use Ethernet or InfiniBand interconnects. Regarding the latter, the complexity of the InfiniBand programming API (i.e., InfiniBand Verbs) makes it difficult for applications to get the maximum performance of these networks. In this paper we expose how we have tuned a remote GPU virtualization framework whose communications module is implemented using InfiniBand Verbs. The net result is a noticeable increase in the performance of this framework, significantly reducing the gap between remote and local GPUs. | es_ES |
dc.description.sponsorship | This work was funded by the Spanish MINECO and FEDER funds under Grant TIN2012-38341-C04-01. Authors are also grateful for the generous support provided by Mellanox Technologies. | en_EN |
dc.language | Inglés | es_ES |
dc.publisher | Springer Verlag (Germany) | es_ES |
dc.relation.ispartof | Journal of Supercomputing | es_ES |
dc.rights | Reserva de todos los derechos | es_ES |
dc.subject | HPC | es_ES |
dc.subject | InfiniBand | es_ES |
dc.subject | CUDA | es_ES |
dc.subject | Remote GPU virtualization | es_ES |
dc.subject | Networks | es_ES |
dc.subject | Performance | es_ES |
dc.subject | Tuning | es_ES |
dc.subject.classification | ARQUITECTURA Y TECNOLOGIA DE COMPUTADORES | es_ES |
dc.title | Tuning remote GPU virtualization for InfiniBand networks | es_ES |
dc.type | Artículo | es_ES |
dc.identifier.doi | 10.1007/s11227-016-1754-3 | |
dc.relation.projectID | info:eu-repo/grantAgreement/MINECO//TIN2012-38341-C04-01/ES/MEJORA DE LA ARQUITECTURA DE SERVIDORES, SERVICIOS Y APLICACIONES/ | es_ES |
dc.rights.accessRights | Abierto | es_ES |
dc.contributor.affiliation | Universitat Politècnica de València. Departamento de Informática de Sistemas y Computadores - Departament d'Informàtica de Sistemes i Computadors | es_ES |
dc.contributor.affiliation | Universitat Politècnica de València. Escola Tècnica Superior d'Enginyeria Informàtica | es_ES |
dc.description.bibliographicCitation | Reaño González, C.; Silla Jiménez, F. (2016). Tuning remote GPU virtualization for InfiniBand networks. Journal of Supercomputing. 72(12):4520-4545. https://doi.org/10.1007/s11227-016-1754-3 | es_ES |
dc.description.accrualMethod | S | es_ES |
dc.relation.publisherversion | https://link.springer.com/article/10.1007/s11227-016-1754-3 | es_ES |
dc.description.upvformatpinicio | 4520 | es_ES |
dc.description.upvformatpfin | 4545 | es_ES |
dc.type.version | info:eu-repo/semantics/publishedVersion | es_ES |
dc.description.volume | 72 | es_ES |
dc.description.issue | 12 | es_ES |
dc.relation.senia | 321411 | es_ES |
dc.contributor.funder | Mellanox Technologies Ltd. | |
dc.description.references | InfiniBand Trade Association (IBTA) (2015) [Online]. http://www.infinibandta.org | es_ES |
dc.description.references | DAmbrosia J (2014) Ethernet in the TOP500 [Online]. http://www.scientificcomputing.com/blogs/2014/07/ethernet-top500 | es_ES |
dc.description.references | TOP500 Supercomputer Sites (2014) [Online]. http://www.top500.org/ | es_ES |
dc.description.references | InfiniBand Trade Association (IBTA) (2007) The InfiniBand Trade Association Specification | es_ES |
dc.description.references | Kerr G (2011) Dissecting a small infiniband application using the verbs API. CoRR abs/1105.1827 [Online]. arxiv:1105.1827 | es_ES |
dc.description.references | Woodruff B, Hefty S, Dreier R, Rosenstock H (2005) Introduction to the infiniband core software. In: Linux symposium, vol 2 | es_ES |
dc.description.references | Bedeir T (2010) Building an RDMA-capable application with ib verbs, Technical report, HPC Advisory Council, Tech. Rep., 2010. http://www.hpcadvisorycouncil.com/pdf/building-an-rdma-capable-application-with-ib-verbs.pdf | es_ES |
dc.description.references | Liu Q, Russell RD (2014) A performance study of infiniband fourteen data rate (fdr). In: Proceedings of the High performance computing symposium, ser. HPC ’14. San Diego, CA, USA: Society for Computer Simulation International, 2014, pp 16:1–16:10 [Online]. http://dl.acm.org/citation.cfm?id=2663510.2663526 | es_ES |
dc.description.references | Hjelm N (2014) Optimizing one-sided operations in open mpi. In: Proceedings of the 21st European MPI Users’ Group Meeting, ser. EuroMPI/ASIA ’14. New York, NY, USA: ACM, 2014, pp 123:123–123:124 [Online]. http://doi.acm.org/10.1145/2642769.2642792 | es_ES |
dc.description.references | Subramoni H, Hamidouche K, Venkatesh A, Chakraborty S, Panda D (2014) Designing mpi library with dynamic connected transport (dct) of infiniband: Early experiences. In: Kunkel J , Ludwig T, Meuer H (eds) Supercomputing, ser. lecture notes in computer science. Springer International Publishing, 2014, vol 8488, pp 278–295 [Online]. doi: 10.1007/978-3-319-07518-1_18 | es_ES |
dc.description.references | Unified Communication X (UCX), 2015 [Online]. http://www.openucx.org | es_ES |
dc.description.references | NVIDIA (2014) CUDA C Programming Guide 6.5 | es_ES |
dc.description.references | Peña AJ, Reaño C, Silla F, Mayo R, Quintana-Ortí ES, Duato J (2014) A complete and efficient cuda-sharing solution for hpc clusters. Parallel Comput 40(10):574– 588 [Online]. http://www.sciencedirect.com/science/article/pii/S0167819114001227 | es_ES |
dc.description.references | Reaño C, Silla F, Gimeno AC, Peña AJ, Mayo R, Quintana-Ortí ES, Duato J (2015) Improving the user experience of the rcuda remote GPU virtualization framework. Concurr Comput Pract Exp 27(14)3746–3770 [Online]. doi: 10.1002/cpe.3409 | es_ES |
dc.description.references | Prades J, Reaño C, Silla F (2016) Flexible access to CUDA accelerators from Xen virtual machines in InfiniBand clusters using rCUDA. In: 21st ACM SIGPLAN symposium on principles and practice of parallel programming, PPoPP 2016 | es_ES |
dc.description.references | Iserte S, Gimeno AC, Mayo R, Quintana-Ortí ES, Silla F, Duato J, Reaño C, Prades J (2014) SLURM support for remote GPU virtualization: implementation and performance study. In: 26th IEEE international symposium on computer architecture and high performance computing, SBAC-PAD, 2014, pp 318–325 [Online]. doi: 10.1109/SBAC-PAD.2014.49 | es_ES |
dc.description.references | NVIDIA (2014) NVIDIA CUDA Samples 6.5 | es_ES |
dc.description.references | Che S, Boyer M, Meng J, Tarjan D, Sheaffer J, Lee S-H, Skadron K (2009) Rodinia: a benchmark suite for heterogeneous computing. In: Workload Characterization, 2009. IISWC 2009. IEEE international symposium on, 2009, pp 44–54 | es_ES |
dc.description.references | University of Tennessee, MAGMA: matrix algebra on GPU and multicore architectures [Online]. http://icl.cs.utk.edu/magma | es_ES |
dc.description.references | Bosma W, Cannon J, Playoust C (1997) The Magma algebra system. I. The user language. Computational algebra and number theory (London, 1993). J Symbol Comput 24(3–4) 235–265 [Online]. doi: 10.1006/jsco.1996.0125 | es_ES |
dc.description.references | GROMACS web page (2014 ) [Online]. http://www.gromacs.org/ | es_ES |
dc.description.references | Pronk S, Pll S, Schulz R, Larsson P, Bjelkmar P, Apostolov R, Shirts MR, Smith JC, Kasson PM, van der Spoel D, Hess B, Lindahl E (2013) Gromacs 4.5: a high-throughput and highly parallel open source molecular simulation toolkit. Bioinformatics 29(7)845–854 [Online]. http://bioinformatics.oxfordjournals.org/content/29/7/845.abstract | es_ES |
dc.description.references | Brown WM, Kohlmeyer A, Plimpton SJ, Tharrington AN (2012) Implementing molecular dynamics on hybrid high performance computers: particle–particle particle–mesh. Comp Phys Commun 183(3):449–459 | es_ES |
dc.description.references | Athanasopoulos A, Dimou A, Mezaris V, Kompatsiaris I (2011) GPU acceleration for support vector machines. In: 12th international workshop on image analysis for multimedia interactive services (WIAMIS) | es_ES |