- -

A case for malleable thread-level linear algebra libraries: The LU factorization with partial pivoting

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

A case for malleable thread-level linear algebra libraries: The LU factorization with partial pivoting

Mostrar el registro sencillo del ítem

Ficheros en el ítem

dc.contributor.author Catalán, Sandra es_ES
dc.contributor.author Herrero, José R. es_ES
dc.contributor.author Quintana Ortí, Enrique Salvador es_ES
dc.contributor.author Rodríguez-Sánchez, Rafael es_ES
dc.contributor.author van de Geijn, Robert es_ES
dc.date.accessioned 2020-06-02T05:37:14Z
dc.date.available 2020-06-02T05:37:14Z
dc.date.issued 2019-01-31 es_ES
dc.identifier.uri http://hdl.handle.net/10251/144816
dc.description (c) 2019 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other users, including reprinting/ republishing this material for advertising or promotional purposes, creating new collective works for resale or redistribution to servers or lists, or reuse of any copyrighted components of this work in other works. es_ES
dc.description.abstract [EN] We propose two novel techniques for overcoming load-imbalance encountered when implementing so-called look-ahead mechanisms in relevant dense matrix factorizations for the solution of linear systems. Both techniques target the scenario where two thread teams are created/activated during the factorization, with each team in charge of performing an independent task/branch of execution. The first technique promotes worker sharing (WS) between the two tasks, allowing the threads of the task that completes first to be reallocated for use by the costlier task. The second technique allows a fast task to alert the slower task of completion, enforcing the early termination (ET) of the second task, and a smooth transition of the factorization procedure into the next iteration. The two mechanisms are instantiated via a new malleable thread-level implementation of the basic linear algebra subprograms, and their benefits are illustrated via an implementation of the LU factorization with partial pivoting enhanced with look-ahead. Concretely, our experimental results on an Intel-Xeon system with 12 cores show the benefits of combining WS+ET, reporting competitive performance in comparison with a task-parallel runtime-based solution. es_ES
dc.description.sponsorship This work was supported in part by the Spanish Ministerio de Economia y Competitividad under Project TIN2014-53495-R, Project TIN2015-65316-P, and Project TIN2017-82972-R, in part by the H2020 EU FETHPC "INTERTWinE" under Project 671602, in part by the Generalitat de Catalunya under Project 2017-SGR-1414, and in part by the NSF under Grant ACI-1550493. es_ES
dc.language Inglés es_ES
dc.publisher Institute of Electrical and Electronics Engineers es_ES
dc.relation.ispartof IEEE Access es_ES
dc.rights Reserva de todos los derechos es_ES
dc.subject Solution of linear systems es_ES
dc.subject Multi-threading es_ES
dc.subject Workload balancing es_ES
dc.subject Thread malleability es_ES
dc.subject Basic linear algebra subprograms (BLAS) es_ES
dc.subject Linear algebra package (LAPACK) es_ES
dc.subject.classification ARQUITECTURA Y TECNOLOGIA DE COMPUTADORES es_ES
dc.title A case for malleable thread-level linear algebra libraries: The LU factorization with partial pivoting es_ES
dc.type Artículo es_ES
dc.identifier.doi 10.1109/ACCESS.2019.2895541 es_ES
dc.relation.projectID info:eu-repo/grantAgreement/EC/H2020/671602/EU/Programming Model INTERoperability ToWards Exascale (INTERTWinE)/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/MINECO//TIN2014-53495-R/ES/COMPUTACION HETEROGENEA DE BAJO CONSUMO/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/NSF//1550493/US/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/MINECO//TIN2015-65316-P/ES/COMPUTACION DE ALTAS PRESTACIONES VII/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2013-2016/TIN2017-82972-R/ES/TECNICAS ALGORITMICAS PARA COMPUTACION DE ALTO RENDIMIENTO CONSCIENTE DEL CONSUMO ENERGETICO Y RESISTENTE A ERRORES/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/Generalitat de Catalunya//2017 SGR 1414/ es_ES
dc.rights.accessRights Abierto es_ES
dc.contributor.affiliation Universitat Politècnica de València. Departamento de Informática de Sistemas y Computadores - Departament d'Informàtica de Sistemes i Computadors es_ES
dc.description.bibliographicCitation Catalán, S.; Herrero, JR.; Quintana Ortí, ES.; Rodríguez-Sánchez, R.; Van De Geijn, R. (2019). A case for malleable thread-level linear algebra libraries: The LU factorization with partial pivoting. IEEE Access. 7:17617-17633. https://doi.org/10.1109/ACCESS.2019.2895541 es_ES
dc.description.accrualMethod S es_ES
dc.relation.publisherversion https://doi.org/10.1109/ACCESS.2019.2895541 es_ES
dc.description.upvformatpinicio 17617 es_ES
dc.description.upvformatpfin 17633 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.description.volume 7 es_ES
dc.identifier.eissn 2169-3536 es_ES
dc.relation.pasarela S\381212 es_ES
dc.contributor.funder Generalitat de Catalunya es_ES
dc.contributor.funder National Science Foundation, EEUU es_ES
dc.contributor.funder Ministerio de Economía y Competitividad es_ES
dc.contributor.funder Agencia Estatal de Investigación es_ES


Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem