- -

GPU implementation of Krylov solvers for block-tridiagonal eigenvalue problems

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

GPU implementation of Krylov solvers for block-tridiagonal eigenvalue problems

Mostrar el registro sencillo del ítem

Ficheros en el ítem

dc.contributor.author Lamas Daviña, Alejandro es_ES
dc.contributor.author Román Moltó, José Enrique es_ES
dc.date.accessioned 2016-06-27T08:13:46Z
dc.date.available 2016-06-27T08:13:46Z
dc.date.issued 2016-04-02
dc.identifier.isbn 978-3-319-32148-6
dc.identifier.issn 0302-9743
dc.identifier.issn 10.1007/978-3-319-32149-3_18
dc.identifier.uri http://hdl.handle.net/10251/66516
dc.description The final publication is available at Springer via http://dx.doi.org/10.1007/978-3-319-32149-3_18 es_ES
dc.description.abstract In an eigenvalue problem defined by one or two matrices with block-tridiagonal structure, if only a few eigenpairs are required it is interesting to consider iterative methods based on Krylov subspaces, even if matrix blocks are dense. In this context, using the GPU for the associated dense linear algebra may provide high performance. We analyze this in an implementation done in the context of SLEPc, the Scalable Library for Eigenvalue Problem Computations. In the case of a generalized eigenproblem or when interior eigenvalues are computed with shift-and-invert, the main computational kernel is the solution of linear systems with a block-tridiagonal matrix. We explore possible implementations of this operation on the GPU, including a block cyclic reduction algorithm. es_ES
dc.description.sponsorship This work was partially supported by the Spanish Ministry of Economy and Competitiveness under grant TIN2013-41049-P. Alejandro Lamas was supported by the Spanish Ministry of Education, Culture and Sport through grant FPU13-06655. es_ES
dc.format.extent 10 es_ES
dc.language Inglés es_ES
dc.publisher Springer es_ES
dc.relation.ispartof Parallel Processing and Applied Mathematics es_ES
dc.relation.ispartofseries Lecture Notes in Computer Science;9573
dc.rights Reserva de todos los derechos es_ES
dc.subject GPU computing es_ES
dc.subject Eigenvalue computation es_ES
dc.subject Krylov methods es_ES
dc.subject Block-tridiagonal linear solvers es_ES
dc.subject.classification CIENCIAS DE LA COMPUTACION E INTELIGENCIA ARTIFICIAL es_ES
dc.title GPU implementation of Krylov solvers for block-tridiagonal eigenvalue problems es_ES
dc.type Capítulo de libro es_ES
dc.type Comunicación en congreso es_ES
dc.identifier.doi 10.1007%2F978-3-319-32149-3_18
dc.relation.projectID info:eu-repo/grantAgreement/MINECO//TIN2013-41049-P/ES/EXTENSION DE LA LIBRERIA SLEPC PARA POLINOMIOS MATRICIALES, FUNCIONES MATRICIALES Y ECUACIONES MATRICIALES EN PLATAFORMAS DE COMPUTACION EMERGENTES/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/MECD//FPU13%2F06655/ES/FPU13%2F06655/ es_ES
dc.rights.accessRights Abierto es_ES
dc.contributor.affiliation Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació es_ES
dc.description.bibliographicCitation Lamas Daviña, A.; Román Moltó, JE. (2016). GPU implementation of Krylov solvers for block-tridiagonal eigenvalue problems. En Parallel Processing and Applied Mathematics. Springer. 182-191. https://doi.org/10.1007%2F978-3-319-32149-3_18 es_ES
dc.description.accrualMethod S es_ES
dc.relation.conferencename 11th International Conference on Parallel Processing and Applied Mathematics (PPAM 2015) es_ES
dc.relation.conferencedate September 6-9, 2015 es_ES
dc.relation.conferenceplace Krakow, Poland es_ES
dc.relation.publisherversion http://link.springer.com/chapter/10.1007%2F978-3-319-32149-3_18 es_ES
dc.description.upvformatpinicio 182 es_ES
dc.description.upvformatpfin 191 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.relation.senia 311439 es_ES
dc.contributor.funder Ministerio de Economía y Competitividad es_ES
dc.contributor.funder Ministerio de Educación, Cultura y Deporte es_ES
dc.description.references Baghapour, B., Esfahanian, V., Torabzadeh, M., Darian, H.M.: A discontinuous Galerkin method with block cyclic reduction solver for simulating compressible flows on GPUs. Int. J. Comput. Math. 92(1), 110–131 (2014) es_ES
dc.description.references Bientinesi, P., Igual, F.D., Kressner, D., Petschow, M., Quintana-Ortí, E.S.: Condensed forms for the symmetric eigenvalue problem on multi-threaded architectures. Concur. Comput. Pract. Exp. 23, 694–707 (2011) es_ES
dc.description.references Haidar, A., Ltaief, H., Dongarra, J.: Toward a high performance tile divide and conquer algorithm for the dense symmetric eigenvalue problem. SIAM J. Sci. Comput. 34(6), C249–C274 (2012) es_ES
dc.description.references Heller, D.: Some aspects of the cyclic reduction algorithm for block tridiagonal linear systems. SIAM J. Numer. Anal. 13(4), 484–496 (1976) es_ES
dc.description.references Hernandez, V., Roman, J.E., Vidal, V.: SLEPc: a scalable and flexible toolkit for the solution of eigenvalue problems. ACM Trans. Math. Softw. 31(3), 351–362 (2005) es_ES
dc.description.references Hirshman, S.P., Perumalla, K.S., Lynch, V.E., Sanchez, R.: BCYCLIC: a parallel block tridiagonal matrix cyclic solver. J. Comput. Phys. 229(18), 6392–6404 (2010) es_ES
dc.description.references Minden, V., Smith, B., Knepley, M.G.: Preliminary implementation of PETSc using GPUs. In: Yuen, D.A., Wang, L., Chi, X., Johnsson, L., Ge, W., Shi, Y. (eds.) GPU Solutions to Multi-scale Problems in Science and Engineering. Lecture Notes in Earth System Sciences, pp. 131–140. Springer, Heidelberg (2013) es_ES
dc.description.references NVIDIA: CUBLAS Library V7.0. Technical report, DU-06702-001 $$\_$$ v7.0, NVIDIA Corporation (2015) es_ES
dc.description.references Park, A.J., Perumalla, K.S.: Efficient heterogeneous execution on large multicore and accelerator platforms: case study using a block tridiagonal solver. J. Parallel and Distrib. Comput. 73(12), 1578–1591 (2013) es_ES
dc.description.references Reguly, I., Giles, M.: Efficient sparse matrix-vector multiplication on cache-based GPUs. In: Innovative Parallel Computing (InPar), pp. 1–12 (2012) es_ES
dc.description.references Roman, J.E., Vasconcelos, P.B.: Harnessing GPU power from high-level libraries: eigenvalues of integral operators with SLEPc. In: International Conference on Computational Science. Procedia Computer Science, vol. 18, pp. 2591–2594. Elsevier (2013) es_ES
dc.description.references Seal, S.K., Perumalla, K.S., Hirshman, S.P.: Revisiting parallel cyclic reduction and parallel prefix-based algorithms for block tridiagonal systems of equations. J. Parallel Distrib. Comput. 73(2), 273–280 (2013) es_ES
dc.description.references Stewart, G.W.: A Krylov-Schur algorithm for large eigenproblems. SIAM J. Matrix Anal. Appl. 23(3), 601–614 (2001) es_ES
dc.description.references Tomov, S., Nath, R., Dongarra, J.: Accelerating the reduction to upper Hessenberg, tridiagonal, and bidiagonal forms through hybrid GPU-based computing. Parallel Comput. 36(12), 645–654 (2010) es_ES
dc.description.references Vomel, C., Tomov, S., Dongarra, J.: Divide and conquer on hybrid GPU-accelerated multicore systems. SIAM J. Sci. Comput. 34(2), C70–C82 (2012) es_ES
dc.description.references Zhang, Y., Cohen, J., Owens, J.D.: Fast tridiagonal solvers on the GPU. In: Proceedings of the 15th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. PPopp 2010, pp. 127–136 (2010) es_ES


Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem