Energy-efficient execution of dense linear algebra algorithms on multi-core processors

Alonso-Jordá, Pedro; Dolz Zaragozá, Manuel Francisco; Mayo, Rafael; Quintana-Ortí, Enrique S.

doi:10.1007/s10586-012-0215-x

Identificarse

Buscar en RiuNet

Listar

Todo RiuNet
Esta colección

Mi cuenta

Acceder

Estadísticas

Ver Estadísticas de uso

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Energy-efficient execution of dense linear algebra algorithms on multi-core processors

Mostrar el registro sencillo del ítem

Ficheros en el ítem

Nombre: 10.1007_s10586-01 ...

Tamaño: 881.1Kb

Formato: PDF

Descripción: Versión editorial

Solicitar una copia al autor

dc.contributor.author	Alonso-Jordá, Pedro	es_ES
dc.contributor.author	Dolz Zaragozá, Manuel Francisco	es_ES
dc.contributor.author	Mayo, Rafael	es_ES
dc.contributor.author	Quintana-Ortí, Enrique S.	es_ES
dc.date.accessioned	2015-01-12T12:27:05Z
dc.date.available	2015-01-12T12:27:05Z
dc.date.issued	2013-09
dc.identifier.issn	1386-7857
dc.identifier.uri	http://hdl.handle.net/10251/45984
dc.description.abstract	This paper addresses the efficient exploitation of task-level parallelism, present in many dense linear alge- bra operations, from the point of view of both computa- tional performance and energy consumption. The strategies considered here, referred to as the Slack Reduction Algo- rithm (SRA) and the Race-to-Idle Algorithm (RIA), adjust the operation frequency of the cores during the execution of a collection of tasks (in which many dense linear alge- bra algorithms can be decomposed) with very different ap- proaches to save energy. The procedures are evaluated using an energy-aware simulator, which is in charge of schedul- ing/mapping the execution of these tasks to the cores, lever- aging dynamic frequency voltage scaling featured by current technology. Experiments with this tool and the practical in- tegration of the RIA strategy into a runtime show the energy gains for two versions of the QR factorization.	es_ES
dc.description.sponsorship	This work was supported by project CICYT TIN2011-23283 and FEDER.	en_EN
dc.language	Inglés	es_ES
dc.publisher	Springer Verlag (Germany)	es_ES
dc.relation.ispartof	Cluster Computing	es_ES
dc.rights	Reserva de todos los derechos	es_ES
dc.subject	Dense linear algebra	es_ES
dc.subject	Power consumption	es_ES
dc.subject	Multi-core processors	es_ES
dc.subject	DVFS	es_ES
dc.subject.classification	CIENCIAS DE LA COMPUTACION E INTELIGENCIA ARTIFICIAL	es_ES
dc.subject.classification	LENGUAJES Y SISTEMAS INFORMATICOS	es_ES
dc.title	Energy-efficient execution of dense linear algebra algorithms on multi-core processors	es_ES
dc.type	Artículo	es_ES
dc.identifier.doi	10.1007/s10586-012-0215-x
dc.relation.projectID	info:eu-repo/grantAgreement/MICINN//TIN2011-23283/ES/POWER-AWARE HIGH PERFORMANCE COMPUTING/	es_ES
dc.relation.projectID	info:eu-repo/grantAgreement/EC/FP7/287759/EU/High Performance and Embedded Architecture and Compilation/
dc.rights.accessRights	Cerrado	es_ES
dc.contributor.affiliation	Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació	es_ES
dc.description.bibliographicCitation	Alonso-Jordá, P.; Dolz Zaragozá, MF.; Mayo, R.; Quintana-Ortí, ES. (2013). Energy-efficient execution of dense linear algebra algorithms on multi-core processors. Cluster Computing. 16(3):497-509. https://doi.org/10.1007/s10586-012-0215-x	es_ES
dc.description.accrualMethod	S	es_ES
dc.relation.publisherversion	http://dx.doi.org/10.1007/s10586-012-0215-x	es_ES
dc.description.upvformatpinicio	497	es_ES
dc.description.upvformatpfin	509	es_ES
dc.type.version	info:eu-repo/semantics/publishedVersion	es_ES
dc.description.volume	16	es_ES
dc.description.issue	3	es_ES
dc.relation.senia	232717
dc.contributor.funder	Ministerio de Ciencia e Innovación	es_ES
dc.description.references	Borkar, S., Chien, A.: The future of microprocessors. Commun. ACM 54, 67–77 (2011)	es_ES
dc.description.references	Esmaeilzadeh, H., Blem, E., Amant, R.St., Sankaralingam, K., Burger, D.: Dark silicon and the end of multicore scaling. In: Proceeding of the 38th Annual International Symposium on Computer Architecture, ISCA’11, New York, NY, USA, pp. 365–376. ACM Press, New York (2011)	es_ES
dc.description.references	Dongarra, J., Beckman, P., Moore, T., Aerts, P., Aloisio, G., Andre, J.C., Barkai, D., Berthou, J.Y., Boku, T., Braunschweig, B., et al.: The international exascale software project roadmap. Int. J. High Perform. Comput. Appl. 25(1), 3 (2011)	es_ES
dc.description.references	Duranton, M., et al.: The HiPEAC vision (2010). Available from http://www.hipeac.net/roadmap	es_ES
dc.description.references	Feng, W.-c., Feng, X., Ce, R.: Green supercomputing comes of age. IT Prof. 10(1), 17–23 (2008)	es_ES
dc.description.references	Hsu, C., Feng, W.: A feasibility analysis of power awareness in commodity-based high-performance clusters. In: Cluster 2005 (2005)	es_ES
dc.description.references	Albers, S.: Energy-efficient algorithms. Commun. ACM 53, 86–96 (2010)	es_ES
dc.description.references	Cilk project home page (2012). http://supertech.csail.mit.edu/cilk/	es_ES
dc.description.references	SMP superscalar project home page (2012). http://www.bsc.es/plantillaG.php?cat_id=385	es_ES
dc.description.references	StarPU project home page (2012). http://runtime.bordeaux.inria.fr/StarPU/	es_ES
dc.description.references	Van Zee, F.G.: libflame: The Complete Reference (2009). www.lulu.com	es_ES
dc.description.references	Anderson, E., Bai, Z., Bischof, C., Blackford, L.S., Demmel, J., Dongarra, J.J., Du Croz, J., Hammarling, S., Greenbaum, A., McKenney, A., Sorensen, D.: LAPACK Users’ Guide, 3rd edn. SIAM, Philadelphia (1999)	es_ES
dc.description.references	PLASMA project home page (2012). http://icl.cs.utk.edu/plasma/	es_ES
dc.description.references	Alonso, P., Dolz, M.F., Mayo, R., Quintana-Ortí, E.S.: Improving power efficiency on multi-core processors via slack control. In: Proceedings of the 2011 International Conference on High Performance Computing & Simulation (HPCS 2011). IEE Catalog Number CFP1178H-CDR, pp. 463–470 (2011)	es_ES
dc.description.references	Alonso, P., Dolz, M.F., Igual, F., Mayo, R., Quintana-Ortí, E.S.: DVFS-control techniques for dense linear algebra operations on multi-core processors. Comput. Sci. Res. Dev., 1–10 (2011). doi: 10.1007/s00450-011-0188-7	es_ES
dc.description.references	Golub, G.H., Van Loan, C.F.: Matrix Computations, 3rd edn. The Johns Hopkins University Press, Baltimore (1996)	es_ES
dc.description.references	Gunter, B.C., van de Geijn, R.A.: Parallel out-of-core computation and updating the QR factorization. ACM Trans. Math. Softw. 31(1), 60–78 (2005)	es_ES
dc.description.references	Etinski, M., Corbalán, J., Labarta, J., Valero, M.: Utilization driven power-aware parallel job scheduling. Comput. Sci. Res. Dev. 25(3–4), 207–216 (2010)	es_ES
dc.description.references	Yao, F., Demers, A., Shenker, S.: A scheduling model for reduced cpu energy. In: Proceedings of the 36th Annual Symposium on Foundations of Computer Science, FOCS’95, Washington, DC, USA, p. 374. IEEE Computer Society, Los Alamitos (1995)	es_ES
dc.description.references	Manzak, A., Chakrabarti, C.: Variable voltage task scheduling for minimizing energy or minimizing power. In: Proceedings on IEEE International Conference of the Acoustics, Speech, and Signal Processing, 2000, Washington, DC, USA, vol. 06, pp. 3239–3242. IEEE Computer Society, Los Alamitos (2000)	es_ES
dc.description.references	Gruian, F., Kuchcinski, K.: Lenes: task scheduling for low-energy systems using variable supply voltage processors. In: Proceedings of the 2001 Asia and South Pacific Design Automation Conference, ASP-DAC’01, New York, NY, USA, pp. 449–455. ACM Press, New York (2001)	es_ES
dc.description.references	Martin, S.M., Flautner, K., Mudge, T., Blaauw, D.: Combined dynamic voltage scaling and adaptive body biasing for lower power microprocessors under dynamic workloads. In: Proceedings of the 2002 IEEE/ACM International Conference on Computer-aided Design, ICCAD’02, New York, NY, USA, pp. 721–725. ACM Press, New York (2002)	es_ES
dc.description.references	Zhang, Y., Hu, X.S., Chen, D.Z.: Task scheduling and voltage selection for energy minimization. In: Proceedings of the 39th Annual Design Automation Conference, DAC’02, New York, NY, USA, pp. 183–188. ACM Press, New York (2002)	es_ES
dc.description.references	Robert, Y., Parashar, M., Badrinath, R., Prasanna, V.K.: High performance computing—HiPC 2006. In: Proceedings of 13th International Conference, Bangalore, India, December 18–21, 2006. Lecture Notes in Computer Science, vol. 4297. Springer, Berlin (2006)	es_ES
dc.description.references	Lee, Y.C., Zomaya, A.Y.: Minimizing energy consumption for precedence-constrained applications using dynamic voltage scaling. In: Proceedings of the 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid-Volume 00, pp. 92–99. IEEE Computer Society, Los Alamitos (2009)	es_ES
dc.description.references	Kimura, H., Sato, M., Hotta, Y., Boku, T., Takahashi, D.: Empirical study on reducing energy of parallel programs using slack reclamation by DVFS in a power-scalable high performance cluster. In: IEEE International Conference on Cluster Computing, 2006, pp. 1–10. IEEE Press, New York (2007)	es_ES
dc.description.references	Shekar, V., Izadi, B.: Energy aware scheduling for DAG structured applications on heterogeneous and DVS enabled processors. In: International Conference on Green Computing, pp. 495–502. IEEE Press, New York (2010)	es_ES
dc.description.references	King, D., Ahmad, I., Sheikh, H.F.: Stretch and compress based re-scheduling techniques for minimizing the execution times of DAGs on multi-core processors under energy constraints. In: International Conference on Green Computing, pp. 49–60. IEEE Press, New York (2010)	es_ES
dc.description.references	Palli, K.: Scheduling dags for minimum finish time and power consumption on heterogeneous processors. Master’s thesis, Albers University, Albers, AL (2005)	es_ES
dc.description.references	Shaffer, L.R., Ritter, J.B., Meyer, W.L.: The Critical-Path Method. McGraw-Hill, New York (1965)	es_ES
dc.description.references	Li, R., Huang, H.C.: List scheduling for jobs with arbitrary release times and similar lengths. J. Sched. 10(6), 365–373 (2007)	es_ES
dc.description.references	Mtibaa, A., Ouni, B., Abid, M.: An efficient list scheduling algorithm for time placement problem. Comput. Electr. Eng. 33(4), 285–298 (2007)	es_ES
dc.description.references	Quintana-Ortí, G., Quintana-Ortí, E.S., van de Geijn, R.A., Van Zee, F.G., Chan, E.: Programming matrix algorithms-by-blocks for thread-level parallelism. ACM Trans. Math. Softw. 36(3), 14:1–14:26 (2009)	es_ES

Este ítem aparece en la(s) siguiente(s) colección(ones)

Artículos, conferencias, monografías [47575]

Mostrar el registro sencillo del ítem

Energy-efficient execution of dense linear algebra algorithms on multi-core processors

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Buscar en RiuNet

Listar

Todo RiuNet

Esta colección

Mi cuenta

Estadísticas

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Energy-efficient execution of dense linear algebra algorithms on multi-core processors

Ficheros en el ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)