Project home page for OpenCL - the open standard for parallel programming of heterogeneous systems. project home page http://www.khronos.org/opencl/
The Green500 list 2010 http://www.green500.org
The top500 list 2010 http://www.top500.org
[+]
Project home page for OpenCL - the open standard for parallel programming of heterogeneous systems. project home page http://www.khronos.org/opencl/
The Green500 list 2010 http://www.green500.org
The top500 list 2010 http://www.top500.org
OmpSs project home page http://pm.bsc.es/ompss/
StarPU project home page http://runtime.bordeaux.inria.fr/StarPU/
Mentat project http://www.cs.virginia.edu/~mentat/
Harmony project home page http://code.google.com/p/harmonyruntime/
Cilk project http://supertech.csail.mit.edu/cilk/
Badia, R. M., Herrero, J. R., Labarta, J., Pérez, J. M., Quintana-Ortí, E. S., & Quintana-Ortí, G. (2009). Parallelizing dense and banded linear algebra libraries using SMPSs. Concurrency and Computation: Practice and Experience, 21(18), 2438-2456. doi:10.1002/cpe.1463
PLASMA project home page http://icl.cs.utk.edu/plasma/
FLAME project home page http://www.cs.utexas.edu/users/flame/
Borkar, S., & Chien, A. A. (2011). The future of microprocessors. Communications of the ACM, 54(5), 67. doi:10.1145/1941487.1941507
Esmaeilzadeh H Blem E St. Amant R Sankaralingam K Burger D Dark silicon and the end of multicore scaling Proceedings 38th Annual International Symposium Computer Architecture 2011 365 376
Duranton M et al The HiPEAC vision for advanced computing in horizon 2020 2013 http://www.hipeac.net/roadmap
Zee FGV libflame . the complete reference 2008 http://www.cs.utexas.edu/users/flame
Quintana-Ortí, G., Quintana-Ortí, E. S., Geijn, R. A. V. D., Zee, F. G. V., & Chan, E. (2009). Programming matrix algorithms-by-blocks for thread-level parallelism. ACM Transactions on Mathematical Software, 36(3), 1-26. doi:10.1145/1527286.1527288
Quintana-Ortí G Igual FD Quintana-Ortí ES van de Geijn R Solving dense linear algebra problems on platforms with multiple hardware accelerators Ppopp '09: The 14th ACM Sigplan Symposium on Principles and Practice of Parallel Programming 2009 121 129
Alonso, P., Dolz, M. F., Igual, F. D., Quintana-Ortí, E. S., & Mayo, R. (2013). Runtime Scheduling of the LU Factorization: Performance and Energy. Lecture Notes in Computer Science, 153-167. doi:10.1007/978-3-642-40517-4_14
Bientinesi, P., Gunnels, J. A., Myers, M. E., Quintana-Ortí, E. S., & Geijn, R. A. van de. (2005). The science of deriving dense linear algebra algorithms. ACM Transactions on Mathematical Software, 31(1), 1-26. doi:10.1145/1055531.1055532
Alonso P Badia RM Labarta J Barreda M Dolz MF Mayo R Quintana-Ortí ES Reyes R Tools for power-energy modelling and analysis of parallel scientific applications 2012 420 429
Barrachina, S., Castillo, M., Igual, F. D., Mayo, R., Quintana-Ortí, E. S., & Quintana-Ortí, G. (2009). Exploiting the capabilities of modern GPUs for dense matrix computations. Concurrency and Computation: Practice and Experience, 21(18), 2457-2477. doi:10.1002/cpe.1472
Chan E van de Geijn R Chapman A Managing the complexity of lookahead for lu factorization with pivoting Proceedings of the 22nd ACM Symposium on Parallelism in Algorithms and Architectures 2010 200 208 http://doi.acm.org/10.1145/1810479.1810520
Igual, F. D., Chan, E., Quintana-Ortí, E. S., Quintana-Ortí, G., van de Geijn, R. A., & Van Zee, F. G. (2012). The FLAME approach: From dense linear algebra algorithms to high-performance multi-accelerator implementations. Journal of Parallel and Distributed Computing, 72(9), 1134-1143. doi:10.1016/j.jpdc.2011.10.014
Perez, J. M., Bellens, P., Badia, R. M., & Labarta, J. (2007). CellSs: Making it easier to program the Cell Broadband Engine processor. IBM Journal of Research and Development, 51(5), 593-604. doi:10.1147/rd.515.0593
Paraver project http://www.cepba.upc.es/paraver
Alonso, P., Dolz, M. F., Mayo, R., & Quintana-Ortí, E. S. (2012). Modeling power and energy of the task-parallel Cholesky factorization on multicore processors. Computer Science - Research and Development, 29(2), 105-112. doi:10.1007/s00450-012-0227-z
Elnozahy, E. N., Kistler, M., & Rajamony, R. (2003). Energy-Efficient Server Clusters. Lecture Notes in Computer Science, 179-197. doi:10.1007/3-540-36612-1_12
AnandTech Forums Power-consumption scaling with clockspeed and Vcc for the i7-2600K 2011 http://forums.anandtech.com/showthread.php?t=2195927
[-]