- -

Dual Indicators to Analyse AI Benchmarks: Difficulty, Discrimination, Ability and Generality

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

Dual Indicators to Analyse AI Benchmarks: Difficulty, Discrimination, Ability and Generality

Mostrar el registro completo del ítem

Martínez-Plumed, F.; Hernández-Orallo, J. (2020). Dual Indicators to Analyse AI Benchmarks: Difficulty, Discrimination, Ability and Generality. IEEE Transactions on Games. 12(2):121-131. https://doi.org/10.1109/TG.2018.2883773

Por favor, use este identificador para citar o enlazar este ítem: http://hdl.handle.net/10251/169021

Ficheros en el ítem

Metadatos del ítem

Título: Dual Indicators to Analyse AI Benchmarks: Difficulty, Discrimination, Ability and Generality
Autor: Martínez-Plumed, Fernando Hernández-Orallo, José
Entidad UPV: Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació
Fecha difusión:
Resumen:
[EN] With the purpose of better analyzing the result of artificial intelligence (AI) benchmarks, we present two indicators on the side of the AI problems, difficulty and discrimination, and two indicators on the side of ...[+]
Palabras clave: Artificial intelligence , Games , Benchmark testing , Task analysis , Adaptation models , Guidelines , Indexes , Artificial intelligence (AI) benchmarks , AI evaluation , Generality , Item response theory (ITR)
Derechos de uso: Reserva de todos los derechos
Fuente:
IEEE Transactions on Games. (issn: 2475-1502 )
DOI: 10.1109/TG.2018.2883773
Editorial:
Institute of Electrical and Electronics Engineers (IEEE)
Versión del editor: https://doi.org/10.1109/TG.2018.2883773
Código del Proyecto:
info:eu-repo/grantAgreement/INCIBE//INCIBEI-2015-27345/
...[+]
info:eu-repo/grantAgreement/INCIBE//INCIBEI-2015-27345/
info:eu-repo/grantAgreement/EC//CT-EX2018D335821-101/EU//HUMAINT/
info:eu-repo/grantAgreement/UPV//SP20180210/
info:eu-repo/grantAgreement/MECD//PRX17%2F00467/
info:eu-repo/grantAgreement/GVA//BEST%2F2017%2F045/
info:eu-repo/grantAgreement/FLI//RFP2-152/
info:eu-repo/grantAgreement/UPV//PAID-06-18/
info:eu-repo/grantAgreement/AFOSR//FA9550-17-1-0287/
info:eu-repo/grantAgreement/MINECO//TIN2015-69175-C4-1-R/ES/SOLUCIONES EFECTIVAS BASADAS EN LA LOGICA/
info:eu-repo/grantAgreement/GVA//PROMETEOII%2F2015%2F013/ES/SmartLogic: Logic Technologies for Software Security and Performance/
[-]
Agradecimientos:
This work was supported by the U.S. Air Force Office of Scientific Research under Award FA9550-17-1-0287; in part by the EU (FEDER) and the Spanish MINECO under Grant TIN 2015-69175-C4-1-R; and in part by the Generalitat ...[+]
Tipo: Artículo

recommendations

 

Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro completo del ítem