- -

A New AI Evaluation Cosmos: Ready to Play the Game?

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

A New AI Evaluation Cosmos: Ready to Play the Game?

Mostrar el registro sencillo del ítem

Ficheros en el ítem

dc.contributor.author Hernández-Orallo, José es_ES
dc.contributor.author Baroni, Marco es_ES
dc.contributor.author Bieger, Jordi es_ES
dc.contributor.author Chmait, Nader es_ES
dc.contributor.author Dowe, David L es_ES
dc.contributor.author Hofmann, Katja es_ES
dc.contributor.author Martínez-Plumed, Fernando es_ES
dc.contributor.author Strannegard, Claes es_ES
dc.contributor.author Thorissons, Kristinn R. es_ES
dc.date.accessioned 2018-06-01T04:26:16Z
dc.date.available 2018-06-01T04:26:16Z
dc.date.issued 2017 es_ES
dc.identifier.issn 0738-4602 es_ES
dc.identifier.uri http://hdl.handle.net/10251/103145
dc.description.abstract [EN] We report on a series of new platforms and events dealing with AI evaluation that may change the way in which AI systems are compared and their progress is measured. The introduction of a more diverse and challenging set of tasks in these platforms can feed AI research in the years to come, shaping the notion of success and the directions of the field. However, the playground of tasks and challenges presented there may misdirect the field without some meaningful structure and systematic guidelines for its organization and use. Anticipating this issue, we also report on several initiatives and workshops that are putting the focus on analyzing the similarity and dependencies between tasks, their difficulty, what capabilities they really measure and ultimately on elaborating new concepts and tools that can arrange tasks and benchmarks into a meaningful taxonomy. es_ES
dc.language Inglés es_ES
dc.publisher Association for the Advancement of Artificial Intelligence (AAAI) es_ES
dc.relation.ispartof AI Magazine es_ES
dc.rights Reserva de todos los derechos es_ES
dc.subject.classification LENGUAJES Y SISTEMAS INFORMATICOS es_ES
dc.title A New AI Evaluation Cosmos: Ready to Play the Game? es_ES
dc.type Artículo es_ES
dc.identifier.doi 10.1609/aimag.v38i3.2748 es_ES
dc.rights.accessRights Abierto es_ES
dc.contributor.affiliation Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació es_ES
dc.description.bibliographicCitation Hernández-Orallo, J.; Baroni, M.; Bieger, J.; Chmait, N.; Dowe, DL.; Hofmann, K.; Martínez-Plumed, F.... (2017). A New AI Evaluation Cosmos: Ready to Play the Game?. AI Magazine. 38(3):66-69. doi:10.1609/aimag.v38i3.2748 es_ES
dc.description.accrualMethod S es_ES
dc.relation.publisherversion http://doi.org/10.1609/aimag.v38i3.2748 es_ES
dc.description.upvformatpinicio 66 es_ES
dc.description.upvformatpfin 69 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.description.volume 38 es_ES
dc.description.issue 3 es_ES
dc.relation.pasarela S\353310 es_ES


Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem