Mostrar el registro sencillo del ítem
dc.contributor.author | Voudouris, Konstantinos | es_ES |
dc.contributor.author | Crosby, Matthew | es_ES |
dc.contributor.author | Beyret, Benjamin | es_ES |
dc.contributor.author | Hernández-Orallo, José | es_ES |
dc.contributor.author | Shanahan, Murray | es_ES |
dc.contributor.author | Halina, Marta | es_ES |
dc.contributor.author | Cheke, Lucy G. | es_ES |
dc.date.accessioned | 2023-05-03T18:01:53Z | |
dc.date.available | 2023-05-03T18:01:53Z | |
dc.date.issued | 2022-05-24 | es_ES |
dc.identifier.uri | http://hdl.handle.net/10251/193086 | |
dc.description.abstract | [EN] Artificial Intelligence is making rapid and remarkable progress in the development of more sophisticated and powerful systems. However, the acknowledgement of several problems with modern machine learning approaches has prompted a shift in AI benchmarking away from task-oriented testing (such as Chess and Go) towards ability-oriented testing, in which AI systems are tested on their capacity to solve certain kinds of novel problems. The Animal-AI Environment is one such benchmark which aims to apply the ability-oriented testing used in comparative psychology to AI systems. Here, we present the first direct human-AI comparison in the Animal-AI Environment, using children aged 6-10 (n = 52). We found that children of all ages were significantly better than a sample of 30 AIs across most of the tests we examined, as well as performing significantly better than the two top-scoring AIs, "ironbar" and "Trrrrr," from the Animal-AI Olympics Competition 2019. While children and AIs performed similarly on basic navigational tasks, AIs performed significantly worse in more complex cognitive tests, including detour tasks, spatial elimination tasks, and object permanence tasks, indicating that AIs lack several cognitive abilities that children aged 6-10 possess. Both children and AIs performed poorly on tool-use tasks, suggesting that these tests are challenging for both biological and non-biological machines. | es_ES |
dc.description.sponsorship | This work was supported by ESRC DTP funding to KV, ESRC award reference: ES/P000738/1. Research was conducted as a project within the Kinds of Intelligence Program at the Leverhulme Centre for the Future of Intelligence, award number: G108086, and the US DARPA HR00112120007 (RECoG-AI) Grant. | es_ES |
dc.language | Inglés | es_ES |
dc.publisher | Frontiers Media SA | es_ES |
dc.relation.ispartof | Frontiers in Psychology | es_ES |
dc.rights | Reconocimiento (by) | es_ES |
dc.subject | Human-AI comparison | es_ES |
dc.subject | Artificial intelligence | es_ES |
dc.subject | AI benchmarks | es_ES |
dc.subject | Comparative cognition | es_ES |
dc.subject | Out-of-distribution testing | es_ES |
dc.subject | Animal-AI Olympics | es_ES |
dc.subject | Cognitive AI | es_ES |
dc.subject.classification | LENGUAJES Y SISTEMAS INFORMATICOS | es_ES |
dc.title | Direct Human-AI Comparison in the Animal-AI Environment | es_ES |
dc.type | Artículo | es_ES |
dc.identifier.doi | 10.3389/fpsyg.2022.711821 | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/ESRC//ES%2FP000738%2F1/ | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/Leverhulme Trust//G108086//Leverhulme Centre for the Future of Intelligence/ | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/DOD//HR00112120007/ | es_ES |
dc.rights.accessRights | Abierto | es_ES |
dc.contributor.affiliation | Universitat Politècnica de València. Escola Tècnica Superior d'Enginyeria Informàtica | es_ES |
dc.description.bibliographicCitation | Voudouris, K.; Crosby, M.; Beyret, B.; Hernández-Orallo, J.; Shanahan, M.; Halina, M.; Cheke, LG. (2022). Direct Human-AI Comparison in the Animal-AI Environment. Frontiers in Psychology. 13:1-22. https://doi.org/10.3389/fpsyg.2022.711821 | es_ES |
dc.description.accrualMethod | S | es_ES |
dc.relation.publisherversion | https://doi.org/10.3389/fpsyg.2022.711821 | es_ES |
dc.description.upvformatpinicio | 1 | es_ES |
dc.description.upvformatpfin | 22 | es_ES |
dc.type.version | info:eu-repo/semantics/publishedVersion | es_ES |
dc.description.volume | 13 | es_ES |
dc.identifier.eissn | 1664-1078 | es_ES |
dc.identifier.pmid | 35686061 | es_ES |
dc.identifier.pmcid | PMC9172850 | es_ES |
dc.relation.pasarela | S\488504 | es_ES |
dc.contributor.funder | Leverhulme Trust | es_ES |
dc.contributor.funder | U.S. Department of Defense | es_ES |
dc.contributor.funder | Economic and Social Research Council, Reino Unido | es_ES |