Mostrar el registro sencillo del ítem
dc.contributor.author | Martínez-Plumed, Fernando | es_ES |
dc.contributor.author | Jaimovitch-López, Gonzalo Eduardo | es_ES |
dc.contributor.author | Ferri, C. | es_ES |
dc.contributor.author | Ramírez Quintana, María José | es_ES |
dc.contributor.author | Hernández-Orallo, José | es_ES |
dc.date.accessioned | 2024-10-03T18:26:49Z | |
dc.date.available | 2024-10-03T18:26:49Z | |
dc.date.issued | 2024-08 | es_ES |
dc.identifier.uri | http://hdl.handle.net/10251/209287 | |
dc.description.abstract | [EN] Language models and other recent machine learning paradigms blur the distinction between generative and discriminative tasks, in a continuum that is regulated by the degree of pre- and post-supervision that is required from users, as well as the tolerated level of error. In few-shot inference, we need to find a trade-off between the number and cost of the solved examples that have to be supplied, those that have to be inspected (some of them accurate but others needing correction) and those that are wrong but pass undetected. In this paper, we define a new Supply-Inspect Cost Framework, associated graphical representations and comprehensive metrics that consider all these elements. To optimise few-shot inference under specific operating conditions, we introduce novel algorithms that go beyond the concept of rejection rules in both static and dynamic contexts. We illustrate the effectiveness of all these elements for a transformative domain, data wrangling, for which language models can have a huge impact if we are able to properly regulate the reliability-usability trade-off, as we do in this paper. | es_ES |
dc.description.sponsorship | The funding has been received from ValgrAI - Valencian Graduate School and Research Network for Artificial Intelligence; the Norwegian Research Council with Grant no. 329745 (Machine Teaching for Explainable AI); Generalitat Valenciana with Grant nos. CIPROM/2022/6 (FASSLOW) and IDIFEDER/2021/05 (CLUSTERIA); the European Commission under H2020-EU with Grant no. 952215 (TAILOR); US DARPA with Grant no. HR00112120007 (RECoG-AI); the Future of Life Institute with Grant no. RFP2-152; the Spanish Ministry of Science and Innovation (MCIN/AEI/10.13039/ 501100011033) with Grant no. PID2021-122830OB-C42 (SFERA) and ERDF A way of making Europe ; and the Spanish Ministry of Universities with Grant no. PID2022-140110OA-I00 (FISCALTICS) funded by MICIU/AEI/10.13039/501100011033 and by ERDF, EU. | es_ES |
dc.language | Inglés | es_ES |
dc.publisher | SpringerOpen | es_ES |
dc.relation.ispartof | COMPLEX & INTELLIGENT SYSTEMS | es_ES |
dc.rights | Reconocimiento - No comercial - Sin obra derivada (by-nc-nd) | es_ES |
dc.subject | Few-shot inference | es_ES |
dc.subject | Language models | es_ES |
dc.subject | Evaluation | es_ES |
dc.subject | Reliability | es_ES |
dc.subject | Usability | es_ES |
dc.subject.classification | LENGUAJES Y SISTEMAS INFORMATICOS | es_ES |
dc.title | A general supply-inspect cost framework to regulate the reliability-usability trade-offs for few-shot inference | es_ES |
dc.type | Artículo | es_ES |
dc.identifier.doi | 10.1007/s40747-024-01599-6 | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2021-122830OB-C42/ES/METODOS FORMALES ESCALABLES PARA APLICACIONES REALES/ | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2022-140110OA-I00/ES/ADECUACION DE LOS SISTEMAS TRIBUTARIOS A LA CUARTA REVOLUCION INDUSTRIAL: INTELIGENCIA ARTIFICIAL, ROBOTICA Y NUEVAS REALIDADES TECNOLOGICAS/ | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/EC/H2020/952215/EU/Integrating Reasoning, Learning and Optimization/ | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/RCN//329745/ | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/GVA//CIPROM%2F2022%2F6//Tecnologías de Aprendizaje y Razonamiento Rápido y Lento/ | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/GVA//IDIFEDER%2F2021%2F050//Instrumentación avanzada para investigación puntera en fotónica de microondas y programable fase 2/ | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/FLI//RFP2-152/ | es_ES |
dc.relation.projectID | info:eu-repo/grantAgreement/DOD//HR00112120007/ | es_ES |
dc.rights.accessRights | Abierto | es_ES |
dc.contributor.affiliation | Universitat Politècnica de València. Escola Tècnica Superior d'Enginyeria Informàtica | es_ES |
dc.description.bibliographicCitation | Martínez-Plumed, F.; Jaimovitch-López, GE.; Ferri, C.; Ramírez Quintana, MJ.; Hernández-Orallo, J. (2024). A general supply-inspect cost framework to regulate the reliability-usability trade-offs for few-shot inference. COMPLEX & INTELLIGENT SYSTEMS. https://doi.org/10.1007/s40747-024-01599-6 | es_ES |
dc.description.accrualMethod | S | es_ES |
dc.relation.publisherversion | https://doi.org/10.1007/s40747-024-01599-6 | es_ES |
dc.type.version | info:eu-repo/semantics/publishedVersion | es_ES |
dc.identifier.eissn | 2199-4536 | es_ES |
dc.relation.pasarela | S\525630 | es_ES |
dc.contributor.funder | European Commission | es_ES |
dc.contributor.funder | Generalitat Valenciana | es_ES |
dc.contributor.funder | Future of Life Institute | es_ES |
dc.contributor.funder | Research Council of Norway | es_ES |
dc.contributor.funder | U.S. Department of Defense | es_ES |
dc.contributor.funder | Agencia Estatal de Investigación | es_ES |
dc.contributor.funder | European Regional Development Fund | es_ES |
dc.contributor.funder | Valencian Graduate School and Research Network of Artificial Intelligence | es_ES |