- -

A general supply-inspect cost framework to regulate the reliability-usability trade-offs for few-shot inference

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

A general supply-inspect cost framework to regulate the reliability-usability trade-offs for few-shot inference

Mostrar el registro sencillo del ítem

Ficheros en el ítem

dc.contributor.author Martínez-Plumed, Fernando es_ES
dc.contributor.author Jaimovitch-López, Gonzalo Eduardo es_ES
dc.contributor.author Ferri, C. es_ES
dc.contributor.author Ramírez Quintana, María José es_ES
dc.contributor.author Hernández-Orallo, José es_ES
dc.date.accessioned 2024-10-03T18:26:49Z
dc.date.available 2024-10-03T18:26:49Z
dc.date.issued 2024-08 es_ES
dc.identifier.uri http://hdl.handle.net/10251/209287
dc.description.abstract [EN] Language models and other recent machine learning paradigms blur the distinction between generative and discriminative tasks, in a continuum that is regulated by the degree of pre- and post-supervision that is required from users, as well as the tolerated level of error. In few-shot inference, we need to find a trade-off between the number and cost of the solved examples that have to be supplied, those that have to be inspected (some of them accurate but others needing correction) and those that are wrong but pass undetected. In this paper, we define a new Supply-Inspect Cost Framework, associated graphical representations and comprehensive metrics that consider all these elements. To optimise few-shot inference under specific operating conditions, we introduce novel algorithms that go beyond the concept of rejection rules in both static and dynamic contexts. We illustrate the effectiveness of all these elements for a transformative domain, data wrangling, for which language models can have a huge impact if we are able to properly regulate the reliability-usability trade-off, as we do in this paper. es_ES
dc.description.sponsorship The funding has been received from ValgrAI - Valencian Graduate School and Research Network for Artificial Intelligence; the Norwegian Research Council with Grant no. 329745 (Machine Teaching for Explainable AI); Generalitat Valenciana with Grant nos. CIPROM/2022/6 (FASSLOW) and IDIFEDER/2021/05 (CLUSTERIA); the European Commission under H2020-EU with Grant no. 952215 (TAILOR); US DARPA with Grant no. HR00112120007 (RECoG-AI); the Future of Life Institute with Grant no. RFP2-152; the Spanish Ministry of Science and Innovation (MCIN/AEI/10.13039/ 501100011033) with Grant no. PID2021-122830OB-C42 (SFERA) and ERDF A way of making Europe ; and the Spanish Ministry of Universities with Grant no. PID2022-140110OA-I00 (FISCALTICS) funded by MICIU/AEI/10.13039/501100011033 and by ERDF, EU. es_ES
dc.language Inglés es_ES
dc.publisher SpringerOpen es_ES
dc.relation.ispartof COMPLEX & INTELLIGENT SYSTEMS es_ES
dc.rights Reconocimiento - No comercial - Sin obra derivada (by-nc-nd) es_ES
dc.subject Few-shot inference es_ES
dc.subject Language models es_ES
dc.subject Evaluation es_ES
dc.subject Reliability es_ES
dc.subject Usability es_ES
dc.subject.classification LENGUAJES Y SISTEMAS INFORMATICOS es_ES
dc.title A general supply-inspect cost framework to regulate the reliability-usability trade-offs for few-shot inference es_ES
dc.type Artículo es_ES
dc.identifier.doi 10.1007/s40747-024-01599-6 es_ES
dc.relation.projectID info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2021-122830OB-C42/ES/METODOS FORMALES ESCALABLES PARA APLICACIONES REALES/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2021-2023/PID2022-140110OA-I00/ES/ADECUACION DE LOS SISTEMAS TRIBUTARIOS A LA CUARTA REVOLUCION INDUSTRIAL: INTELIGENCIA ARTIFICIAL, ROBOTICA Y NUEVAS REALIDADES TECNOLOGICAS/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/EC/H2020/952215/EU/Integrating Reasoning, Learning and Optimization/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/RCN//329745/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/GVA//CIPROM%2F2022%2F6//Tecnologías de Aprendizaje y Razonamiento Rápido y Lento/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/GVA//IDIFEDER%2F2021%2F050//Instrumentación avanzada para investigación puntera en fotónica de microondas y programable fase 2/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/FLI//RFP2-152/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/DOD//HR00112120007/ es_ES
dc.rights.accessRights Abierto es_ES
dc.contributor.affiliation Universitat Politècnica de València. Escola Tècnica Superior d'Enginyeria Informàtica es_ES
dc.description.bibliographicCitation Martínez-Plumed, F.; Jaimovitch-López, GE.; Ferri, C.; Ramírez Quintana, MJ.; Hernández-Orallo, J. (2024). A general supply-inspect cost framework to regulate the reliability-usability trade-offs for few-shot inference. COMPLEX & INTELLIGENT SYSTEMS. https://doi.org/10.1007/s40747-024-01599-6 es_ES
dc.description.accrualMethod S es_ES
dc.relation.publisherversion https://doi.org/10.1007/s40747-024-01599-6 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.identifier.eissn 2199-4536 es_ES
dc.relation.pasarela S\525630 es_ES
dc.contributor.funder European Commission es_ES
dc.contributor.funder Generalitat Valenciana es_ES
dc.contributor.funder Future of Life Institute es_ES
dc.contributor.funder Research Council of Norway es_ES
dc.contributor.funder U.S. Department of Defense es_ES
dc.contributor.funder Agencia Estatal de Investigación es_ES
dc.contributor.funder European Regional Development Fund es_ES
dc.contributor.funder Valencian Graduate School and Research Network of Artificial Intelligence es_ES


Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem