- -

Exploring AI Safety in Degrees: Generality, Capability and Control

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

Exploring AI Safety in Degrees: Generality, Capability and Control

Mostrar el registro sencillo del ítem

Ficheros en el ítem

dc.contributor.author Burden, John es_ES
dc.contributor.author Hernández-Orallo, José es_ES
dc.date.accessioned 2021-11-24T07:50:08Z
dc.date.available 2021-11-24T07:50:08Z
dc.date.issued 2020-02-07 es_ES
dc.identifier.issn 1613-0073 es_ES
dc.identifier.uri http://hdl.handle.net/10251/177484
dc.description.abstract [EN] The landscape of AI safety is frequently explored differently by contrasting specialised AI versus general AI (or AGI), by analysing the short-term hazards of systems with limited capabilities against those more long-term risks posed by `superintelligence¿, and by conceptualising sophisticated ways of bounding control an AI system has over its environment and itself (impact, harm to humans, self-harm, containment, etc.). In this position paper we reconsider these three aspects of AI safety as quantitative factors ¿generality, capability and control¿, suggesting that by defining metrics for these dimensions, AI risks can be characterised and analysed more precisely. As an example, we illustrate how to define these metrics and their values for some simple agents in a toy scenario within a reinforcement learning setting. es_ES
dc.description.sponsorship We thank the anonymous reviewers for their comments. This work was funded by the Future of Life Institute, FLI, under grant RFP2-152, and also supported by the EU (FEDER) and Spanish MINECO under RTI2018-094403-B-C32, and Generalitat Valenciana under PROMETEO/2019/098. es_ES
dc.language Inglés es_ES
dc.publisher ceur-ws.org es_ES
dc.relation.ispartof Proceedings of the Workshop on Artificial Intelligence Safety (SafeAI 2020)co-located with 34th AAAI Conference on Artificial Intelligence (AAAI 2020) es_ES
dc.rights Reconocimiento (by) es_ES
dc.subject.classification LENGUAJES Y SISTEMAS INFORMATICOS es_ES
dc.title Exploring AI Safety in Degrees: Generality, Capability and Control es_ES
dc.type Comunicación en congreso es_ES
dc.type Artículo es_ES
dc.relation.projectID info:eu-repo/grantAgreement/AEI/Plan Estatal de Investigación Científica y Técnica y de Innovación 2017-2020/RTI2018-094403-B-C32/ES/RAZONAMIENTO FORMAL PARA TECNOLOGIAS FACILITADORAS Y EMERGENTES/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement/FLI//RFP2-152/ es_ES
dc.relation.projectID info:eu-repo/grantAgreement///PROMETEO%2F2019%2F098//DEEPTRUST/ es_ES
dc.rights.accessRights Abierto es_ES
dc.contributor.affiliation Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació es_ES
dc.description.bibliographicCitation Burden, J.; Hernández-Orallo, J. (2020). Exploring AI Safety in Degrees: Generality, Capability and Control. ceur-ws.org. 36-40. http://hdl.handle.net/10251/177484 es_ES
dc.description.accrualMethod S es_ES
dc.relation.conferencename AAAI Workshop on Artificial Intelligence Safety (SafeAI 2020) es_ES
dc.relation.conferencedate Febrero 07-07,2020 es_ES
dc.relation.conferenceplace New York, USA es_ES
dc.relation.publisherversion http://ceur-ws.org/Vol-2560/ es_ES
dc.description.upvformatpinicio 36 es_ES
dc.description.upvformatpfin 40 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.relation.pasarela S\431934 es_ES
dc.contributor.funder Future of Life Institute es_ES
dc.contributor.funder European Regional Development Fund es_ES


Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem