- -

Does ChatGPT have sociolinguistic competence?

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

Does ChatGPT have sociolinguistic competence?

Mostrar el registro sencillo del ítem

Ficheros en el ítem

dc.contributor.author Duncan, Daniel es_ES
dc.date.accessioned 2024-11-26T12:14:42Z
dc.date.available 2024-11-26T12:14:42Z
dc.date.issued 2024-11-15
dc.identifier.uri http://hdl.handle.net/10251/212286
dc.description.abstract [EN] Large language models are now able to generate content- and genre-appropriate prose with grammatical sentences. However, these targets do not fully encapsulate human-like language use. For example, set aside is the fact that human language use involves sociolinguistic variation that is regularly constrained by internal and external factors. This article tests whether one widely used LLM application, ChatGPT, is capable of generating such variation. I construct an English corpus of sociolinguistic interviews using the application and analyze the generation of seven morphosyntactic features. I show that the application largely fails to generate any variation at all when one variant is prescriptively incorrect, but that it is able to generate variable deletion of the complementizer that that is internally constrained, with variants occurring at human-like rates. ChatGPT fails, however, to properly generate externally constrained complementizer that deletion. I argue that these outcomes reflect bias both in the training data and Reinforcement Learning from Human Feedback. I suggest that testing whether an LLM can properly generate sociolinguistic variation is a useful metric for evaluating if it generates human-like language. es_ES
dc.language Inglés es_ES
dc.publisher Universitat Politècnica de València es_ES
dc.relation.ispartof Journal of Computer-Assisted Linguistic Research es_ES
dc.rights Reconocimiento - No comercial - Sin obra derivada (by-nc-nd) es_ES
dc.subject Large language models es_ES
dc.subject ChatGPT es_ES
dc.subject Variation es_ES
dc.subject Morphosyntactic variation es_ES
dc.subject Sociolinguistics es_ES
dc.title Does ChatGPT have sociolinguistic competence? es_ES
dc.type Artículo es_ES
dc.identifier.doi 10.4995/jclr.2024.21958
dc.rights.accessRights Abierto es_ES
dc.description.bibliographicCitation Duncan, D. (2024). Does ChatGPT have sociolinguistic competence?. Journal of Computer-Assisted Linguistic Research. 8:51-75. https://doi.org/10.4995/jclr.2024.21958 es_ES
dc.description.accrualMethod OJS es_ES
dc.relation.publisherversion https://doi.org/10.4995/jclr.2024.21958 es_ES
dc.description.upvformatpinicio 51 es_ES
dc.description.upvformatpfin 75 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.description.volume 8 es_ES
dc.identifier.eissn 2530-9455
dc.relation.pasarela OJS\21958 es_ES


Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem