Does ChatGPT have sociolinguistic competence?

Duncan, Daniel

doi:10.4995/jclr.2024.21958

RiuNet repositorio UPV
:
Investigación
:
Material investigación. Editorial UPV
:
Revistas UPV. Editorial UPV
:
Journal of Computer-Assisted Linguistic Research
:
Journal of Computer-Assisted Linguistic Research - Vol 08 (2024)
:
Ver ítem

Identificarse

Buscar en RiuNet

Listar

Todo RiuNet
Esta colección

Mi cuenta

Acceder

Estadísticas

Ver Estadísticas de uso

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Does ChatGPT have sociolinguistic competence?

Mostrar el registro sencillo del ítem

Ficheros en el ítem

Nombre: Duncan - Does ChatGPT ...

Tamaño: 862.8Kb

Formato: PDF

Descripción: Versión editorial

Abrir

dc.contributor.author	Duncan, Daniel	es_ES
dc.date.accessioned	2024-11-26T12:14:42Z
dc.date.available	2024-11-26T12:14:42Z
dc.date.issued	2024-11-15
dc.identifier.uri	http://hdl.handle.net/10251/212286
dc.description.abstract	[EN] Large language models are now able to generate content- and genre-appropriate prose with grammatical sentences. However, these targets do not fully encapsulate human-like language use. For example, set aside is the fact that human language use involves sociolinguistic variation that is regularly constrained by internal and external factors. This article tests whether one widely used LLM application, ChatGPT, is capable of generating such variation. I construct an English corpus of sociolinguistic interviews using the application and analyze the generation of seven morphosyntactic features. I show that the application largely fails to generate any variation at all when one variant is prescriptively incorrect, but that it is able to generate variable deletion of the complementizer that that is internally constrained, with variants occurring at human-like rates. ChatGPT fails, however, to properly generate externally constrained complementizer that deletion. I argue that these outcomes reflect bias both in the training data and Reinforcement Learning from Human Feedback. I suggest that testing whether an LLM can properly generate sociolinguistic variation is a useful metric for evaluating if it generates human-like language.	es_ES
dc.language	Inglés	es_ES
dc.publisher	Universitat Politècnica de València	es_ES
dc.relation.ispartof	Journal of Computer-Assisted Linguistic Research	es_ES
dc.rights	Reconocimiento - No comercial - Sin obra derivada (by-nc-nd)	es_ES
dc.subject	Large language models	es_ES
dc.subject	ChatGPT	es_ES
dc.subject	Variation	es_ES
dc.subject	Morphosyntactic variation	es_ES
dc.subject	Sociolinguistics	es_ES
dc.title	Does ChatGPT have sociolinguistic competence?	es_ES
dc.type	Artículo	es_ES
dc.identifier.doi	10.4995/jclr.2024.21958
dc.rights.accessRights	Abierto	es_ES
dc.description.bibliographicCitation	Duncan, D. (2024). Does ChatGPT have sociolinguistic competence?. Journal of Computer-Assisted Linguistic Research. 8:51-75. https://doi.org/10.4995/jclr.2024.21958	es_ES
dc.description.accrualMethod	OJS	es_ES
dc.relation.publisherversion	https://doi.org/10.4995/jclr.2024.21958	es_ES
dc.description.upvformatpinicio	51	es_ES
dc.description.upvformatpfin	75	es_ES
dc.type.version	info:eu-repo/semantics/publishedVersion	es_ES
dc.description.volume	8	es_ES
dc.identifier.eissn	2530-9455
dc.relation.pasarela	OJS\21958	es_ES

Este ítem aparece en la(s) siguiente(s) colección(ones)

Journal of Computer-Assisted Linguistic Research - Vol 08 (2024) [4]

Mostrar el registro sencillo del ítem

Does ChatGPT have sociolinguistic competence?

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Buscar en RiuNet

Listar

Todo RiuNet

Esta colección

Mi cuenta

Estadísticas

Ayuda RiuNet

Admin. UPV

Compartir/Enviar a

Citas

Estadísticas

Does ChatGPT have sociolinguistic competence?

Ficheros en el ítem

Este ítem aparece en la(s) siguiente(s) colección(ones)