- -

clickR: Semi-automatic pre-processing of messy data with change tracking for integral dataset cleaning

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

clickR: Semi-automatic pre-processing of messy data with change tracking for integral dataset cleaning

Mostrar el registro sencillo del ítem

Ficheros en el ítem

dc.contributor.author Hervás-Marín, David es_ES
dc.contributor.author Fuente, David es_ES
dc.date.accessioned 2024-11-14T19:13:05Z
dc.date.available 2024-11-14T19:13:05Z
dc.date.issued 2024-09 es_ES
dc.identifier.uri http://hdl.handle.net/10251/211792
dc.description.abstract [EN] In this contribution, we present clickR, an R package intended for data cleaning following a semi-automatic and supervised procedure. Few packages and commercial software with cleaning capacities are available. In all cases, their functionalities just cover part of the overall data pre-processing and do not follow an integral approach to cleaning up the data. In contrast, clickR brings together all functions needed for correcting the main structural, variable-assignment and typographical errors found in databases and allows researchers to have a strict control on the suggested changes. This is possible because the package creates a data frame that keeps track of all the implemented data modifications. To prove its capacity for detecting and fixing errors, we clean a messy database that exhibits multiple types of errors within date, numeric and factor variables. es_ES
dc.language Inglés es_ES
dc.publisher Elsevier es_ES
dc.relation.ispartof SoftwareX es_ES
dc.rights Reconocimiento (by) es_ES
dc.subject Data pre-processing es_ES
dc.subject Data cleaning es_ES
dc.subject R package es_ES
dc.subject.classification ESTADISTICA E INVESTIGACION OPERATIVA es_ES
dc.title clickR: Semi-automatic pre-processing of messy data with change tracking for integral dataset cleaning es_ES
dc.type Artículo es_ES
dc.identifier.doi 10.1016/j.softx.2024.101865 es_ES
dc.rights.accessRights Abierto es_ES
dc.contributor.affiliation Universitat Politècnica de València. Escuela Técnica Superior de Ingeniería Agronómica y del Medio Natural - Escola Tècnica Superior d'Enginyeria Agronòmica i del Medi Natural es_ES
dc.description.bibliographicCitation Hervás-Marín, D.; Fuente, D. (2024). clickR: Semi-automatic pre-processing of messy data with change tracking for integral dataset cleaning. SoftwareX. 27. https://doi.org/10.1016/j.softx.2024.101865 es_ES
dc.description.accrualMethod S es_ES
dc.relation.publisherversion https://doi.org/10.1016/j.softx.2024.101865 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.description.volume 27 es_ES
dc.identifier.eissn 2352-7110 es_ES
dc.relation.pasarela S\525399 es_ES
dc.contributor.funder Universitat Politècnica de València es_ES
upv.costeAPC 621 es_ES


Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem