- -

Which method to Use? An assessment of Data Mining Methods in Environmental Data Science

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

Which method to Use? An assessment of Data Mining Methods in Environmental Data Science

Mostrar el registro sencillo del ítem

Ficheros en el ítem

dc.contributor.author Gibert, K. es_ES
dc.contributor.author Izquierdo Sebastián, Joaquín es_ES
dc.contributor.author Sànchez-Marrè, M. es_ES
dc.contributor.author Hamilton, S. H. es_ES
dc.contributor.author Rodríguez-Roda, I. es_ES
dc.contributor.author Holmes, G. es_ES
dc.date.accessioned 2019-05-11T20:03:41Z
dc.date.available 2019-05-11T20:03:41Z
dc.date.issued 2018 es_ES
dc.identifier.issn 1364-8152 es_ES
dc.identifier.uri http://hdl.handle.net/10251/120346
dc.description.abstract [EN] Data Mining (DM) is a fundamental component of the Data Science process. Over recent years a huge library of DM algorithms has been developed to tackle a variety of problems in fields such as medical imaging and traffic analysis. Many DM techniques are far more flexible than more classical numerial simulation or statistical modelling approaches. These could be usefully applied to data-rich environmental problems. Certain techniques such as artificial neural networks, clustering, case-based reasoning or Bayesian networks have been applied in environmental modelling, while other methods, like support vector machines among others, have yet to be taken up on a wide scale. There is greater scope for many lesser known techniques to be applied in environmental research, with the potential to contribute to addressing some of the current open environmental challenges. However, selecting the best DM technique for a given environmental problem is not a simple decision, and there is a lack of guidelines and criteria that helps the data scientist and environmental scientists to ensure effective knowledge extraction from data. This paper provides a broad introduction to the use of DM in Data Science processes for environmental researchers. Data Science contains three main steps (pre-processing, data mining and post-processing). This paper provides a conceptualization of Environmental Systems and a conceptualization of DM methods, which are in the core step of the Data Science process. These two elements define a conceptual framework that is on the basis of a new methodology proposed for relating the characteristics of a given environmental problem with a family of Data Mining methods. The paper provides a general overview and guidelines of DM techniques to a non-expert user, who can decide with this support which is the more suitable technique to solve their problem at hand. The decision is related to the bidimensional relationship between the type of environmental system and the type of DM method. An illustrative two way table containing references for each pair Environmental System-Data Mining method is presented and discussed. Some examples of how the proposed methodology is used to support DM method selection are also presented, and challenges and future trends are identified. es_ES
dc.language Inglés es_ES
dc.publisher Elsevier es_ES
dc.relation.ispartof Environmental Modelling & Software es_ES
dc.rights Reconocimiento - No comercial - Sin obra derivada (by-nc-nd) es_ES
dc.subject Data mining es_ES
dc.subject Data science es_ES
dc.subject Method selection es_ES
dc.subject Multidisciplinarity es_ES
dc.subject Environmental systems es_ES
dc.subject.classification MATEMATICA APLICADA es_ES
dc.title Which method to Use? An assessment of Data Mining Methods in Environmental Data Science es_ES
dc.type Artículo es_ES
dc.identifier.doi 10.1016/j.envsoft.2018.09.021 es_ES
dc.rights.accessRights Abierto es_ES
dc.date.embargoEndDate 2020-12-01 es_ES
dc.contributor.affiliation Universitat Politècnica de València. Departamento de Matemática Aplicada - Departament de Matemàtica Aplicada es_ES
dc.description.bibliographicCitation Gibert, K.; Izquierdo Sebastián, J.; Sànchez-Marrè, M.; Hamilton, SH.; Rodríguez-Roda, I.; Holmes, G. (2018). Which method to Use? An assessment of Data Mining Methods in Environmental Data Science. Environmental Modelling & Software. 110:3-27. https://doi.org/10.1016/j.envsoft.2018.09.021 es_ES
dc.description.accrualMethod S es_ES
dc.relation.publisherversion http://doi.org/10.1016/j.envsoft.2018.09.021 es_ES
dc.description.upvformatpinicio 3 es_ES
dc.description.upvformatpfin 27 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.description.volume 110 es_ES
dc.relation.pasarela S\371482 es_ES


Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem