- -

A unified view of performance metrics: translating threshold choice into expected classification loss

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

A unified view of performance metrics: translating threshold choice into expected classification loss

Mostrar el registro sencillo del ítem

Ficheros en el ítem

dc.contributor.author José Hernández-Orallo es_ES
dc.contributor.author Flach, Peter es_ES
dc.contributor.author Ferri Ramírez, César es_ES
dc.date.accessioned 2015-03-04T11:47:37Z
dc.date.available 2015-03-04T11:47:37Z
dc.date.issued 2012
dc.identifier.issn 1533-7928
dc.identifier.uri http://hdl.handle.net/10251/47702
dc.description.abstract [EN] Many performance metrics have been introduced in the literature for the evaluation of classification performance, each of them with different origins and areas of application. These metrics include accuracy, unweighted accuracy, the area under the ROC curve or the ROC convex hull, the mean absolute error and the Brier score or mean squared error (with its decomposition into refinement and calibration). One way of understanding the relations among these metrics is by means of variable operating conditions (in the form of misclassification costs and/or class distributions). Thus, a metric may correspond to some expected loss over different operating conditions. One dimension for the analysis has been the distribution for this range of operating conditions, leading to some important connections in the area of proper scoring rules. We demonstrate in this paper that there is an equally important dimension which has so far received much less attention in the analysis of performance metrics. This dimension is given by the decision rule, which is typically implemented as a threshold choice method when using scoring models. In this paper, we explore many old and new threshold choice methods: fixed, score-uniform, score-driven, rate-driven and optimal, among others. By calculating the expected loss obtained with these threshold choice methods for a uniform range of operating conditions we give clear interpretations of the 0-1 loss, the absolute error, the Brier score, the AUC and the refinement loss respectively. Our analysis provides a comprehensive view of performance metrics as well as a systematic approach to loss minimisation which can be summarised as follows: given a model, apply the threshold choice methods that correspond with the available information about the operating condition, and compare their expected losses. In order to assist in this procedure we also derive several connections between the aforementioned performance metrics, and we highlight the role of calibration in choosing the threshold choice method. es_ES
dc.language Inglés es_ES
dc.publisher Microtome Publishing es_ES
dc.relation.ispartof Journal of Machine Learning Research es_ES
dc.rights Reserva de todos los derechos es_ES
dc.subject Classification performance metrics es_ES
dc.subject Cost-sensitive evaluation es_ES
dc.subject Operating condition es_ES
dc.subject Brier score es_ES
dc.subject Area under the ROC curve (AUC) es_ES
dc.subject Calibration loss es_ES
dc.subject Refinement loss es_ES
dc.subject.classification LENGUAJES Y SISTEMAS INFORMATICOS es_ES
dc.title A unified view of performance metrics: translating threshold choice into expected classification loss es_ES
dc.type Artículo es_ES
dc.rights.accessRights Abierto es_ES
dc.contributor.affiliation Universitat Politècnica de València. Departamento de Sistemas Informáticos y Computación - Departament de Sistemes Informàtics i Computació es_ES
dc.description.bibliographicCitation José Hernández-Orallo; Flach, P.; Ferri Ramírez, C. (2012). A unified view of performance metrics: translating threshold choice into expected classification loss. Journal of Machine Learning Research. 13:2813-2869. http://hdl.handle.net/10251/47702 es_ES
dc.description.accrualMethod S es_ES
dc.relation.publisherversion http://www.jmlr.org/papers/v13/hernandez-orallo12a.html es_ES
dc.description.upvformatpinicio 2813 es_ES
dc.description.upvformatpfin 2869 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.description.volume 13 es_ES
dc.relation.senia 238086


Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem