- -

Impact of multi-output and stacking methods on feed efficiency prediction from genotype using machine learning algorithms

RiuNet: Repositorio Institucional de la Universidad Politécnica de Valencia

Compartir/Enviar a

Citas

Estadísticas

  • Estadisticas de Uso

Impact of multi-output and stacking methods on feed efficiency prediction from genotype using machine learning algorithms

Mostrar el registro sencillo del ítem

Ficheros en el ítem

dc.contributor.author Mora, Mónica es_ES
dc.contributor.author González, Pablo es_ES
dc.contributor.author Quevedo, José Ramón es_ES
dc.contributor.author Montañés, Elena es_ES
dc.contributor.author Tusell, Llibertat es_ES
dc.contributor.author Bersgma, Rob es_ES
dc.contributor.author Piles, Miriam es_ES
dc.date.accessioned 2024-06-12T18:18:28Z
dc.date.available 2024-06-12T18:18:28Z
dc.date.issued 2023-11 es_ES
dc.identifier.issn 0931-2668 es_ES
dc.identifier.uri http://hdl.handle.net/10251/205067
dc.description.abstract [EN] Feeding represents the largest economic cost in meat production; therefore, selection to improve traits related to feed efficiency is a goal in most livestock breeding programs. Residual feed intake (RFI), that is, the difference between the actual and the expected feed intake based on animal's requirements, has been used as the selection criteria to improve feed efficiency since it was proposed by Kotch in 1963. In growing pigs, it is computed as the residual of the multiple regression model of daily feed intake (DFI), on average daily gain (ADG), backfat thickness (BFT), and metabolic body weight (MW). Recently, prediction using single-output machine learning algorithms and information from SNPs as predictor variables have been proposed for genomic selection in growing pigs, but like in other species, the prediction quality achieved for RFI has been generally poor. However, it has been suggested that it could be improved through multi-output or stacking methods. For this purpose, four strategies were implemented to predict RFI. Two of them correspond to the computation of RFI in an indirect way using the predicted values of its components obtained from (i) individual (multiple single-output strategy) or (ii) simultaneous predictions (multi-output strategy). The other two correspond to the direct prediction of RFI using (iii) the individual predictions of its components as predictor variables jointly with the genotype (stacking strategy), or (iv) using only the genotypes as predictors of RFI (single-output strategy). The single-output strategy was considered the benchmark. This research aimed to test the former three hypotheses using data recorded from 5828 growing pigs and 45,610 SNPs. For all the strategies two different learning methods were fitted: random forest (RF) and support vector regression (SVR). A nested cross-validation (CV) with an outer 10-folds CV and an inner threefold CV for hyperparameter tuning was implemented to test all strategies. This scheme was repeated using as predictor variables different subsets with an increasing number (from 200 to 3000) of the most informative SNPs identified with RF. Results showed that the highest prediction performance was achieved with 1000 SNPs, although the stability of feature selection was poor (0.13 points out of 1). For all SNP subsets, the benchmark showed the best prediction performance. Using the RF as a learner and the 1000 most informative SNPs as predictors, the mean (SD) of the 10 values obtained in the test sets were: 0.23 (0.04) for the Spearman correlation, 0.83 (0.04) for the zero-one loss, and 0.33 (0.03) for the rank distance loss. We conclude that the information on predicted components of RFI (DFI, ADG, MW, and BFT) does not contribute to improve the quality of the prediction of this trait in relation to the one obtained with the single-output strategy. es_ES
dc.description.sponsorship Universidad Politecnica de Valencia. MM is a recipient of a (FPI), Grant/Award Number: RTI2018-097610R-100. es_ES
dc.language Inglés es_ES
dc.publisher Blackwell Publishing es_ES
dc.relation.ispartof Journal of Animal Breeding and Genetics es_ES
dc.rights Reconocimiento (by) es_ES
dc.subject Artificial intelligence es_ES
dc.subject Multi-trait es_ES
dc.subject Regression problem es_ES
dc.subject Residual feed intake es_ES
dc.subject SNPs es_ES
dc.subject Stacking es_ES
dc.title Impact of multi-output and stacking methods on feed efficiency prediction from genotype using machine learning algorithms es_ES
dc.type Artículo es_ES
dc.identifier.doi 10.1111/jbg.12815 es_ES
dc.relation.projectID info:eu-repo/grantAgreement/AEI//RTI2018-097610-R-100/ es_ES
dc.rights.accessRights Abierto es_ES
dc.description.bibliographicCitation Mora, M.; González, P.; Quevedo, JR.; Montañés, E.; Tusell, L.; Bersgma, R.; Piles, M. (2023). Impact of multi-output and stacking methods on feed efficiency prediction from genotype using machine learning algorithms. Journal of Animal Breeding and Genetics. 140(6):638-652. https://doi.org/10.1111/jbg.12815 es_ES
dc.description.accrualMethod S es_ES
dc.relation.publisherversion https://doi.org/10.1111/jbg.12815 es_ES
dc.description.upvformatpinicio 638 es_ES
dc.description.upvformatpfin 652 es_ES
dc.type.version info:eu-repo/semantics/publishedVersion es_ES
dc.description.volume 140 es_ES
dc.description.issue 6 es_ES
dc.identifier.pmid 37403756 es_ES
dc.relation.pasarela S\496952 es_ES
dc.contributor.funder Agencia Estatal de Investigación es_ES
dc.contributor.funder Universitat Politècnica de València es_ES


Este ítem aparece en la(s) siguiente(s) colección(ones)

Mostrar el registro sencillo del ítem