A simple data pruning algorithm to reduce computational load in genetic evaluation of population-traits with high rates of missing phenotypes
DOI:
https://doi.org/10.22319/rmcp.v17i2.6952Palabras clave:
Algorithm, Breeding value, Parent average, Pedigree, PhenotypeResumen
This study presents a streamlined data pruning algorithm aimed to reducing the computational load in genetic evaluations, particularly for population-traits with high proportions of missing phenotypes. The method addresses the challenge posed by large datasets by selectively retaining only phenotyped animals and their parents for the primary genetic evaluation. This approach, applicable to both pedigree-based (BLUP) and hybrid (ssGBLUP) genetic evaluation methods, significantly reduces the number of equations to solve, thereby reducing computational requirements. The pruning process involves either upward pedigree tracing from phenotyped animals (and genotyped animals in the case of ssGBLUP), or the iterative removal of non-phenotyped non-parents. Genetic evaluations for breeding values and reliabilities are then conducted on the reduced dataset. Subsequently, evaluations for pruned animals are back-solved iteratively, calculating their breeding values and reliabilities based on those of their parents from the main analysis. This iterative back-solving ensures that all animals receive an evaluation. Due to no phenotypic data loss, the estimated and breeding values and reliabilities, and the back-solved breeding values are not compromised. However, back-solved reliabilities for pruned animals were inflated due to the back-solving assumption of unrelated parents. The algorithms and corresponding R code are provided for educational purposes, offering a practical and efficient solution for genetic evaluation in sparse-data contexts.
Descargas
Citas
Nilforooshan MA, Ruı́z-Flores A, Pérez-Rodrı́guez P, Pabiou T, Saavedra-Jiménez LA, Valerio-Hernández JE. Multitrait analysis of growth traits for the optimization of breeding value prediction in Braunvieh cattle. Rev Mex Cienc Pecu 2025;16(1):55-68. https://doi.org/10.22319/rmcp.v16i1.6648. DOI: https://doi.org/10.22319/rmcp.v16i1.6648
Saavedra-Jiménez LA, Ramírez-Valverde R, Núñez-Domínguez R, Ruíz-Flores A, García-Muñiz JG, Nilforooshan MA. Effect of two phantom parent grouping strategies on the genetic evaluation of growth traits in Mexican Braunvieh cattle. Rev Mex Cienc Pecu 2021;12(3):878-892. https://doi.org/10.22319/rmcp.v12i3.5562. DOI: https://doi.org/10.22319/rmcp.v12i3.5562
Henderson CR. Best linear unbiased estimation and prediction under a selection model. Biometrics 1975;31(2):423-447. https://doi.org/10.2307/2529430. DOI: https://doi.org/10.2307/2529430
VanRaden PM. Efficient methods to compute genomic predictions. J Dairy Sci 2008;91(11):4414-4423. https://doi.org/10.3168/jds.2007-0980. DOI: https://doi.org/10.3168/jds.2007-0980
Aguilar I, Misztal I, Johnson DL, Legarra A, Tsuruta S, Lawlor TJ. Hot topic: A unified approach to utilize phenotypic, full pedigree, and genomic information for genetic evaluation of Holstein final score. J Dairy Sci 2010;93(2):743-752. https://doi.org/10.3168/jds.2009-2730. DOI: https://doi.org/10.3168/jds.2009-2730
Christensen OF, Lund MS. Genomic prediction when some animals are not genotyped. Genet Sel Evol 2010;42(1):2. https://doi.org/10.1186/1297-9686-42-2. DOI: https://doi.org/10.1186/1297-9686-42-2
Harris B, Johnson D. Approximate reliability of genetic evaluations under an Animal Model. J Dairy Sci 1998;81(10):2723-2728. https://doi.org/10.3168/jds.S0022-0302(98)75829-1. DOI: https://doi.org/10.3168/jds.S0022-0302(98)75829-1
Nilforooshan MA, Garrick D. Reduced animal models fitting only equations for phenotyped animals. Front Genet 2021;12:372. https://doi.org/10.3389/fgene.2021.637626. DOI: https://doi.org/10.3389/fgene.2021.637626
R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. https://www.R-project.org/.
Nilforooshan MA, Saavedra-Jiménez LA. ggroups: An R package for pedigree and genetic groups data. Hereditas 2020;157:17. https://doi.org/10.1186/s41065-020-00124-2. DOI: https://doi.org/10.1186/s41065-020-00124-2
Nilforooshan MA. Iterative pruning of non-phenotyped non-parent animals and back-solving of their solutions followed by the genetic evaluation analysis. Mendeley data 2025. https://doi.org/10.17632/sjts2m64fd.2.
Misztal I, Tsuruta S, Lourenco DAL, Aguilar I, Legarra A, Vitezica Z. Manual for BLUPF90 family of programs. 2014. https://nce.ads.uga.edu/wiki/lib/exe/fetch.php?media=blupf90_all.pdf.
Publicado
Cómo citar
-
Resumen301
-
PDF86
-
PDF 59
-
XML15
-
XML 20
-
Texto completo46
-
Full text 60
Número
Sección
Licencia

Esta obra está bajo una licencia internacional Creative Commons Atribución-NoComercial-CompartirIgual 4.0.
Los autores/as que publiquen en la Revista Mexicana de Ciencias Pecuarias aceptan las siguientes condiciones:
De acuerdo con la legislación de derechos de autor, la Revista Mexicana de Ciencias Pecuarias reconoce y respeta el derecho moral de los autores/as, así como la titularidad del derecho patrimonial, el cual será cedido a la revista para su difusión en acceso abierto.

Esta obra está bajo una Licencia Creative Commons Atribución-NoComercial-CompartirIgual 4.0 Internacional.




