0:00:01.189,0:00:02.240 Hello. 0:00:02.240,0:00:05.960 The topic of this lecture is genomic selection. 0:00:05.960,0:00:08.711 The lecture is part of Module 3, Animal Breeding. 0:00:08.711,0:00:16.800 The creation of this presentation was supported by the ERASMUS+ KA2 grant within the project 0:00:16.800,0:00:22.320 ISAGREED, Innovation of content and structure of study programs in the field of management 0:00:22.320,0:00:26.910 of animal genetic and food resources using digitization. 0:00:26.910,0:00:32.259 To begin, let's define a genome. 0:00:32.259,0:00:37.140 A genome is the complete set of genetic information of an organism. 0:00:37.140,0:00:44.840 In eukaryotic organisms, the genome is contained in the haploid set of chromosomes. 0:00:44.840,0:00:53.320 After 2000, the development of next-generation sequencing technologies allowed for faster 0:00:53.320,0:00:57.129 and cheaper sequencing of whole genomes. 0:00:57.129,0:01:03.410 The first sequencing of the whole genomes of major livestock species took place around 0:01:03.410,0:01:05.640 2009. 0:01:05.640,0:01:21.460 Cattle have a genome size of 2.7 Gbp (30 chromosomes / 1n), pigs 2.5 Gbp (19 chromosomes / 1n), 0:01:21.460,0:01:28.350 and chickens 1.05 Gbp (40 chromosomes / 1n). 0:01:28.350,0:01:36.740 Generally, birds have about half the genome size of mammals. 0:01:36.740,0:01:43.399 Thanks to the sequencing of whole genomes, tremendous variability has been found in short 0:01:43.399,0:01:48.549 variants (mainly SNP, indel - insertion-deletion). 0:01:48.549,0:01:58.369 Cattle have 97 million, pigs have approximately 71 million, sheep have 58 million, and chickens 0:01:58.369,0:02:03.939 have 22 million of these polymorphisms. 0:02:03.939,0:02:09.820 There are several genomic databases available, and one of them is ENSEMBL www.ensembl.org/, 0:02:09.820,0:02:13.709 or on the NCBI server. 0:02:13.709,0:02:25.150 This made it possible to perform whole-genome association analyses (GWAS) to identify QTL 0:02:25.150,0:02:33.050 regions using thousands of SNP markers that evenly cover the entire genome. 0:02:33.050,0:02:40.980 Results from GWAS in livestock species and humans lead to the conclusion that the individual 0:02:40.980,0:02:48.840 QTL effect on complex traits is very small, and therefore a large number of QTL is needed 0:02:48.840,0:02:52.880 to explain the genetic variance in these traits. 0:02:52.880,0:02:59.290 The gains from MAS programs using a small number of DNA markers to detect a limited 0:02:59.290,0:03:09.230 number of QTL are small, and alternative technologies needed to be developed to use denser information 0:03:09.230,0:03:15.220 through genomic SNPs, called genomic selection. 0:03:15.220,0:03:22.879 Genomic selection uses a panel of whole-genome markers where QTL are in linkage disequilibrium 0:03:22.879,0:03:26.689 with one or more SNP. 0:03:26.689,0:03:32.700 Estimated genomic breeding values are predicted as the sum of the effects of these SNPs across 0:03:32.700,0:03:37.550 the entire genome. 0:03:37.550,0:03:40.439 Genomic selection is a higher version of MAS. 0:03:40.439,0:03:47.709 This is possible thanks to the use of many SNPs discovered during genome sequencing and 0:03:47.709,0:03:55.610 new methods of genotyping a large number of SNPs (DNA microarrays, SNP chips containing 0:03:55.610,0:03:59.799 60,000 or 700,000 SNPs). 0:03:59.799,0:04:09.879 The ideal method for estimating BVs from genomic data is to calculate the conditional mean 0:04:09.879,0:04:15.500 of the breeding value of a given genotype of the animal at each QTL (~ marker). 0:04:15.500,0:04:24.490 In practice, markers (SNPs) are used instead of QTL genotypes, but the more we approach 0:04:24.490,0:04:32.539 larger sequences and SNP data, the more ideal the method will be. 0:04:32.539,0:04:35.650 The idea of genomic selection is quite old. 0:04:35.650,0:04:43.490 In 2001, Meuwissen et al. proposed the concept of genomic selection as the use of a large 0:04:43.490,0:04:54.350 number of genetic markers covering the entire genome to predict the genetic value of individuals. 0:04:54.350,0:05:01.570 The application of genomic selection is currently used in dairy cattle, beef cattle, and pigs. 0:05:01.570,0:05:05.260 However, this varies between different countries. 0:05:05.260,0:05:10.080 In the Czech Republic, it is currently only used in dairy cattle. 0:05:10.080,0:05:17.800 The basic principle of genomic selection involves genotyping of whole-genome SNP markers and 0:05:17.800,0:05:23.979 utilizing this information to estimate the true genetic variability. 0:05:23.979,0:05:31.539 Subsequently, based on genomic SNP markers, a kinship matrix is calculated and incorporated 0:05:31.539,0:05:40.050 into the equations for Best Linear Unbiased Prediction (BLUP), which have various variants 0:05:40.050,0:05:42.510 and are constantly evolving. 0:05:42.510,0:05:50.280 Further, it is necessary to estimate the genomic heritability of the trait, resulting in an 0:05:50.280,0:05:54.690 estimation of genomic estimated breeding value (GEBV). 0:05:54.690,0:06:02.820 The reason why genomic selection is expanding to other groups of animals is that it enables 0:06:02.820,0:06:11.040 more efficient breeding by reducing costs, improving accuracy of predicted breeding values, 0:06:11.040,0:06:18.860 shortening generation intervals, and reducing inbreeding. 0:06:18.860,0:06:27.520 Is there any difference in estimating breeding values between traditional and genomic methods? 0:06:27.520,0:06:32.199 Estimating breeding values using traditional methods requires calculation of the additive 0:06:32.199,0:06:40.069 genetic relationship matrix (A) based on the expected proportion of shared genes from parents, 0:06:40.069,0:06:45.690 so knowledge of pedigree data is necessary. 0:06:45.690,0:06:53.169 Estimating genomic breeding values (~genomic selection), on the other hand, also requires 0:06:53.169,0:06:59.680 knowledge of relatedness, but this time based on genotypes of genetic markers (SNPs), from 0:06:59.680,0:07:05.330 which a genomic relationship matrix is calculated. 0:07:05.330,0:07:11.729 Its elements express the estimates of the realized proportion of genome that two individuals 0:07:11.729,0:07:14.240 share from their parents. 0:07:14.240,0:07:20.740 Basically, there is no difference between traditional and genomic estimation of breeding 0:07:20.740,0:07:21.740 values. 0:07:21.740,0:07:26.349 However, the latter is more advantageous. 0:07:26.349,0:07:31.789 The genomic selection system is depicted here. 0:07:31.789,0:07:38.139 It is always necessary to create a prediction equation for estimating genomic breeding values 0:07:38.139,0:07:47.849 on a reference population and test it on reference populations with known SNP genotypes and phenotypes 0:07:47.849,0:07:50.599 (performance traits). 0:07:50.599,0:07:57.300 Subsequently, the resulting prediction equation is used to evaluate candidate parents (often 0:07:57.300,0:08:05.150 very young, theoretically even embryos) for which we do not need to know phenotypes, only 0:08:05.150,0:08:06.150 SNP genotypes. 0:08:06.150,0:08:16.099 The result is a very accurate estimation of genomic breeding values for candidate parents. 0:08:16.099,0:08:20.280 In the following diagram, we can see its application in cattle. 0:08:20.280,0:08:25.870 As you can see, the principle of breeding has not really changed. 0:08:25.870,0:08:31.400 We start with the reality of the population from which we want to select individuals, 0:08:31.400,0:08:39.740 we must know phenotypes, genetic parameters, relatedness, and in the end, from a lot of 0:08:39.740,0:08:49.170 data, we have only one number, which is the value of genomic breeding value. 0:08:49.170,0:08:54.580 The advantages of genomic selection include improved accuracy of statistical estimates, 0:08:54.580,0:09:04.070 shortened generation interval, increased genetic and therefore economic gain, and the ability 0:09:04.070,0:09:06.000 to effectively manage inbreeding. 0:09:06.000,0:09:13.420 It is particularly advantageous for traits with low heritability, sex-linked traits, 0:09:13.420,0:09:18.010 or traits determined post mortem. 0:09:18.010,0:09:20.220 What about the future? 0:09:20.220,0:09:26.510 Due to constant technological advancement in NGS and TGS, it will be possible to obtain 0:09:26.510,0:09:34.070 more and more information from more individuals - longer sequence reads in a shorter time, 0:09:34.070,0:09:41.670 which will improve SNP identification - increasing the number of detectable SNPs and reducing 0:09:41.670,0:09:46.570 their false detection, leading to more informative markers. 0:09:46.570,0:09:53.149 It will also be economically advantageous due to decreasing costs of whole genome resequencing 0:09:53.149,0:09:57.320 to a few hundred USD. 0:09:57.320,0:10:04.910 Furthermore, the estimation of whole-genome heritability will be accurate down to the 0:10:04.910,0:10:07.950 nucleotide level. 0:10:07.950,0:10:16.790 In conclusion, genomic selection is very successful in cattle as it provides greater genetic gain 0:10:16.790,0:10:20.899 at similar or lower costs. 0:10:20.899,0:10:26.779 Genomic selection is a very recent innovation, yet it is quickly being implemented. 0:10:26.779,0:10:32.790 Genomic selection is rapidly evolving, including reducing the cost of genotyping, strategies 0:10:32.790,0:10:40.019 for phenotyping new traits, approaches for creating or replacing reference populations, 0:10:40.019,0:10:46.519 and increasing the robustness and stability of genomic predictions through the identification 0:10:46.519,0:10:53.980 of causal mutations from genome sequences or genomic predictions of genetics × environment 0:10:53.980,0:10:58.480 interactions. 0:10:58.480,0:11:00.629 And thank you for your attention.