0:00:01.440,0:00:06.840 Hello. Molecular genetics  influences new breeding methods. 0:00:06.840,0:00:10.980 This lecture focuses on genetic  markers and their applications. 0:00:12.120,0:00:18.600 The lecture is part of Module 3, Animal Breeding.  The creation of this presentation was supported by 0:00:18.600,0:00:27.600 ERASMUS+ KA2 grant within the ISAGREED project,  Innovation of content and structure of study 0:00:27.600,0:00:33.660 programs in the field of management of animal  genetic and food resources using digitization. 0:00:38.160,0:00:44.820 With the development of molecular genetics since  the 1970s, and especially molecular genetic 0:00:44.820,0:00:52.080 methods working with DNA molecules, it is possible  to identify real genetic variability (genotypes) 0:00:52.080,0:00:59.640 using molecular genetic markers. This is used,  for example, for mapping genes or regions in 0:00:59.640,0:01:08.520 the genome (QTL) where genes for complex utility  traits might be located. Thanks to these analyses, 0:01:08.520,0:01:14.220 the use of this information in breeding  is possible, specifically for improving 0:01:14.220,0:01:20.520 estimates of genetic parameters (such as  heritability) and breeding values, which leads 0:01:20.520,0:01:27.300 to the efficiency of breeding through increased  genetic gain and shortened generation interval. 0:01:29.340,0:01:35.640 At the beginning (i.e. since the 1980s),  the idea of using genetic markers as a 0:01:35.640,0:01:41.340 selection criterion emerged, i.e. after  identifying the genotype of individuals, 0:01:41.340,0:01:47.940 subsequent selection of a genotype-appropriate  parent. This is called marker-assisted selection 0:01:48.540,0:01:56.100 (MAS). However, this approach is limited in  its actual use. Such selection has been used, 0:01:56.100,0:02:03.660 for example, for the stress gene in pigs (CRC  gene) or BLAD in cattle. These were genes with 0:02:03.660,0:02:08.460 monogenic inheritance, mutations of which  caused a reduction in individual fitness. 0:02:09.480,0:02:17.520 For utility, complex, quantitative traits, direct  selection based on the genotype of a single gene 0:02:17.520,0:02:24.780 is inappropriate, or multiple markers had to  be included in statistical models and breeding 0:02:24.780,0:02:31.200 values were estimated. If the variability in the  causal gene was directly determined by a marker, 0:02:31.200,0:02:39.300 it is called gene-assisted selection (GAS). A great revolution in these approaches was the 0:02:39.300,0:02:45.540 advent of new technologies for massive genome  sequencing and identification of millions of 0:02:45.540,0:02:52.920 markers (SNPs) and more precise determination  of QTL regions and genes in the genome for 0:02:52.920,0:03:00.660 quantitative traits. Since 2009, after sequencing  the genomes of cattle, and gradually other 0:03:00.660,0:03:07.560 economically important animal species, genomic  selection has been gradually introduced. This is 0:03:07.560,0:03:14.220 a method that incorporates genomic SNP markers  (tens of thousands to hundreds of thousands) 0:03:14.880,0:03:21.240 into a genomic relationship matrix and into  equations such as BLUP (various variants) in 0:03:21.240,0:03:25.140 order to estimate breeding values -  GEBV (genomic estimated breeding value). 0:03:25.680,0:03:31.080 This is again just one number that is  easily usable in breeding practice. 0:03:32.520,0:03:38.940 Because we use real genetic variability, breeding  is significantly streamlined - costs are reduced, 0:03:38.940,0:03:44.280 breeding value estimates are improved,  and the generation interval is shortened. 0:03:45.960,0:03:52.800 So what is a genetic molecular marker? It is a  detectable polymorphism (multiple alleles) with a 0:03:54.300,0:04:02.520 known position in the genome. There are three  types of these markers: Type I are coding genes, 0:04:02.520,0:04:16.200 so-called candidate genes (e.g. CRC gene, ESR  in pigs). Type II markers are microsatellites, 0:04:17.100,0:04:25.020 i.e. short tandemly repeating base sequences  (STR) - found outside coding sequences. 0:04:25.980,0:04:34.680 Type III markers are biallelic single nucleotide  polymorphisms (SNPs) in coding or more commonly 0:04:34.680,0:04:42.840 in non-coding intronic or intergenic regions. An  example of one SNP is shown in the bottom image. 0:04:45.900,0:04:51.180 The diagram shows polymorphism in  SNPs for a single nucleotide pair, 0:04:52.080,0:04:57.180 and polymorphism in microsatellites,  which is caused by length variation of 0:04:57.180,0:05:05.280 tandem repeats (especially dinucleotide  repeats). SNPs usually have two alleles, 0:05:05.280,0:05:09.720 while microsatellites can have up  to twenty alleles in a population, 0:05:09.720,0:05:15.900 although there are significantly fewer  of them in genomes compared to SNPs. 0:05:18.600,0:05:25.140 The main significance of a genetic marker is that  it can be associated with phenotypic variability 0:05:25.140,0:05:32.220 of an important production trait in breeding, even  though it may not have a direct biological effect 0:05:32.220,0:05:40.080 on the trait. Such a marker is then referred to  as indirect. It is called indirect because it 0:05:40.080,0:05:48.360 is in linkage (i.e. close proximity on the  chromosome) with another gene (QTL region) 0:05:48.360,0:05:56.640 that directly influences the trait. The marker's  linkage disequilibrium with QTL is then utilized. 0:05:56.640,0:06:02.400 The stronger the linkage, the more  informative the marker is for breeding. 0:06:05.100,0:06:12.420 An example of a SNP that is also a candidate  gene is a SNP on the 4th chromosome, 0:06:12.420,0:06:17.760 located in the leptin gene, where there is a  polymorphism of adenine and guanine substitution. 0:06:19.560,0:06:25.800 Furthermore, we can see that the mutation is  in the coding sequence and causes a change in 0:06:25.800,0:06:34.200 reading, substituting the 40th amino acid from  threonine to alanine. This SNP is therefore a 0:06:34.200,0:06:40.980 direct marker. Subsequent association analysis  must determine whether this SNP affects the 0:06:40.980,0:06:47.640 variability in the effect of leptin and, in this  case, the indicators of fat in cattle muscle. 0:06:51.180,0:06:58.080 The relationship between the marker and QTL  is therefore related to their distance from 0:06:58.080,0:07:01.380 each other and the possibility  of crossing over between them. 0:07:02.040,0:07:08.340 If the marker is not direct and causal, it  may be in linkage with the causal locus QTL. 0:07:09.000,0:07:15.300 In the figure, we can see different combinations  of linkage disequilibrium between marker alleles 0:07:15.300,0:07:26.100 (M1 and M2) and QTL alleles (L - worse-low and  H - better-high performance). Depending on the 0:07:26.100,0:07:32.880 combination on the chromosome, allele M1 may  be advantageous in one case, according to which 0:07:32.880,0:07:41.220 selection can be made, while allele M2 may be  advantageous in another case. In the third case, 0:07:41.220,0:07:48.120 no marker allele is suitable because both  are linked to the same H allele in QTL. 0:07:50.940,0:07:56.280 In brief, we will describe the  principle of marker-assisted selection 0:07:56.280,0:08:02.220 (MAS). MAS in selection programs for livestock  allows for increased accuracy in selecting 0:08:02.220,0:08:09.120 specific DNA variations associated with measurable  differences in economically important traits. 0:08:10.200,0:08:16.440 The degree of genetic improvement achieved  through MAS can be substantially higher than 0:08:16.440,0:08:21.240 improvement achieved through selection  based on breeding values for traits 0:08:21.240,0:08:29.820 with low heritability values in populations or  traits determined post-mortem. Therefore, MAS 0:08:29.820,0:08:35.940 has the potential to significantly increase the  effectiveness of animal breeding for these traits. 0:08:37.560,0:08:45.180 MAS phases. Detection phase, evaluation phase,  and implementation phase are distinguished. 0:08:45.900,0:08:52.380 In the detection phase, DNA polymorphisms  are used as direct or linked markers to 0:08:52.380,0:08:56.580 determine specific allele frequencies  within QTL segregation populations. 0:08:57.540,0:09:05.820 During this phase, markers associated with QTL are  identified, and the size of allele effects and the 0:09:05.820,0:09:14.220 location of QTL in the genome can be estimated. In the evaluation phase, linked markers are 0:09:14.220,0:09:20.040 tested in target populations to determine  whether QTL segregate within the population. 0:09:21.600,0:09:28.080 The implementation phase uses predictive  linked markers within families, and direct 0:09:28.080,0:09:34.800 markers are used across families to create a  genotype database. These data are combined with 0:09:34.800,0:09:41.880 pedigree and phenotypic information during genetic  evaluation to predict individual genetic values. 0:09:43.800,0:09:49.200 MAS is suitable for direct selection  of individuals based on the genotype 0:09:49.200,0:09:56.280 of a genetic marker/gene for simple traits  - monogenic traits (most commonly monogenic 0:09:56.280,0:10:02.880 diseases). For example, the CRC stress  gene and the BLAD disease gene in cattle. 0:10:03.660,0:10:09.420 For markers associated with quantitative  traits, the use of MAS is limited, 0:10:09.420,0:10:15.840 with a smaller effect - there are not as many  described candidate genes, and there are not many 0:10:15.840,0:10:22.980 traits with simple genetic determinism. For these  traits, it is necessary to include whole-genome 0:10:22.980,0:10:30.120 SNP markers in the genomic selection system. And thank you for your attention.