SHORT COMMUNICATION doi: 10.1111/age.12488 Genome-wide linkage disequilibrium and past effective population size in three Korean cattle breeds P. Sudrajad*, D. W. Seo*, T. J. Choi, B. H. Park, S. H. Roh, W. Y. Jung, S. S. Lee, J. H. Lee*, S. Kim and S. H. Lee* *Division of Animal and Dairy Science, Chungnam National University, Daejeon 305-764, Korea. Indonesian Agency for Agricultural Research and Development, Ministry of Agriculture, Jakarta 12540, Indonesia. Animal Breeding and Genetics Division, National Institute of Animal Science, Seonghwan 31000, Korea. Hanwoo Genetic Improvement Center, National Agricultural Cooperative Federation, Chungnam 356-831, Korea. Summary The routine collection and use of genomic data are useful for effectively managing breeding programs for endangered populations. Linkage disequilibrium (LD) using high-density DNA markers has been widely used to determine population structures and predict the genomic regions that are associated with economic traits in beef cattle. The extent of LD also provides information about historical events, including past effective population size (N e ), and it allows inferences on the genetic diversity of breeds. The objective of this study was to estimate the LD and N e in three Korean cattle breeds that are genetically similar but have different coat colors (Brown, Brindle and Jeju Black Hanwoo). Brindle and Jeju Black are endangered breeds with small populations, whereas Brown Hanwoo is the main breeding population in Korea. DNA samples from these cattle breeds were genotyped using the Illumina BovineSNP50 Bead Chip. We examined 13 cattle breeds, including European taurines, African taurines and indicines, and hybrids to compare their LD values. Brown Hanwoo consistently had the lowest mean LD compared to Jeju Black, Brindle and the other 13 cattle breeds (0.13, 0.19, 0.21 and 0.15 0.22 respectively). The high LD values of Brindle and Jeju Black contributed to small N e values (53 and 60 respectively), which were distinct from that of Brown Hanwoo (531) for 11 generations ago. The differences in LD and N e for each breed reflect the breeding strategy applied. The N e for these endangered cattle breeds remain low; thus, effort is needed to bring them back to a sustainable tract. Keywords genetic diversity, Hanwoo, population parameter The Food and Agricultural Organization of the United Nations (FAO) documents Korean cattle breeds on its Domestic Animal Diversity Information System. Korean cattle have been classified and named based on their body color and geographical distribution (Dadi et al. 2012; Suh et al. 2014). Brown Hanwoo is widely used for breeding in Korea, Brindle Hanwoo is from the Korean Peninsula and Jeju Black is from Jeju Island (Fig. S1). The Korean government has been trying to improve the genetic Address for correspondence S. H. Lee, Division of Animal and Dairy Science, Chungnam National University, 99 Daehak-ro, Yuseong-gu, Daejeon 305-764, Korea. E-mail: slee46@cnu.ac.kr and S. Kim, Animal Breeding and Genetics Division, National Institute of Animal Science, Seonghwan 31000, Korea. E-mail: goldstar@korea.kr Accepted for publication 16 July 2016 performance of these cattle breeds even though the conditions of the breeding populations are not the same. Brown Hanwoo has the largest population (3 million animals) with a well-established breeding program and progeny testing to select bulls (Park et al. 2013), whereas the others (Brindle and Jeju Black) are endangered (Suh et al. 2014), although these breeds have the potential to be developed as beef cattle with quality comparable to that of Brown Hanwoo (Lee et al. 2013c; Han et al. 2015). Current genetic improvement programs for Korean cattle rely on the framework of genomic selection, population structure, and genetic predictions which have become routine genomic tools (Lee et al. 2014). Analysis of genomic data is becoming part of the beef cattle industry and useful for the improvement of the effectiveness of genetic programs for endangered breeds such as Brindle and Jeju Black. Linkage disequilibrium (LD) and effective population size (N e ) analyses are important, as these parameters are strongly influenced by historical events, including genetic 2016 Stichting International Foundation for Animal Genetics 1
2 Sudrajad et al. drift, migration, selection, mutation, recombination rate and bottleneck, experienced in a population (Smith & Kuhner 2009). Recent population sizes of Jeju Black and Brindle cattle were 500 and 2885 breeding animals respectively (Chikso Management System 2015). These two cattle breeds are managed by artificial insemination (AI) by sires at livestock research centers that belong to the local government. We genotyped all available AI sires for Jeju Black (n = 20) and Brown Hanwoo (n = 667), but we had access to only 10 sire and 10 female Brindle cattle. All animals were genotyped using the BovineSNP50 BeadChip (Illumina). SNP data for 304 animals, obtained from Gautier et al. (2010), were used as representative of European taurine (EUT; Angus, Brown Swiss, Jersey, Hereford and Guernsey), African taurine (AFT; N Dama, Oulmes Zaer and Sheko), indicine (ZEB; Brahman, Nelore and Madagascar Zebu) and hybrid (H; Beef Master and Santa Gertrudis). The quality of the SNP data was maintained using the following criteria: SNPs that were out of Hardy-Weinberg equilibrium (P < 1 9 10 4 ) or had a low call rate (<90%) were removed, and the minimum minor allele frequency of SNPs was limited to 1%. Missing genotypes were imputed within each breed using 30 iterations with BEAGLE 4.0 (Browning & Browning 2007). LD was measured as r 2 among all pairs of syntenic SNPs. SNP cleaning and LD estimation were finished using PLINK v1.07 (Purcell et al. 2007) for each breed. The historical LD phase was calculated using the linear regression of the square root of LD (r) between breeds (Sargolzaei et al. 2008). We used both PLINK and ADMIXTURE v1.23 (Alexander et al. 2009) to determine population structure analyses, i.e. by calculating the multidimensional scaling algorithm and estimating the unsupervised model for ancestors admixture analysis respectively. We also calculated the heterozygosity and runs of homozygosity to examine the genetic diversity of the Korean cattle. N e was estimated from the LD value following Sved s (1971) equation as explained by Sargolzaei et al. (2008) and fitted to a non-linear least square regression in R (R Core Team 2015). Then, the equation was plotted following estimated times in the horizontal ordinate (Hayes et al. 2003). Korean cattle breeds were observed to have high proportions of cleaned SNPs (>71%) from a panel of 54 609 SNPs (Table S1). Despite the small sample size of Brindle and Jeju Black, we were still confident with our results. These sample sizes were adequate if following the recommendation of Khatkar et al. (2008) regarding the minimum number of samples for r 2 analysis, as both Brindle and Jeju Black were classified as endangered (FAO 2015). The expected heterozygosity results proved that our samples were randomly selected (Fig. S2). The run of homozygosity plot (Fig. S3) supports the result that no homozygous region was observed. The other 13 breeds were used for comparison, and the LD results for these breeds should be considered only a rough approximation. Based on the multidimensional scaling plot (Fig. S4), we suggest that Korean cattle are more liable to be inclined toward EUT because the point distance from Korean cattle to EUT is closer than that to AFT and about half that to ZEB. We performed admixture analysis with K = 2toK = 20 and found that the minimum error after cross validation was at K = 15, i.e. 0.48702 (Figs. S5 & S6). EUT was seen as dominantly structured (60%) in the Brown Hanwoo population at K = 3. We observed that Korean cattle still have their independent structure in the next K (Fig. S5; K = 7, K = 11, K = 15), which means they may have a common ancestor. The proportions of the genetic mixtures among Korean cattle breeds were different, proving their genetic diversity. Lee et al. (2013a) and Porto-Neto et al. (2014) also performed similar studies, in which similar primary results were observed. Breed, chromosome and genomic distance are known to have significant effects on LD value (Lu et al. 2012). Brown Hanwoo had a lower LD value, and in all autosomal chromosomes and all distance ranges, the LD value decayed more rapidly compared to those of the other Korean cattle breeds (Tables S2 S4) and other 13 breeds (Fig. 1). EUT breeds tended to have the highest LD values. Brindle and Jeju Black had lower LD values compared to EUT and seemed to overlap with ZEB, AFT and H in the extended distance ranges. The Korean cattle breeds were found to have different LD values, i.e. 0.13, 0.21 and 0.19 for Brown Hanwoo, Brindle, and Jeju Black respectively (Table S1). This demonstrates that the LD pattern is a breed-specific feature that depends on genetic events passed on by individuals in the population (Smith & Kuhner 2009). The value of adjacent LDs for Brown Hanwoo in the present study was lower compared to the value reported by Edea et al. (2015). The difference could be because of the distinctive types of Hanwoo populations that were analyzed, i.e. steers (Edea et al. 2015) vs. proven bulls in the present study. The levels of LD seemed to be inversely related to the number of marker pairs (Table S5). Higher LD values were obtained in cases of a smaller number of SNP pairs, which were usually found in the short distance range (<20 kb). This result agrees with similar studies in Hanwoo (Edea et al. 2015) and other cattle breeds (Gautier et al. 2007; de Roos et al. 2008; Sargolzaei et al. 2008; Lu et al. 2012). Unfortunately, estimation bias appeared when LD was analyzed for the short distance range for the other 13 cattle breeds because of the small sample size, as predicted by Khatkar et al. (2008). That is why Angus, N Dama, Brahman and Beef Master, which are representative of EUT, AFT, ZEB and H respectively, all had very low LD values at a short distance range <20 kb. Linkage disequilibrium (LD) was persistent between the Korean cattle breeds and extended up to 20 kb (r-values of 0.89 and 0.91 for Brown Hanwoo vs. Brindle and Brown Hanwoo vs. Jeju Black respectively; Table S6). The correlation coefficient is a result of the genetic relationship
LD and N e of three Korean cattle breeds 3 between populations (de Roos et al. 2008; Sargolzaei et al. 2008), and a strong relationship is usually found with a close genetic distance (Gautier et al. 2007; de Roos et al. 2008), as shown in our study. Furthermore, the correlation coefficient can be used to estimate population history if genomic distance is assigned as historical time (Hayes et al. 2003) and the correlation coefficient is considered as the population relatedness level. We propose that these Korean cattle breeds came from a common ancestral population. We identified that chromosomes 21 and 26 had higher LD for all three Korean cattle breeds (Table S7). Given that Lee et al. (2013a) and Li & Kim (2015b) identified significant associations on chromosomes 21 and 26 with growthrelated traits in Brown Hanwoo and Porto-Neto et al. (2014) identified significant associations on chromosome 21 with immune system traits, we suggest that those associations would also affect Brindle and Jeju Black cattle. Moreover, Lee et al. (2013b) reported that one major region on chromosome 14 was related to carcass weight in Brown Hanwoo. In the present study, chromosome 14 was also identified as one of the three chromosomes with the highest LD in Brown Hanwoo but not in the other breeds. This might be because of the strict genetic selection through performance test that has been applied only for Brown Hanwoo for the past 30 years (Park et al. 2013; Suh et al. 2014). Figure 1 Linkage disequilibrium plot for Brown Hanwoo, Brindle Hanwoo, Jeju Black, European taurines, African taurines, indicines, and hybrids. Figure 2 Effective population size (N e ) plot for Brown Hanwoo, Brindle Hanwoo, and Jeju Black.
4 Sudrajad et al. The estimated N e tended to decrease over time in all three breeds (Fig. 2; Table S8). The high LD of Brindle and Jeju Black contributed to a small N e (53 and 60 respectively), which is considerably lower than that of the Brown Hanwoo (531) for the past 11 generations. The N e for Brown Hanwoo in the recent study was lower compared to that in the study performed by Li & Kim (2015a), who reported an N e value of 630 for Hanwoo 11 generations past. However, our result was higher compared to that in a previous study performed by Lee et al. (2011), where they reported an N e value of 352 for the 10 past generations. These differences might be due to the sample type and size variances used in these studies, because estimates of N e depend on the number of cattle alive at any time in the population, generation length and variance in family size (Lu et al. 2012). Furthermore, SNP densities also have influence on the N e because they depend on LD, which is biased for low marker density (Edea et al. 2015). We used about 43K cleaned SNPs from 667 proven bulls (sires), a higher density than did Li & Kim (2015a), who used about 36K cleaned SNPs from 61 sires and 486 steers, and Lee et al. (2011), who used about 4.5K cleaned SNPs from 232 Hanwoo steers. Moreover, both paternally and maternally inherited genotypes were used in our study, whereas Li & Kim (2015a) used only maternally inherited genotypes. Low N e values that have been estimated for Brindle and Jeju Black corroborated the demographic history explained by Dadi et al. (2012), by which the population reduction experienced by both Brindle and Jeju Black cattle accounts for the loss of genetic variation and increase in their inbreeding rate. However, the N e for both Brindle and Jeju Black populations were still higher than the limit assigned by the FAO, i.e. 50 per generation to maintain breed fitness (Lu et al. 2012). The N e of Brown Hanwoo has decreased dramatically in the past 60 70 years, possibly due to population loss during the Korean War during which the number of Brown Hanwoo decreased to 390 000 animals (Song 1994), which is 10 times smaller than the current population. The endangered Brindle and Jeju Black animals have low N e values because N e is intensely influenced by the applied selection and inbreeding levels (Hayes et al. 2003; de Roos et al. 2008). Inbreeding most likely occurs because of a limited number of bulls in an isolated natural population (Sargolzaei et al. 2008). The other reason is that Brown Hanwoo was selected as the main breed early in the genetic improvement program, whereas Brindle and Jeju Black were excluded (Suh et al. 2014). In conclusion, distinct LD and N e values were identified in three Korean cattle breeds, and the three breeds probably came from a common ancestral population. The differences in LD and N e for each breed reflect historical events and recent selection through the breeding program. The N e values for endangered cattle breeds remain low and efforts are needed to maintain its sustainability. Acknowledgements This study was supported by awards from the AGENDA project (Grant no. PJ01091001) and Molecular Breeding Program (PJPJ01134903) of the Next Generation BIO- GREEN21 project of the National Institute of Animal Science, Rural Development Administration, Republic of Korea. References Alexander D.H., Novembre J. & Lange K. (2009) Fast model-based estimation of ancestry in unrelated individuals. Genome Research 19, 1655 64. Browning S.R. & Browning B.L. (2007) Rapid and accurate haplotype phasing and missing data inference for whole genome association studies by use of localized haplotype clustering. The American Journal of Human Genetics 81, 1084 97. Chikso Management System (2015) Chikso ancestry, photo, DNA genotyping, and management. URL http://121.135.53.76/kbh/ kbh.php (Korean). Dadi H., Lee S.H., Jung K.S., Choi J.W., Ko M.S., Han Y.J., Kim J.J. & Kim K.S. (2012) Effect of population reduction on mtdna diversity and demographic history of Korean cattle populations. Asian-Australasian Journal of Animal Sciences 25, 1223 8. Edea Z., Dadi H., Dessie T., Lee S.H. & Kim K.S. (2015) Genomewide linkage disequilibrium analysis of indigenous cattle breeds of Ethiopia and Korea using different SNP genotyping BeadChips. Genes & Genomics 37, 759 65. Food and Agricultural Organization of the United Nations (2015) Domestic Animal Diversity Information System. URL http://www. fao.org/dad-is/. Gautier M., Faraut T., Moazami-Goudarzi K. et al. (2007) Genetic and haplotypic structure in 14 European and African cattle breeds. Genetics 177, 1059 70. Gautier M., Laloe D. & Moazami-Goudarzi K. (2010) Insights into the genetic history of French cattle from dense SNP data on 47 worldwide breeds. PLoS One 5, e13038. Han S.H., Oh H.S., Lee J.B. et al. (2015) Effects of genetic polymorphisms of ADD1 gene on economic traits in Hanwoo and Jeju Black cattle derived commercial populations in Jeju-do. Journal of Life Science 25, 21 8. Hayes B.J., Visscher P.M., McPartlan H.C. & Goddard M.E. (2003) Novel multilocus measure of linkage disequilibrium to estimate past effective population size. Genome Research 13, 635 43. Khatkar M.S., Nicholas F.W., Collins A.R., Zenger K.R., Cavanagh J.A.L., Barris W., Schnabel R.D., Taylor J.F. & Raadsma H.W. (2008) Extent of genome-wide linkage disequilibrium in Australian Holstein-Friesian cattle based on a high-density SNP panel. BMC Genomics 9, 187. Lee S.H., Cho Y.M., Lim D. et al. (2011) Linkage disequilibrium and effective population size in Hanwoo Korean Cattle. Asian- Australasian Journal of Animal Sciences 24, 1660 5. Lee K.T., Chung W.H., Lee S.Y. et al. (2013a) Whole-genome resequencing of Hanwoo (Korean cattle) and insight into regions of homozygosity. BMC Genomics 14, 519. Lee S.H., Choi B.H., Lim D. et al. (2013b) Genome-wide association study identifies major loci for carcass weight on BTA14 in Hanwoo (Korean cattle). PLoS One 8, e74677.
LD and N e of three Korean cattle breeds 5 Lee Y., Park S., Kim H., Lee S.K., Kim J.W., Lee H.H., Choi J.W. & Lee S.J. (2013c) A C1069G SNP of the MC4R gene and its association with economic traits in Korean native cattle (brown, brindle, and black). Electronic Journal of Biotechnology 16, 5. Lee S.H., Park B.H., Sharma A. et al. (2014) Hanwoo cattle: origin, domestication, breeding strategies and genomic selection. Journal of Animal Science and Technology 56, 2. Li Y. & Kim J.J. (2015a) Effective population size and signatures of selection using bovine 50K SNP chips in Korean native cattle (Hanwoo). Evolutionary Bioinformatics 11, 143 53. Li Y. & Kim J.J. (2015b) Multiple linkage disequilibrium mapping methods to validate additive quantitative trait loci in Korean native cattle (Hanwoo). Asian-Australasian Journal of Animal Sciences 28, 926 35. Lu D., Sargolzaei M., Kelly M., Li C., Vander Voort G., Wang Z., Plastow G., Moore S. & Miller S.P. (2012) Linkage disequilibrium in Angus, Charolais, and Crossbred beef cattle. Frontiers in Genetics 3, 152. Park B., Choi T., Kim S. & Oh S.H. (2013) National genetic evaluation (system) of Hanwoo (Korean native cattle). Asian- Australasian Journal of Animal Sciences 26, 151 6. Porto-Neto L.R., Lee S.H., Sonstegard T.S., Van Tassell C.P., Lee H.K., Gibson J.P. & Gondro C. (2014) Genome-wide detection of signatures of selection in Korean Hanwoo cattle. Animal Genetics 45, 180 90. Purcell S., Neale B., Todd-Brown K. et al. (2007) PLINK: a toolset for whole-genome association and population-based linkage analysis. The American Journal of Human Genetics 81, 559 75. R Core Team (2015) R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria. URL http://www.r-project.org/. de Roos A.P.W., Hayes B.J., Spelman R.J. & Goddard M.E. (2008) Linkage disequilibrium and persistence of phase in Holstein- Friesian, Jersey and Angus cattle. Genetics 179, 1503 12. Sargolzaei M., Schenkel F.S., Jansen G.B. & Schaeffer L.R. (2008) Extent of linkage disequilibrium in Holstein cattle in North America. Journal of Dairy Science 91, 2106 17. Smith L.P. & Kuhner M.K. (2009) The limits of fine-scale mapping. Genetic Epidemiology 33, 344 56. Song C.W. (1994) The Korean Hanwoo beef cattle. In: Animal Genetic Resources Information (Ed. by J. Boyazoglu & D. Chupin), pp. 61 71. FAO, Rome. Suh S., Kim Y.S., Cho C.Y., Byun M.J., Choi S.B., Ko Y.G., Lee C.W., Jung K.S., Bae K.H. & Kim J.H. (2014) Assessment of genetic diversity, relationships and structure among Korean native cattle breeds using microsatellite markers. Asian-Australasian Journal of Animal Sciences 27, 1548 53. Sved J.A. (1971) Linkage disequilibrium and homozygosity of chromosome segments in finite populations. Theoretical Population Biology 2, 125 41. Supporting information Additional supporting information may be found online in the supporting information tab for this article: Table S1 Data summary. Table S2 Summary of the Brown Hanwoo SNP data. Table S3 Summary of the Brindle Hanwoo cattle SNP data. Table S4 Summary of the Jeju Black cattle SNP data. Table S5 Linkage disequilibrium (LD) over different distances. Table S6 Linkage disequilibrium (LD) phase among the Korean cattle breeds. Table S7 Linkage disequilibrium (LD) on the distance <20 kb of autosomal chromosomes. Table S8 Past effective population size (N e ). Figure S1 Images of the three Korean cattle breeds (Brown Hanwoo, Brindle Hanwoo, and Jeju Black). Figure S2 Observed heterozygosity (Ho) and expected heterozygosity (He) density plot of the Korean cattle breeds. Figure S3 Runs of homozygosity of the Korean cattle breeds. Figure S4 Multi-dimensional scaling plot of the cattle breeds. Figure S5 Admixture bar plot of the cattle breeds using 2, 3, 7, 11 and 15 ancestry models. Figure S6 Cross-validation plot of the admixture analysis.