• 《东北虎与华南虎全基因组重测序及其比较基因组学分析》
  • 作者:李倩著
  • 单位:东北林业大学
  • 论文名称 东北虎与华南虎全基因组重测序及其比较基因组学分析
    作者 李倩著
    学科 野生动植物保护与利用
    学位授予单位 东北林业大学
    导师 张明海指导
    出版年份 2019
    中文摘要 随着全基因组测序变得商业化,测序成本的降低,在生物多样性,种群历史动态,及适应性进化等方面的应用越来越广泛。将全基因组学与群体遗传学进行结合,对虎的群体进化,群体多样性和亚种的分化机制进行研究,不仅能加深对虎的理解,更可以为其他物种的研究提供参考。最主要的是对种群遗传多样性的恢复并最终放归大自然提供依据。 虎(Panthera tigris)是最大的猫科动物。在生态系统中,虎位于食物链的顶端,具有调节生态平衡和物种多样性的作用。处于不同地理条件下的虎亚种,在身体的某些方面也发生了巨大的变化。在生理方面,比如脂肪的存储,食物的消化和身体代谢的差异;在外型方面,比如毛色,花纹,体型大小等的差异。对这些方面的形成机理进行研究,可以更加理解区域差异造成种间差异的原因,以及为了对环境更好的适应,物种所进行的进化。同时对其他虎亚种的进化机制提供参考。对于保护濒危野生动物,了解他们的生存状态及种群历史动态与环境气候变化之间的规律,对于采用针对性的保护措施具有重要作用。 本文利用二代测序技术,对5只东北虎和3只华南虎进行全基因组重测序。结合SNP和CNV多态性的分析,对两个群体之间的遗传结构特征,基因交流情况和进化机制进行研究,取得的结果有: 1.通过全基因组重测序,本研究共在东北虎和华南虎中得到了647Gb的高质量clean data数据,平均测序深度为27.5×;共获得630万个高质量的SNPs,位于非编码区的SNP有429万个(67.9%),并且编码区域内的非同义突变有18534个,同义突变有25148个;共获得173万个InDels,其中有插入86万个(49.8%),缺失87万个(50.2%),位于编码区域的InDels有2114个(0.12%);共获得27万个CNVs,其中缺失有26万个(97.5%),重复有6783个(2.5%)。平均每个个体含有32676个CNV,平均长度为4029bp。 2.利用邻接法构建了两个虎亚种群体的进化树。从进化树中可以看出两个种群分化明显。利用线粒体全基因组对虎亚种之间的遗传距离和变异程度进行分析,得出亚种之间遗传距离最小的为东北虎与孟加拉虎,孟加拉虎与华南虎和苏门答腊虎的遗传距离最大,而华南虎与苏门答腊虎的遗传距离较小。通过重新构建虎和9种常见哺乳动物线粒体的系统发育树可以看出,大约100万年之前虎就与其他8个哺乳动物分离开。在100多万年以前,虎曾与人类的祖先共同生活在一起。 3.对东北虎与华南虎的PCA分析发现东北虎和华南虎在全基因组水平具有显著差异;从二者的遗传结构分析结果发现,东北虎与华南虎差异显著,这与进化树和PCA分析的结果一致。利用PSMC分析8只虎的种群历史动态得出8只虎亚种具有共同的祖先,并且在距今大约70万年的时间里,种群有一个扩张,并达到了最高的种群数量(Ne=3万)。 4.基于SNP的选择消除分析,在东北虎和华南虎中分别筛选出133和182个选择区域,经东北虎和华南虎的重叠区域进行合并,得到东北虎和华南虎的受选择区域分别有45和103个,二者之间共有79个相同的染色体区域。发现东北虎和华南虎中的基因具有相似的功能富集和通路分析,二者的基因主要富集在9大类生物功能中,其中参与能量代谢的基因有22个,参与肌肉收缩功能的基因共有9个,与肉食性取食相关的基因有20个。在KEGG通路分析中,东北虎和华南虎中的基因分别注释到200和192条生化通路,在这些通路分析中发现有38个基因参与到多条疾病相关通路中,这些基因可能与虎的易患疾病及寄生虫感染相关。 5.基于CNV的选择消除分析,本研究一共鉴定出782个受选择的区域,这些区域的总长度为3.9Mb,平均长度约为4.9Kb,包含94个基因。去除与SNP分析相一致的结果外,发现有30个参与嗔觉功能的基因。 6.对东北虎和华南虎受选择区域的基因进行分析,在东北虎的受选择区域共有702个基因,华南虎的受选择区域共有414个基因,二者共同拥有的基因有50个。发现位于东北虎中的PRKG1具有强烈正选择信号,并与冷适应相应答。TLE3基因在P101L位点上有一个错义SNP突变,从而推测该位点的突变可能是造成东北虎的体型较华南虎差异明显的原因。 关键词 东北虎;华南虎;基因组学;种群多样性;适应性进化
    英文摘要 With the commercialization of whole-genome sequencing, the cost of sequencing is reduced, the application is becoming more and more extensive and in-depth. Especially in biological diversity, population history dynamics, adaptive evolution and other aspects have important applications. The study on population evolution process of the tiger and the formation mechanism of species diversity, can not only deepen the understanding of the tiger, also provide reference for other species research combines the genomics with population genetics. The most important thing is to restore the population genetic diversity and finally release animals to nature. The tiger (Panthera Tigris) is the largest cat. In the ecosystem, tiger is at the top of the food chain and has the function of regulating ecological balance and species diversity. The tiger subspecies under different geographical conditions have undergone great changes in some aspects of the body. The physiology difference such as the fat, food digest and metabolism, appearance difference such as color, coat pattern, body size and so on. The study of formation mechanism is better to understand the species differences caused by regional differences, as well as the evolution for adapting to the environment. Meanwhile providing reference for the evolution mechanism of other subspecies and organisms. For the protection of endangered wild animals, to understand the history of their living condition and the relationship between population change and environmental laws, which play an important role for the corresponding protection measures. In this paper, the whole genome of 5 amur tigers and 3 South China tigers was resequenced by using next generation sequencing. To study the genetic structure and gene exchange of each group base on SNP and CNV, we get the fellowing conclusions: 1.Through whole genome resequencing, we obtain 647 Gb high quality clean datas, the average sequencing depth is 27.5×. A total of 6.3 million high-quality SNPs was obtained, with 4.29 million (67.9%) SNP in the non-coding region, and 18534 non-synonymous SNPs and 25,148 synonym SNPs in the coding region. A total of 1.73 million InDels were obtained, of which 860 thousand (49.8%) were insertion and 870 thousand (50.2%) were deletion, and the InDels in the coding region had 2114 (0.12%). A total of 270 thousand CNVs were obtained, of which 260 thousand (97.5%) were deletion, and 6783 (2.5%) were duplication. On average, each individual contains 32676 CNV, with an average length of 4029bp. 2.The phylogenetic tree of two tiger subspecies was constructed by Neighbor-Joining method. From the evolutionary tree we can see that the two subpopulations obvious differentiation. The genetic distance and variation degree between the subspecies were analyzed base on the mitochondrial genome, the smallest genetic distance amoung subspecies were the amur tiger and bengal tiger, and the biggest was bengal tiger with sumatran tiger and southern China tiger. But the genetic distance between southern China tiger and sumatran tiger was small. Through rebuilding tiger and several common mammalian mitochondrial phylogenetic tree, we estimated the divergence time amoung 9 mammalian genomes. About a million years ago the tiger was separated from other eight mammals. And more than a million years ago, the tiger has lived together with our ancestors. 3.The PCA analysis of amur tiger and South China tiger showed significant differences between them at the whole genome level. The results showed that the difference between amur tiger and South China tiger was significant, which was consistent with the results of the evolutionary tree and PCA analysis. PSMC analysis 8 tiger population history dynamic learns that eight tiger subspecies have a common ancestor, and in about 700 thousand years ago, populations have an expansion, and has achieved the highest population (Ne = 30 thousand). 4.From selective sweep analysis based on SNP, we screened 133 and 182 sweep regions in amur tiger and South China tiger, respectively. Getting rid of 79 common regions, we get 45 and 103 regions in amur tiger and South China tiger, respectively. In amur tiger and South China tiger, there has similar GO enrichment and KEGG pathway analysis, both of the genes mainly enrich in 9 kinds of biological functions, which involved in 22 genes in energy metabolism, 9 genes involved in muscle contraction function and 20 genes related to carnivorous feeding. In KEGG pathway analysis, there are 200 and 192 biochemical pathways in amur tiger and South China tiger, respectively. In these pathways analysis we found 38 genes involved in disease pathways and that may associate with tiger susceptible disease and parasitic infections. 5.Selective sweep analysis based on CNV, we identified 782 selected regions, the total length of these regions was 3.9Mb, and the average length was about 4.9Kb, including 94 genes. In addition to the results of SNP analysis, 30 genes involved in olfactory function were found. 6.In selected region, a total of 702 genes were found in amur tiger and 414 genes in South China tiger, both have 50 genes in common. It was found that the PRKG1 have strong positive selection signal and respond in cold adaptation in amur. TLE3 gene in amur tiger had a SNP non-synonymous mutation at P101L site, suggesting that the mutation of the P101L site may be the cause of the difference between the body size of amur tiger and South China tiger. Keywords Amur tiger; South China tiger; genomic; population diversity; adaptive evolution
    鸥维数据云查询平台
      联系我们
    • 电话:400-139-8015
    • 微信:vbeiyou
    • 邮箱:ovo@qudong.com
    • 总部:北京市海淀区学院路30号科群大厦西楼5层
    Copyright © 西北大学西部大数据研究院旗下“鸥维数据” 京ICP备17065155号-6