Development of genome wide insertion deletion markers in rice based on graphic pipeline platform
- 格式:pdf
- 大小:2.66 MB
- 文档页数:12
生物多样性 2016, 24 (2): 237–243 doi: 10.17520/biods.2015205 Biodiversity Science http: //・综述・InDel标记的研究和应用进展杨洁赫佳王丹碧施恩杨文宇耿其芳王中生*(南京大学生命科学学院, 南京 210023)摘要: InDel是指在近缘种或同一物种不同个体之间基因组同一位点的序列发生不同大小核苷酸片段的插入或缺失(insertion-deletion), 是同源序列比对产生空位(gap)的现象。
InDel在基因组中分布广泛、密度大、数目众多。
InDel 多态性分子标记是基于插入/缺失位点两侧的序列设计特异引物进行PCR扩增的标记, 其本质仍属于长度多态性标记, 可利用便捷的电泳平台进行分型。
InDel标记准确性高、稳定性好, 避免了由于特异性和复杂性导致的后续分析模糊。
此外, InDel标记能扩增混合DNA样品和高度降解的微量DNA样品, 并进行有效分型。
InDel标记目前已开始应用于动植物群体遗传分析、分子辅助育种以及人类法医遗传学、医学诊断等领域。
随着位于功能基因上InDel 标记的开发, 结合染色体步移和基因精细定位, 可将这些标记应用于相关物种经济性状的功能基因的筛选, 有利于优良基因的进一步开发和利用。
关键词:分子标记; InDel; SNP; SSRProgress in research and application of InDel markersJie Yang, Jia He, Danbi Wang, En Shi, Wenyu Yang, Qifang Geng, Zhongsheng Wang*School of Life Sciences, Nanjing University, Nanjing 210023Abstract: InDel indicates insertions or deletions (insertion-deletion) of nucleotide fragments of different siz-es at the same site in the genome sequence between the same or closely related species and is a gap in se-quence derived from alignment of the homologous sequence. InDel is widely distributed across the genome and occurs in a high density and large numbers in a genome. The InDel polymorphic molecular marker is a PCR-amplified marker that is based on specific primers designed from both sides of the site of sequence of insertion / deletion. It is essentially a length polymorphic marker still, and one can use the convenient elec-trophoresis platform for genotyping. InDel molecular markers have the advantage of high accuracy and good stability, which help to avoid confusion in subsequent analysis due to marker specificity and complexity, as is often seen in other length polymorphic markers. Furthermore, mixed or highly degraded DNA samples can be successfully amplified with InDel markers, and effectively typed. Because of its abundance, convenient typing platform and other advantages, InDel molecular markers have been applied to genetic analyses of an-imal and plant populations, molecular assisted crops and farmed animal breeding, human forensic genetics, medical diagnostics and other research areas. The development of the InDel molecular marker located on functional genes, combined with chromosome walking and fine gene mapping, has enabled the application of these molecular markers in the screening of genes related to important economic traits, which is conducive to the further development and utilization of these valuable genes. In this review, on the basis of an overview of the InDel marker development and applications, we discuss some of the technical limitations of the develop-ment and limited efficiency of genetic analysis, as well as potential future applications in the fine mapping and genetic structure of large numbers of individuals.Key words: molecular marker; InDel; SNP; SSR插入/缺失(insertion-deletion, InDel)是指在近缘种或同一物种不同个体之间基因组同一位点的序——————————————————收稿日期: 2015-07-15; 接受日期: 2015-12-17基金项目: 国家自然科学基金(31100270)∗通讯作者Author for correspondence. E-mail: wangzs@238 生物多样性 Biodiversity Science第24卷列发生了不同大小核苷酸片段的插入或缺失, 即一个序列上某一位点相比同源的另一个序列插入或缺失了一个或多个碱基(Weber et al, 2002)。
生物多样性 2016, 24 (2): 237–243 doi: 10.17520/biods.2015205 Biodiversity Science http: //・综述・InDel标记的研究和应用进展杨洁赫佳王丹碧施恩杨文宇耿其芳王中生*(南京大学生命科学学院, 南京 210023)摘要: InDel是指在近缘种或同一物种不同个体之间基因组同一位点的序列发生不同大小核苷酸片段的插入或缺失(insertion-deletion), 是同源序列比对产生空位(gap)的现象。
InDel在基因组中分布广泛、密度大、数目众多。
InDel 多态性分子标记是基于插入/缺失位点两侧的序列设计特异引物进行PCR扩增的标记, 其本质仍属于长度多态性标记, 可利用便捷的电泳平台进行分型。
InDel标记准确性高、稳定性好, 避免了由于特异性和复杂性导致的后续分析模糊。
此外, InDel标记能扩增混合DNA样品和高度降解的微量DNA样品, 并进行有效分型。
InDel标记目前已开始应用于动植物群体遗传分析、分子辅助育种以及人类法医遗传学、医学诊断等领域。
随着位于功能基因上InDel 标记的开发, 结合染色体步移和基因精细定位, 可将这些标记应用于相关物种经济性状的功能基因的筛选, 有利于优良基因的进一步开发和利用。
关键词:分子标记; InDel; SNP; SSRProgress in research and application of InDel markersJie Yang, Jia He, Danbi Wang, En Shi, Wenyu Yang, Qifang Geng, Zhongsheng Wang*School of Life Sciences, Nanjing University, Nanjing 210023Abstract: InDel indicates insertions or deletions (insertion-deletion) of nucleotide fragments of different siz-es at the same site in the genome sequence between the same or closely related species and is a gap in se-quence derived from alignment of the homologous sequence. InDel is widely distributed across the genome and occurs in a high density and large numbers in a genome. The InDel polymorphic molecular marker is a PCR-amplified marker that is based on specific primers designed from both sides of the site of sequence of insertion / deletion. It is essentially a length polymorphic marker still, and one can use the convenient elec-trophoresis platform for genotyping. InDel molecular markers have the advantage of high accuracy and good stability, which help to avoid confusion in subsequent analysis due to marker specificity and complexity, as is often seen in other length polymorphic markers. Furthermore, mixed or highly degraded DNA samples can be successfully amplified with InDel markers, and effectively typed. Because of its abundance, convenient typing platform and other advantages, InDel molecular markers have been applied to genetic analyses of an-imal and plant populations, molecular assisted crops and farmed animal breeding, human forensic genetics, medical diagnostics and other research areas. The development of the InDel molecular marker located on functional genes, combined with chromosome walking and fine gene mapping, has enabled the application of these molecular markers in the screening of genes related to important economic traits, which is conducive to the further development and utilization of these valuable genes. In this review, on the basis of an overview of the InDel marker development and applications, we discuss some of the technical limitations of the develop-ment and limited efficiency of genetic analysis, as well as potential future applications in the fine mapping and genetic structure of large numbers of individuals.Key words: molecular marker; InDel; SNP; SSR插入/缺失(insertion-deletion, InDel)是指在近缘种或同一物种不同个体之间基因组同一位点的序——————————————————收稿日期: 2015-07-15; 接受日期: 2015-12-17基金项目: 国家自然科学基金(31100270)∗通讯作者Author for correspondence. E-mail: wangzs@238 生物多样性 Biodiversity Science第24卷列发生了不同大小核苷酸片段的插入或缺失, 即一个序列上某一位点相比同源的另一个序列插入或缺失了一个或多个碱基(Weber et al, 2002)。
农大考博辅导班:2019中国农业大学预防兽医学考博难度解析及经验分享根据教育部学位与研究生教育发展中心最新公布的第四轮学科评估结果可知,在2018-2019年预防兽医学专业学校排名中,排名第一的是华中农业大学,排名第二的是华南农业大学,排名第三的是南京农业大学。
作为中国农业大学实施国家“211工程”和“985工程”的重点学科,动物医学院的预防兽医学一级学科在历次全国学科评估中均名列第四。
下面是启道考博辅导班整理的关于中国农业大学预防兽医学考博相关内容。
一、专业介绍预防兽医学是研究动物传染病和动物寄生虫病的病原特性、致病机理、流行规律、疾病诊断以及预防、控制的原理及技术的科学。
本学科由原先的动物寄生虫学与寄生虫病学、动物传染病学与预防兽医学、兽医微生物学与免疫学等学科组合而成,研究范围涉及各种致病性寄生虫、微生物及其感染的各种动物和人,深入动物群体、个体、细胞及分子水平。
预防兽医学是兽医学一级学科理论研究和实际应用的体现,该学科不仅事关动物保健,还直接服务于公共卫生、动物防疫检疫、诊断诊疗、生物制品及环境保护等领域。
中国农业大学动物医学院的预防兽医学专业在博士招生方面,划分为4个研究方向:090602预防兽医学研究方向: 01动物传染病诊断与防治技术02动物分子病毒学与免疫学03兽医寄生虫学04兽医细菌学复试科目:专业知识笔试:1、预防兽医学综合考试2、专业英语二、考试内容中国农业大学预防兽医学专业博士研究生招生为资格审查加综合考核形式,由初选+复核+录取构成。
其中,综合考核内容为:(一)初选:2019年1-3月1、由学院组织专家小组对申请人提交的材料进行审核,综合审查结果和招生导师组的意见,形成进入复试考核的申请人名单,并在申请学院主页及研究生招生信息网上公示。
对初选结果有异议的申请人可以通过公布的电话申请复查。
进入复核阶段的申请人在相关学科专业范围内选择填报导师。
2、国家专项计划申请人通过学院初审后,需提交学校“专项计划”招生审核组对当年专项招生计划进行审核排序,本着择优原则确定进入复核阶段的人选。
宁夏农林科技,基金项目:宁夏农林科学院自主研发项目(DWHZC-2017007)。
作者简介:佘奎军(1982-),男,宁夏平罗人,副研究员,硕士研究生,主要从事玉米遗传育种工作。
E-mail:通信作者:杨国虎(1970-),男,宁夏银川人,博士,研究员,主要从事玉米遗传育种工作。
E-mail:收稿日期:2020-04-26修回日期:2021-06-091单倍体的发现及单倍体诱导系的选育玉米单倍体育种技术研究进展及展望佘奎军1,刘艳妮2,程晋龙1,杨国虎11.宁夏农林科学院农作物研究所,宁夏银川750002;2.银川能源学院,宁夏永宁750105摘要:玉米是重要的粮食、饲料作物,以及医药、工业原料。
选育优质高产的玉米杂交种对保障粮食安全,推动畜牧业的发展和保障工业原料的持续供给具有非常重要的作用。
单倍体育种技术是利用诱导系诱导产生孤雌生殖单倍体,单倍体加倍后可以快速获得纯系。
利用该技术可加快玉米自交系选育进程,提高选育效率。
文章阐述了单倍体育种的关键技术,包括单倍体诱导系的选育、单倍体诱导的机理、影响单倍体诱导率的因素、单倍体的鉴定方法、单倍体加倍的条件,分析了单倍体育种技术存在的问题,展望了玉米单倍体育种技术应用前景。
关键词:玉米;单倍体育种技术;诱导系中图分类号:S513文献标识码:A文章编号:1002-204X (2021)10-0025-06doi:10.3969/j.issn.1002-204x.2021.10.007Research Progress in Double Haploid Breeding Technology of MaizeAbstract Key words 2562卷10期R njABPI2产生单倍体诱导的机制qhirBqhirMATRILINEL MATLZmPLA NOTLIKE DAD NLDZmPLA MATRILINEL MATL NOT LIKE DAD NLDqhirZmDMP ZmPLA ZmDMPZmDMP3单倍体的诱导率及其影响因素佘奎军,等玉米单倍体育种技术研究进展及展望264单倍体的鉴定方法R njR njR njR nj R njR njR nj6单倍体育种技术存在的问题及展望R njABPI5单倍体的二倍体化方法CRISPR CasgRNACRISPR Cas参考文献:[1]CHASE S S.Selection for parthenogenesis and monoploids[J].Agronomia,1952,44:263-267.[2]COE E H.A line of maize with high haploid frequency[J].The American Naturalist,1959,93:381-382.[3]KERMICLE J L.Androgenesis conditioned by a mutationin maize[J].Science,1969,166:1422-1424.[4]LASHERMES P,BECKERT M.Genetic control ofmaternal haploidy in maize(Zea mays L.)and selection of haploid inducing lines[J].Theoretical&Applied Genetics,1988,76:405-410.[5]CHALYK S T.Creating new haploid-inducing lines ofmaize[J].Maize Genet Coop Newslet,1999,73:53.[6]Röber F K,GORDILLO G A,GEIGER H H.Invivo haploid induction in maize-performance of new inducers and significance of doubled haploid lines in hybrid breeding[J].Maydica,2005,50:275.[7]CHEN S J,Song T M.Identification haploid withhigh oil xenia effect in maize[J].Acta Agronomica Sinica,2003,4:19.[8]CAI Z G,XU X H,LIU Y L et al.The breeding ofJAAS3-haploid inducer with high frequency partheno-genesis in maize[J].Jorunal of Maize Sciences,2007, 151:1-4.[9]LI L,LI H C,XU X W,et al.Preliminary optimiza-tion of in-vivo haploid induction in maize[J].Journal of China Agricultural University,2012,171:9-13.[10]李向永,姜龙,王薪淇,等.糯玉米单倍体诱导和加倍的研究[J].西北农林科技大学学报(自然科学版),2016,44(8):90-96.[11]WEBER D F.Today’s use of haploids in cornplant breeding[J].Advances in Agronomy,2014,123:123-144.[12]HU H,SCHRAG T A,PEIS R,et al.The geneticbasis of haploid induction in maize identified with anovel genome-wide association method[J].Genetics,2016,202:1267-1276.[13]PRIGGE V,MELCHINGER A E.Production ofhaploids and doubled haploids in maize[M].In:Loy-ola-Vargas VM,Ochoa-Alejo N(eds)Plant cellculture protocols.Berlin:Springer,2012:161-172. [14]DONG X,XU X,MIAO J,et al.Fine mapping ofqhir1influencing in vivo haploid induction in maize[M].Theoretical&Applied Genetics,2013,126:1713-1720.[15]SCHNABLE P S,WARE D,FULTON R S,et al.The B73maize genome:complexity,diversity,anddynamics[J].Science,2009,326:1112-1115.[16]KELLIHER T,STARR D,RICHBOURG L,et al.Matrilineal,a sperm specific phospholipase,triggersmaize haploid induction[J].Nature,2017,542:105-109.[17]LIU C,LI X,MENG D,et al.A4-bp insertion atZmPLA1encoding a putative phospholipase a gener-ates haploid induction in maize[J].Molecular Plant, 2017,10(3):520-522.[18]GILLES L M,KHALED A,LAFFAIRE J,et al.Losso f pollen‐specific phospholipase NOT LIKE DAD triggers gynogenesis in maize[J].The EMBO Journal, 2017,36:707-717.[19]ZHONG Y,LIU C X,QI X L.et al.Mutation ofZmDMP enhances haploid induction in maize[J].Nature Plants,2019,5:575-580.[20]KRIZ A L,LARKINS B A.Molecular genetic approachesto maize improvement[M].Berlin:Springer-Verlag,2009.[21]CHALYK S T.Creating new haploid-inducing linesof maize[J].Maize Genet Coop Newslet,1999,73:53 [22]ROTARENCO V A,DICU G,STAGE D,et al.New inducers of maternal haploids in maize[J].Maize Genet Coop Newsl,2010,84:15.[23]GEIGER H H.Doubled haploids.In:Bennetzen JL,Hake S(eds)Handbook of maize[M].New York: Springer,2009,641-657.[24]LIU Z Z,SONG T M.The breeding and identifica-tion of haploid inducer with high frequency parthenogenesis in maize[J].Acta Agronomica Sinica, 2000,26:570-574.[25]LASHERMES P,BECKERT M.Genetic control ofmaternal haploidy in maize(Zea mays L.)and selection of haploid inducing lines[J].Theoretical&Applied Genetics,1988,76:405-410.[26]EDER J,CHALYK S.In vivo haploid induction inmaize[J].Theoretical&Applied Genetics,2002,104: 703-708.[27]LI L,XU X,JIN W,et al.Morphological and molecu-lar evidences for DNA introgression in haploid induction via a high oil inducer CAUHOI in maize[J].Planta, 2009,230:367-376.[28]PRIGGE V,SCHIPPRACK W,MAHUKU G,et al.Development of in vivo haploid inducers for tropical maize breeding programs[J].Euphytica,2012,185: 481-490.[29]CHAIKAM V,NAIR S K,MARTINEZ L,et al.Marker-assisted breeding of improved maternal hap-loid inducers in maize for the tropical/subtropical regions[J].Front Plant Science,2018,9:1527.[30]刘志增,宋同明.玉米高频率孤雌生殖单倍体诱导系(上接第19页)[8]何进尚,柳伟祥,刘阳春,等.宁夏灵武市郝家桥镇农田土壤盐分含量的空间分布特点[J].安徽农学通报,2009,15(3):54-56.[9]黄绍文,金继运,杨俐苹,等.土壤养分空间变异的影响因素[M].金继运,白由路.精准农业与土壤养分管理.北京:中国大地出版社,2001.39-50.[10]何文寿,何进勤,王菊兰,等.宁夏引黄灌区春小麦不同生育期土壤有效氮磷钾养分含量的动态变化特点[J].西北农业学报,2006,15(3):54-59.责任编辑:达海莉的选育与鉴定[J].作物学报,2000,26(5):570-574. [31]SARKAR K R,PANKE S,SACHAN JKS.Devel-opment of maternal haploidy-inducer lines in maize (Zea mays L.)[J].Indian Journal Agricultural Sciences, 1972,42:781-786.[32]BELICUAS P,GUIMARAES C,PAIVA J,et al.An-drogenetic haploids and SSR markers as tools for the development of tropical maize hybrids[J].Euphytica, 2007,156:95-102.[33]Preciado-Ortiz R E,García-Lara S,Ortiz-Islas S,etal.Response of recurrent selection on yield,kernel oil content and fatty acid composition of subtropical maize populations[J].Field Crops Research,2013,142(20):27-35.[34]DONG X,XU X W,LI L,et al.Marker-assistedselection and evaluation of high oil in vivo haploid inducers in maize[J].Molecular Breeding,2014,34: 1147-1158.[35]MELCHINGER A E,SCHIPPRACK W,Würschum T,et al.Rapid and accurate identification of in vi-vo-induced haploid seeds based on oil content in maize[J].Scientific Reports,2013(3):2129.[36]YU W,BIRCHLER J A.A green fluorescent pro-tein-engineered haploid inducer line facilitates haploid mutant screens and doubled haploid breeding in maize [J].Molecular Breeding,2016,36:1-12.[37]MOLENAAR W S,de Oliveira Couto E G,PIEPHOH,et al.Early diagnosis of ploidy status in doubled haploid production of maize by stomata length and flow cytometry measurements[J].Plant Breeding,2019, 138(3):266-276.[38]CHOE E,CARBONERO C H,MULVANEY K,etal.Improving in vivo maize doubled haploid produc-tion efficiency through early detection of false positives [J].Plant Breeding,2012,131:399-401.[39]BLAKESLEE A F,AVERY A G.Methods of induc-ing chromosome doubling in plants by treatment with colchicine[J].Journal of Heredity,1937,28(12): 393-411.[40]MELCHINGER A E,BRAUNER P C,Böhm J,et al.In vivo haploid induction in maize:comparison of dif-ferent testing regimes for measuring haploid induction rates[J].Crop Science,2016,56:1127-1135.[41]MOLENAAR W S,SCHIPPRACK W,BRAUNET PC,et al.Haploid male fertility and spontaneous chro-mosome doubling evaluated in a diallel and recurrent selection experiment in maize[J].Theoretical and Ap-plied Genetics,2019,132:2273-2284.[42]魏俊杰,张晓丽,陈梅香,等.6叶期秋水仙素注射处理玉米单倍体的加倍效果研究[J].玉米科学,2007,15(4):49-51.[43]VIJAY C,WILLEM M,ALBRECHT E,et al.Dou-bled haploid technology for line development in maize:technical advances and prospects[J].Theoretical and Applied Genetics,2019,132:3227-3243.[44]WANG B,ZHU L,ZHAO B,et al.Developmentof a haploid-inducer mediated genome editing system for accelerating maize breeding[J].Molecular Plant, 2019,12(4):597-602.[45]KELLIHER T,STARR D,SU X,et al.One-stepgenome editing of elite crop germplasm during haploid induction[J].Nature Biotechnology,2019,37:287-292.责任编辑:达海莉。
作物遗传育种学科在农业科学中占有核心地位[1],其根本任务是从基因型和环境2个层面研究并形成作物持续高产、优质、高效的理论、方法和技术,是关于大田作物生产与品种遗传改良的学科。
学科的创新发展,对保障国家粮食安全具有重大的意义。
粮食产量的“十一连增”[2]离不开作物遗传育种学科的创新和进步。
近年来,随着生物技术、信息技术和新材料技术的快速发展,我国的传统作物遗传育种学科迎来了新的发展机遇,成为生命科学领域具有发展潜力的学科之一,在基础、应用基础和应用研究方面取得了突破性进展,研发了1批重大的科技成果,对国家粮食安全和农业可持续发展做出了显著贡献。
1我国作物遗传育种学科的主要研究进展1.1基因组学等新技术广泛渗透近10a ,由“Next Generation Sequencing (NGS )”技术引领的基因组学技术正在一个空前的高速度推动下迅猛发展。
目前,高通量NGS 技术已经成为生命科学领域中应用最为广泛的研究手段。
例如,完成了小麦A 、D 基因组等图谱的绘制[3,4];构建了第2代玉米单体型图谱,其中包含了5500万个SNP 标记[5];对二倍体棉花———雷蒙德氏棉全基因组进行测序,并阐述了棉花基因组的多倍化及其纤维发育[6]。
此外,基DOI :10.16318/ki.hbnykx.2015.06.018河北农业科学,2015,19(6):66-70Journal of Hebei Agricultural Sciences编辑蔡海燕我国作物遗传育种学科的发展现状与“十三五”发展重点张江丽1,董文琦2,杜晓东3*(1.中国农业科学院科技管理局,北京100081;2.河北省农林科学院,河北石家庄050051;3.河北省农林科学院农业信息与经济研究所,河北石家庄050051)摘要:作物遗传育种学科的创新发展,对促进现代农业的健康发展和保障我国粮食安全具有重大的意义。
作者总结了近年来我国在作物遗传育种学方面取得的主要进展,从基因组学、种质资源保护与利用、新基因挖掘、作物杂种优势机理及利用、分子标记育种、分子设计育种、作物细胞工程和诱变育种等方面分析了“十三五”重点发展的方向。
南方农业学报 Journal of Southern Agriculture 2023,54(11):3340-3348ISSN 2095-1191; CODEN NNXAABDOI:10.3969/j.issn.2095-1191.2023.11.020利用iCaspase9特异性诱导鸡原始生殖细胞消除的研究孙玲玲,黄振文,田奎,张嘉乐,郑基坛,濮黎萍,陆阳清*(广西大学动物科学技术学院/广西畜禽繁育与疾病防控重点实验室,广西南宁530004)摘要:【目的】制备在生殖细胞特异表达iCaspase9的鸡原始生殖细胞系(PGCs),提升外源性PGCs的生殖传递效率,为生产诱导不育的雄性或雌性受体鸡胚打下基础。
【方法】利用piggyBac转座子将iCaspase9片段整合到鸡基因组中,构建稳定转染iCaspase9片段的CAG-iCaspase9-mCherry DF-1细胞系,通过荧光微显镜观察鉴定其在iCaspase9系统诱导底物B/B二聚化配体作用下的细胞消除效果;再利用CRISPR/Cas9技术将iCaspase9片段定点插入DAZL基因第11外显子之后构建Dazl-iCaspase9-EGFP PGCs,经iCaspase9片段插入鉴定和PGCs生殖特性鉴定后,通过荧光微显镜观察鉴定其在B/B二聚化配体诱导下的消除效果。
【结果】成功建立的CAG-iCaspase9-mCherry DF-1细胞系与野生型DF-1细胞(WT DF-1)的生长增殖无显著差异(P>0.05,下同);CCK-8法测定发现,0.25~5.00 nmol/L B/B二聚化配体对WT DF-1细胞生长数量无显著影响,但CAG-iCaspase9-mCherry DF-1细胞在添加B/B二聚化配体后,细胞数量较空白对照组极显著降低(P<0.01,下同),且失去正常的细胞形态,逐渐变圆甚至凋亡。
成功建立的DAZL-iCaspase9-EGFP PGCs仍特异性高表达Cvh、Nanog、PouV、DAZL、Cdh和Ddx4等生殖细胞相关基因,在蛋白水平上也能鉴定出DAZL、SSEA1和CVH等蛋白,说明插入iCaspse9片段后并没有改变生PGCs的殖细胞特异性;在0.25 nmol/L B/B二聚化配体诱导作用下,DAZL-iCaspae9-EGFP PGCs数量显著减少,几乎全部消除,说明iCaspase9系统可诱导细胞凋亡而有效消除内源性PGCs。
基因变异Somatic cell genetic defects can not be transmitted to the next generation.对According to current researches, Single nucleotide variants (SNV) and Insertion–deletion variants (indels) are easier to be interpreted than other types of variants.对Variants in introns don’t affect the transcription regulation process.错In some genes, there are mutation hotspots which may contribute a lot to the disease pathogenicity.对Non-homologous equal crossover yields fusion genes.错PART II: FILL IN THE BLANKS.6. Human genetic variants can be classified into fivecategories.Apart from Insertion–deletion variants (indels), Block substitutions, and Inversion variants, what are the rest two categories? single nucleotide variants and copy number variants7. Effects of disease-causing mutations on function of gene product include four aspects, which are discribed as follows: loss of function, gain of function, acquisition of novel property, and abnormal expression: heterochronic or ectopic. Among them, the majority of the cause of pathology is loss of functionStrand slippage can be caused by ( ) and results in insertions and deletions. simple sequence repeats9. Splicing processes affected by mutations in introns which happen in the receptor sites or donor sites bring about intron retention, exon skipping and ( ) activation of cryptic splice sites10.( )mutations in exon coding sequences always create truncated proteins which may have a prior tendency to result in abnomal protein function. Nonsense致病基因的识别填空题Alleles of SNPs that are close together tending to be inherited together can be called as (haplotype)Transgenic animals can be set for modelling diseases and understanding of (gene function )多选题3.The 3 Generation Human Genetic Markers used for linkage analysis can be listed are follows, please sort them in chronological order:Restriction Fragment Length Polymorphism (RFLP),Microsatellite Markers (CA Repeats),Single Nucleotide Polymorphism (SNP) and HaplotypeSingle Nucleotide Polymorphism (SNP) and Haplotype,Microsatellite Markers (CA Repeats),Restriction Fragment Length Polymorphism (RFLP)Microsatellite Markers (CA Repeats),Restriction Fragment Length Polymorphism (RFLP),Single Nucleotide Polymorphism (SNP) and HaplotypeSingle Nucleotide Polymorphism (SNP) and Haplotype,Restriction Fragment Length Polymorphism (RFLP),Microsatellite Markers (CA Repeats)4.Positional cloning is a method to clone a gene based simply on knowing its position in the genome by linkage analysis without any idea of the function of that gene. We can carry out positional cloning by 3 steps, which can be listed in chronological order are follows.Fine mapping,Disease mapping,Candidate genesFine mapping,Candidate genes,Disease mappingCandidate genes,Disease mapping,Fine mappingDisease mapping,Fine mapping,Candidate genes窗体底端5.Mutation screening strategy includes ( )RT-PCR SSCP / Heteroduplex / dHPLC Sequencing / DNA chip6.Research and clinical testing of inherited monogenic diseases such as LCA can be carried out by ( )Single gene testing Targeted sequence captured panelWhole exome sequencing Whole genome sequencing是非题7.Families collection, linkage analysis and functional study of a gene are all important for position dependent candidate cloning and the identification of a candidate gene.true false8.aCGH (Array Comparative Genomic Hybridization) is a method of testing for SNP (Single Nucleotide Variation).true false9.The Genome Wide Association approach (GWAS) is a method based on the whole genome which can be applied mainly for polygenic disease. Researchers have identified a great many of susceptibility genes that may contribute to disease predisposition by GWAS.true false10.The identification of disease causative mutations can be largely assisted by the reference for related databases such as Online Mendelian Inheritance in Man (OMIM), Human Gene Mutation Database (HGMD),etc.true false遗传病的分子机理填空题DNA sequence that is homologous with a known gene but non-functional can be called as a (pseudogene ).A group of adjacent genes that are identical or related can be called as a (gene cluster).选择题3.The effects of gene mutations on protein function include:A.Loss of protein functionB.Gain of protein functionC.Novel property aquirementD.Heterochronic or Ectopic expression or both4.Which type of human globin genes DOESN’T express during prenatal period?A.αB.βC.γD.ξ5.A individual who carries the β0 /β0 genotype w ill present a phenotype called:A.Sickle cell anemia B.Hb Bart’s syndrome C.β-thalassemia trait D.β-thalassemia major6.Diseases can be due to mutations in the following different classes of proteins:A.Transport and storageB.Structure of cells and organsC.Extracellular homeostasisD.Developmental geneexpression E.Control of growth and differentiationF.Intercellular metabolism and communication判断题7.Thalassemias can arise in the following types of mutations: deletions and single nucleotide variations(SNV).true8.The mutation spectrum difference between α-thalassemia andβ-thalassemia is thatα-thalassemia is mainly caused by point mutations and β-thalassemia is mainly caused by deletions.false9.Mutations in different genes of a gene cluster result in the same disease subtype。
玉米二态性InDel位点的鉴定和分子标记开发周玲;梁帅强;林峰;吕远大【摘要】随着下一代测序技术( Next generation sequencing, NGS)的发展,可以利用高通量测序数据挖掘基因组特异的插入/缺失( InDel)位点,这使InDel分子标记的开发成为可能。
本研究利用327份玉米自交系的测序数据,通过全基因组结构变异评估发现分布于10条染色体上的25847个InDel位点。
利用PIC值≥0.30的3304个位点进行系统发育树构建和群体结构分析,其结果与利用SNP位点分析获得的结果高度一致,说明这些多态性较高的InDel位点可以作为代表性标记用于玉米遗传组成的差异分析。
采用53对PCR引物进行扩增试验,发现这些InDel引物具有位点专化特征并且其杂交种鉴定的平均杂合率为0.5692,表明这些标记能准确鉴定出玉米杂交种的杂合度。
%Next generation sequencing ( NGS) technology revolutionized the throughput of sequence data generation and enabled genome-wide identification of insertion and deletion variation ( InDel) , which could provide potential informa-tion for molecular marker development. However, large scale characterization of InDel in maize genome was poorly under-stood. Here, we evaluated 327 maize inbreed lines genomes and detected more than 25 847 biallelic InDel loci across ten chromosomes. 3 304 InDel loci with relative high PIC values were used to construct the genetic dendrogram, which was consistent with the results concluding from all called SNP loci, suggesting that these biallelic InDel loci could indiscrimi-nately describe maize genome variations. To explore the application of the InDel loci, we developed 53 InDel markers for PCR validation. The results exhibited thatthe newly developed molecular markers were not only locus specific, but had the capabilities for identifying the hybrid seed heterozygosity. Taken together, our results will particularly facilitate the develop-ment of biallelic molecular marker for studies in genetics and have potential for marker-assisted breeding.【期刊名称】《江苏农业学报》【年(卷),期】2016(032)006【总页数】9页(P1223-1231)【关键词】玉米;二态性 InDel 位点;系统发育树;群体结构【作者】周玲;梁帅强;林峰;吕远大【作者单位】江苏省农业科学院农业生物技术研究所/江苏省农业生物学重点实验室,江苏南京 210014;江苏省农业科学院农业生物技术研究所/江苏省农业生物学重点实验室,江苏南京 210014;江苏省农业科学院农业生物技术研究所/江苏省农业生物学重点实验室,江苏南京 210014;江苏省农业科学院农业生物技术研究所/江苏省农业生物学重点实验室,江苏南京 210014【正文语种】中文【中图分类】S513.3玉米起源于美洲[1],是典型的异交作物,表现为极端的近交衰退和杂种优势,因此杂种优势一直是玉米新品种选育的重要依据。
DOI: 10.3724/SP.J.1006.2022.14025甘蓝型油菜白花基因InDel连锁标记开发王瑞1,2陈雪1,2郭青青1,2周蓉1,2陈蕾1,2李加纳1,2,*1西南大学农学与生物科技学院,重庆400715;2 重庆市油菜工程技术研究中心,重庆400715摘要:碱基插入/缺失(InDel)是基因组上广泛分布的遗传变异形式。
但甘蓝型油菜白花基因InDel连锁标记还未见有关研究报道。
本研究以甘蓝型油菜双单倍体(doubled haploid, DH)纯系黄花Y05和甘蓝型油菜纯系白花W01杂交构建F2群体。
在F2群体中选取30株极端白花和30株极端纯黄花构建叶片DNA子代池,对亲本和DNA子代池进行30×重测序。
以法国甘蓝型油菜Darmor-bzh为参考序列,QTL-seq流程和PoPoolation2流程相互结合鉴定白花基因候选区间,2种方法均将白花基因定位于法国甘蓝型油菜Darmor-bzh C03染色体52~54 Mb区间。
利用IGV软件可视化白花基因候选区间插入缺失(InDel)变异位点,依据候选区间序列信息设计InDel引物,聚丙烯酰胺凝胶电泳筛选到8个与白花基因连锁共分离的InDel标记。
上述研究为甘蓝型油菜白花基因精细定位和分子标记辅助选育以及白花基因功能标记开发奠定了研究基础和工作思路。
关键词:甘蓝型油菜;重测序;白花基因;InDel标记;Development of linkage InDel markers of the white petal gene based on whole-genome re-sequencing data in Brassica napus L.WANG Rui1,2, CHEN Xue1,2, GUO Qing-Qing1,2, ZHOU Rong1,2, CHEN Lei1,2, and LI Jia-Na1,2,*1College of Agronomy and Biotechnology, Southwest University, Chongqing 400715, China; 2Chongqing Engineering Research Center for Rapeseed, Chongqing 400715, ChinaAbstract: InDel is widely distributed across the genome and occurs in a high density and large numbers in a genome. To date, the researches about linkage InDel markers of the white petal gene in B. napus L are very less yet. In this study, we constructed the F2 mapping population from the cross between DH Y05 (yellow petal) and DH W01 (white petal). Two bulks with 30 yellow petal lines and 30 white petal lines of F2 population were constructed by mixing an equal amount of DNA. Then two bulks and parents were performed 30× whole-genome re-sequencing. Darmor-bzh as the reference genome was aligned to sequence data from the two bulks and parents. QTL-seq and PoPoolation2 workflow were applied to identify the candidate region of the white petal gene. A major candidate region was identified on chromosome C03 (52–54 Mb) of Darmor-bzh. The insertion-deletion (InDel) sites can be visualized in candidate interval by Integrative Genomics Viewer (IGV). Based on these Indel variations, we used Vector and Blast to design InDel primers. Eight InDel markers closely linked to the white petal gene were screened by Polyacrylamide gel electrophoresis (PAGE). In summary, these results provide a basis for fine mapping white petal gene and InDel molecular marker located on functional genes as well as molecular marker assisted selection breeding.Keywords: Brassica napus L.; re-sequencing; white petal genes; InDel markers甘蓝型油菜(Brassica napus,AACC)属十字花科(Cruciferace)芸薹属(Brassica),是由白菜型油菜(Brassica rapa,AA)和甘本研究由高等学校学科创新引智计划(111计划)项目(B12006)资助。
Development of genome-wide insertion/deletion markers in rice based on graphic pipeline platformYang L €u1†,Xiao Cui 1†,Rui Li 1,Piaopiao Huang 1,Jie Zong 2,Danqing Yao 3,Gang Li 3,Dabing Zhang 1and Zheng Yuan 1*1State Key Laboratory of Hybrid Rice,School of Life Sciences and Biotechnology,Shanghai Jiao Tong University,Shanghai 200240,China,2NovelBioinformatics Company,Shanghai 200240,China,3Shanghai Agricultural Technology Extension and Service Center,Shanghai 201103,China.†These authors contribute equally to this work.*Correspondence:zyuan@Abstract DNA markers play important roles in plant breed-ing and genetics.The Insertion/Deletion (InDel)marker is one kind of co-dominant DNA markers widely used due to its low cost and high precision.However,the canonical way of searching for InDel markers is time-consuming and labor-intensive.We developed an end-to-end computational solution (InDel Markers Development Platform,IMDP)to identify genome-wide InDel markers under a graphic pipeline environment.IMDP constitutes assembled genome sequen-ces alignment pipeline (AGA-pipe)and next-generation re-sequencing data mapping pipeline (NGS-pipe).With AGA-pipe we are able to identify 12,944markers between the genome of rice cultivars Nipponbare and ing NGS-pipe,we reported 34,794InDels from re-sequencing data of rice cultivars Wu-Yun-Geng7and bining AGA-pipe and NGS-pipe,we developed 205,659InDels in eight japonica and nine indica cultivars and 2,681InDels showed a subgroup-speci fic pattern.Polymerase chain reaction (PCR)analysis of subgroup-speci fic markers indicated that the precision reached 90%(86of 95).Finally,to make them available to the public,we have integrated the InDels/markers information into a website (Rice InDel Marker Database,RIMD,http://202.120.45.71/).The application of IMDP in rice will facilitate ef ficiency for development of genome-wide InDel markers,in addition it can be used in other species with reference genome sequences and NGS data.Keywords:Genetic polymorphism;genome alignment;InDel marker;molecular breeding;next-generation sequencingCitation:L €uY,Cui X,Li R,Huang P,Zong J,Yao D,Li G,Zhang D,Yuan Z (2015)Development of genome-wide insertion/deletion markers in rice based on graphic pipeline platform.J Integr Plant Biol XX:XX –XX doi:10.1111/jipb.12354Edited by:Li-Jia Qu,Peking University,China Received Dec.18,2014;Accepted Mar.19,2015Available online on Mar.21,2015at /journal/jipb©2015Institute of Botany,Chinese Academy of SciencesINTRODUCTIONRice (Oryza sativa L.)is the most widely consumed food staple for humans (Goff et al.2002).The rapid growth of world population coupled with a decrease in clean water and arable land resources,have urged plant scientists to place most of their efforts on the study of increasing yield of rice and other crops,during which molecular genetic techniques such as molecular marker-assisted selection (MAS)have been widely used (Mohan et al.1997).Successful MAS dependent crop breeding relies greatly on functional genomic analysis and application of agronomically important genes (Collard and Mackill 2008),as well as ef ficient screening assays based on DNA polymorphism detection (Collard et al.2005).Molecular markers have been developed rapidly during the past three decades,from hybridization-based molecular markers,such as restriction fragment length polymorphism (RFLP)(Botstein et al.1980),to polymerase chain reaction (PCR)-based molecular markers such as simple sequence repeat (SSR)(Powell et al.1996),random ampli fied poly-morphic DNA (RAPD)(Williams et al.1990),insertion/deletion markers (InDels)(Rickert et al.2003;Shen et al.2004),and to sequencing-based molecular markers such as single-nucleo-tide polymorphisms (SNPs)(Rafalski 2002).SNPs could also be transformed to cleave ampli fied polymorphism sequence (CAPS)(Konieczny and Ausubel 1993)and derived cleaved ampli fied polymorphic sequence (dCAPS)(Neff et al.1998),both of which were based on PCR and subsequent restriction enzyme digestion.All of these markers have been proved useful tools for species identi fication,genetic variation studies and population structure analysis,phylogenetic analysis,genetic mapping,and MAS (McCouch et al.1997).Given the drawbacks of SNPs and its derived markers such as high-cost or special equipment requirement,PCR-based markers (RAPD,SSR and InDels)are more widely used than SNPs in research and breeding.SSR markers are more widely used than RAPD because of convenient detection,moderate cost,high reproducibility,and multi-allele separation capa-bility (Powell et al.1996),but are limited due to insuf ficient genome abundance and coverage as well as ambiguities in complex and heterogeneous mutation patterns (PAN et al.2008;Liu et al.2013).The InDel marker,one kind of co-dominant PCR-based markers,has a higher order of magnitude than SSRs and displays no ambiguity under the same detection condition.Molecular markers are useful tools in research and breeding,but the high cost of marker development and marker application is the major limitation for its application (Collard and Mackill 2008).The genome-wide polymorphisms analysis and development of reliable molecular markers are time-consuming and labor-intensive,which also limit the application of MAS enormously.The application of next-Free AccessNew Technologygeneration sequencing(NGS)technologies have allowed the discovery of a large number of genome-wide DNA poly-morphisms,such as SNPs and InDels in relatively shorter time (Varshney et al.2009),which reduce the cost of marker development(Gao et al.2012).As the top two largest polymorphic mutations in the genome,the high density of SNP and InDel polymorphisms make them more suitable for genome-wide marker development(PAN et al.2008).In2009, the International Rice Functional Genomics Consortium initiated a rice sequence analysis program using20japonica/ indica cultivars and identified160,000SNPs(McNally et al. 2009).Huang et al.identified approximately3.6million SNPs and performed genome-wide association studies(GWAS)for 14agronomic traits via sequencing517rice landraces(Huang et al.2010).The application of SNPs genotyping analysis, however,is still not practical for most research laboratories due to its high cost and special equipment requirement(Liu et al.2013).In contrast,InDel markers are low-cost and technically easier to use for most laboratories.Therefore, genome-wide discovery of InDel polymorphisms offers a practical approach for basic research and MAS.With the continuous release of assembled and re-sequenced genomes, InDel markers have been developed in several plant species such as Arabidopsis(Hou et al.2010;Pacurar et al.2012), Brassica rapa(Liu et al.2013),and rice(Shen et al.2004;Arai-Kichise et al.2011).Despite many successful cases being reported,less time-and labor-consuming tools are in urgent need to be developed for discovery of genome-wide InDel markers.In this study,we developed an end-to-end computational solution(InDel Marker Development Platform,IMDP)to identify genome-wide InDel markers under Laboratory of Neuro Imaging(LONI)Pipeline Processing Environment(Rex et al.2003)for both assembled genome sequences and NGS ing assembled genome sequences pipeline (AGA-pipe)and next-generation re-sequencing data pipeline (NGS-pipe)of IMDP,Nipponbare/93-11and Wu-Yun-Geng7/ Guang-La-Ai4were analyzed for InDel markers development separately.Furthermore,we unified AGA-pipe and NGS-pipe and developed2,681subgroup InDels between eight japonica cultivars and nine indica rmation of all the InDels/ markers was then constructed into a user-friendly Rice InDel Markers Database(RIMD,http://202.120.45.71/),which would benefit the rice genomic research community in future studies and molecular breeding.The IMDP we developed in this study could also be applied in other species with assembled reference genome.RESULTSOverview of IMDPIMDP was designed with two process steps:credible InDels calling and automatic high quality primer design with e-PCR specificity validation.Depending on the nature of available genomic data(complete genome sequence or NGS data), two strategies for InDels calling were used in the InDel calling step:genome alignment(AGA-pipe)and NGS reads mapping(NGS-pipe).Both of AGA-pipe and NGS-pipe provide end-to-end solutions for InDel markers develop-ment,which differ in InDel calling processes but use the same modules in primer design and e-PCR validation (Figure1A).AGA-pipe used MUMmer3to obtain homologous regions, and sequence alignment of two assembled chromosomes and the output was subsequently processed by several scripts to identify optimal InDels for primer design and e-PCR validation. For rice cultivars without assembled genome sequences, NGS-pipe could be used to map re-sequencing data to reference genome and used it as a‘bridge’to identify InDel polymorphisms(Figure S1)(Liu et al.2013).NGS-pipe contains modules for trimming,sequence quality assessment,reads mapping,mapping quality assessment,InDel calling,primer design and e-PCR validation.In particular,NGS-pipe used Pindel for InDel detection,which excels in intermediate and long InDels detection that are suitable for PCR-based marker development.In the primer design module,Primer3program was used to automatically select primers based on user-specified criteria (Table S1),such as primer size,GC content,melting temper-ature,PCR product size,and others(Rozen and Skaletsky 2000).Followed by Primer3,MFEprimer2(e-PCR,computer simulations of theoretical PCR results)was used to evaluate the primers considering the conditions such as number of binding sites of the primers in the target sequence,stability at the30-end of the primer,melting temperature,GC content and several other criteria(Qu et al.2009).The chance of PCR failure or mis-priming due to off target matches in genome could be therefore greatly reduced.Finally,all IMDP models were wrapped into LONI processing pipeline environment (Figure1B)and could be easily installed(See Materials and Methods).The pipeline environment provides graphic inter-face,allowing bench scientists to set-up the analysis project, run the analysis and monitor the process with ease.InDel markers development using AGA-pipeUsing AGA-pipe of IMDP,we analyzed InDel polymorphisms in two inbred rice cultivars,japonica cv.Nipponbare and indica cv.93-11.The genomic sequence alignment indicated good collinearity between Nipponbare and93-11genome (Figure S2).In total,we identified90,548md-InDels in Nipponbare against93-11(Nipponbare/93-11)genome by get-indels.pl program.Among them,36,066hq-InDels were selected for PCR primer design using Primer3,followed by primer validation by MFEprimer2.Finally,12,944markers were obtained and could be used for further PCR and polyacryla-mide gel electrophoresis(PAGE)assay.The density of Nipponbare/93-11hq-InDels markers was3.7per BAC,33.4 per1Mb,which covered86.6%BACs of Nipponbare genome (2,998out of3,463;Table1).97.7%hq-InDel markers in Nipponbare/93-11were6–50bp in length(Figure S3;Table S2), which could be easily detected by general PCR and PAGE assay.In agreement with other InDel studies(Shen et al.2004; Arai-Kichise et al.2011),the distribution of InDel markers we found in Nipponbare/93-11was also unevenly distributed across the Nipponbare chromosomes,especially in or near centromere and telomere regions(Figure2),and chromo-some2had the highest InDel frequency and e-PCR validated marker density(Figure2;Table1).To confirm the efficiency of AGA-pipe,we compared the InDels/markers obtained from another two studies and our study.The number of e-PCR validated InDel markers(>6bp in2L€u et al.length)in this study was 12,944,which was 11times more than the reported 1,047in another study (Zeng et al.2013).We further compared the 12,944e -PCR validated markers in this study to the 986InDels developed by Zeng et al.(2013).The data we obtained from the paper of Zeng et al.(2013)only contained information of primers,making it hard to obtain the exact locations of InDels and therefore prohibited us from comparing all of these InDels against our own dataset.We filtered the 1,047pairs of PCR primers developed through MFEprimer2,and after removing mismatching primers and primers whose ampli fication products were not on target chromosomes,only 806InDels were accuratelylocatedFigure 1.Pipeline for InDels/markers development based on genome alignment and next-generation DNA sequencing(A )The strategy of InDel Markers Development Platform (IMDP)for InDel markers development.(B )Screenshot of IMDP interface in LONI pipeline environment.Reference genome:complete genome sequence,such as Nipponbare IRGSP build 5.Query:genome or a single DNA sequence,such as 93-11genome sequence.Nucmer :a Perl script pipeline in MUMmer 3for multiple closely related nucleotide sequences alignment.BWA :software package for NGS reads mapping.Pindel :software for detecting InDels.A and B in gray circles represent perl script used in InDels detection for different data sources.Set A and Set B are InDels from genome alignment and NGS mapping respectively.Primer3,a tool for PCR primers design.e -PCR,MFEprimer,which is used to evaluate the speci ficity of PCR primers.RIMD,Rice InDel Marker Database.Graphic pipeline platform for InDel markers development 3on rice chromosomes,among which 551were identi fied in our validated hq -InDel list.Within the 551hq -InDels,236overlapped with e -PCR markers identi fied by us,while the remaining 315markers in Zeng ’s study were filtered out because of lacking precise locations rather than our criteria.The number of md -InDels discovered using our InDel discovery pipeline was 90,548,which was 1.3times as many as that in a previous report (Shen et al.2004),the number of which was 68,073.Because of lack of detailed information of Shen ’s study,we could not do moredetailedFigure 2.Distribution pattern of hq -InDels and e -PCR validated markers of Nipponbare against 93-11The y-axis represents the physical distance along each chromosome in 100kb windows.The x-axis indicates the number of hq -InDels (in yellow)and corresponding e-PCR validated markers (in green).The gray areas show centromere regions.Table 1.Identi fication of InDels/markers between Nipponbare and 93-11using AGA-pipe Chromosome Nipponbaregenome (millionbase pairs)a 93-11genome (million base pairs)b md -InDels c hq -InDels d e -PCR validated markers Covered BAC (uncovered BAC)Frequency e Chr0145.047.212,3765,0201,745376(16) 4.4Chr0236.838.110,6362,2241,671340(19) 4.6Chr0337.341.89,0321,9961,400308(21) 4.2Chr0436.134.76,3962,136872217(78) 2.9Chr0530.131.26,4694,3101,015241(45) 3.5Chr0632.132.87,4943,9671,038251(30) 3.6Chr0730.427.97,3712,5031,110259(31) 3.8Chr0828.530.46,8102,800951254(24) 3.4Chr0923.921.76,4172,967954206(17) 4.2Chr1023.722.25,7852,777751182(21) 3.6Chr1131.223.05,6372,842668168(90) 2.6Chr1227.722.96,1252,524769196(73) 2.8Total382.8373.990,54836,06612,9442,998(465)3.7Nipponbare reference genome (IRGSP build5).93-11reference genome (GeneBank accession –GCA_000004655.2).md -InDels,the lengths of InDel polymorphisms were from 6to 100bp.d hq -InDels,md -InDels with no other polymorphisms more than 6bp in length existed in the 300bp region flanking the InDel.e Frequency,markers per BAC according to Nipponbare genome.4L €uet al.comparison between our markers with the reported 68,073InDels.We also used the same default settings to identify InDels between Nipponbare and Kasalath.Kasalath is a member of the aus group of O.sativa,which has been widely used in breeding and functional genomic studies for its specific traits such as early maturity and tolerance to drought and phosphate deficiency.The genome sequence of Kasalath was recently assembled(Sakai et al.2014).Overall,94,730md-InDels were detected using get-indels.pl program and38,846 hq-InDels were identified in Nipponbare/Kasalath genome (Table S3).The density of the identified hq-InDels was10.1per 100kb(Figure S4A).The lengths of about95%InDel poly-morphisms were6–50bp,which could be easily distinguished by gel electrophoresis(Figure S4B,C).Development of InDel markers based on NGS-pipeGiven that molecular markers are cultivar-specificity,tools for developing markers of any two specified cultivars will be of much significance in basic research and breeding.We used NGS-pipe for InDel markers development based on NGS mapping and tested it using two landrace cultivars widely planted in China,Wu-Yun-Geng7(WYG7,an Oryza sativa japonica cultivar)and Guang-Lu-Ai4(GLA4,an Oryza sativa indica cultivar).We re-sequenced WYG7and the average re-sequencing depth was approximately35Âusing Illumina Genome Analyzer II platform.The NGS data of GLA4was retrieved from National Center for Biotechnology Information(NCBI) (See methods).Since re-sequencing depth affects the cover-age of a target genome and is one main evaluation criterion for NGS data quality(Sims et al.2014),wefirst checked the correlation between theoretical sequencing depth and genome coverage.It was no surprise tofind that theoretical sequencing depth and genome coverage were positively correlated(Figure3A).Regression analysis further suggested that the relationship between genome coverage and theoretical depth might meet reciprocal function y¼(aþbx)/x(a<0,b>0,x and y represent theoretical depth and coverage breadth,respectively,Figure3B).Thereciprocal Figure3.The correlation analysis between genome coverage,InDels number and theoretical sequencing depth(A)The positive correlation between genome coverage and sequencing depth of WYG7against Nipponbare reference genome. Blue dotted line shows that the coverage of WYG7NGS data in the reference genome is94.39%,close to the largest theoretical coverage(97.22%)when the sequencing depth is10.(B)The regression analysis shows that breadth of coverage(y)and theoretical depth(x)might meet the reciprocal function:y¼(aþbx)/x.(C)The total number of InDels among WYG7,GLA4and Nipponbare,indicating that the amount of InDels is positively correlated to the sequencing depth.The proportion of md-InDels is 12%to20%.(D)The distribution pattern of InDels and e-PCR validated markers in WYG7/GLA4genome when sequencing depth is 10,showing that the lengths of98.6%md-InDel markers in WYG7/GLA4are6–50bp.Graphic pipeline platform for InDel markers development5function we obtained is y¼(–0.5139þ0.9722x)/x,in which0.9722represents the largest theoretical coverage, and R2represents the correlation coefficient,respectively (Figure3B).When the sequencing depth was10,the NGS data coverage of WYG7in the reference genome was94.39% (Figure3A),close to the largest theoretical coverage(97.22%), suggesting that a cost-effective sequencing depth might be10Â.We also checked the correlation between InDels number and re-sequencing depth of WYG7and GLA4.Similar to a previous report(Arai-Kichise et al.2011),the number of InDels was positively correlated with re-sequencing depth (Figure3C).However,the percentage of md-InDels was relatively stable,ranging from12%to20%regardless of the re-sequencing depth(Figure3C;Table S4).It was not surprising tofind that InDels number of japonica/japonica cultivars (such as WYG7/Nipponbare)was much less than that of japonica/indica cultivars(such as WYG7/GLA4and Nipponbare/GLA4)(Figure3C).The number of md-InDels was34,794in WYG7/GLA4genome,33,432in GLA4/Nippon-bare genome,and6,301in WYG7/Nipponbare genome,all adequate for further e-PCR validation(Table S4).Therefore, we developed md-InDel markers in WYG7/GLA4genome with re-sequencing depth of10.Consistent with the distribution pattern of InDels in Nipponbare/93-11genome,the telomere and centromere regions of chromosomes could not be covered by md-InDels and e-PCR validated InDels in WYG7/ GLA4genome either(Figure S5).The length distribution of NGS-pipe results are similar to the pattern of AGA-pipe results,in which98.6%md-InDel markers in WYG7/GLA4were 6-50bp in length(Figure3D;Table S5).These results suggested that the NGS-pipe is efficient for InDel marker development.Development of InDel markers between japonica and indica subgroup rice cultivarsThe application of NGS technologies is a great step towards fast revealing of the genomic diversities of rice germplasms (Kilian and Graner2012),as well as facilitating the develop-ment of japonica/indica subgroup markers in rice.To develop genome-wide subgroup markers between japonica and indica cultivars,we collected NGS data of13rice cultivars in addition to WYG7,GLA4,Nipponbare,and93-11(Table S6).To avoid selecting closely related cultivars during rice domestication and breeding,we only chose the ones from diverse geo-graphic distributions in Asia or America(Table S6).All of these NGS data werefirstly processed by Trimmomatic as described above(Table S7),then compared individually with the Nipponbare reference genome sequence.The number of md-InDels detected in these cultivars varied from1,641(NK-58/ Nipponbare)to56,867(IR36/Nipponbare),demonstrating a different number and density of InDels on each chromosome between japonica and indica subgroups(Figure4A;Table S7). Totally,133,420md-InDels were obtained in all15cultivars against the Nipponbare reference genome sequence.Among them,18,309md-InDels were also found in93-11/Nipponbare (Figure S5,Table S8).It was interesting that the InDel densities of japonica/Nipponbare genomes varied significantly on all chromosomes,while the InDel densities of indica/Nipponbare genomes was very close to each other(Figure4A).The18,309 InDels were further analyzed by hierarchical cluster analysis.As expected,japonica cultivars and indica cultivars were clustered on two different branches(Figure4B).Furthermore,we identified2,681japonica/indica subgroup md-InDels,out of which1,042passed designing primers and e-PCR criteria(blue bar in Figure4C).We randomly chose95 subgroup markers to perform PCR and denatured PAGE followed by silver staining in12cultivars(see Table S6).The verification results showed that two markers($2.1%)gen-erated no polymorphism and seven markers($7.4%)gen-erated polymorphisms within japonica or indica subgroup cultivars,which suggested the accuracy of our developed japonica/indica subgroup InDel markers reached90.5% (Table S9;Figure S7).These subgroup specific markers can be used to develop key markers for rice gene mapping and molecular breeding.As mentioned above,one of the preconditions for molecular breeding is functional genomics analysis.The correlation between japonica/indica subgroup InDel markers in this study and151reported agricultural trait genes was also investigated(Table S10),such as yield,grain quality,resistance to bacteria and pest,plant height and tillering.Grain size, grain number and grain weight are three important traits contributing to grain productivity.In our analysis,25grain production quantitative trait loci(QTLs)were linked with our japonica/indica subgroup markers(Table S10),such as Ghd8/ DTH8(a QTL conferring pleiotropic effects on grain productivity,plant height,and heading date)(Yan et al. 2011),Ghd7(a QTL regulating growth,development and stress response)(Weng et al.2014),FLO2(a QTL involved in regulation of rice grain size and starch quality)(She et al. 2010),GS5(a QTL regulating grain size and yield)(Li et al.2011) (Table S11).All of those agricultural trait QTLs are very important in rice productivity,whose verification during MAS could be performed by our japonica/indica subgroup markers. In total,the japonica/indica subgroup markers appeared within one mega base of89trait genes downstream or upstream (about4cM,Table S11),which might suggest that these markers could be used for testing the89agricultural trait genes in molecular breeding.A web-based database for rice InDel markersAll of the205,659md-InDels(133,420md-InDels from NGS data against Nipponbare,90,548md-InDels from93-11against Nipponbare,Figure S6)and12,944e-PCR validated InDel markers developed in this study were deposited into a user-friendly database,RIMD(http://202.120.45.71/),which is a website developed using LAMP(Linux,Apache,MySQL and PHP)and openly accessed for researchers to browse and query rice InDels/markers(Figure5).To further facilitate the application of these InDels/markers in molecular breeding,we imported5,589agronomically important genes from China Rice Data Center(/gene/),Oryzabase (http://www.shigen.nig.ac.jp/rice/oryzabase/gene/list) (Yamazaki et al.2010)and PMRD(/)(Cui et al.2012).These genes were classified by trait ontology (Jaiswal et al.2002),including anatomy and morphology trait, biochemical trait,growth and development trait,quality trait, stature or fertility trait,stress trait,and yield trait.The RIMD database hasfive functional modules(Figure5):‘Jap/In markers’for key markers of japonica/indica subgroup cultivars;‘Search’for InDels/Markers by ID,gene locus,6L€u et al.BAC/PAC or chromosome region and the details of classical markers were shown in ‘Search result ’(Figure S8);‘Trait Genes ’for agronomically important genes (Figure S9);‘Gbrowse ’for Generic Genome browser (Stein et al.2002);‘Primer3plus ’for primer design online,‘Tools and Download ’for useful scripts,datasets developed in this research,and useful links of rice research websites in ‘More ’;‘Help ’for tutorial of RIMD and frequently asked questions.Therefore,RIMD provides a friendly web interface that allows users to easily search e-PCR validated InDels for rice MAS or other basic research,such as map-based gene cloning.RIMD also provides the pipeline and scripts developed in this study for other researchers to develop InDels in other rice cultivars or develop markers in other plants with reference genome sequences.For users who intend to obtain InDels coordinates on the latest rice reference genome (IRGSP7)(Sakai et al.2013),we provided a converted General Feature File (GFF)which can be downloaded on RIMD (http://202.120.45.71/attachments/article/10/indels_gff_to_msu7.7z ).DISCUSSIONTo increase the production of high-quality food to cope with global environmental change and population increase,marker-assisted selection (MAS)could be one of most promising and effective strategies (Collard and Mackill 2008).However,the biggest limitation of MAS in plant breeding is the cost of marker development and utilization (Collard et al.2005;Collard and Mackill 2008).High-throughput genotyping platforms based on DNA sequencing or DNA microarray have been rapidly developed in the past few years (Chen et al.2014;Yu et al.2014).These techniques generally require special equipment and complicated bio-informatics analysis.With the advantage of high density of InDels in genome and its direct application based on PCR rather than sequencing,we focused on moderate length InDel markers development for ordinary laboratory and breeding organization.Instead of traditionally time-consuming and labor-intensive approaches of developing InDel markers,which needs to employ Basic Local Alignment SearchToolFigure 4.Characterization of md -InDels and distribution of japonica /indica subgroup InDel markersThe overall number of md -InDels in indica cultivars against Nipponbare genome was more than that in japonica cultivars against Nipponbare genome.(A )The InDels density (per mega bases)of each cultivar against Nipponbare genome on each chromosome.(B )The hierarchical cluster analysis of the 18309InDels in 17cultivars genome.japonica cultivars:Canella De Ferro (CDF),Chodongji (CDJ),Haginomae Mochi (HM),Lemont (LEM),Nongken-58(NK),Omachi (Om),Wuyungeng-7(WYG)and Nipponbare (NIP).indica cultivars:Popot 165(P165),Ai-Chiao-Hong (ACH),Gie 57(G57),Guang-lu-ai 4(GL4),Guan-Yin-Tsan (GYT),IR36,JC91,93-11and Leung Pratew (LP).(C )2681md -InDels were identi fied as japonica /indica subgroup InDel markers,which distribute unevenly on 12chromosomes and are shown in blue segments (e-PCR veri fied markers)and red dots (PCR validated markers).Graphic pipeline platform for InDel markers development 7。