DNA Pattern multigrammars by Brian Mayoh
- 格式:pdf
- 大小:50.22 KB
- 文档页数:15
美国科学家成功绘制大脑语义信息功能图谱
佚名
【期刊名称】《健康大视野》
【年(卷),期】2016(000)009
【摘要】美国加州大学伯克利分校杰克·盖伦特和他的同事们,近日在英国《自然》杂志发表的一篇语言学论文中,描绘了叙事性语言含义在人类大脑中分布的详细图谱。
这项研究可能有助于深入了解语言的神经生物学基础。
该项研究分析了包含不同语义范畴(几组有相似之处的概念,比如“食物”“工具”或“有生命的物体”)的口语叙事故事所激发的大脑反应。
【总页数】1页(P13-13)
【正文语种】中文
【中图分类】R338.64
【相关文献】
1.大脑完整基因表达图谱和神经元联系图谱绘制完成 [J],
2.大脑语义信息功能图谱绘制成功 [J], ;
3.机器人成功绘制出大脑图谱 [J],
4.美国科学家绘制出迄今最完整的人类三维智力大脑结构图像 [J],
5.我校类脑智能科学与技术研究院冯建峰教授团队首次绘制大脑功能网络动态图谱[J],
因版权原因,仅展示原文概要,查看原文内容请购买。
大脑记忆奥秘在上海实验室里被破译
佚名
【期刊名称】《实验室研究与探索》
【年(卷),期】2005(24)6
【总页数】1页(P5-5)
【关键词】大脑记忆;基因组;编码单元;神经元;记忆编码
【正文语种】中文
【中图分类】R338.64
【相关文献】
1.人类首次破译“大脑记忆密码” [J], 宋铮
2.破译人类大脑的奥秘r——记西南大学心理学部邱江教授 [J], 马开
3.记忆密码被破译大脑思维可推断 [J], 吴华
4.“法兰·茜丝”神秘兰花破译美颜护肤奥秘所在——专访上海兰芙化妆品有限公司总经理、中国工商联美容化妆品商会常务理事汤春芳女士 [J], 李俪瑛
5.大脑记忆奥秘在上海实验室里被破译 [J],
因版权原因,仅展示原文概要,查看原文内容请购买。
美国科学家找到抹去大脑记忆的方法
孝文
【期刊名称】《《职业教育(上、中旬)》》
【年(卷),期】2009(000)019
【摘要】据《纽约时报》报道,假设科学家可以通过调整大脑中的某种物质,抹去某些记忆,以此帮助我们忘记长期的恐惧或精神创伤,甚至是戒掉不良习惯,这肯定是了不起的科学壮举。
【总页数】1页(P41)
【作者】孝文
【作者单位】(Missing)
【正文语种】中文
【相关文献】
1.美国科学家找到解决癌症“脑转移”的有效方法 [J], ;
2.美国科学家找到抹去大脑记忆的方法 [J], 孝文
3.美国科学家称记忆存储在DNA而不是大脑中 [J],
4.美科学家发现形成记忆的关键大脑物质,可用于研发抹去大脑记忆的药物 [J],
5.美国科学家找到预防药物伤肝新方法 [J],
因版权原因,仅展示原文概要,查看原文内容请购买。
2009年5月51日晚7点,英国大使馆文化教育处会议室,在一片瞩目和掌声中,满头银发、西装配搭休闲裤的约翰萨尔斯顿大步走来。
在用中文简单问候了“大家好”后,这位67岁的“基因图谱之父”随即展开了自己的演讲,演讲的题目名为“科学与伦理的向左走与向右走”。
基因的秘密“为什么有些人比另一些人更容易感冒?为什么有些人比另一些人更爱发脾气?除了后天的个性外,主要是先天的基因在不知不觉地影响着我们。
”作为排列出首份动物基因图谱的科学家,2002年诺贝尔生理学或医学奖得主——萨尔斯顿的开场白并没有拉出一副“高深莫测”的架势。
“每个人的身体细胞包含大约30亿个碱基对配对。
它们以生物特有的方式排列组合,构成了人体基因组,从而确定和影响每个人不同的身体特征,决定你是你而不是我。
”萨尔斯顿说,如果能确认每一个碱基对的序列,将这些密码全盘破解,则能最大程度地预防和治疗人类现有的5000多种遗传类及心血管疾病。
就目前而言,科学界对于单基因病,比如视网膜母细胞瘤,已经能实现有效的治疗。
但对于复杂的多基因病,比如癌症,仍无有效对策。
1998年,萨尔斯顿成功绘出了体长仅一毫米的透明线虫的基因组图,这是人类第一次绘出动物的基因组图。
之后,国际人类基因组计划启动,萨尔斯顿作为学科带头人,在1992年至2000年间,带领剑桥桑格研究中心的科学家终于绘制出人类基因草图。
近年来,随着基因探索的不断深入,各种基因药物、基因治疗开始走进人们的生活,随即引发了一场“利用基因改变遗传信息是否遵从伦理”的争论。
萨尔斯顿怀揣自己的主张,亦由此调整了工作重心,目前他正在担任英国曼切斯特大学科学道德及创新学院主任。
2009年1月9日,世界第一个“无癌宝宝”在英国伦敦降生。
这则新年“喜讯”,被媒体评价为在基因医学的发展中具有里程碑式的意义。
该“无癌宝宝”是一名试管婴儿,女婴的父系和母系家族几代以来,都有人患乳腺癌或卵巢癌。
为避免后代继续遗传这种基因,夫妇俩决定进行人工受孕,并在胚胎植入母体前进行基因检测。
小学上册英语第六单元暑期作业英语试题一、综合题(本题有100小题,每小题1分,共100分.每小题不选、错误,均不给分)1.My favorite activity to do with my family is ______.2. A _______ is a device used to measure the concentration of a solution.3. A wild boar has sharp ________________ (牙齿).4.What do we use to clean our teeth?A. SoapB. ToothbrushC. TowelD. Comb5.The chemical symbol for silicon is ______.6.The ________ (乡镇) has a friendly community.7.She is _____ (playing) a game.8.The chemical symbol for palladium is _______.9.I have a small _____ (花坛) in my garden.10.My aunt is going to have a __________ (宝宝) soon.11.What do you call a baby dog?A. KittenB. PuppyC. CubD. Foal12.What do we call the smallest unit of life?A. CellB. TissueC. OrganD. OrganismA13.What is the color of snow?A. BlueB. GreenC. WhiteD. YellowC14.What do we call the person who plays music?A. ArtistB. MusicianC. PainterD. WriterB15.The _______ (松鼠) gathers nuts.16.What is the capital of Brazil?A. Rio de JaneiroB. São PauloC. BrasíliaD. SalvadorC17.The ____ is often seen in parks searching for food.18.What do we call a place where people go to see animals?A. ZooB. AquariumC. CircusD. Sanctuary19.What is the main ingredient in sushi?A. RiceB. BreadC. NoodlesD. PotatoesA20.What is the name of the famous river in India?A. GangesB. YamunaC. BrahmaputraD. Godavari21.The ant has a strong ______ compared to its size.22.The chemical formula for sodium citrate is _____.23.Electrons are found in the ________ of an atom.24. A __________ (碳足迹) measures the environmental impact of chemical processes.25.I enjoy playing with my ______ (玩具车) in the living room. It goes ______ (快).26.They are going to ________ the zoo.27. A solution can be ______ or colored.28.The primary elements found in organic molecules are carbon, hydrogen, and_______.29.The chemical formula for magnesium oxide is __________.30.The _____ (train) is leaving the station.31.I enjoy learning about different ______ (职业). It helps me think about my future career.32.The teacher _____ us a new song. (teaches)33.What is the opposite of 'clean'?A. DirtyB. NeatC. ClearD. PureA34.I want to _______ (学习) about physics.35. A __________ is a fundamental building block of matter.36.I found a ________ in my pocket.37.I saw a _______ (小老虎) playing with its mother.38.The ______ grows in forests.39.The squirrel is _____ nuts for winter. (collecting)40.What is the name of the superhero who wears a cape and flies?A. BatmanB. Spider-ManC. SupermanD. Iron ManC41.What do we call a large group of fish swimming together?A. SchoolB. FlockC. PodD. SwarmA42.I love my new ________ that's shaped like a dinosaur.43.The first known civilization was in __________ (美索不达米亚).44.The puppy is _____ in the yard. (playing)45.My sister's favorite animal is a _____.46.The ________ (teacher) is very nice.47.I need to ___ (study/rest) for my exam.48.My mom loves to __________ (和家人聚会).49.What do we call a person who studies the environment?A. Environmental ScientistB. EcologistC. ConservationistD. All of the above50.My brother enjoys building ____ (models) of cars.51.The ancient Romans built roads to connect their ______ (城市).52.The ________ (project) has clear objectives.53. A _______ is a mixture that appears uniform throughout.54.I enjoy learning new ______ (技能) and hobbies to expand my interests.55.My sister is very __________. (聪明)56.What is the name of the planet we live on?A. MarsB. JupiterC. EarthD. VenusC57.What do you call the person who fixes cars?A. TeacherB. DoctorC. MechanicD. ChefC58.The _______ (小长颈鹿) eats leaves from tall trees.59.I think being kind is important. I show kindness by __________.60.How many days are in a leap year?A. 365B. 366C. 367D. 368B61.What do we call the study of the earth's physical features?A. GeographyB. GeologyC. AstronomyD. MeteorologyA62.What is the freezing point of water?A. 0 degrees CelsiusB. 100 degrees CelsiusC. 32 degrees FahrenheitD. Both A and CD63.The rain is ______ on the roof. (falling)64.The stars are _______ (在夜空中).65.Chemicals can be found in our _____.66.The ______ is known for her fashion sense.67.What is the main ingredient in bread?A. RiceB. FlourC. SugarD. YeastB68.The fox is known for its _________. (狡猾)69.Certain plants can thrive in ______ (低光) conditions.70.The ancient Romans used ________ to build their cities.71.The chemical formula for sodium acetate is _______.72. A ______ is often used to represent the composition of a compound.73.Plant diversity is important for a healthy ______. (植物多样性对健康的生态系统至关重要。
Why Your DNA Isn't Your Destiny DNA不是一切(中英对照)By John Cloud, TIME magazine Wednesday, Jan. 06, 2010中文翻译:Quintessence [/blog/QuintEssence/]瑞典北方,遥远而辽阔的冰天雪地中,似乎不会是顶尖遗传科学的发轫地。
瑞典国境最北的北博滕省(Norrbotten)寥无人烟,平均每平方公里只有两个人。
然而,透过这一小撮人,我们却能更加认识基因在日常生活中的作用。
The remote, snow-swept expanses of northern Sweden are an unlikely place to begin a story about cutting-edge genetic science. The kingdom's northernmost county, Norrbotten, is nearly free of human life; an average of just six people live in each square mile. And yet this tiny population can reveal a lot about how genes work in our everyday lives.十九世纪的北博滕省对外交通不便,因此若当地收成欠佳,居民只好挨饿。
由于难以事先预测荒年,使得情况更加严峻。
比如说,1800、1812、1821、1836、1856年灾情惨重;但是1801、1822、1828、1844、1863年却是五个大丰年,使得之前挨饿的居民忽然能够暴饮暴食好几个月。
Norrbotten is so isolated that in the 19th century, if the harvest was bad, people starved. The starving years were all the crueler for their unpredictability. For instance, 1800, 1812, 1821, 1836 and 1856 were years of total crop failure and extreme suffering. But in 1801, 1822, 1828, 1844 and 1863, the land spilled forth such abundance that the same people who had gone hungry in previous winters were able to gorge themselves for months.白葛恩(Lars Olov Bygren)博士是一名预防医学家,目前任职于斯德哥尔摩声望卓著的卡洛林斯卡医学研究院(Karolinska Institute)。
植物干细胞调控研究新进展中国细胞生物学学报Chinese Journal of Cell Biology 2015, 37(7): 1021–1028DOI: 10.11844/cjcb.2015.07.0024收稿日期: 2015-01-15 接受日期: 2015-04-07973计划前期研究专项(批准号: 2014CB160306)、重庆市教委创新团队建设基金(批准号: KJTD201307)和重庆师范大学引进人才启动基金项目(批准号: 12XLR36)资助的课题*通讯作者。
Tel: 023-********, E-mail: hanmazhang@/doc/f017810486.html, Received: January 15, 2015 Accepted: April 7, 2015This work was supported by the National Grand Fundamental Research Pre-973 Program of China (Grant No.2014CB160306), the Innovation T eam Fund of the Education Department of Chongqing Municipality (Grant No.KJTD201307) and a Start-Up Fund from Chongqing Normal University (Grant No.12XLR36)*Corresponding author. Tel: +86-23-65912976, E-mail: hanmazhang@/doc/f017810486.html, 网络出版时间: 2015-07-01 16:53 URL: /doc/f017810486.html,/kcms/detail/31.2035.Q. 20150701.1653.001.html植物干细胞调控研究新进展赵中华南文斌梁永书张汉马*(植物环境适应分子生物学重庆市重点实验室, 重庆师范大学生命科学学院, 重庆 401331)摘要植物干细胞是植物胚后发育形成各种组织和器官的细胞来源和信号调控中心, 其调控机理是植物学研究的重要内容。
dna甲基化的书籍摘要:1.DNA 甲基化的概念和重要性2.DNA 甲基化的书籍推荐3.书籍的内容概述和特点4.读者评价和建议正文:DNA 甲基化是指DNA 上的一种化学修饰,这种修饰可以在不改变DNA 序列的情况下影响基因的表达。
近年来,随着研究的深入,DNA 甲基化被认为是一种重要的表观遗传调控机制,与许多生物学过程和疾病都有着密切的关系。
因此,了解DNA 甲基化的相关知识对于科研人员和学生来说十分重要。
这里,我为大家推荐几本关于DNA 甲基化的书籍,它们内容详实,既有理论知识,也有实验技术,非常适合初学者入门学习。
首先是《DNA 甲基化与表观遗传学》,这本书由我国著名表观遗传学家李蓬教授编写,系统地介绍了DNA 甲基化的发现、作用机制、调控因子和生物学功能等内容。
书中还附有许多实验方法和技术,可以帮助读者深入理解DNA 甲基化的研究方法。
另一本值得推荐的书是《表观遗传学:原理、技术和应用》,这本书由多位专家共同撰写,涵盖了表观遗传学的各个领域,包括DNA 甲基化、组蛋白修饰、非编码RNA 等。
书中既有理论知识的讲解,也有丰富的实例分析,对于想要深入学习表观遗传学的读者来说非常有价值。
这些书籍的内容概述和特点如下:《DNA 甲基化与表观遗传学》:本书分为四部分,第一部分介绍了DNA 甲基化的基本概念和发现历程;第二部分讲述了DNA 甲基化的作用机制和调控因子;第三部分则详述了DNA 甲基化在基因表达调控中的作用;第四部分为实验技术和方法部分。
《表观遗传学:原理、技术和应用》:本书分为五部分,第一部分介绍了表观遗传学的基本概念和原理;第二部分和第三部分分别讲述了DNA 甲基化和组蛋白修饰;第四部分为非编码RNA;第五部分为表观遗传学在疾病中的应用。
读者评价和建议:《DNA 甲基化与表观遗传学》适合对DNA 甲基化感兴趣的初学者,书中的内容详实,易于理解。
而《表观遗传学:原理、技术和应用》更适合想要深入学习表观遗传学的读者,涉及的内容更为广泛和深入。
"DNA Pattern multigrammars" by Brian MayohHow can one go from the sequence of nucleotides in a DNA molecule to a description of its secondary structure? A popular answer [Se93] is:use one or more grammars to parse the DNA string and the resulting derivation trees give descriptions of the secondary structure of the DNA string. As grammars usually give languages with more than one word, they can be used to parse more than one DNA string, so they can handle genetic variation. The language L generated by a grammar can cover not only the possible alleles of a gene but also the possible mutations that are viable. Biologists have devised several elegant grammars for DNAstrings, but they also hope that machine learning techniques can extract suitable grammars from a sample of "related" DNA strings [Se93].In this paper we introduce DNA pattern multigrammars and argue that they are particularly appropriate for both machine learning and the description of DNA secondary structure.#1 Grammars that describe DNA stringsProteins are long sequences of aminoacids, so their primary structures can be represented as words on 20 symbol alphabet. In a later paper we will describe how grammars can give the secondary structure of proteins. An RNA molecule is a long sequence of ribonucleotides - Adenine, Cytocine, Guanine or Uracil - so its primary structure can be represented as a word on the alphabet { a, c, g, u}. A DNA molecule is one or two long sequences of deoxyribonucleotides - Adenine, Cytocine, Guanine or Thymine - so its primary structure can be represented as one or two words on the alphabet { a, c, g, t}. As shown in figure 1, one of the two words for double stranded DNA can be ignored, because it is uniquely determined by the other word and the pairing rules: a = t, c = g, g = c, t = a .g g t a g c c c c c c c c c c c t a g g g c c a t c g g g g g g g g g g g a t c c cFig.1 Twin Strand DNA molecule and its twin wordsThe secondary structure of RNA and DNA molecules arises from the fact that these pairing rules also apply to letters in their individual "primary structure words". As shown in figure 2, the rules give crossdependencies that indicate how the molecules fold in space.g g t a g c c c c c c c c c c g a t g g gg g t a g c c c c c c c c c c c t a g g gg g tagc t a gg g c cc c cc c c cc c t a = g a t Fig.2 Crossdependencies and word dualityFor spatial reasons "reversal" is involved in the definition of matching dual words. For both DNA and RNA the dual λ of the empty word λ is just the empty word λ. For DNA the dual word W is defined by:a W = W t, cW= W g, g W= W c, t W= W aFor RNA the dual word W is defined by:a W = W u, cW= W g, g W= W c, u W= W aThese dualities should be incorporated in any grammar for describing and analysing families of DNA and RNA molecules. Among such grammars we should mention stochastic context-free grammars [SBHMSUH94] and string grammars [Se93]. In the next paragraph we introduce our rival "pattern grammars".One can define a pattern as a word W on terminal and nonterminal letters together with a partition of the occurrences of the nonterminals in W. When displaying patterns we display the partitition using the convention:• λ is the empty pattern•nonterminals with the same adornment are equivalent,•unadorned nonterminals are only equivalent to themselves.Thus a b S c b T a c S' b b T' is a pattern where the last two nonterminals are equivalent to each other and the first two are only equivalent tothemselves. If W1 (W2 , W3) are terminal words derivable from S (T, both S and T), then one can substitute in the pattern to get the terminal word: a b W1 c b W2 a c W3 b b W3To get an RNA or DNA pattern we need only add: nonterminals may ormay not be underlined; dual words are substituted for underlined patterns. Def.1. A DNA (RNA) pattern multigrammarΓ consists of ∆, anonterminal alphabet, and for each S in ∆ a set ΠS of DNA(RNA) patterns over ∆ and {a, c, g, t} ({a, c, g, u }).Def.2. A DNA (RNA) pattern language L is generated by a DNA (RNA) pattern multigrammar Γ if L= L S for some S in ∆ where L S is the S-component of the least fixed point of the language function Φ defined by W in ΦS[....Lδ...] iffW = u0 v1 u1 v2 u2.....v n u nfor some pattern u 0 δ1 u 1 δ2 u 2.....δn u n in ΠS and terminal words u 0,u 1,u 2.....u n,v 1,v 2....v n such that v i in L δ i and•δi eq δj implies v i = v j•δi eq δj implies v i = v j•δi eq δj implies v i = v j .Here ....L δ.. ...represents a tuple of pattern sets L δ one for each δ in ∆.Conventions: Any word can be substituted for an underlined space in a pattern. The ornament & indicates that any word may be substituted but the same word must be substituted at other occurrences of & and the dual of the word must be substituted at other occurrences of & . Both these conventions are used in the one pattern DNAparsing multigrammar in figure 3. Repeat -> _ & _ & _g g g cc c cc cc_g t a g c t ag c c c & & _ _ Repeat g g t a g c c c c c c c c c c c t a g g g_ & _ & _Fig.3 DNA Parsing with a one pattern grammar#2 Grammars describing DNA secondary structure.Every cell in a multicellular organism like us has identical DNA molecules, but most cells are very different; hair is very unlike skin,muscle and brain cells. The reason for these differences is that in each cell some of the genes encoded in the DNA are active and some are passive.Sometimes this genetic switching between active and passive genes corresponds to different secondary structures of the DNA. As grammars can be ambiguous, allow for alternative derivations of the same word, they can neatly capture alternative secondary structures of DNA and RNA molecules. A simple example of this is given by the two pattern grammar:InvertedRepeat -> _ & _ & _Attenuator -> _ & _ & _ & _Clearly any word that can be derived from the attenuator pattern can also be derived from the repeat pattern in two different ways. Figure 4 gives an example of this simple example of possible genetic switching.g g g c c c c cc c _ g t a g cta g c c c& & _ _ g g g g t a g c c c c c &Attenuator_ g g t a g c c c c c c c c c c c t a g g g g g g g t a g c c c c c_ & _ &g g t a g c c c c c c c c c c c t a g g g g g g g t a g c c c c c_ & _ & c c c t a tag c c c g gg g g g g t a g c c c c c c c c c c cAttenuator_ _ _ _ & & & g g t a g c c c c c c c c c c c t a g g g g g g g t a g c c c c c_ &Fig.4 Genetic SwitchingAnother common secondary structure is the PseudoKnot; it is prominent in the RNA pattern multigrammar for tobacco mosaic virus RNA:PseudoKnot -> & _ ¨& _ & _ ¨&tmv_3prime -> PseudoKnot PseudoKnot PseudoKnot _ & _ ¨& &'_ InvertedRepeat &' ¨& _ & InvertedRepeat PseudoKnot _InvertedRepeat -> & _ &Notice that one has the alternative derivation of" _ & _ ¨& &' _ InvertedRepeat &' ¨& _ & " from the Pseudoknot pattern. Figure 5 shows the secondary structure of a simple PseudoKnot and the more intricate tmvRNA.Fig.5 Pseudoknots and Tobacco Mosaic Virus RNA (177 nucleotides) The string grammar for transfer RNA [Se 93p92] begins with:tRNA(Codon) ---> AcceptorArm, _, DArm, _, ~DArm, _, AnticodonArm, _ ,~AnticodonArm, _, TpsiCArm, _, ~TpsiCarm, _, ~AcceptorArm, _.The RNA pattern multigrammar for transfer RNA begins with: tRNA -> & CloverLeaf &CloverLeaf -> _ Darm _ AntiCodonArm _ TψCarm _Figure 6 gives the full multigrammar and the derivation for tRNA; figure 7 shows that biological reality is more complex than the simple examples in this paper.tRNA -> & CloverLeaf &CloverLeaf -> _ Darm _ AntiCodonArm _ TψCarm _Darm -> & _ &AntiCodonArm -> & _ Codon _ &TψCarm -> & _ &Codon -> Nucleotide Nucleotide NucleotideFig. 6 grammatical derivation of transfer RNAFig. 7 Real transfer RNAa) spatially realisticsecondary structureb)corrected cloverleafsecondary structurec) tertiary structure#3 Genetic variation: alleles and viri.One can consider DNA or RNA strings of length N as points in a sequence space of dimension N. As there are only 4N such points in this space, its topology is very different from our familiar Euclidean space; in particular the largest possible distance (= number of mutations) between two points is N, and there are many "shortest" paths between points, not just one "straight line". Any DNA or RNA pattern language corresponds to a set of points in this strange sequence space; the DNA or RNA strings in any population correspond to a multiset of points in this strange sequence space, as the same string may occur many times in the population. Consider the population of people now living in Denmark. The corresponding (multi)set of sequence space points is included in the (multi)set for the people who have ever lived in Denmark, which is included in the (multi)set for the people who will ever live in Denmark. This is because there is genetic variation in people, genes have alleles. Traditionally one assumes that the genetic variation of a species is given by a sequence space point for the socalled "wild type" and a cluster of nearby points, but this will only be true if "nearby" does not mean "few mutations away from". There is much evidence that the traditional view is totally inadequate for influenza, HIV and other viri, because they have vast populations and high mutation rates [Ei92]. Eigen has introduced the term quasispecies for populations of viri and other organisms with very dynamic genetic variation. In sequence space quasispecies will give cloudy, fractal multisets; species will give clean, compact multisets; nevertheless both should be amenable to description by DNA and RNA pattern multigrammars.Fig.8 Population dynamicsa) perfect replication of wild typeb) highly imperfect replicationc) perfect replication with allelesd) quasispeciesViri seem to be a promising area for grammatical approaches to DNA and RNA analysis. The chemical and biological details of genetic switching, we described earlier, were first worked out for the bacteriophage λ (a virus that preys on bacteria) [Pt86]. The phage λswitches between two forms: a virulent form, in which bursts the bacteria, and a dormant form where it reproduces when the bacteria cell divides. The two forms differ in which genes are expressed and which are repressed. It is not difficult to devise a pattern multigrammar for phage λin which the two forms correspond to alternative derivations of its RNA sequence:phage -> early1 cloff early2 headon tailon lysisonphage -> early1 clon early2 headoff tailoff lysisoffearly1 -> recombination C111Nearly2 -> cro C11replication Qand patterns for cro and the other genes - cloff (headon, tailon, lysison) has the same pattern as clon (headoff, tailoff, lysisoff)phage -> early1 cloff early2headon tailon lysisonphage -> early1 clon early2headoff tailoff lysisoffFig.9 Genetic Switchingin the phage λ#4 Machine learning of DNA grammars.Where do grammars come from? There are many machine methods for inventing grammars that fit positive and negative examples of the "language" to be described by the grammar [OP91,CO94]. Some of these methods allow expert knowledge to preadapt the grammar search space. Among the methods with this advantage, the one that seems most appropriate to the hunt for RNA and DNA grammars seems to be genetic algorithms , even although it is not mentioned in the latest survey of machine learning applied to gene recognition [CS94]. Genetic algorithms can be used if one can measure the fitness of a candidate RNA/DNA grammar. The discussion of sequence spaces and quasispecies in the last section suggests a suitable fitness measure: how well does the language generated by the grammar include the multiset of positive examples and exclude the multiset of negative examples.The basic idea of genetic algorithms is shown in figure 10: a population of phenotypes is constructed from a population of genotypes, the fitness of the phenotypes is evaluated, their fitness influences the evolution of a new genotype population and a new "life-cycle" starts. Figure 10 also shows a version of the usual evolutionary operators, crossover and mutation, that is appropriate if we make the reasonable assumption:Genotype = Phenotype = DNA/RNA pattern multigrammarso the construction phase is trivial.Fig 10 Genetic algorithms & grammar evolutionary operatorsAny attempt to learn DNA/RNA pattern multigrammars by genetic programming will only be successful if wise decisions have been made on various crucial questions.•genotype representation Do we want PRE,SUF,LIG operations on grammars [Se93], or do sorted lists of patterns suffice?•crossovers How do we ensure that useful structural building blocks are not destroyed? Should we insist that all grammars contain such common building blocks as: internal repeats, Pseudoknots etc.?•mutations Do we need more than changing the underlining and ornamentation (equivalence relation) of patterns?•fitness Clearly recognition of positive (negative) examples should be rewarded (punished), but should precise and detailed grammars be rewarded (remember the alternative derivation in the tobacco virus grammar) ?•examples Wise choice of negative examples is essential, but shouldall positive examples be used? Coevolution as in [Hi90] but using sampling and spatial rotations to get new positive and negative examples. #5 Conclusion.DNA and RNA pattern multigrammars are expressive and learnable; they may be useful in analysing DNA and RNA molecules.References[CO94] R.C.Carrasco, J.Oncina (eds.) "Grammatical inference and applications" Springer LNAI 862(1994)[CS94] M.W.Craven, J.W.Shavlik "Machine learning approaches to gene recognition" IEEE Expert???(1994)2-10[Ei92] M.Eigen "Steps toward life", Oxford U.P.1992[Ei93] M.Eigen "Viral quasispecies", Sci.Amer.1993 july 32-40[Hi90] W.D.Hillis "Co-evolving parasites improve simulated evolution as an optimisation procedure" in S.Forrest(Ed.) "Emergent computation" pp228-234, North Holland 1990.[KSQMSWSR74] S.H.Kim, F.L.Suddath, F.L.Quigley, A.McPherson, J.L.Sussman, A.H.J.Wang, N.C.Seegman,A.Rich "Three-dimensional tertiary structure of Yeast phenylalanine transfer RNA", Science 185(1974) 435-439[OP91] G.C.Overton, J.A.Pastor "A platform for applying multiple machine learning strategies to the task of understanding gene structure" IEEE???(1991) 450 -457[Pt86] M.Ptashne "A Genetic Switch: gene control and phage λ"Cell Press and Blackwell Scientific Publications, 1986. [SBHMSUH94] Y.Sakakibara, M.Brown, R.Hughey, I.S.Mian, K.Sjolander, R.C.Underwood, D.Haussler "Recent methods for RNA modelling using stochastic context-free grammars" Springer LNCS 807(1994) 289-306[Se93] D.B.Searls "The computational linguistics of biological sequences" in Artificial Intelligence and Molecular Biology chapter 2, pages 47-120. AAAI Press 1993.[Wa77] J.D.Watson "Molecular biology of the gene", W.A.Benjamin, Inc.1977。