基于语料库的语篇分析范式研究
- 格式:pdf
- 大小:222.43 KB
- 文档页数:5
《语料库与批判话语分析》篇一一、引言在当今社会,语言不仅是人们交流的工具,更是社会文化、意识形态和权力关系的反映。
因此,对语言的研究显得尤为重要。
语料库和批判话语分析作为两种重要的语言研究方法,为人们提供了深入探讨语言背后隐藏的社会、文化和心理层面的手段。
本文将分别介绍语料库和批判话语分析的概念、特点及两者在语言研究中的应用,并探讨它们之间的互动关系。
二、语料库的概念与特点1. 语料库的概念语料库是一种大规模的、结构化的语言数据集合,用于语言学、语言教育、翻译等领域的研究。
它通过收集、整理和分析大量的语言实例,为研究者提供了丰富的语言数据资源。
2. 语料库的特点(1)大规模性:语料库包含大量的语言实例,可以反映语言的真实使用情况。
(2)结构化:语料库中的数据经过整理和标注,便于研究者进行数据分析和提取。
(3)客观性:语料库提供的数据具有客观性,可以避免主观臆断和偏见。
三、批判话语分析的概念与特点1. 批判话语分析的概念批判话语分析是一种以社会、文化和意识形态为背景的语言分析方法,旨在揭示语言背后的权力关系、意识形态和社会不平等。
它通过对文本、话语和交流过程的分析,揭示出语言使用中的社会、文化和心理层面的意义。
2. 批判话语分析的特点(1)社会性:批判话语分析关注语言与社会、文化和意识形态的关系。
(2)批判性:批判话语分析注重揭示语言背后的权力关系和意识形态。
(3)综合性:批判话语分析需要综合考虑文本、语境、交际者等多方面的因素。
四、语料库与批判话语分析在语言研究中的应用1. 语料库在语言研究中的应用(1)语言描写与对比:通过语料库,研究者可以收集大量语言实例,对不同语言进行描写和对比,揭示语言的特征和规律。
(2)语言教学与翻译:语料库为语言教学和翻译提供了丰富的数据资源,有助于提高教学效果和翻译质量。
(3)社会语言学研究:语料库可以反映社会的语言使用情况,为社会语言学研究提供数据支持。
2. 批判话语分析在语言研究中的应用(1)揭露社会不平等:批判话语分析通过分析文本和交际过程,揭露语言背后的权力关系和社会不平等。
《语料库与批判话语分析》篇一一、引言在当今社会,语言不仅是人们交流的工具,更是社会现象的反映和文化的载体。
随着语言学研究的深入发展,语料库和批判话语分析作为两种重要的研究方法,在语言研究中发挥着越来越重要的作用。
语料库为研究者提供了大量的语言实例,使得语言研究更加客观、科学;而批判话语分析则注重从社会、文化、政治等多角度对语言进行解读,揭示语言背后的深层含义。
本文将分别介绍语料库和批判话语分析的原理、方法及实践应用,并探讨二者在语言研究中的互补性。
二、语料库的原理与方法1. 语料库的原理语料库是一种大规模的语言实例集合,它通过收集、整理、标注等方式,将语言使用情况以数据的形式呈现出来。
语料库的建立为语言研究提供了丰富的数据资源,使得研究者可以通过统计分析等方法,客观地了解语言的特征和规律。
2. 语料库的方法语料库的方法主要包括语料收集、标注、检索和分析等步骤。
首先,研究者需要根据研究目的和需求,选择合适的语料来源,如报刊杂志、网络论坛等。
然后,对收集到的语料进行标注和整理,以便进行后续的检索和分析。
最后,通过使用计算机软件等工具,对标注好的语料进行统计分析,得出研究结论。
三、批判话语分析的原理与方法1. 批判话语分析的原理批判话语分析是一种从社会、文化、政治等多角度对语言进行解读的方法。
它认为语言不仅是交流的工具,更是社会现象的反映和文化的载体。
因此,批判话语分析注重揭示语言背后的深层含义和意识形态。
2. 批判话语分析的方法批判话语分析的方法主要包括文本解读、语境分析和互文性分析等。
首先,研究者需要对文本进行细致的解读,了解文本的主题、内容和表达方式。
其次,通过分析文本产生的语境,如社会背景、文化传统等,揭示文本背后的深层含义。
最后,通过分析文本之间的互文性关系,探讨文本与其他文本之间的联系和影响。
四、语料库与批判话语分析的实践应用1. 语料库在语言研究中的应用语料库在语言研究中的应用广泛,如词汇研究、句法研究、语用研究等。
基于语料库的儿童文学的语篇分析本文将使用语料库检索软件,对美国作家弗兰克·鲍姆(Frank Baum)创作的儿童文学作品《绿野仙踪》的文本特征、主要内容等方面进行分析,以展示语料库检索软件在文学分析方面的强大功能,提高英语专业大学生对英语文学学习的兴趣,促进他们的英语学习。
标签:语料库;《绿野仙踪》;文本分析1简介随着近年来信息技术的发展以及计算机的普及和语料库研究的渐渐升温,国内外不少学者将语料库研究方法应用到文学领域,利用语料库检索软件(Concordance)对文学语篇进行分析,如Conrad、杨建枚、张厚振等。
他们的研究大胆创新,十分具有操作性,为后来的文学研究者带来很大的启示。
并且语料库研究方法也为英语专业的学习带来了非常大的便利,也节省了资源与时间,是一种高效的研究方法。
《绿野仙踪》又名《奇妙的奥兹男巫》,是美国作家弗兰克·鲍姆创作发表的奇幻冒险童话故事集,共十四本。
问世百年以来被翻译成多种语言出版,根据《绿野仙踪》故事改编的动画片和电影更是不计其数。
国内常见的《绿野仙踪》是这个系列的第一本。
《绿野仙踪》主要讲述了小女孩“Dorothy”和她的小狗“Toto”被龙卷风吹到了一个奇妙的“Oz”(奥兹国),小女孩为了能回到自己的家,经历了一系列有趣又惊现的事情,最后安全回家的故事。
2基于词表的语篇基本情况分析基于语料库的语言研究一般采取定性与定量相结合的研究方法,要进行定量研究就要涉及文本检索和数据统计。
Wordsmith软件中的Wordlist工具可以对语篇的基本信息进行统计,自动生成词表(图1),它可以提供文本中的简略统计数据,从而有助于分析文本的总体统计特征和基本情况。
词表的主要统计特征有:文件的字节数(bytes)、形符数(tokens)、类符数(types)、类符形符比(type/图1)The Wizard of OZ的文本统计信息截图(token ratio)、标准化类符形符比(standard type/tokenratio)、平均词长(meanword length)、句子数(sentences)等。
基于语料库的功能语篇分析——奥巴马总统2013年就职演说[Abstract]This paper mainly used corpus and Halliday’s three meta-functions theories through the discussions of the transitivity system, modality system and theme system to find out Obama’s discourse strategies and intentions in 2013 inaugural speech. This speech not only adopts so many long sentences to reserve its seriousness and formality but also deploys massive simple words to cater for a large audience. The recurrent emphases on freedom and equality indicate his political stance and incentive strategies. Material process verbs occupy an overwhelming part in the transitivity system, which makes the speech more convincing and practical. The application of numerous first person pronouns, high value model verbs and radialized thematic progression patterns effectively motivate the audience’s passion. Corpus can provide important data support for the study of functional discourse, hence proved an effective approach in discourse analysis.[Key words]systemic-functional grammar,corpus linguistics,discourse analysisI. IntroductionThe year 2013 witnessed some American economic recoveries from the financial crisis, Obama’s inaugural speech once again caught thegeneral public’s attention after his first term as president. In order to generalize the implied discourse strategies and intentions between the lines of the very speech, the author adopted both quantitative and qualitative methods respectively through corpus-based approach and systemic-functional approach. We call the discourse analysis under the systemic functional grammar functional discourse analysis.\[1\] Ⅱ. Theoretical Framework2.1 Corpus linguisticsThe approach to linguistic study based on corpus and corpus linguistics theories can be called corpus linguistic approach, which resorts to modern computer techniques and relies on empirical data-based or data-driven method.\[2\] Compared with the traditional linguistic research methods, the advantages of corpus linguistics analysis methods are reflected in the following aspects: (1) to analyze the pattern of natural discourse in an empirical way; (2) to collect large scale natural language materials as the sources for analysis; (3) to have automatic data analysis with the help of computers; (4) to show a better picture with both quantitative and qualitative analyses.\[3\]2.2 Systemic-functional grammarHalliday gives a discourse analysis through the following threemajor functions of the language, which are called meta-functions: the ideational or content-bearing function; the interpersonal function, indicating the writer’s or speaker’s attitude; and the textual function, enabling a speaker to arrange his or his utterances in such a way that it makes sense in context and delivers messages.\[4\] The ideational function is realized through transitivity system and voice system. The interpersonal function is embodied by mood system and modality system. And the textual function can be represented by the theme system, information system and cohesion system, which can reveal the main point the writer or speaker is arriving at and also the progression of the textual information.\[5\]Ⅲ. Research MethodologyFirst, the author used the Wordlist function of Wordsmith Tool 4.0 to gain the words frequency and the statistics of the whole text. Next, CLAWS (POS tagger) was adopted to tag the parts of speech for all the words in the text for the convenience of abstracting the desirable words later. Then, he put the tagged text into Wordsmith, used its Concond function, and searched for all the verbs. According to Halliday’s transitivity system theory in ideational function theory. The author abstracted the verbs and discussed them within 6 processes: material process, mental process, relational process, verbal process, behavioralprocess and existential process. And based on his modality system in interpersonal function theory, the author abstracted all the modal verbs in the text and analyzed them from three levels depending on the degree. Finally, the author randomly selected from the whole text one part which consisted of several sentences for the analysis of rheme system in textual function theory. The thematic progression patten was studied together with the cohesive devices.Ⅳ. Data Analysis and Discussions4.1 OverviewPicture 1Picture 2With the assistance of Wordsmith Tool 4.0, it is found that in Obama’s 2013 inaugural speech there are altogether 2,135 words with 774 types and 85 sentences. The type/token ratio is 36%, which is a relatively low rate, meaning there are not so many unfamiliar words and it is easier for Obama to reach a large audience. The average length of a sentence consists of 25 words, which shows that it is a formal and serious discourse with many long sentences. From picture 2, it is perceived that there mainly exist short words with 2-5 characters, this is also to cater for the large audience.Picture 3The top 50 frequent words are selected out from the original text. From these data it can be concluded that the use of first personal pronouns are frequently used (like “we”, “us”, “our”), creating some effects that the president is just one of the audience, and he will make efforts with all the others in the construction the United States. Likewise, it merits attention on the abundant occurrences of semantic field:“people”, “America”, “country”, “citizens”, “together”, which is for the same purpose as mentioned above, proving that it is a typical provocative speech. In addition, the words like “equal” and “freedom” are highlighted, manifesting Obama’s persistence on the American spirits that would recall the majority of people to support him.4.2 The transitivity systemDiagram 1The data for transitivity systemThe ideational function is embodied by transitivity, which divides people’s activities into 6 different processes: material process, mental process, relational process, verbal process, behavioral process and existential process, among which the first three processes are commonly noticed. From this pie chart, it is clear to notice that material process is quite predominant in the whole text with a share of 75%, sufficient use of which makes the discourse sounds more objective and convincing, and which also indicates that Obama is trying to figure out more specificmoves to solve the current problems for the second term other than give a mere emotional inspiration. The process involves at least one participant as an actor (see Table 1). The second main process in this text is the relational process, which shows Obama has a clear mind of what the situations are and what need to be resolved (see Table 2). Some mental processes enable Obama to better reach the audience and make all the listeners feel amiable and natural, such as the uses of “determine”, “believe”, “understand” (see Table 3). A few verbal processes would reinforce Obama’s mood so that the speech sounds more inspiring, such as the use of “say”, “declare”, “tell”, but this does not counter much since Obama’s focus is on the actions. There are no behavioral and existential processes.We, the people,stillbelievethat enduring security and lastingpeace do not require perpetual war4.3 The modality systemModal manifestations are various, such as modal verbs, modaladjuncts and metaphors of modality. This paper mainly discusses the use of modal verbs, which serve different purposes of the speaker and in Halliday’s view they can be classified in to three levels according to intensity (see table 4). Modal verbs have some functions, expressing the following meanings: prediction of future events, personal intension, willingness or wish, ability, permission, hypothesis, possibility, certainty, obligation or requirement, desirability.\[6\]Table 4High valuemust, have toMedium valuewill, would, shouldLow valuemay, might, can, couldDiagram 2The data for the Modality SystemAbundant applications of high value modal verbs like “must” (see e.g.4) and medium value ones like “will” (see e.g.5) indicate Obama’s confidence for the recovery of American economy and determination to address other urgent issues confronting his country.E.g.4: We must act, knowing that our work will be better.E.g.5: We will respond to the threat of climate change.4.4 The theme systemTheme is the starting point of information, and rheme is the explanation of theme and gives unknown information. The relationship between theme and rheme is called thematic progression, the four patterns of which are commonly seen: radialized pattern, centralized pattern, continuous pattern, and crossover pattern. The following paragraph is a randomly chosen one.(1)We [T1], the people, still believe that every citizen deserves a basic measure of security and dignity[R1].(2)We [T2]must make the hard choices to reduce the cost of health care and the size of our deficit[R2].(3) But we [T3]reject the belief [R3](4) that America[T4]must choose between caring for the generation that built this country and investing in the generation that will build its future[R4].(5) For we [T5]remember the lessons of our past[R5],(6) when twilight years [T6]were spent in poverty [R6],(7) and parents [T7]of a child with a disability had nowhere to turn[R7].Picture 4Picture 5This paragraph discusses Obama’s attitudes towards the security and dignity of every citizen especially for the care of the old and the young. The cohesive devices are personal repetitions (“we”), references (“that”, “when”), and conjunctives (“but”, “for”). The thematic progression patterns are radialized pattern (see picture 4) and continuous pattern (see picture 5). The radialized thematic progression pattern is obviously noticed throughout the whole discourse, because this is for the purpose to making parallel sentences so that the speech may sounds much more inspiring and affirmative, which conforms to the typical style of Obama’s speech. In this way, the sentences develop fluently and cohesively in the discourse.Ⅴ. ConclusionIt can be concluded that Obama’s inaugural speech in 2013 reveals his some typical speech strategies with the abundant use of first personal references and radialized thematic progression pattern for motivating the masses to support him, the preference for material process verbs to promote his proposals and reinforce the speech effects, the adept use of modal verbs for expressing his authority and determination as a president,and also the application of easy and short words to reach an audience as large as possible. All the speech strategies are to serve for motivating the great masses to support his future policies on some important issues like employment, medical insurance, and economic recovery. Also, the study shows that corpus and system-functional grammar are effective approaches to discourse analysis.【References 】[1]Guo Wen, H. 2001. Discourse Analysis Theory and Practice [M].Shanghai Foreign Language Education Press.\[2\] Shan, S. 2008. Discourse analysis to speeches by American president — a corpus-based study on radio addresses by President George W Bush[D].Shandong Normal University.\[3\] Biber, D. S. Conrad, R. Reppen. 1998. Corpus Linguistics: Investigating Structure and Use[M].Cambridge University Press.\[4\] Halliday, M. A. K. 1973. Explorations in the Functions of Language[M].London: Edward Arnold.\[5\] Xinhua, K., Jia, X. 2012. A Functional Discourse Analysis of Obama’s speech in Arizona State University[J].Overseas English, 12.\[6\] Leech, G. 1994. A communicative Grammar of English[M].London: Longman Group UK Limited.\[7\] Haifeng, L.2012. Functional Discourse Analysis of “U.S. President Barack Obama’s speech at the G20 summit in Cannes press conference”[J].Overseas English, 15.基于语料库的功能语篇分析——奥巴马总统2013年就职演说[摘要]本文基于语料库的研究方法,主要运用韩礼德的三个元功能理论中的及物性系统、情态系统和主位系统来研究奥巴马2013的就职演说如何实现其演讲策略。
基于语料库对美国总统奥巴马每周电台演讲的语篇分析政治语篇是语片的一种特殊形式,诸如政客的讲演,政府公告,政策条文,议会辩论,政党策略等等都属于政治语篇的范畴。
政治语篇是从社会政治层面对语篇的一种划分,其中大都包含了语篇作者的政见观点。
本研究的研究对象是美国现任总统奥巴马每周电台演讲的转写文本。
此类语篇兼顾了政治语篇和政治演讲的特点,属于政治语篇的一种特殊形式。
因此此研究具有十分重要的研究意义,有助于我们了解此类政治语篇的语言特点和发掘语篇背后隐藏的政治观点。
自20世纪50年代被提出以来,当代话语分析理论得到了长足发展,已成为当代语言学的一个重要分支。
当前,话语分析主要是以系统功能语法,批评语言学,语用学,言语行为理论等为理论基础进行深层次的分析。
然而这种定性分析的分析方法其研究的深度和广度深受研究者的影响,其研究的主观性是不可避免的。
相比之下,建立在数据和定量分析基础上的语料库语言学更加客观,可以在很大程度上弥补话语分析理论的不足。
本研究引入了基于语料库的话语分析方法,根据语料库的建库原则建立了自建语料库CPOWA(包含了奥巴马总统自2010年5月执政至2011年12月31日共83篇每周电台演讲的演讲稿)。
在对其进行观察、检索、分析的基础上,作者尝试回答下列问题:(1)在词汇,短语,句子和衔接层面上此类文本有何特征?(2)文中体现的对某些特定对象的观点和看法是什么?在此研究中基于语料库的话语分析方法被运用在对自建语料库CPOWA中语篇的词汇、短语、句子和衔接层面的分析上。
最后作者对研究的成果,意义及局限性进行了总结。
当然,基于语料库的话语分析方法到目前为止发展还是不成熟的,其对语篇的研究大体还是集中在词汇层面,在索引分析的理论和技术层面还有待进一步的研究和提高。
《语料库研究》篇一一、引言语料库作为一种资源丰富的语言数据集合,已成为语言学、语言学研究以及相关领域的热点研究对象。
它能够为语言分析、语言教学、翻译、词典编纂等多个领域提供支持。
本文将介绍语料库研究的重要性,并就当前语料库研究的现状进行梳理,进而分析其中存在的挑战和问题,并探讨未来的发展趋势。
二、语料库研究的现状1. 语料库类型及建设随着技术的进步,语料库建设日趋成熟。
根据不同领域和用途,语料库可大致分为通用型和专用型。
其中,通用型语料库如COCA、BNC等,涵盖了广泛的语言使用场景;专用型语料库则针对特定领域或主题进行收集,如法律、医学等。
此外,还有多媒体语料库和口语语料库等类型。
在建设过程中,研究者需考虑语料库的规模、代表性、时效性等因素。
2. 语料库应用领域语料库在多个领域得到了广泛应用。
在语言学领域,语料库为语言研究提供了丰富的数据支持;在翻译领域,语料库可帮助提高翻译的准确性和效率;在词典编纂方面,语料库为词汇的收集和释义提供了有力支持。
此外,在语言教学、自然语言处理等领域,语料库也发挥着重要作用。
三、当前挑战与问题尽管语料库研究取得了显著成果,但仍面临诸多挑战和问题。
首先,在语料库建设方面,如何确保数据的代表性和真实性是一个亟待解决的问题。
此外,随着技术的发展,如何利用人工智能等手段对语料库进行智能化处理和利用也是一大挑战。
其次,在应用方面,如何将语料库与实际需求相结合,提高应用效果也是一个难题。
此外,不同领域和行业对语料库的需求存在差异,如何满足这些不同需求也是一项挑战。
四、未来展望面对未来的发展,语料库研究将呈现以下几个趋势:1. 多样化与个性化:随着用户需求的多样化与个性化发展,未来的语料库将更加关注用户需求和实际应用场景的差异。
研究者需要设计更多类型的语料库来满足不同领域和行业的需求。
2. 智能化与自动化:人工智能技术的不断发展将促进语料库的智能化和自动化处理。
例如,利用自然语言处理技术对语料进行自动标注、分类和分析等操作,提高处理效率和准确性。
基于语料库的研究范式是一种以语料库为基础,通过对大量真实语言数据的分析和处理来研究语言现象、语言使用和语言变化的方法。
这种范式通常包括以下几个步骤:
1. 语料库建设:收集大量的语言数据,并建立语料库。
这些数据可以来自不同的来源,如文学作品、新闻媒体、社交媒体等。
2. 语料处理:对语料库中的数据进行预处理,包括文本清洗、分词、词性标注等。
3. 语料分析:使用各种统计和分析方法来处理语料库中的数据。
这可能包括频率分析、关键词提取、主题建模等。
4. 结论得出:根据语料分析的结果,得出关于语言现象、语言使用和语言变化的结论。
这些结论可以为语言学、文学、文化等领域的研究提供有益的启示和证据。
基于语料库的研究范式具有以下优点:
1. 大量的语言数据支持:语料库可以包含大量的真实语言数据,使得研究者可以对语言现象进行深入的研究和分析。
2. 定量与定性相结合:基于语料库的研究范式可以将定性和定量的方法相结合,从而更全面地了解语言现象的本质和规律。
3. 跨学科性:基于语料库的研究范式可以应用于多个学科领域,如语言学、文学、文化学等,使得不同学科之间的交流和合作更加便捷。
总之,基于语料库的研究范式是一种重要的语言研究方法,可以帮助我们更好地了解语言的本质和规律,进一步拓展和丰富世界
文化多样性。