Search the World's Largest Database of Information Science & Technology Terms & Definitions
InfInfoScipedia LogoScipedia
A Free Service of IGI Global Publishing House
Below please find a list of definitions for the term that
you selected from multiple scholarly research resources.

What is Corpus

New Technological Applications for Foreign and Second Language Learning and Teaching
A large collection of texts selected in a systematic way and stored as an electronic database.
Published in Chapter:
Corpus-Informed Pedagogy in a Language Course: Design, Implementation, and Evaluation
Nina Vyatkina (University of Kansas, USA)
DOI: 10.4018/978-1-7998-2591-3.ch015
Abstract
Data-Driven Learning (DDL), or a corpus-based method of language teaching and learning, has been developing rapidly since the turn of the century and has been shown to be effective and efficient. Nevertheless, DDL is still not widely used in regular classrooms for a number of reasons. One of them is that few workable pedagogical frameworks have been suggested for integrating DDL into language courses and curricula. This chapter describes an exemplar of a practical application of such a pedagogical framework to a high-intermediate university-level German as a foreign language course with a significant DDL component. The Design-Based Research approach is used as the main methodological framework. The chapter concludes with a discussion of wider pedagogical implications.
Full Text Chapter Download: US $37.50 Add to Cart
More Results
Focus on Text Messages: A Review of Studies in French
Set of text messaging collected.
Full Text Chapter Download: US $37.50 Add to Cart
Web-Based English Writing Courses for Graduate Students
A collection of natural occurring language text, chosen to characterize the state or variety of a language.
Full Text Chapter Download: US $37.50 Add to Cart
Sharpening Students' Critical Literacy Skills Through Corpus-Based Instruction: Addressing the Issue of Language Sexism
Derived from Latin where it originally meant ‘body’, a corpus is a large body of texts which can be stored and processed in an electronic form. Given its size, it constitutes a representative sample of language, while its machine-readable format allows annotation, as well as various types of analysis based on the criteria set and the tools used (e.g., part-of-speech, frequencies, key-word-in-context, etc.). Specialized software allows processing of the data that a corpus contains. (Plural form: corpora )
Full Text Chapter Download: US $37.50 Add to Cart
Full Text Chapter Download: US $37.50 Add to Cart
Automated Essay Scoring Systems
Reference collection text used to establish stylistic or knowledge base for AES.
Full Text Chapter Download: US $37.50 Add to Cart
Full Text Chapter Download: US $37.50 Add to Cart
Concordancing 2.0: On Custom-Made Corpora in the Classroom
A collection of linguistic data, either written texts or a transcription of recorded speech, which can be used as a starting-point of linguistic description or as a means of verifying hypotheses about a language.
Full Text Chapter Download: US $37.50 Add to Cart
Entropy, Chaos, and Language
A specific compilation of textual data following certain criteria and usually with added meta data with linguistic value.
Full Text Chapter Download: US $37.50 Add to Cart
Corpora in the Classroom and Beyond
A collection of annotated or unannotated texts used for linguistic analysis.
Full Text Chapter Download: US $37.50 Add to Cart
Managing Corporate Terminology as an Internationalization Strategy: An Overview
Collection of texts that are selected according to specific criteria and constitutes a language sample.
Full Text Chapter Download: US $37.50 Add to Cart
Full Text Chapter Download: US $37.50 Add to Cart
Full Text Chapter Download: US $37.50 Add to Cart
Amplifying Participant Voices Through Text Mining
A collection of documents for use in text mining analysis.
Full Text Chapter Download: US $37.50 Add to Cart
Sharing Corpus Resources in Language Learning
A collection of naturally occurring data collected for the purpose of a linguistic investigation. A corpus may include materials representing various modes, registers and text types, and it may be possible to isolate these subsets of data, and analyze them separately or contrast them. Such a subdivision of a corpus is known as a subcorpus. A parallel corpus contains texts and translations of those texts, and is compiled in order to analyze and study translations.
Full Text Chapter Download: US $37.50 Add to Cart
Statistical Modelling of Highly Inflective Languages
A large collection of texts, usually in electronic form. The corpus has greater value if it is tokenized (segmented into sentences, words etc.) and linguistically annotated (for example POS-tagged and lemmatized).
Full Text Chapter Download: US $37.50 Add to Cart
The Integration of Corpus Tools in the Design and Implementation of a Novel Analytical Model for the Learning of K12 Classrooms
Any collection of more than one text, referring to a large collection of natural texts compiled and considered to be representative of a variety or a genre of a language in machine-readable form.
Full Text Chapter Download: US $37.50 Add to Cart
Big Data Visualization Tools and Techniques
A collection of written texts, literary works or aggregated data on a particular subject matter or the entire textual aggregation of works by a specific author.
Full Text Chapter Download: US $37.50 Add to Cart
First Person Pronouns in Online Diary Writing
A collection of written texts or transcriptions of spoken language. Now understood to be an electronic collection.
Full Text Chapter Download: US $37.50 Add to Cart
Latent Dirichlet Allocation Approach for Analyzing Text Documents
A corpus is a collection consisting of two or more documents.
Full Text Chapter Download: US $37.50 Add to Cart
Less Is More in College Students' Writing: Extremely Short Stories as a Bridge to Academic Writing
Written and spoken language that is collected and stored (on computers) for use in the study of language and compilation of dictionaries.
Full Text Chapter Download: US $37.50 Add to Cart
Full Text Chapter Download: US $37.50 Add to Cart
Computer-Assisted Language Learning in East Asia
A collection of naturally occurring language text, chosen to characterize the state or variety of a language (Sinclair, 1991).
Full Text Chapter Download: US $37.50 Add to Cart
Individual Differences among Student Teachers in Taking an Online Corpus Linguistics Course: A Multiple Case Study
A corpus (plural corpora) is a collection of electronically stored texts from written or spoken language which is representative of a genre.
Full Text Chapter Download: US $37.50 Add to Cart
Mispronunciation Detection Using Neural Networks for Second Language Learners
A language dataset selected systematically and stored as an electronic database.
Full Text Chapter Download: US $37.50 Add to Cart
Estimating Importance From Web Reviews Through Textual Description and Metrics Extraction
A dataset about some specific subject. Widely used in comptational linguistic area.
Full Text Chapter Download: US $37.50 Add to Cart
An Evaluation of Preposition Representation in the Omani Basic Education ELT Textbooks: Focus on Grades 1-4
A large collection of written or spoken texts which form the basis for corpus linguistics.
Full Text Chapter Download: US $37.50 Add to Cart
Develop a Neural Model to Score Bigram of Words Using Bag-of-Words Model for Sentiment Analysis
Corpus is a original repositories or online dataset which is used in most of the NLP projects.
Full Text Chapter Download: US $37.50 Add to Cart
eContent Pro Discount Banner
InfoSci OnDemandECP Editorial ServicesAGOSR