Creating and Digitizing Language Corpora

Creating and Digitizing Language Corpora
Author: J. Beal
Publisher: Springer
Total Pages: 270
Release: 2007-07-12
Genre: Language Arts & Disciplines
ISBN: 0230223206

A range of electronic corpora has become accessible via the WWW and CD-ROM. This coincides with improvements in standards governing the collecting, encoding and archiving of such data. This book develops similar standards for enriching and preserving 'unconventional' data': the fragmentary texts and voices left to us as accidents of history.

Creating and Digitizing Language Corpora

Creating and Digitizing Language Corpora
Author: Karen P. Corrigan
Publisher: Springer
Total Pages: 378
Release: 2016-09-19
Genre: Language Arts & Disciplines
ISBN: 1137386452

This book unites a range of approaches to the collection and digitization of diverse language corpora. Its specific focus is on best practices identified in the exploitation of these resources in landmark impact initiatives across different parts of the globe. The development of increasingly accessible digital corpora has coincided with improvements in the standards governing the collection, encoding and archiving of ‘Big Data’. Less attention has been paid to the importance of developing standards for enriching and preserving other types of corpus data, such as that which captures the nuances of regional dialects, for example. This book takes these best practices another step forward by addressing innovative methods for enhancing and exploiting specialized corpora so that they become accessible to wider audiences beyond the academy.

Creating and Digitizing Language Corpora

Creating and Digitizing Language Corpora
Author: J. Beal
Publisher: Springer
Total Pages: 266
Release: 2007-06-27
Genre: Language Arts & Disciplines
ISBN: 0230223931

A range of electronic corpora is increasingly accessible via the WWW and CD-ROM. This development coincided with improved standards governing the collecting, encoding and archiving of such data. This book looks at developing similar standards for enriching and preserving unconventional data: dialects, child language and bilingual databases.

Creating and Digitizing Language Corpora

Creating and Digitizing Language Corpora
Author: Karen P. Corrigan
Publisher: Palgrave Macmillan
Total Pages: 359
Release: 2016-09-27
Genre: Language Arts & Disciplines
ISBN: 9781137386441

This book unites a range of approaches to the collection and digitization of diverse language corpora. Its specific focus is on best practices identified in the exploitation of these resources in landmark impact initiatives across different parts of the globe. The development of increasingly accessible digital corpora has coincided with improvements in the standards governing the collection, encoding and archiving of ‘Big Data’. Less attention has been paid to the importance of developing standards for enriching and preserving other types of corpus data, such as that which captures the nuances of regional dialects, for example. This book takes these best practices another step forward by addressing innovative methods for enhancing and exploiting specialized corpora so that they become accessible to wider audiences beyond the academy.

The Routledge Handbook of English Language and Digital Humanities

The Routledge Handbook of English Language and Digital Humanities
Author: Svenja Adolphs
Publisher: Routledge
Total Pages: 693
Release: 2020-04-16
Genre: Language Arts & Disciplines
ISBN: 1000049787

The Routledge Handbook of English Language and Digital Humanities serves as a reference point for key developments related to the ways in which the digital turn has shaped the study of the English language and of how the resulting methodological approaches have permeated other disciplines. It draws on modern linguistics and discourse analysis for its analytical methods and applies these approaches to the exploration and theorisation of issues within the humanities. Divided into three sections, this handbook covers: sources and corpora; analytical approaches; English language at the interface with other areas of research in the digital humanities. In covering these areas, more traditional approaches and methodologies in the humanities are recast and research challenges are re-framed through the lens of the digital. The essays in this volume highlight the opportunities for new questions to be asked and long-standing questions to be reconsidered when drawing on the digital in humanities research. This is a ground-breaking collection of essays offering incisive and essential reading for anyone with an interest in the English language and digital humanities.

The Handbook of Language Variation and Change

The Handbook of Language Variation and Change
Author: J. K. Chambers
Publisher: John Wiley & Sons
Total Pages: 628
Release: 2018-05-01
Genre: Language Arts & Disciplines
ISBN: 1119457084

Reflecting a multitude of developments in the study of language change and variation over the last ten years, this extensively updated second edition features a number of new chapters and remains the authoritative reference volume on a core research area in linguistics. A fully revised and expanded edition of this acclaimed reference work, which has established its reputation based on its unrivalled scope and depth of analysis in this interdisciplinary field Includes seven new chapters, while the remainder have undergone thorough revision and updating to incorporate the latest research and reflect numerous developments in the field Accessibly structured by theme, covering topics including data collection and evaluation, linguistic structure, language and time, language contact, language domains, and social differentiation Brings together an experienced, international editorial and contributor team to provides an unrivalled learning, teaching and reference tool for researchers and students in sociolinguistics

Building a National Corpus

Building a National Corpus
Author: Dawn Knight
Publisher: Springer Nature
Total Pages: 192
Release: 2021-10-08
Genre: Language Arts & Disciplines
ISBN: 3030818586

This book aims to provide a micro-level, working model of a methodological approach and practical guidelines for building a corpus, informed by the work on the CorCenCC project (Corpws Cenedlaethol Cymraeg Cyfoes - the National Corpus of Contemporary Welsh). It focuses specifically on the development of detailed design frames for corpora across communicative modes (spoken, written and e-language), and the practical processes involved in the planning, collection, transcription, collation and (re)presentation of language data. The book is designed to be of significant value and relevance to those interested in critically engaging with corpus methodology. Although Welsh is the language under discussion, the processes and approaches discussed in the building of CorCenCC can be applied to a lesser or greater extent to other language contexts. This book provides a working model, and an account of how to build a corpus dataset from which step by step guidelines for creating other linguistic corpora in any language can be easily extrapolated. It will be of value to students and scholars of minority languages and corpus linguistics.

Creating and Digitizing Language Corpora

Creating and Digitizing Language Corpora
Author: Karen P. Corrigan
Publisher: Palgrave Macmillan
Total Pages: 0
Release: 2016-09-27
Genre: Language Arts & Disciplines
ISBN: 9781137386441

This book unites a range of approaches to the collection and digitization of diverse language corpora. Its specific focus is on best practices identified in the exploitation of these resources in landmark impact initiatives across different parts of the globe. The development of increasingly accessible digital corpora has coincided with improvements in the standards governing the collection, encoding and archiving of ‘Big Data’. Less attention has been paid to the importance of developing standards for enriching and preserving other types of corpus data, such as that which captures the nuances of regional dialects, for example. This book takes these best practices another step forward by addressing innovative methods for enhancing and exploiting specialized corpora so that they become accessible to wider audiences beyond the academy.