Corpus Manager: A tool for multilingual corpus analysis

George Kouklakis, Georgios Mikros, George Markopoulos, Ilias Koutsis

Research output: Contribution to conferencePaperpeer-review

Abstract

The vast amount of electronic texts in the web facilitates the creation of megacorpora at a very short time. However, as the corpus size increases, the complexity of its management is becoming a serious problem limiting its functionality. Furthermore, a great deal of corpus linguistics research is based on the quantitative comparison of small corpus samples, which are drawn from a bigger general language corpus based on specific criteria such as authorship, topic, genre, register etc (Biber 1993). For these reasons a number of tools have already been developed and aim to organize and handle texts in corpora (e.g. Christ 1994, Holmes-Higgin et al. 1994). However, most of the developed systems have a significant learning curve and limited flexibility regarding the metadata which can be used as subcorpus selection criteria.
Original languageEnglish
Number of pages4
Publication statusPublished - 2007
Externally publishedYes
EventCorpus Linguistics Conference 2007 - Birmingham, United Kingdom
Duration: 27 Jul 200730 Jul 2007

Conference

ConferenceCorpus Linguistics Conference 2007
Country/TerritoryUnited Kingdom
CityBirmingham
Period27/07/0730/07/07

Fingerprint

Dive into the research topics of 'Corpus Manager: A tool for multilingual corpus analysis'. Together they form a unique fingerprint.

Cite this