Now showing items 1-3 of 3
Semi-Automated Elicitation Corpus Generation
In this document we will describe a semi-automated process for creating elicitation corpora. An elicitation corpus is translated by a bilingual consultant in order to produce high quality word aligned sentence pairs. The ...
The MILE Corpus for Less Commonly Taught Languages
(Association for Computational Linguistics, 2006)
This paper describes a small, structured English corpus that is designed for translation into Less Commonly Taught Languages (LCTLs), and a set of re-usable tools for creation of similar corpora. The corpus systematically ...
Parallel Reverse Treebanks for the Discovery of Morpho-Syntactic Markings
This paper describes a corpus of syntactic structures and associated sentences. However, it is not a traditional treebank. The syntactic structures are created first and are then associated with sentences in a human language. ...