Now showing items 1-4 of 4
Semi-Automated Elicitation Corpus Generation
In this document we will describe a semi-automated process for creating elicitation corpora. An elicitation corpus is translated by a bilingual consultant in order to produce high quality word aligned sentence pairs. The ...
The MILE Corpus for Less Commonly Taught Languages
(Association for Computational Linguistics, 2006)
This paper describes a small, structured English corpus that is designed for translation into Less Commonly Taught Languages (LCTLs), and a set of re-usable tools for creation of similar corpora. The corpus systematically ...
Automatic Learning of Grammatical Encoding
Avenue (Probst et al., 2002, Monson et al., 2004, Lavie et al., 2003, Font-Llitjoset al.,2005)1 is a machine translation system that automatically learns translation rules between two languages. In the Avenue scenario, one ...
Parallel Reverse Treebanks for the Discovery of Morpho-Syntactic Markings
This paper describes a corpus of syntactic structures and associated sentences. However, it is not a traditional treebank. The syntactic structures are created first and are then associated with sentences in a human language. ...