Now showing items 1-6 of 6
Montage: Leveraging advances in grammar engineering, linguistic ontologies, and mark-up for the documentation of underdescribed languages
(Language Resources and Evaluation Conference, 2004)
The Montage project aims to develop a suite of software tools which will assist field linguists in organizing and analyzing the data they collect while at the same time producing resources which are easily discoverable ...
Creating a database and query-tools for the TELL multi-speaker linguistic corpus
(University of Pennsylvania, 2001)
The Turkish Electronic Living Lexicon (TELL; http://socrates.berkeley.edu:7037) represents the first large-scale effort to collect transcribed recordings of a large number of partial morphological paradigms (more than ...
Parallel Reverse Treebanks for the Discovery of Morpho-Syntactic Markings
This paper describes a corpus of syntactic structures and associated sentences. However, it is not a traditional treebank. The syntactic structures are created first and are then associated with sentences in a human language. ...
Semi-Automated Elicitation Corpus Generation
In this document we will describe a semi-automated process for creating elicitation corpora. An elicitation corpus is translated by a bilingual consultant in order to produce high quality word aligned sentence pairs. The ...
The MILE Corpus for Less Commonly Taught Languages
(Association for Computational Linguistics, 2006)
This paper describes a small, structured English corpus that is designed for translation into Less Commonly Taught Languages (LCTLs), and a set of re-usable tools for creation of similar corpora. The corpus systematically ...
Automatic Learning of Grammatical Encoding
Avenue (Probst et al., 2002, Monson et al., 2004, Lavie et al., 2003, Font-Llitjoset al.,2005)1 is a machine translation system that automatically learns translation rules between two languages. In the Avenue scenario, one ...