Now showing items 1-6 of 6
Montage: Leveraging advances in grammar engineering, linguistic ontologies, and mark-up for the documentation of underdescribed languages
(Language Resources and Evaluation Conference, 2004)
The Montage project aims to develop a suite of software tools which will assist field linguists in organizing and analyzing the data they collect while at the same time producing resources which are easily discoverable ...
Parallel Reverse Treebanks for the Discovery of Morpho-Syntactic Markings
This paper describes a corpus of syntactic structures and associated sentences. However, it is not a traditional treebank. The syntactic structures are created first and are then associated with sentences in a human language. ...
Semi-Automated Elicitation Corpus Generation
In this document we will describe a semi-automated process for creating elicitation corpora. An elicitation corpus is translated by a bilingual consultant in order to produce high quality word aligned sentence pairs. The ...
The MILE Corpus for Less Commonly Taught Languages
(Association for Computational Linguistics, 2006)
This paper describes a small, structured English corpus that is designed for translation into Less Commonly Taught Languages (LCTLs), and a set of re-usable tools for creation of similar corpora. The corpus systematically ...
Automatic Learning of Grammatical Encoding
Avenue (Probst et al., 2002, Monson et al., 2004, Lavie et al., 2003, Font-Llitjoset al.,2005)1 is a machine translation system that automatically learns translation rules between two languages. In the Avenue scenario, one ...
Creating a database and query-tools for the TELL multi-speaker linguistic corpus
(University of Pennsylvania, 2001)
The Turkish Electronic Living Lexicon (TELL; http://socrates.berkeley.edu:7037) represents the first large-scale effort to collect transcribed recordings of a large number of partial morphological paradigms (more than ...