Now showing items 51-60 of 70
The central question which I will be trying to answer here is where principles governing the linear organization of linguistic constituents “belong” in a standard componential model of grammar.
The MILE Corpus for Less Commonly Taught Languages
(Association for Computational Linguistics, 2006)
This paper describes a small, structured English corpus that is designed for translation into Less Commonly Taught Languages (LCTLs), and a set of re-usable tools for creation of similar corpora. The corpus systematically ...
Automatic Learning of Grammatical Encoding
Avenue (Probst et al., 2002, Monson et al., 2004, Lavie et al., 2003, Font-Llitjoset al.,2005)1 is a machine translation system that automatically learns translation rules between two languages. In the Avenue scenario, one ...
Deconstructing descriptive grammars
(University of Hawai'i Press, 2012)
Much work within digital linguistics has focused on the problem of developing concrete methods and general principles for encoding data structures designed for non-digital media into digital formats. This work has been ...
‘Community’ collaboration in Africa: Experiences from Northwest Cameroon
A prominent feature of the literature on language documentation has been the importance of designing documentary projects in ways that allow speaker communities to benefit from the work of an outside researcher. Canonical ...
(Oxford University, 2008)
Certain grammatical patterns are found again and again in the languages of the world. Some of these patterns recur so frequently that they are given the label “universal”. Explaining the source of such patterns is clearly ...
Data and language documentation
(Cambridge University Press, 2011)
The topic of this chapter is the relationship between data and language documentation. Unlike many fields of study, concerns regarding data collection and manipulation play a central role in our understanding of, and ...
Modeling contested categorization in linguistic databases
A fundamental problem in the design of linguistic databases is finding effective ways to encode content which is of a contested nature—that is, content which involves data for which there is no general consensus on how ...