• Login
    View Item 
    •   UBIR Home
    • Publications
    • Language Sciences (public)
    • View Item
    •   UBIR Home
    • Publications
    • Language Sciences (public)
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Assessing agreement level between forced alignment models with data from endangered language documentation corpora

    Thumbnail
    View/Open
    assessing_agreement_2012.pdf (115.3Kb)
    Date
    2012
    Author
    DiCanio, Christian
    Nam, Hosung
    Whalen, Douglas H.
    Bunnell, H. Timothy
    Amith, Jonathan D.
    Castillo Garcia, Rey
    Metadata
    Show full item record
    Abstract
    Automatic forced alignment between transcriptions has achieved high levels of agreement for languages with large corpora, but the technique holds great promise for work on all languages. Here, we apply two forced alignment programs to data from an endangered Mixtecan language of Mexico. Both yielded a majority of boundaries within 20 ms of hand-labeled ones. Phonemes with fairly steady-state elements (e.g. nasals, fricatives) were more accurately labeled than others. Forced alignment thus may increase efficiency of labeling texts from smaller languages, at least in cases where the phoneme inventories are similar to those of the languages of the training.
    URI
    http://hdl.handle.net/10477/41243
    Collections
    • Language Sciences (public)

    To add content to the repository or for technical support: Contact Us
     

     

    Browse

    All of UBIRCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsTypesThis CollectionBy Issue DateAuthorsTitlesSubjectsTypes

    My Account

    LoginRegister

    To add content to the repository or for technical support: Contact Us