Part of Speech Induction from Distributional Features: Balancing Vocabulary and Context
Datla,V.V. ; Louwerse,M.M. ; Lin,King-Ip
Datla,V.V.
Louwerse,M.M.
Lin,King-Ip
Abstract
Past research on grammar induction has found promising results in predicting parts-of-speech from n-grams using a fixed vocabulary and a fixed context. In this study, we investigated grammar induction whereby we varied vocabulary size and context size. Results indicated that as context increased for a fixed vocabulary, overall accuracy initially increased but then leveled off. Importantly, this increase in accuracy did not occur at the same rate across all syntactic categories. We also address the dynamic relation between context and vocabulary in terms of grammar induction in an unsupervised methodology. We formulate a model that represents a relationship between vocabulary and context for grammar induction. Our results concur with what has been called the word spurt phenomenon in the child language acquisition literature.
Description
Date
2014-05-03
Journal Title
Journal ISSN
Volume Title
Publisher
AAAI Press
Research Projects
Organizational Units
Journal Issue
Keywords
Citation
Datla, V V, Louwerse, M M & Lin, K-I 2014, Part of Speech Induction from Distributional Features : Balancing Vocabulary and Context. in W Eberle & C Boonthum-Denecke (eds), Proceedings of the Twenty-Seventh International Florida Artificial Intelligence Research Society Conference. AAAI Press, pp. 28-32, Twenty-Seventh International Florida Artificial Intelligence Research Society Conference, Florida, United States, 3/05/14.
