Automatically Filtering Irrelevant Words for Applications in Language Acquisition


Sohsah G. N., Akkurt E., Safarli I., Unal M. E., Guzey O.

13th International Conference on Machine Learning and Applications (ICMLA), Michigan, United States Of America, 3 - 06 December 2014, pp.557-561, (Full Text) identifier identifier

  • Publication Type: Conference Paper / Full Text
  • Doi Number: 10.1109/icmla.2014.113
  • City: Michigan
  • Country: United States Of America
  • Page Numbers: pp.557-561
  • Acibadem Mehmet Ali Aydinlar University Affiliated: No

Abstract

Building one's vocabulary in a language is an important component of language acquisition. Children learn their native language by being immersed in the language that is used in their environment. However, in second language acquisition, learners are often exposed to vocabulary that is selected by others specifically to aid language acquisition such as textbooks and word-lists. In this paper, we are presenting a machine learning based method for automatically selecting words that are relevant to the language acquisition task. The word relevancy is determined using data collected from 30 practicing English as a Second Language teachers for this purpose. We demonstrate the viability of this approach by using words from two major corpora, although in practice any corpora such as Google Books corpus can be utilized.