UNSPECIFIED ANNOTATION MODEL FOR LOANWORDS IN INDONESIAN CORPUS : A LOCAL GRAMMAR FRAMEWORK. Proceeding of International Conference on Language Maintenance and Shift III (ISSN 2088-6799) . ISSN 2088-6799
Microsoft Word - Published Version 91Kb |
Abstract
There is a considerable number for loanwords in Indonesian language as it has been, or even continuously, in contact with other languages. The contact takes place via different media; one of them is via machine readable medium. As the information in different languages can be obtained by a mouse click these days, the contact becomes more and more intense. This paper aims at proposing an annotation model and lexical resource for loanwords in Indonesian. The lexical resource is applied to a corpus by a corpus processing software called UNITEX. This software works under local grammar framework (Gross, 1993 & 1997). The lexical resource has already been tested on a small corpus to perform automatic retrieval of loanwords in Indonesian. The automatic retrieval aims at identifying the loanwords from different languages. The queries demonstrated in this paper allows the users not only to retrieve loanwords, but also show etymological information about the loanwords, mainly the donor language, and in some cases, the language that introduce the loanwords to Indonesian.
Item Type: | Article |
---|---|
Subjects: | P Language and Literature > P Philology. Linguistics |
Divisions: | Faculty of Humanities > Department of English |
ID Code: | 39597 |
Deposited By: | INVALID USER |
Deposited On: | 23 Jul 2013 10:27 |
Last Modified: | 23 Jul 2013 10:27 |
Repository Staff Only: item control page