题名

Developing an Online Corpus of Formosan Languages

DOI

10.6519/TJL.2008.6(2).4

作者

宋麗梅(Li-May Sung);蘇以文(Lily I-Wen Su);謝富惠(Fu-Hui Hsieh);林哲民(Zhe-Min Lin)

关键词

Formosan languages ; Taiwan ; corpus ; database normalization ; discourse ; intonation unit IU ; Pear story ; Frog story ; cross-referencing retrievability ; multilingual search ; interoperability

期刊名称

Taiwan Journal of Linguistics

卷期/出版年月

6卷2期(2008 / 12 / 01)

页次

79 - 117

内容语文

英文

英文摘要

Information technologies have now matured to the point of enabling researchers to create a repository of language resources, especially for those languages facing the crisis of endangerment. The development of an online platform of corpora, made possible by recent advances in data storage, character-encoding and web technology, has profound consequences for the accessibility, quantity, quality and interoperability of linguistic field data. This is of particular significance for Formosan languages in Taiwan, many of which are on the verge of extinction. As a response to the recognition of this burgeoning problem, the key objectives of the establishment of the NTU Corpus of Formosan Languages aim to document and thus preserve valuable linguistic data, as well as relevant ethnological and cultural information. This paper will introduce some of the theoretical bases behind this initiative, as well as the procedures, transcription conventions, database normalization, in-house system and three special features in the creation of this corpus.

主题分类 人文學 > 語言學
参考文献
  1. Zeitoun, Elizabeth,Ching-Hua Yu.(2005).The Formosan language archive: Linguistic analysis and language processing.Computational Linguistics and Chinese Language Processing,10(2),167-200.
    連結:
  2. Berman, Ruth,Dan Slobin.(1994).Relating Events in Narrative: A Crosslinguistic Developmental Study.Hillsdale, N.J.:Lawrence Erlbaum Associates.
  3. Brenzinger, M. (ed.)(1992).Language Death: Factual and Theoretical Explorations with Special Reference to East Africa.Berlin:Mouton de Gruyter.
  4. Brill, Eric.(1995).Transformation-based error-driven learning and natural language processing: A case study in part-of-speech tagging.Computational Linguistics,21(4),543-565.
  5. Chafe, Wallace L.(1994).Discourse, Consciousness and Time.Chicago:University of Chicago Press.
  6. Chafe, Wallace L. (ed.)(1980).The Pear Stories: Cognitive, Cultural, and Linguistic Aspects of Narrative Production.Norwood, N.J.:Ablex Publishing Corp.
  7. Chafe, Wallace L.,Russell S. Tomlin (ed.)(1987).Coherence and Grounding in Discourse.Amsterdam:John Benjamins.
  8. Chen, Hsueh-hua.(2007).Studies on the Development of Indigenous Languages.Taipei:Council of Indigenous Peoples.
  9. Couper-Kuhlen, E.,Thompson, S.,E Couper-Kuhlen (ed.),B. Koremann(2000).Cause-condition-concession-contrast: Cognitive and Discourse Perspectives.Berlin:Mouton de Gruyter.
  10. Crawford, J.(1995).Endangered native American languages: What is to be done and why.The Bilingual Research Journal,19(1),17-38.
  11. DeLancey, Scott.(1981).An interpretation of split ergativity and related patterns.Language,57(3),626-657.
  12. Diamond, Jared M.(2000).Linguistics: Taiwan`s gift to the world.Nature,403,709-710.
  13. Du Bois, J. W.(1987).The discourse basis of ergativity.Language,63,805-855.
  14. Du Bois, J. W.,John Haiman (ed.)(1985).Iconicity in Syntax.Amsterdam:Benjamins.
  15. Du Bois, J. W,Stephan Schuetze-Coburn,Susanna Cumming,Danae Paolino.,J. A. Edwards (ed.),M. D. Lampert(1993).Outline of discourse transcription. Talking Data: Transcription and Coding for Language Research.Hillsdale, N.J.:Lawrence Erlbaum Associates.
  16. Heine, Bernd.(1997).Cognitive Foundations of Grammar.Oxford/New York:Oxford University Press.
  17. Himmelmann, Nikolaus.,Jost Gippert (ed.),Nikolaus P. Himmelmann,Ulrike Mosel(2006).Essentials of Language Documentation.Berlin:Mouton de Gruyter.
  18. Hopper, Paul J.,Sandra A. Thompson.(1980).Transitivity in grammar and discourse.Language,56,251-299.
  19. Hopper, Paul J.,Sandra A. Thompson.(1984).The discourse basis for lexical categories in Universal Grammar.Language,60(4),702-752.
  20. Huang, Shuanfan,Jozsef Szakos,Li-May Sung.(1999).NSC Technical Report.National Taiwan University.
  21. Huang, Shuanfan,Lily I-wen Su,Li-May Sung.(2004).NSC Technical Report.National Taiwan University.
  22. Huang, Shuanfan,Lily I-wen Su,Li-May Sung.(2002).NSC Technical Report.National Taiwan University.
  23. Huang, Shuanfan,Lily I-wen Su,Li-May Sung.(2001).NSC Technical Report.National Taiwan University.
  24. Huang, Shuanfan,Lily I-wen Su,Li-May Sung.(2003).NSC Technical Report.National Taiwan University.
  25. Huang, Shuanfan,Li-May Sung.(2005).NSC Technical Report.National Taiwan University.
  26. Krauss, M.(1992).The world`s languages in crisis.Language,68,6-10.
  27. Lerner, Gene H.,Takagi, T.(1999).On the place of linguistic resources in the organization of talk-in-interaction.Journal of Pragmatics,31,49-75.
  28. Lin, Zhemin.(2005).National Taiwan University.
  29. Lin, Zhemin,Li-May Sung.(2004).Tiny corpus applications with transformation-based error-driven learning: Evaluations of automatic grammar induction and partial parsing of SaiSiyat.Proceedings of the 18 Pacific Asia Conference on Language, Information and Computation,Tokyo:
  30. Mayer, Mercer.(1980).Frog, Where Are You.New York:Dial Books.
  31. Ono, Tsuyoshi,S. A. Thompson.,P. W. Davis (ed.)(1995).Alternative Linguistics: Descriptive and Theoretical Modes.Amsterdam:John Benjamins.
  32. Robins, R. H.,Uhlenbeck, E. (eds.)(1991).Endangered Languages.Oxford:Berg.
  33. Sack, Harvey,Emanuel A. Schegloff,Gail Jefferson.(1974).A simplest system for the organization of turn-taking for conversation.Language,50,696-735.
  34. Schegloff, Emanuel A.(1988).Pre-sequences and Indirection.Journal of Pragmatics,12,55-62.
  35. Schegloff, Emanuel A.(1980).Preliminaries to preliminaries: "Can I ask you a question?".Sociological Inquiry,50,104-151.
  36. Schegloff, Emanuel A.,E. Ochs (ed.),E. Schegloff,S. Thompson(1996).Interaction and Grammar.New York:Cambridge University Press.
  37. Schegloff, Emanuel A.,Lauren B. Resnick (ed.),John M. Levine,Stephanie D. Teasley(1991).Perspectives on Socially Shared Cognition.Washington D.C.:American Psychological Association.
  38. Schmidt, A.(1990).The Loss of Australia`s Aboriginal Language Heritage.Canberra:Aboriginal Studies Press.
  39. Strömqvist, Sven,Ludo Verhoeven. (ed.)(2003).Relating Events in Narrative, Volume 2: Typological and Contextual Perspectives.Hillsdale, N.J.:Lawrence Erlbaum Associates.
  40. Su, Lily I-wen,Li-May Sung,Shuping Huang,Fuhui Hsieh,Zhemin Lin.(2008).NTU Corpus of Formosan Languages: A state-of-the-art report.Corpus Linguistics and Linguistic Theory,4-2,291-294.
  41. Tao, Hongyin.(1996).Units in Mandarin Conversation: Prosody, Discourse and Grammar.Amsterdam:John Benjamins.
  42. Tuan, Yi-Fu.(1974).Topophilia: A Study of Environmental Perception, Attitudes, and Values.Englewood Cliffs, N.J.:Prentice Hall.
  43. Zeitoun, Elizabeth,Ching-hua Yu,Cui-xia Weng.(2003).The Formosan language archive: Development of a multimedia tool to salvage the languages and oral traditions of the indigenous tribes of Taiwan.Oceanic Linguistics,42(1),218-232.
被引用次数
  1. 賴惠玲,劉吉軒,葉秋杏(2021)。臺灣客語語料庫建置與客語詞彙使用初探。數位典藏與數位人文,8,75-131。
  2. 葉美利(2014)。賽夏語指示詞的句法、構詞與語意初探。臺灣語文研究,9(1),55-78。
  3. (2016)。助動詞「會」情態語義演變共時構擬─基於跨語言/方言的比較研究。語言暨語言學,17(2),195-233。