题名

分析華語口語語料庫高頻詞之特點並對TOCFL詞表提出建議

并列篇名

Analyzing the Features of the High-Frequency Words on Chinese Spoken Corpus and Offering the Word-recruiting Suggestion to TOCFL Wordlist

作者

楊惠媚(Hui-Mei Yang);陳浩然(Howard Hao-Jan Chen);潘依婷(I-Ting Pan)

关键词

華語學習 ; 華語詞表 ; 口語語料庫 ; 高頻詞 ; Chinese learning ; Chinese wordlist ; Spoken corpus ; high-frequency words

期刊名称

華語文教學研究

卷期/出版年月

12卷1期(2015 / 03 / 01)

页次

1 - 44

内容语文

繁體中文

中文摘要

詞表主要功能在於列出學習者應學習的詞彙,能作為第二語言詞彙教學之參考。臺灣著名的華語詞表-TOCFL詞表為測驗及學習者常用詞表。但因TOCFL詞表選詞以參考書面語語料庫詞頻為主,口語語料所佔的比例較低。然而書面語與口語詞彙為不同語體詞彙,是故語言教學不應僅限於書面語,選詞亦應涵蓋口語詞彙。為探究是否部分口語高頻詞可建議納入TOCFL詞表,以使詞表書面語語料及口語語料的比例較為平均,本研究蒐集具有對話性質的華語連續劇及電影對白作為母語者口語語料,藉詞頻排列得出口語高頻詞彙,並與TOCFL詞表進行對比。對比後發現有713筆口語語料高頻詞並未收錄於TOCFL詞表中。本研究提出詞頻最高且最具口語關鍵詞特點之238筆詞彙供TOCFL詞表增修參考詞彙,亦依據自前人文獻整理出的6項口語詞彙特點針對713筆口語高頻詞進行分類並歸納出以下特點:(1)口語詞彙多以詞塊合成詞的形式呈現;(2)口語詞彙中包含較多多音節熟語;(3)以「不」及「沒」的組合型式詞塊數量偏多。本研究最後將依詞頻及口語詞彙特色提出建議可納入TOCFL詞表之詞彙,期望提高詞表中口語詞彙以增加TOCFL詞表的豐富性及多元性,並提供華語教學更貼近口語交際使用之詞彙參考。

英文摘要

Wordlist serves as a reference for second language teaching, and also guides second language learners to evaluate what words they need to acquire. The TOCFL wordlist is one of the common learning materials for learners preparing for Chinese proficiency test. However, words included in the TOCFL wordlist were largely selected from a written corpus, whereas words extracted from a spoken corpus were limited. Because written and spoken corpora are presumably different, it is necessary to include words in both registers and to emphasize the differences while teaching. To balance the proportions of written and spoken words in the TOCFL wordlists, this study first established a native spoken corpus by extracting subtitles from Mandarin movies and TV series, and then compiled a list of high-frequency spoken words as an amendment to the TOCFL wordlist. Comparison between this spoken wordlist with the TOCFL wordlist showed that the most frequently used 713 words in the corpus were not covered in the TOCFL wordlist. We then suggested a list of the top 238 high-frequency words to be included in the TOCFL wordlist. The 713 high-frequency spoken words were further classified into six groups based on their features, and some key findings were summarized as follows: (1) the majority of the items are word chunks, (2) the spoken words are characterized as multi-syllable words, and (3) there are large numbers of word combinations of bu and mei in the list. We hope that the provision of this commonly used spoken wordlist can increase the proportion of spoken words in the TOCFL wordlist, which can offer learners more authentic materials to meet their oral communication needs.

主题分类 人文學 > 語言學
社會科學 > 教育學
参考文献
  1. Chen, Keh-Jiann,Bai, Ming-Hong(1998).Unknown Word Detection for Chinese by a Corpus-based Learning Method.International Journal of Computational linguistics and Chinese Language Processing,3(1),27-44.
    連結:
  2. 楊惠媚、陳浩然、潘依婷(2014)。兩岸華語詞表之比較及選詞建議。華語文教學研究,11(1),67-98。
    連結:
  3. (2004).Vocabulary in a Second Language: Selection, Acquisition, and Testing.
  4. (1983).Language and communication.
  5. Berber-Sardinha, T.(2000).Comparing corpora with WordSmith Tools: How large must the reference corpus be?.Proceedings of the workshop on Comparing corpora-Volume 9
  6. Biber, Douglas(1988).Variation Across Speech and Writing.Cambridge:Cambridge University Press.
  7. Biber, Douglas,Conrad, Susan,Cortes, Viviana(2004).If you look at...: Lexical bundles in university teaching and textbooks.Applied linguistics,25(3),371-405.
  8. Biber, Douglas,Finegan, Edward(1991).On the Exploitation of Computerized Corpora in Variation Studies.English Corpus Linguistics,London:
  9. Carter, Ronald(2004).Language and Creativity: The Art of Common Talk.London:Routledge.
  10. Carter, Ronald,McCarthy, Michael,Hughes, Rebecca(2002).Exploring Grammar in Context:Upper-intermediate and Advanced.Ernst Klett Sprachen.
  11. Danescu-Niculescu-Mizil, Cristian,Lee, Lillian(2011).Chameleons in Imagined Conversations: A new Approach to Understanding Coordination of Linguistic Style in Dialogs.Proceedings of the 2nd Workshop on Cognitive Modeling and Computational Linguistics
  12. Davies, Mark,Gardner, Dee(2013).A Frequency Dictionary of American English: Word Sketches, Collocates and Thematic Lists.Routledge.
  13. Greenbaum, Sidney,Svartvik, Jan(1990).The London-Lund Corpus of Spoken English.Lund University Press.
  14. Halliday, Michael. A. K.(1989).Spoken and Written Language.Oxford University Press.
  15. Halliday, Michael. A. K.(ed.),Gibbons, John(ed.),Nicholas, Howard(ed.)(1990).Learning, Keeping and Using Language: Selected papers from the Eighth World Congress of Applied Linguistics, Sydney
  16. Kilgarriff, Adam(1997).I don't believe in word senses.Computers and the Humanities,31(2),91-113.
  17. Kilgarriff, Adam,Pavel, Rychly,Pavel, Smrz,David, Tugwell(2004).The Sketch Engine.Proc. EURALEX,Lorient. France:
  18. Krishnamurthy, Ramesh(2003).Language as Chunks, Not Words.JALT2003 Proceedings,Tokyo. Japan:
  19. McCarthy, Michael(2004).Touchstone: From corpus to course book.Cambridge University Press.
  20. McCarthy, Michael,Carter, Ronald(2001).Size isn't everything: Spoken English, Corpus, and the Classroom.Tesol Quarterly,35(2),337-340.
  21. McCarthy, Michael,O'Dell, Felicity(2003).English Vocabulary in Use. Advanced.Cambridge:Cambridge University Press.
  22. McCarthy, Michael,O'Dell, Felicity(2003).English Idioms in Use. Intermediate to Upper-intermediate. with Answers.Cambridge:Cambridge University Press.
  23. Nation, I. S. P.(2001).Learning Vocabulary in Another Language.Cambridge University Press.
  24. Nation, Paul,Waring, Robert(1997).Vocabulary Size, Text Coverage and Word Lists.Vocabular: Descri ption, Acquisition, Pedagogy,14,6-19.
  25. Nattinger, James R.,DeCarrico, Jeanette. S.(1992).Lexical Phrases and Language Teaching.Oxford University Press.
  26. Phillips, Martin(1989).Lexical Structure of Text.English Language Research.
  27. Saussure, Ferdinand de,Bally, Charles(ed.),Sechehaye, Albert(ed.),Riedlinger, Albert(ed.),Baskin, Wade(Trans.)(1959).Course in General Linguistics.New York:Philosophical Library.
  28. Shirato, J.(2005).A Corpus-based Analysis of Basic Spoken Vocabulary in EFL Textbook Conversations.Hokusei Gakuen University Graduate School Literature Review,2,15-31.
  29. Willis, Dave(1990).The Lexical Syllabus: A New Approach to the Language Teaching.Collins ELT.
  30. Wray, Alison(2000).Formulaic Sequences in Second Language Teaching: Principle and Practice.Applied linguistics,21(4),463-489.
  31. Wray, Alison(2002).The transition to language.
  32. 尹惠貞(2006)。北京語言大學=Beijing Language and Culture University。
  33. 王希杰(1991)。語言學百題。上海=Shanghai:上海教育出版社=Shanghai Educational Press。
  34. 王芳智編(1990)。漢語口語學。山西教育出版社=Shanxi Educational Pub. Co.。
  35. 朱亞軍、田宇(2000)。現代漢語詞綴的性質及其分類研究。學術交流,2000(2),134-137。
  36. 朱慶明(2005)。現代漢語實用語法分析。北京=Beijing:清華大學出版社=Qing Hua University Press。
  37. 吳麗君(2004)。口語詞彙與書面語詞匯教學研究。雲南師範大學學報(對外漢語教學與研究版),2004(3),14-19。
  38. 宋婧婧(2013)。漢語口語與書面語詞彙使用對比分析─基於傳媒語料庫。廈門理工學院學報,21(3),88-92。
  39. 李世文、陳秋梅(1993)。中文口語與書寫語的比較研究。教學與研究,15,63-96。
  40. 李如龍(2007)。關注漢語口語詞彙與書面語詞匯的研究。陝西師範大學學報,36(2),110-116。
  41. 李如龍、吳茗(2005)。略論對外漢語辭彙教學的兩個原則。語言教學與研究,2005(2),41-47。
  42. 車豔秋、楊虹、曹明(2013)。外語焦慮與大學英語口語教學設計。遼寧行政學院學報,2013(11),81-83。
  43. 周祖謨(1959)。漢語詞彙講話。人民教育出版社=People's Education Press。
  44. 胡元江(2011)。口語產出中的詞塊研究:回顧與展望。外語教學理論與實踐,2011(2),55-63。
  45. 徐立人(2011)。中央大學網路學習科技研究所=Graduate Institute of Network Learning Technology, National Central University。
  46. 徐海美(2009)。詞彙組塊在二語習得的實踐分析。吉林工程技術師範學院學報,25(1),44-46。
  47. 徐素萍(2012)。淺議建立與現代漢語課程配套的口語語料庫的意義。長沙大學學報,26(1),125-126。
  48. 高名凱、石安石(2002)。語言學概論。北京=Beijing:中華書局=ZhongHua Book Company。
  49. 常敬宇(1986)。漢語口語詞彙的特點。邏輯與語言學習,1986(4),36-37。
  50. 張文賢、路雲、李曉琪(2012)。基於口語語料庫的漢語口語自動化考試詞表的研製。中文教學現代化學報,1(2),37-44。
  51. 張金蘭(2012)。對外華語文教學中的書面語教學—以臺灣華語文教材為主所做的探討。政治大學華語文教學中心「錦華工作坊」
  52. 張莉萍、陳鳳儀(2005)。華語詞彙分級初探。第六屆漢語詞彙語義學研討會論文集
  53. 張莉萍、陳鳳儀(2007)。華語文能力測驗發展現況。外語能力測驗之動向與展望國際研討會論文集
  54. 曹合建編(2008)。基於語料庫的商務英語研究。北京=Beijing:對外經濟貿易大學出版社=University of International Business and Economics Press。
  55. 曹煒(2003)。現代漢語口語詞和書面語詞的差異初探。語言教學與研究,2003(6),39-44。
  56. 陳露、韋漢(2005)。英語口語語料庫在英語口語教學中的作用。外語電化教學,103,23-26。
  57. 楊俊萱(1984)。口語和書面語。語言教學與研究,1984(1),137-146。
  58. 葉蜚聲、徐通鏘(1993)。語言學綱要。臺北市=Taipei:書林=Bookman Bookstore。
  59. 翟瑩(2012)。淺談幼師英語詞塊教學。成功(教育),2012(5),104-104。
  60. 劉玫芳(2013)。國立高雄師範大學=National Kaohsiung Normal University。
  61. 錢旭菁(2008)。漢語詞塊研究初探。北京大學學報(哲學社會科學版),45(5),139-146。
  62. 謝智香(2011)。論現代漢語口語詞的特點。西南石油大學學報(社會科學版),2011(3),103-106。
  63. 顧安達、朱志平(2005)。口語和書面語教學目標的衝突與漢語教學的課程改革。海外華文教育,2005(2),42-47。
被引用次数
  1. (2024)。影響中文文本可讀性之句子難度因素探究。華語文教學研究,21(3),1-30。