


The Revision of the Chinese Linguistic Inquiry and Word Count Dictionary 2015




林瑋芳(Wei-Fang Lin);黃金蘭(Chin-Lan Huang);林以正(Yi-Cheng Lin);李嘉玲(Chia-Ling Lee);James W. Pennebaker


語文探索與字詞計算 ; 文本分析 ; 大數據應用 ; LIWC (Linguistic Inquiry and Word Count) ; text analysis ; big data analytics




45期(2020 / 10 / 01)


73 - 118




以自動化日常語言分析展現心理特性的研究,近年來相當受到關注,語文探索與字詞計算(Linguistic Inquiry and Word Count,簡稱LIWC)就是一項廣受學者青睞的分析工具。LIWC歷經幾次的改版,近期對LIWC2007的詞典做了大幅的增刪,並在2015正式發布最新版本。本研究之目的,即在對LIWC2015詞典建立相對應的中文版詞典,進行信效度檢驗,並介紹相關應用文獻。研究一以中文版LIWC2007詞典為基礎,對照LIWC2015詞典的改版,進行了相對應的類別增刪及語詞的增補。研究二蒐集各類題材的部落格文本為材料,並將各文本以奇偶句分成兩文本分別進行LIWC分析,並檢驗其在各類別使用率的相關性,進行信度分析。研究三以Ptt電子佈告欄上的Hate版與Sad版文章各50篇進行書寫差異比較,以檢驗CLIWC2015的效度。本文並對LIWC與大數據分析相關之文獻進行介紹,同時說明LIWC的優勢與限制。期待透過CLIWC2015的修訂完成,對研究華語文使用者的心理特性探討,提供一項研究利器。


Automated analysis of natural language in its daily use has shown to be effective in capturing psychological characteristics in the literature. Linguistic Inquiry and Word Count (LIWC), developed by Pennebaker and his research team, is one of the most commonly used text analysis tools in the social sciences. The essential assumption of LIWC is that the frequencies of word usage in certain categories serve as language markers that index individuals' inner thoughts and psychological processes. LIWC contains two parts, the computer software and the dictionary. The computer software is used to calculate the frequency of words in each category. The dictionary is the LIWC key classifying words into categories. LIWC2015 is the latest dictionary, and is based on a significant revision of its predecessor, the LIWC2007 dictionary. The aim of the current study is to develop a corresponding Chinese version of the LIWC2015 dictionary (CLIWC2015) and demonstrate its reliability and validity. Based on the Chinese LIWC 2007 dictionary, we revised CLIWC2015 by adding and deleting corresponding categories of the LIWC2015 dictionary. We described the details of the process in Study 1. There is a total of 10,795 words belonging to 79 categories in CLIWC2015, including 25 linguistic process categories and 54 psychological process categories. Study 2 collected 100 texts from blog posts on various topics. The average total word count in each post was 1,290 in Study 2. To calculate the reliability, sentences in each text were ordered first, and then odd- and even-numbered sentences were grouped into two subtexts. LIWC indices were calculated for each subtext, and then correlation coefficients between the corresponding subtexts for each language category were used for reliability analyses. Results showed that all word categories demonstrated strong correlation effects except one punctuation category which calculated the frequency of the dashes usage. One possible explanation is that dashes is not a commonly used punctuation mark in the blog posts which could have lowered the reliability. To examine the validity, study 3 collected 100 posts from the Ptt bulletin board system, 50 of which were from the "hate" board, and the rest were from the "sad" board. The average total word count in each post was 164 in Study 3. The two sets' linguistic features were compared. Consistent with our hypotheses, "hate" board posts used significantly more anger, swear and netspeak words, and exclamation marks. In contrast, "sad" board posts used significantly more first-personal singular pronouns, sad, anxiety and cognitive words, and higher cognitive complexity words. Across studies 2 and 3, our findings supported the reliability and validity of the CLIWC2015 well. Unlike traditional content analysis, which requires a great deal of time and effort, one of the most important strengths of LIWC is the ability to analyze huge text files rapidly. Recently, more and more research has applied LIWC to analyze big data. In the last part of this article, we also discussed the implications of using CLIWC2015 and its applications in Chinese culture and big data analytics.

主题分类 社會科學 > 社會科學綜合
  1. 林瑋芳, Wei-Fang,黃金蘭, Chin-Lan,林以正, Yi-Cheng(2015)。中庸與轉念:以字詞分析體現中庸思維之情緒調節動態歷程。本土心理學研究,44,119-150。
  2. 金樹人, Shuh-Ren(2010)。心理位移之結構特性及其辯證現象之分析:自我多重面向的敘寫與敘說。中華輔導與諮商學報,28,187-228。
  3. 黃金蘭, Chi-Lan,Chung, Cindy K.,Hui, Natalie,林以正, Yi-Cheng,謝亦泰, Yi-Tai,Lam, Ben C. P.,程威銓, Wei-Chuan,Bond, Michael H.,Pennebaker, James W.(2012)。中文版「語文探索與字詞計算」詞典之建立。中華心理學刊,54(2),185-201。
  4. 黃金蘭, Chin-Lan,張仁和, Jen-Ho,程威銓, Wei-Chuan,林以正, Yi-Cheng(2014)。我你他的轉變:以字詞分析探討大學生心理位移書寫文本之位格特性。中華輔導與諮商學報,39,35-58。
  5. Cohn, Michael A.,Mehl, Matthias R.,Pennebaker, James W.(2004).Linguistic Markers of Psychological Change Surrounding September 11, 2001.Psychological Science,15,687-693.
  6. Czechowski, Konrad,Miranda, Dave,Sylvestre, John(2016).Like a Rolling Stone: A Mixed-methods Approach to Linguistic Analysis of Bob Dylan’s Lyrics.Psychology of Aesthetics, Creativity, and the Arts,10(1),99-113.
  7. DeWall, C. Nathan,Buffardi, Laura E.,Bonser, Ian,Campbell, W. Keith(2011).Narcissism and Implicit Attention Seeking: Evidence from Linguistic Analyses of Social Networking and Online Presentation.Personality and Individual Differences,51(1),57-62.
  8. Dzogang, Fabon,Lightman, Stafford,Cristianini, Nello(2018).Diurnal Variations of Psychometric Indicators in Twitter Content.PLoS ONE,13(6),e0197002.
  9. Edwards, To’Meisha,Holtzman, Nicholas S.(2017).A Meta-analysis of Correlations Between Depression and First Person Singular Pronoun Use.Journal of Research in Personality,68,63-68.
  10. Fiedler, Klaus(ed.)(2007).Social Communication: Frontiers of Social Psychology.New York:Psychology Press.
  11. Frimer, Jeremy A.,Brandt, Mark J.,Melton, Zachary,Motyl, Matt(2019).Extremists on the Left and Right Use Angry, Negative Language.Personality and Social Psychology Bulletin,45(8),1216-1231.
  12. Golder, Scott A.,Macy, Michael W.(2011).Diurnal and Seasonal Mood Vary with Work, Sleep, and Daylength Across Diverse Cultures.Science,333(6051),1878-1881.
  13. Gonzales, Amy L.,Hancock, Jeffrey T.,Pennebaker, James W.(2010).Language Style Matching as a Predictor of Social Dynamics in Small Groups.Communication Research,37(1),3-19.
  14. Graybeal, Anna,Sexton, Janel D.,Pennebaker, James W.(2002).The Role of Story-Making in Disclosure Writing: The Psychometrics of Narrative.Psychology and Health,17,571-581.
  15. Hancock, Jeffrey T.,Woodworth, Michael,Boochever, Rachel(2018).Psychopaths Online: The Linguistic Traces of Psychopathy in Email, Text Messaging and Facebook.Media and Communication,6(3),89-92.
  16. Ireland, Molly E.,Pennebaker, James W.(2010).Language Style Matching in Writing: Synchrony in Essays, Correspondence, and Poetry.Journal of Personality and Social Psychology,99,549-571.
  17. Ireland, Molly E.,Slatcher, Richard B.,Eastwick, Paul W.,Scissors, Lauren E.,Finkel, Eli J.,Pennebaker, James W.(2011).Language Style Matching Predicts Relationship Initiation and Stability.Psychological Science,22,39-44.
  18. Jones, Jennifer J.(2016).Talk ’Like a Man’: The Linguistic Styles of Hillary Clinton, 1992–2013.Perspectives on Politics,14(3),625-642.
  19. Jordan, Kayla N.,Pennebaker, James W.,Ehrig, Chase(2018).The 2016 U.S. Presidential Candidates and How People Tweeted About Them.SAGE Open,8(3),1-8.
  20. Jordan, Kayla N.,Sterling, Joanna,Pennebaker, James W.,Boyd, Ryan L.(2019).Examining Long-term Trends in Politics and Culture through Language of Political Leaders and Cultural Institutions.Proceedings of the National Academy of Sciences,116(9),3476-3481.
  21. Kacewicz, Ewa,Pennebaker, James W.,Davis, Matthew,Jeon, Moongee,Graesser, Arthur C.(2014).Pronoun Use Reflects Standings in Social Hierarchies.Journal of Language and Social Psychology,33,125-143.
  22. Kahn, Jeffrey H.,Tobin, Renée M.,Massey, Audra E.,Anderson, Jennifer A.(2007).Measuring Emotional Expression with the Linguistic Inquiry and Word Count.The American Journal of Psychology,120(2),263-286.
  23. Karan, Alexander,Rosenthal, Robert,Robbins, Megan L.(2019).Meta-Analytic Evidence that We-Talk Predicts Relationship and Personal Functioning in Romantic Couples.Journal of Social and Personal Relationships,36(9),2624-2651.
  24. Li, Yachao,Samp, Jennifer A.(2018).Internalized Homophobia, Language Use, and Relationship Quality in Same-sex Romantic Relationships.Communication Reports,32(1),15-28.
  25. Lin, Chi-Wei,Lin, Meei-Ju,Wen, Chin-Chen,Chu, Shao-Yin(2016).A Word-Count Approach to Analyze Linguistic Patterns in the Reflective Writings of Medical Students.Medical Education Online,21,29522.
  26. Lin, Wei-Fang,Chen, Lung Hun,Li, Tsui-Shan(2016).Are ’We’ Good? A Longitudinal Study of We-Talk and Stress Coping in Dual-Earner Couples.Journal of Happiness Studies,17(2),757-772.
  27. Lin, Wei-Fang,Lin, Yi-Cheng,Huang, Chin-Lan,Chen, Lung Hung(2016).We Can Make It Better: ‘We’ Moderates the Relationship Between a Compromising Style in Interpersonal Conflict and Well-Being.Journal of Happiness Studies,17(1),41-57.
  28. Ma, Wei-Yun,Chen, Keh-jiann(2003).Introduction to CKIP Chinese Word Segmentation System for the First International Chinese Word Segmentation Bakeoff.Proceedings of the Second SIGHAN Workshop on Chinese Language Processing
  29. Margolin, Drew,Markowitz, David M.(2017).A Multitheoretical Approach to Big Text Data: Comparing Expressive and Rhetorical Logics in Yelp Reviews.Communication Research,45(5),688-718.
  30. Markowitz, David M.,Hancock, Jeffrey T.(2017).The 27 Club: Music Lyrics Reflect Psychological Distress.Communication Reports,30(1),1-13.
  31. Mitra, Tanushree,Counts, Scott,Pennebaker, James W.(2016).Understanding Anti-Vaccination Attitudes in Social Media.Tenth International AAAI Conference on Web and Social Media,Cologne, Germany:
  32. Newman, Matthew L.,Pennebaker, James W.,Berry, Diane S.,Richards, Jane M.(2003).Lying Words: Predicting Deception from Linguistic Styles.Personality and Social Psychology Bulletin,29,665-675.
  33. Pennebaker, James W.(2011).Using Computer Analyses to Identify Language Style and Aggressive Intent: The Secret Life of Function Words.Dynamics of Asymmetric Conflict,4(2),92-102.
  34. Pennebaker, James W.(2011).The Secret Life of Pronouns: What Our Words Say About Us.New York:Bloomsbury Press.
  35. Pennebaker, James W.,Beall, Sandra K.(1986).Confronting a Traumatic Event: Toward an Understanding of Inhibition and Disease.Journal of Abnormal Psychology,95,274-281.
  36. Pennebaker, James W.,Booth, Roger J.,Boyd, Ryan L.,Francis, Martha E.(2015).Linguistic Inquiry and Word Count: LIWC2015.Austin, TX:Pennebaker Conglomerates.
  37. Pennebaker, James W.,Boyd, Ryan L.,Jordan, Kayla,Blackburn, Kate(2015).The Development and Psychometric Properties of LIWC2015.Austin, TX:University of Texas at Austin.
  38. Pennebaker, James W.,Chung, Cindy K.,Frazee, Joey,Lavergne, Gary M.,Beaver, David I.(2014).When Small Words Foretell Academic Success: The Case of College Admissions Essays.PLoS ONE,9(12),e115844.
  39. Pennebaker, James W.,Colder, Michelle,Sharp, Lisa K.(1990).Accelerating the Coping Process.Journal of Personality and Social Psychology,58,528-537.
  40. Pennebaker, James W.,Francis, Martha E.(1996).Cognitive, Emotional, and Language Processes in Disclosure.Cognition and Emotion,10,601-626.
  41. Pennebaker, James W.,Mehl, Matthias R.,Niederhoffer, Kate G.(2003).Psychological Aspects of Natural Language Use: Our Words, Our Selves.Annual Review of Psychology,54,547-577.
  42. Ritter, Ryan S.,Preston, Jesse L.(2013).Representations of Religious Words: Insights for Religious Priming Research.Journal for the Scientific Study of Religion,52(3),494-507.
  43. Rohrbaugh, Michael J.,Mehl, Matthias R.,Shoham, Varda,Reilly, Elizabeth S.,Ewy, Gordon A.(2008).Prognostic Significance of Spouse We Talk in Couples Coping with Heart Failure.Journal of Consulting and Clinical Psychology,76,781-789.
  44. Rude, Stephanie,Gortner, Eva-Maria,Pennebaker, James W.(2004).Language Use of Depressed and Depression-Vulnerable College Students.Cognition and Emotion,18,1121-1133.
  45. Russell, James A.(1980).A Circumplex Model of Affect.Journal of Personality and Social Psychology,39(6),1161-1178.
  46. Sell, John,Farreras, Ingird G.(2017).LIWC-ing at a Century of Introductory College Textbooks: Have the Sentiments Changed?.Procedia Computer Science,118,108-112.
  47. Simmons, Rachel A.,Gordon, Peter C.,Chambless, Dianne L.(2005).Pronouns in Marital Interaction: What Do ’You’ and ’I’ Say About Marital Health?.Psychological Science,16,932-936.
  48. Stirman, Shannon W.,Pennebaker, James W.(2001).Word Use in the Poetry of Suicidal and Nonsuicidal Poets.Psychosomatic Medicine,63,517-522.
  49. Tausczik, Yla R.,Pennebaker, James W.(2010).The Psychological Meaning of Words: LIWC and Computerized Text Analysis Methods.Journal of Language and Social Psychology,29(1),24-54.
  50. Topaloglu, Omer,Dass, Mayukh(2019).The Impact of Online Review Content and Linguistic Style Matching on New Product Sales: The Moderating Role of Review Helpfulness.Decision Sciences.
  51. Wang, Fang,Karimi, Sahar(2019).This Product Works Well (For Me): The Impact of First-person Singular Pronouns on Online Review Helpfulness.Journal of Business Research,104,283-294.
  52. Yen, Chih-Long,Cheng, Chung-Ping,Huang, Chin-Lan,Lin, Yi-Cheng(2019).Does Awareness of Death Strengthen Awareness of Self? The Effects of Existential Threat on Self-Focus.Current Psychology
  53. Yoon, Gunwoo,Li, Cong,Ji, Yi (Grace),North, Michael,Hong, Cheng,Liu, Jiangmeng(2018).Attracting Comments: Digital Engagement Metrics on Facebook and Financial Performance.Journal of Advertising,47(1),24-37.
  54. 屈承熹, Chauncey C.(2006).漢語認知功能語法.台北=Taipei:文鶴=Crane.
  55. 林瑋芳, Wei-Fang,黃金蘭, Chin-Lan,林以正, Yi-Cheng(2014)。來得早不如來得巧:中庸與陰陽轉折的時機。中國社會心理學評論,7,87-107。
  56. 黃宣範(譯),Li, Charles N.,Thompson, Sandra A.(2008).漢語語法.台北:文鶴=University of California Press.
  57. 劉月華, Yuehua,,潘文娛,故驊, Wei(2001).實用現代漢語語法.北京=Beijing:商務印書館=The Commercial Press.
  1. 黃金蘭,林瑋芳,李怡青(2021)。婚姻平權議題之支持方與反對方的心理特性差異:以字詞分析為取向。教育心理學報,53(1),109-126。
  2. 黃金蘭,林以正,仲傳仁(2022)。觀點取替對態度極化的緩解作用:中介及遷移效果分析。教育心理學報,54(2),283-306。
  3. 林瑋芳(2022)。以字詞分析取向探討台灣新冠肺炎防疫工作主事者之心理特性。台灣公共衛生雜誌,41(4),438-448。
  4. 林瑋芳(2023)。字詞分析工具(LIWC)的理論基礎及其在體育領域的應用。體育學報,56(1),1-16。
  5. 張明偉,徐儷瑜,李采凌(2023)。再評估策略對注意力不足過動症高風險成人生氣誘發情境的情緒調控效果探討。中華心理學刊,65(3),257-276。
  6. (2024)。語言探索與字詞計算詞典2015簡體中文版之建置與應用。本土心理學研究,61,115-163。