


Supporting Browsing of Blogs via Text Clustering and Concept Analysis Techniques: A Task-Oriented Evaluation Approach


吳怡瑾(I-Chin Wu);方友杉(Yu-Shang Fang);喻欣凱(Hsin-Kai Yu)


網誌 ; 概念階層 ; 正規概念分析 ; 階層式文件分群 ; 任務導向評估法 ; Blog ; Concept hierarchy ; Formal concept analysis ; Hierarchical agglomerative clustering ; Task-oriented evaluation approach




4卷1期(2009 / 12 / 01)


133 - 164




由知識管理的角度觀之,網誌(blog)提供一種新型態的知識交換模式,即網誌空間提供作者與讀者積極的互動轉換關係,促進知識的交流。由於大部份的網誌內容是偏於生活中的隨筆,其形式較無法像一般的網站,在設計之初就能規劃好所欲討論之內容及整體架構。本研究主要結合階層式分群法與概念關聯分析以協助網誌作者與讀者能夠正確的重新組織文章並進而有效的瀏覽文章。研究以使用者網誌的文章為例,設計考量使用者觀點之階層式文件分群方法將網誌之文章進行分群,以達成彈性分群之目的並由使用者角度重新組織網誌文章。研究並透過正規概念分析(formal concept analysis)方法建構與顯示各群組關鍵字詞之階層關係,主要目的為協助分群之命名並更有效的協助讀者進行資料的搜尋與瀏覽。研究並嘗試透過設計問卷與模擬使用者搜尋任務(task-oriented approach)以評估方法之有效性,實證結果顯示,結合分群與概念關聯呈現均有助網誌作者與讀者對網誌內容的理解,提升瀏覽效率與節省搜尋時間。


Blogs are a good online tool that encourages information exchange and knowledge sharing. However, blog users often face two challenges. First, blog contents are often categorized vaguely or inadequately by authors. Second, many popular blogs sort content by date. Consequently, when a blog user does not possess the right keywords for information retrieval, he or she must take time to browse the contents by chronological order until the relevant content is identified. This study addressed the problems by using the hierarchical agglomerative clustering and formal concept analysis methods to re-classify blog contents. To evaluate the effectiveness of the proposed automated text clustering solution, we further conducted user task-oriented evaluation. The results showed that the technique can help authors (bloggers) define new categories and refine existing categories. In addition, the concept hierarchy applied in each category helped the blog users to quickly discover the needed information.

主题分类 人文學 > 圖書資訊學
社會科學 > 傳播學
  1. Chen, K. J.,Bai, M. H.(1998).Unknown word detection for Chinese by a corpus-based learning method.International Journal of Computational linguistics and Chinese Language Processing,3(1),27-44.
  2. 陳年興、謝盛文、黃琬婷(2007)。自動化建構具時間向度之知識結構映射圖-以資訊管理領域之知識及其演進為例。資訊管理學報,14(1),1-32。
  3. Brooks, C. H.,Montanez, N.(2006).WWW'06: Proceedings of the 15th International Conference on World Wide Web.New York:ACM.
  4. Chen, K. J.,Liu, S. H.(1992).COLING'92: Proceedings of the 14th International Conference on Computational Linguistics.Nantes:France.
  5. Chen, N. S.,Kinshuk, Wei, C. W.,Chen, H. J.(2008).Mining e-Learning domain concept map from academic articles.Computers & Education,50(3),1009-1021.
  6. Chuang, S. L.,Chien, L. F.(2004).CIKM'04: Proceedings of the Thirteenth ACM Conference on Information and Knowledge Management.New York:ACM.
  7. Divitini, M.,Haugalokken, O.,Morken, E. M.(2005).ICALT '05: Proceedings of the 5th IEEE International Conference on Advanced Learning Technology.Kaohsiung, Taiwan:IEEE Computer Society.
  8. Everts, T. J.,Park, S. S.,Kang, B. H.(2006).ACSC'06: Proceedings of the 29th Australasian Computer Science Conference.ACM:Hobart.
  9. Ganter, B.,Wille, R.(1999).Formal concept analysis: Mathematical foundations.Berlin Heidelberg:Springer-Verlag.
  10. Han, J.,Kamber, M.(2000).Data mining: Concepts and techniques.San Francisco:Morgan Kaufmann.
  11. Hill, M. D.,Gaudiot, J. L.,Hall, M.,Marks, J.,Prinetto, P.,Baglio, D.(2006).A wiki for discussing and promoting best practices in research.Communications of the ACM,49(9),63-64.
  12. Jain, A. K.,Murty, M. N.,Flynn, P. J.(1999).Data clustering: A review.ACM Computing Surveys,31(3),264-323.
  13. Jonassen, D. H.,Reeves, T. C.,Hong, N.,Harvey, D.,Peters, K.(1997).Concept mapping as cognitive learning and assessment tools.Journal of Interactive Learning Research,8(3-4),289-308.
  14. Liu, D. L.,Wu, I. C.,Chen, W. H.(2006).Lecture Notes in Artificial Intelligence.Heidelberg:Springer Berlin.
  15. McAfee, A. P.(2006).Enterprise 2.0: The dawn of emergent collaboration.MIT Sloan Management Review,47(3),21-28.
  16. McAleese, R.(1994).A theoretical view on concept mapping.Association for Learning Technology Journal,2(1),38-48.
  17. McAleese, R.(2000).Concept mapping: A critical review.Innovations in Education and Training International,36(4),351-360.
  18. Rosenbloom, A.(2004).The Blogosphere.Communications of the ACM,47(12),31-33.
  19. Salton, G.,Buckley, C.(1988).Term weighting approaches in automatic text retrieval.Information Processing and Management,24(5),513-523.
  20. Salton, G.,Wong, A.,Yang, C. S.(1975).A vector space model for automatic indexing.Communications of the ACM,18(11),613-620.
  21. Vegas, J.,Crestani, F.,Fuente, P. D. L.(2007).Context representation for Web search results.Journal of Information Science,33(1),77-94.
  22. White, R. W.,Jose, J. M.,Ruthven, I. G.(2003).A task-oriented study on the influencing effects of query-biased summarization in Web searching.Information Processing and Management,39(5),707-733.
  23. Wille, R.,I. Rival(Ed.)(1982).Ordered Sets.Dordrecht-Boston:D. Reidel Publishing Company.
  24. Yao, J.(2006).WI-IATW'06: Proceedings of the 2006 IEEE/WIC/ACM International Conference on Web Intelligence and Intelligent Agent Technology.Washington, D.C.:IEEE Computer Society.
  25. 中文句結構樹建立原則
  26. 吳育龍(2000)。中壢市,私立中原大學資訊工程研究所。
  27. 巫啟台(2002)。台南市,國立成功大學資訊工程研究所。
  28. 林柏成(2002)。中壢市,私立中原大學資訊工程研究所。
  29. 未知詞擷取作法
  30. 高宜敏(2001)。新竹市,國立交通大學資訊科學系。
  31. 陳年興、孫振凱、淡江資訊管理學系編(2002)。第十三屆國際資訊管理學術研討會論文集。臺北縣:陳年興。
  32. 陳道輝(2003)。高雄市,國立中山大學資訊管理研究所。
  33. 蔡宜龍(2002)。台北市,國立台灣大學資訊工程學研究所。