题名

Automatic Categorization of Patent Documents for R&D Knowledge Self-organization

DOI

10.6504/JOM.2006.23.04.01

作者

Amy J. C. Trappey;Charles V. Trappey;Evan C. H. Hsieh

关键词

Knowledge management ; Document categorization ; Classification ; Neural networks ; Patent analysis

期刊名称

管理學報

卷期/出版年月

23卷4期(2006 / 08 / 01)

页次

413 - 424

内容语文

英文

英文摘要

The World Intellectual Property Organization (WIPO) reports that ninety to ninety-five percent of all R&D refers to existing patent documents. A company can reduce costs and shorten development time by effectively utilizing existing knowledge, as disclosed in the global patent corpus and in the intellectual property news media. As a consequence, patent information plays an important role in the era of knowledge-based economies. However, owing to the dramatic increase in the number of patent documents people have difficulty reading, organizing, and fully utilizing them. There are also unique technical and legal vocabularies in the context of patent documents that prevent adequate understanding of patent claims. The consistent and effective organization of important ontological content from documents, such as patents and intellectual property information, is therefore a significant issue in R&D technical knowledge management. This paper introduces an intelligent ontology-based knowledge categorization approach to overcome labor-intensive methods when the number of documents that require analysis exceeds manual processing capacity. The ontology-based document categorization approach requires the use of an artificial neural network (ANN) and pre-constructed ontology schemas for given domains. The system extracts the features of a document by using a morphological analysis and sentence analysis. These features are subsequently matched with classes and relationships of the domain ontology and are transferred as input into the ANN model. The ANN model is trained and tested for the given documents and the assigned categories that are based on the content ontological analysis. Two cases that cover chemical mechanical polishing (CMP) patent documents and IP news clippings are provided to demonstrate the categorization approach for R&D knowledge self-organization.

主题分类 社會科學 > 管理學
参考文献
  1. (2004).from http://web.media.mit.edu/~hugo/montylingua
  2. Arora, A., Fosfuri, A.,Gambardella, A(2002).Markets for technology in the knowledge economy.ISSJ 171/2002.
  3. Chiang, J.,Chen, Y(2001).Hierarchical fuzzy-kNN networks for news documents categorization.Proceedings, the 10th IEEE International Conference on Fuzzy Systems.
  4. Gravano, L., Garcia-Molina, H.,Tomasic, A(1999).Text-source discovery over the internet.ACM Transactions on Database Systems.
  5. Grossman, D., Frieder, O., Holmes, D.,Roberts, D(1997).Integrating structured data and text: A relational approach.Journal of the American Society for Information Science.
  6. Gruber, T.R(1992).Technical Report, Knowledge Systems Laboratory, Stanford University.ONTOLINGUA: A Mechanism to Support Portable Ontologies.
  7. Kao, C.H(2000).M.S. Thesis (Advisors: Kuo, Y.H., & Chiang, J.H.), Dept. of Computer Science and Information Engineering, National Cheng Kung University, Taiwan.Personalized Information Classification System with Automatic Ontology Construction Capability.
  8. Karras, D.A.,Mertzios, B.G.(2002).A robust meaning extraction methodology using supervised neural networks.Proceedings, Australian Joint Conference on Artificial Intelligence.
  9. Ko, Y.,Seo, J(2000).Automatic text categorization by unsupervised learning.Proceedings, The 17th conference on computational linguistics (COLING'2000).
  10. Lam, W., Ruiz, M.E.,Srinivasan, P(1999).Automatic text categorization and its applications to text retrieval.IEEE Transactions on Knowledge Data Engineering.
  11. Lam, W.,Han, Y(2003).Automatic textual document categorization based on generalized instance sets and a metamodel.IEEE Transactions on Pattern Analysis and Machine Intelligence.
  12. Liu, H(2004).Retrieved May 1.MontyLingua: An end-to-end natural language processor with common sense.
  13. Massey, L(2003).On the quality ART1 text clustering.Neural Networks.
  14. Matsuo, Y.,Ishizuka, M(2004).Keyword extraction from a single document using word co-occurrence statistical information.International Journal on Artificial Intelligence Tools.
  15. McCallum, A.,Nigam, K(1998).A comparison of event models for naive Bayes text classification.Proceedings, AAAI'98 Workshop on Learning for Text Categorization.
  16. Meier, J.,Sprague, R(1996).Towards a better understanding of electronic document management.Proceedings, The Twenty-Ninth Hawaii International Conference on System Sciences.
  17. Raghavan, V.V.,Wong, S.K.M(1986).A critical analysis of vector space model for information retrieval.Journal of the American Society for Information Science.
  18. Salton, G., Fox, E.A.,Wu, H(1983).Extended Boolean information retrieval.Communications of the ACM.
  19. Selamat A.,Omatu S(2004).Web page feature selection and classification using neural networks.Information Sciences.
  20. Shyhre, A(2004).Rethinking knowledge: a Bergsonian critique of the notion of tacit knowledge.British Academy of Management.
  21. Thompson, M.P.A.,Walsham, G(2004).Placing knowledge management in context.Journal of Management Studies.
  22. Tijerino, Y.A.,Mizoguchi, R(1993).Lecture Notes in Artificial Intelligence 723: Knowledge Acquisition for Knowledge-Based Systems, Caylus.MULTIS II: Enabling End Users to Design Problem Solving Engines via Two-level Task Ontologies.
  23. Trappey, A.J.C., Lin, S.C.I.,Wang, C.L(2005).Using neural network categorization method to develop an innovative knowledge management technology for patent document classification.Proceedings, The 9th International Conference on Computer Supported Cooperative Work in Design, Coventry.
  24. Van Rijsbergen, C.J(1979).Information Retrieval.
  25. WIPO(1996).World Intellectual Property Organization ReportWorld Intellectual Property Organization Report,Retrieved December 10, 2004:.
  26. Yoon, B.-U., Yoon, C.-B.,Park, Y.-T(2002).On the development and application of a self-organizing feature map-based patent map.R&D Management.
被引用次数
  1. 張富雄(2012)。整合專利多元尺度分析與萃思方法論之產品創新設計流程。臺北科技大學工業工程與管理系碩士班學位論文。2012。1-104。