题名

Synta-semantic Paeser: A Stusy of Linguistic Interface

并列篇名

語法暨語意剖析器:跨語言學介面之研究

DOI

10.7084/LIS.201112.0173

作者

盧慧娟(Hui-Chuan Lu);吳奕賢(Yi-Hsien Wu);張展毓(Cheng-Yu Chang)

关键词

剖析器 ; 語意角色 ; 規則為本 ; parser ; thematic role ; rule-based

期刊名称

語文與國際研究

卷期/出版年月

8期(2011 / 12 / 01)

页次

171 - 191

内容语文

西班牙文

中文摘要

為有別於語意學剖析研究趨勢的現況,以助益於語意角色的自動分析為目的,本論文旨在著手開發一個以語意角色、語法功能和詞類之關係互動為基礎,跨語言學子層次之剖析器。此一剖析器之建構主要以規則為本的基礎,輔以其它現有工其所分析出的成果,透過所撰寫的電腦程式處理根據語言現象所歸納出之規則的整合與應用,以期獲致跨層面之語言學訊息。此一工具可區分不同類型子句及轉換構詞語法和語法語意訊息,便利使用者進行自動化的剖析工作。

英文摘要

In the last decade, analysis of parts of speech (POS) has reached a relatively stable stage while semantics still requires further studies and advanced techniques in the field of corpus linguistics. The field of semantics has focused more on the meaning of words. These phenomena are even more obvious in languages other than English. To differentiate from the present situation and facilitate the automatic parsing of thematic roles, this paper presents a developed parser based on the interaction among the thematic roles, grammatical functions, and parts of speech on three different linguistic levels, morphology, syntax, and semantics. The proposed syntax-semantic parser is created through computational programming, combining the design rules and the results obtained by other existing tools such as FreeLing and WordRrference, As a result this parser is capable of automatically analyzing different types of clauses and transforming the morphosyntatic information into the syntasemantic by importing data through users' interface. This study is limited to a few functions due to the difficulties of natural language processing and Spanish linguistics encountered during the collaboration of investigators from two research fields, information engineering and Spanish linguistics. Nevertheless, the results applied to the error detector and the suggestion system that were developed based on the learner' corpus the investigators established over several years, have academic value in the future.

主题分类 人文學 > 語言學
人文學 > 外國文學
参考文献
  1. 3LB: Building a syntactic-semantic-trees-based database. http://www.dlsi.ua.es/projectes/3lb/index-en.html
  2. WordRefrence.com Online Language Dictionaries. http://www.wordreference.com/
  3. The Perl Programming Language. http://www.perl.org/
  4. UAM Spanish Treebank, Laboratorio de Lingüística Informática Universidad Autónoma de Madrid. http://www.lllf.uam.es/-sandoval/UAMTreebank.html
  5. Acosta, O.(2009).Automatic extraction of lexical relations from analytical definitions using a constraint grammar.22nd Canadian Conference on Artificial Intelligence: Advances in Artificial Intelligence,Kelowna, Canada:
  6. Al Zamil, M. G. H.,Can, A. B.(2011).ROLEX-SP: Rules of lexical syntactic patterns for free text categorization.Knowledge-Based Systems,24(1),58-65.
  7. Azevedo, M. M.(1992).Introducción a la lingüística española.New Jersey, United States:Prentice Hall.
  8. Elenius, K.(1990).Comparing a connectionist and a rule based model for assigning parts-of-speech.Proceedings of International Conference on Acoustics, Speech, and Signal Processing
  9. Jackson, P.(1998).Introduction to expert systems.Boston, United States:Addison Wesley.
  10. Lu, H.-C.(2010).An annotated Taiwanese Learners' Corpus of Spanish, CATE.Corpus Linguistics and Linguistic Theory,6(2),297-300.
  11. Mahar, J. A.,Memon, G. Q.(2010).Rule based part of speech tagging of Sindhi language.Proceedings of International Conference on Signal Acquisition and Processing
  12. Mangassarian, H.,Artail, H.(2007).A general framework for subjective information extraction from unstructured English text.Data & Knowledge Engineering,62(2),352-367.
  13. Moreno, A.,Grishman, R.,López, S.,Sánchez, F.,Sekine, S.(2000).A treebank of Spanish and its application to parsing.Proceedings of the Second Conference on Language Resources and Evaluation (LREC2000)
  14. Moreno, A.,López, S.,Sánchez, F.,Grishman, R.(2003).Developing a syntactic annotation scheme and tools for a Spanish treebank.Treebanks: Building and using annotated corpora,Norwell, USA:
  15. Padró, L.,Collado, M.,Reese, S.,Lloberes, M.,Castellón, I.(2010).FreeLing 2.1: Five years of open-source language processing tools.Proceedings of the Seventh International Conference on Language Resources and Evaluation (LREC'10)
  16. Schmid, H.(1995).Improvements in part-of-speech tagging with an application to German.Proceedings of the ACL SIGDAT-Workshop
  17. Schmid, H.(1994).Probabilistic part-of-speech tagging using decision trees.Proceedings of the International Conference on New Methods in Language Processing
  18. Tanaka, T.,Bond, F.,Oepen, S.,Fujita, S.(2005).High precision treebanking. Blazing useful trees using POS information.Proceedings of the 43th Meeting of the Association for Computational Linguistics
  19. Yan, J.,Bracewell, D. B.,Ren, F.,Kuroiwa, S.(2005).Automatically determining semantic relations in Chinese sentences.Proceedings of 2005 IEEE International Conference on Natural Language Processing and Knowledge Engineering