摘要
本文讨论了汉语树库构建的若干基础问题,包括一个适合于自动分析和人工标注的汉语句法标记集、汉语树库加工处理规范和人机互助的树库加工模型,介绍了一个已经实现的汉语自动句法标注系统,和在此基础上进行的一些树库构建实验,最后提出了构建大规模汉语树库的设想。
:In this paper,some basic issues on building a Chinese treebank, including a Chinese syntactic tagset available for automatic analyzing and manual annotation, a working standard for Chinese treebank construction,and a manmachine mutually dependent corpus processing model,are discussed.Then, an automatic syntactic tagging system for the Chinese language is proposed and some experimental results are given.Moreover,some ideas for building a large scale Chinese treebank are also discussed.
出处
《中文信息学报》
CSCD
北大核心
1997年第4期42-51,共10页
Journal of Chinese Information Processing
基金
国家自然科学基金
关键词
树库
语料加工模型
语料库语言学
数据库系统
:Treebank,Syntactic tagset,Working Standard for Treebank Construction,Corpus Processing Model, Corpus Linguistics.