-
摘要: 介绍了一种以潜语义分析模型为基础,辅之以领域本体的文档自动摘要算法.该方法在传统的基于统计的奇异值分解算法基础上,通过领域本体引入了文档主题识别以及概念相似度计算,更好地用形式化的方式描述了文档的主要内容;在文档主题和概念相似度的指导下,使用统计方法和启发式规则抽取文档中的关键句子作为摘要,并通过实验证明提高了摘要的质量.Abstract: A new arithmetic based on Latent Semantic Analysis Model and domain ontology was proposed to summarize the document. Based on the traditional statistic arithmetic,recognition of document theme and computation of concept similarity were imposed by using domain ontology,which described the main content of documents better. In the guide of document theme and concept similarity, statistical approaches and heuristic rules to extract keysentences were used, which are proved to improve the quality of automatic summarization arithmetic by experiment.
点击查看大图
计量
- 文章访问数: 3539
- HTML全文浏览量: 14
- PDF下载量: 1429
- 被引次数: 0