基于类间可分性DAG-SVM的文本分类

黄振龙; 郑骏; 胡文心

基于类间可分性DAG-SVM的文本分类

1.
华东师范大学计算中心,上海 200062

详细信息

中图分类号: TP39
计量
- 文章访问数: 2389
- HTML全文浏览量: 52
- PDF下载量: 2129
- 被引次数: 0
出版历程
- 收稿日期: 2012-05-01
- 修回日期: 2012-08-01
- 刊出日期: 2013-05-25

Text classification based on inter-class separability DAG-SVM

1.
Computer Center, East China Normal University, Shanghai 200062, China

摘要

摘要: 本方法采用了以类间分布和类间中心距离作为依据，对有向无环图结构进行调整，以解决传统的DAG-SVM多分类结构固定、单个节点位置随意引起的误差累积严重的缺陷.实验表明，该改进后的DAG-SVM文本分类方法，对文本分类准确率有一定的提高.
- 文本分类 /
- 支持向量机 /
- DAG-SVM /
- 类间可分性
Abstract: This paper took an improved algorithm based on inter-class separability directed acyclic graph support vector machine (DAG-SVM) for text classification.The method has adjusted the DAG structure according to inter-class distribution and the distance between centers. It has solved the problems of fixed structure and random single node location in traditional DAG-SVM multi-classification method.The experiments show that the algorithm has improved the accuracy.
- text classification /
- support vector machine /
- DAG-SVM /
- inter-class separability

HTML全文

参考文献(1)

[1]

［1］ VAPNIK V N. The Nature of Statistical Learning Theory ［M］. New York: Springer-Verlag, 1995.

［2］朱树先,张仁杰.支持向量机核函数选择的研究［J］.科学技术与工程,2008,8(16):4513-4517.

［3］ BOTTOU L, CORTES C, DENKER J, et al. Comparison of Classifier Methods: A Case Study in Handwritten Digit Recognition. Computer Vision & Image Processing ［C］//Proceedings of the 12th IAPR International Conference. Jerusalem: ［s.n.］,1994: 77-87.

［4］ DEBNATH R, TAKAHIDE N, TAKAHASHI H. A decision based one-against-one method for multi-class support vector machine［J］. PATTERN ANALYSIS & APPLICATIONS, 2004,7: 164-175.

［5］ PLATT J C. Fast Training of Support Vector Machines Using Sequential Minimal Optimization ［M］. ［s.l.］: MIT Press, 1998.

［6］张学工.关于统计学习理论与支持向量机［J］.自动化学报,2000,26(1):32-42.

［7］ TAKAHASHI F, ABE S. Decision-Tree-Based Multi-Class Support Vector Machines［C］//Proceeding of ICONIP’02. Singapore: IEEE Press, 2002.

［8］ Sogou Lab Data.文本分类语料库［DB/OL］.［2012-12-24］. http://www.sogou.com/labs/dl/c.html.

［9］ CHANG C C, LIN C J. LIBSVM: a library for support vector machines［J］. ACM Transactions on Intelligent Systems and Technology, 2011, 2(3): 1-27.

［10］ ICTCLAS.汉语分词系统［DB/OL］.［2012-12-23］. http://www.ictclas.org/.

［11］ YANG Y M, PEDERSEN J O. A Comparative Study on Feature Selection in Text Categorization［C］. International Conference on Machine Learning. 1997.

［12］ POWERS D M W. Evaluation: from precision, recall and F-factor to ROC, informedness, markedness & correlation［J］. Journal of Machine Learning Technologies, 2004, 2: 37-63.

［13］ KOHAVI R. A study of cross-validation and bootstrap for accuracy estimation and model selection［C］. Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence. 1995, 2(12): 1137-1143.

施引文献

资源附件(0)

访问统计