Text classification based on inter-class separability DAG-SVM
-
摘要: 本方法采用了以类间分布和类间中心距离作为依据,对有向无环图结构进行调整,以解决传统的DAG-SVM多分类结构固定、单个节点位置随意引起的误差累积严重的缺陷.实验表明,该改进后的DAG-SVM文本分类方法,对文本分类准确率有一定的提高.Abstract: This paper took an improved algorithm based on inter-class separability directed acyclic graph support vector machine (DAG-SVM) for text classification.The method has adjusted the DAG structure according to inter-class distribution and the distance between centers. It has solved the problems of fixed structure and random single node location in traditional DAG-SVM multi-classification method.The experiments show that the algorithm has improved the accuracy.
-
Key words:
- text classification /
- support vector machine /
- DAG-SVM /
- inter-class separability
-
[1] [1] VAPNIK V N. The Nature of Statistical Learning Theory [M]. New York: Springer-Verlag, 1995.[2] 朱树先,张仁杰.支持向量机核函数选择的研究[J].科学技术与工程,2008,8(16):4513-4517.[3] BOTTOU L, CORTES C, DENKER J, et al. Comparison of Classifier Methods: A Case Study in Handwritten Digit Recognition. Computer Vision & Image Processing [C]//Proceedings of the 12th IAPR International Conference. Jerusalem: [s.n.],1994: 77-87.[4] DEBNATH R, TAKAHIDE N, TAKAHASHI H. A decision based one-against-one method for multi-class support vector machine[J]. PATTERN ANALYSIS & APPLICATIONS, 2004,7: 164-175.[5] PLATT J C. Fast Training of Support Vector Machines Using Sequential Minimal Optimization [M]. [s.l.]: MIT Press, 1998.[6] 张学工.关于统计学习理论与支持向量机[J].自动化学报,2000,26(1):32-42.[7] TAKAHASHI F, ABE S. Decision-Tree-Based Multi-Class Support Vector Machines[C]//Proceeding of ICONIP’02. Singapore: IEEE Press, 2002.[8] Sogou Lab Data.文本分类语料库[DB/OL].[2012-12-24]. http://www.sogou.com/labs/dl/c.html.[9] CHANG C C, LIN C J. LIBSVM: a library for support vector machines[J]. ACM Transactions on Intelligent Systems and Technology, 2011, 2(3): 1-27.[10] ICTCLAS.汉语分词系统[DB/OL].[2012-12-23]. http://www.ictclas.org/.[11] YANG Y M, PEDERSEN J O. A Comparative Study on Feature Selection in Text Categorization[C]. International Conference on Machine Learning. 1997.[12] POWERS D M W. Evaluation: from precision, recall and F-factor to ROC, informedness, markedness & correlation[J]. Journal of Machine Learning Technologies, 2004, 2: 37-63.[13] KOHAVI R. A study of cross-validation and bootstrap for accuracy estimation and model selection[C]. Proceedings of the Fourteenth International Joint Conference on Artificial Intelligence. 1995, 2(12): 1137-1143.
点击查看大图
计量
- 文章访问数: 2389
- HTML全文浏览量: 52
- PDF下载量: 2129
- 被引次数: 0