摘要
支持向量机是Vapnik等人基于统计学习理论提出的一种新型机器学习方法。由于采用了结构风险最小化原则替代经验风险最小化原则,使它能较好地解决小样本学习问题。又由于采用了核函数思想,使它能把非线性问题转化为线性问题来解决并降低了算法的复杂度,而且它具有很好的通用性,鲁棒性,有效性,计算简单等特点,所以支持向量机己被广泛应用于人脸识别,手写识别,模式分类等领域。但是作为一种新兴技术,支持向量机在很多应用领域的研究还有待探索和完善。支持向量机的训练算法在对大规模的数据集进行训练时,训练时间往往过长、算法复杂而难以实现。
本文主要对支持向量机训练算法进行了探讨。首先对支持向量机的基本理论进行了介绍。然后对支持向量机的集中训练算法进行了一般描述。接下来重点介绍了一种比较成功的算法:SMO算法。这种算法都很大地提高了大规模训练集的训练速度。本文的主要工作有:
(1)系统地论述了支持向量机的理论基础,包括统计学习理论基础和支持向量机理论基础。
(2)对支持向量机的训练算法做系统的介绍。
(3)着重介绍了SVM训练算法中比较成功的SMO算法做详细的描述,尤其是算法的推导过程。
关键词:支持向量机;统计学习理论;最优超平面;核函数;二次规划;SMO
ABSTRACT
Support Vector Machine (SVM) is a new method of Machine Learning which is proposed by Vapnik and his group based on the statistical learning theory. It can solve small samples learning problems better by using structural risk minimization in place of experiential risk minimization. Because SVM adopts the thought of kernel function, it can change the nonlinear problems into linear problems to reduce the complexity of algorithm. It also has some advantages in the generality, robustness, effectiveness, simple calculation. So it has been widely used in face recognition, handwriting recognition, pattern classification and other fields.But as a new technique,SVM also have many shortcomings that need to be researched, such as: it costs too much time when training large-scale data set and its alogrithm is so difficult to implentmenion.
In this paper, we main discuss the training alogrithm of SVM. At first, this paper introduced the basic concept of SVM theory. Then we gived a general description of the training alogrithm of SVM. As following, we paid main attention to the successful training alogrithms一SMO. The SMO alogrithm improved the trianing speed on large-scale training set greatly.
The main work of this article:
(1)Systematically described the basic concept of SVM theory,including Statistical learning theory and Support vector machine theory.
(2)Systematically introduced the training alogrithm of SVM.
(3)Detailed introduction of SMO algorithm which is one of the successful SVM training algorithm, especially for the alogrithm derivation.
Keywords:Support vector machine; statistical learning theory; optimal hyperplane; nuclear function; quadratic programming; SMO
目 录