Online Learning with Kernels
Top Cited Papers
- 19 July 2004
- journal article
- Published by Institute of Electrical and Electronics Engineers (IEEE) in IEEE Transactions on Signal Processing
- Vol. 52 (8) , 2165-2176
- https://doi.org/10.1109/tsp.2004.830991
Abstract
Kernel-based algorithms such as support vector machines have achieved considerable success in various problems in batch setting, where all of the training data is available in advance. Support vector machines combine the so-called kernel trick with the large margin idea. There has been little use of these methods in an online setting suitable for real-time applications. In this paper, we consider online learning in a reproducing kernel Hilbert space. By considering classical stochastic gradient descent within a feature space and the use of some straightforward tricks, we develop simple and computationally efficient algorithms for a wide range of problems such as classification, regression, and novelty detection. In addition to allowing the exploitation of the kernel trick in an online setting, we examine the value of large margins for classification in the online setting with a drifting target. We derive worst-case loss bounds, and moreover, we show the convergence of the hypothesis to the minimizer of the regularized risk functional. We present some experimental results that support the theory as well as illustrating the power of the new algorithms for online novelty detection.Keywords
This publication has 19 references indexed in Scilit:
- On the Generalization Ability of On-Line Learning AlgorithmsIEEE Transactions on Information Theory, 2004
- Large Margin Classification for Moving TargetsPublished by Springer Nature ,2002
- Tracking Linear-Threshold Concepts with WinnowPublished by Springer Nature ,2002
- Adaptive and Self-Confident On-Line Learning AlgorithmsJournal of Computer and System Sciences, 2002
- The Relaxed Online Maximum Margin AlgorithmMachine Learning, 2002
- 10.1162/153244301753683726Applied Physics Letters, 2000
- The robustness of the p -norm algorithmsPublished by Association for Computing Machinery (ACM) ,1999
- Large Margin Classification Using the Perceptron AlgorithmMachine Learning, 1999
- Tracking the Best DisjunctionMachine Learning, 1998
- Worst-case quadratic loss bounds for prediction using linear functions and gradient descentIEEE Transactions on Neural Networks, 1996