A scalable modular convex solver for regularized risk minimization
- 12 August 2007
- proceedings article
- Published by Association for Computing Machinery (ACM)
- p. 727-736
- https://doi.org/10.1145/1281192.1281270
Abstract
A wide variety of machine learning problems can be described as minimizing a regularized risk functional, with different algorithms using different notions of risk and different regularizers. Examples include linear Support Vector Machines (SVMs), Logistic Regression, Conditional Random Fields (CRFs), and Lasso amongst others. This paper describes the theory and implementation of a highly scalable and modular convex solver which solves all these estimation problems. It can be parallelized on a cluster of workstations, allows for data-locality, and can deal with regularizers such as l1 and l2 penalties. At present, our solver implements 20 different estimation problems, can be easily extended, scales to millions of observations, and is up to 10 times faster than specialized solvers for many applications. The open source code is freely available as part of the ELEFANT toolbox.Keywords
This publication has 15 references indexed in Scilit:
- Training linear SVMs in linear timePublished by Association for Computing Machinery (ACM) ,2006
- Large scale semi-supervised linear SVMsPublished by Association for Computing Machinery (ACM) ,2006
- Decoding by Linear ProgrammingIEEE Transactions on Information Theory, 2005
- Online Ranking by ProjectingNeural Computation, 2005
- Shallow parsing with conditional random fieldsPublished by Association for Computational Linguistics (ACL) ,2003
- Tools for privacy preserving distributed data miningACM SIGKDD Explorations Newsletter, 2002
- Estimating the Support of a High-Dimensional DistributionNeural Computation, 2001
- A Limited Memory Algorithm for Bound Constrained OptimizationSIAM Journal on Scientific Computing, 1995
- Robust linear programming discrimination of two linearly inseparable setsOptimization Methods and Software, 1992
- Linear and Nonlinear Separation of Patterns by Linear ProgrammingOperations Research, 1965