A scalable modular convex solver for regularized risk minimization

12 August 2007

proceedings article
Published by Association for Computing Machinery (ACM)

p. 727-736
https://doi.org/10.1145/1281192.1281270

Abstract

A wide variety of machine learning problems can be described as minimizing a regularized risk functional, with different algorithms using different notions of risk and different regularizers. Examples include linear Support Vector Machines (SVMs), Logistic Regression, Conditional Random Fields (CRFs), and Lasso amongst others. This paper describes the theory and implementation of a highly scalable and modular convex solver which solves all these estimation problems. It can be parallelized on a cluster of workstations, allows for data-locality, and can deal with regularizers such as l1 and l2 penalties. At present, our solver implements 20 different estimation problems, can be easily extended, scales to millions of observations, and is up to 10 times faster than specialized solvers for many applications. The open source code is freely available as part of the ELEFANT toolbox.

Keywords

This publication has 15 references indexed in Scilit:

Training linear SVMs in linear time
Published by Association for Computing Machinery (ACM) ,2006
Large scale semi-supervised linear SVMs
Published by Association for Computing Machinery (ACM) ,2006
Decoding by Linear Programming
IEEE Transactions on Information Theory, 2005
Online Ranking by Projecting
Neural Computation, 2005
Shallow parsing with conditional random fields
Published by Association for Computational Linguistics (ACL) ,2003
Tools for privacy preserving distributed data mining
ACM SIGKDD Explorations Newsletter, 2002
Estimating the Support of a High-Dimensional Distribution
Neural Computation, 2001
A Limited Memory Algorithm for Bound Constrained Optimization
SIAM Journal on Scientific Computing, 1995
Robust linear programming discrimination of two linearly inseparable sets
Optimization Methods and Software, 1992
Linear and Nonlinear Separation of Patterns by Linear Programming
Operations Research, 1965