L0Learn: Fast Best Subset Selection

Hussein Hazimeh and Rahul Mazumder

Introduction

L0Learn is a highly efficient framework for solving L0-regularized regression (and soon classification) problems. It can (approximately) solve the following three problems, where the squared error loss is penalized by combinations of the L0, L1, and L2 norms:

The optimization is done using coordinate descent and local combinatorial search over a grid of regularization parameter(s) values. Many computational tricks and heuristics are used to speed up the algorithms and improve the solution quality. These heuristics include warm starts, active set convergence, correlation screening, greedy cycling order, and efficient methods for updating the residuals through exploiting sparsity and problem dimensions. Moreover, we employed a new computationally efficient method for dynamically selecting the regularization parameter λ in the path. We describe the details of the algorithms in our paper: Fast Best Subset Selection: Coordinate Descent and Local Combinatorial Optimization Algorithms (arXiv link).

The toolkit is implemented in C++11 and can often run faster than popular sparse learning toolkits (see our experiments in the paper above). We also provide an easy-to-use R interface; see the section below for installation and usage of the R package.

R Package Installation and Usage

The latest version of L0Learn (v1.0.9) can be installed from Github using

library(devtools)
install_github("hazimehh/L0Learn")

The previous version (v1.0.8) can be installed from CRAN using

install.packages("L0Learn",repos ="https://cran.r-project.org")

For a tutorial, please refer to L0Learn's Vignette. For a detailed description of the API, check the Reference Manual.

For users who have been using L0Learn before July 1, 2018, please check this Wiki page for more information on the changes introduced in the new CRAN version.

Citing L0Learn

If you find L0Learn useful in your research, please consider citing the following paper:

@ARTICLE{2018arXiv180301454H,
   author = {{Hazimeh}, H. and {Mazumder}, R.},
   title = "{Fast Best Subset Selection: Coordinate Descent and Local Combinatorial Optimization Algorithms}",
   journal = {ArXiv e-prints},
   archivePrefix = "arXiv",
   eprint = {1803.01454},
   primaryClass = "stat.CO",
   keywords = {Statistics - Computation, Mathematics - Optimization and Control, Statistics - Machine Learning},
   year = 2018,
   month = mar,
   adsurl = {http://adsabs.harvard.edu/abs/2018arXiv180301454H},
}

Name		Name	Last commit message	Last commit date
Latest commit History 309 Commits
R		R
man		man
misc		misc
src		src
vignettes		vignettes
.Rbuildignore		.Rbuildignore
.astylerc		.astylerc
.gitignore		.gitignore
.travis.yml		.travis.yml
DESCRIPTION		DESCRIPTION
LICENSE		LICENSE
NAMESPACE		NAMESPACE
README.md		README.md
cleanup		cleanup
configure		configure
configure.ac		configure.ac

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

L0Learn: Fast Best Subset Selection

Hussein Hazimeh and Rahul Mazumder

Introduction

R Package Installation and Usage

Citing L0Learn

About

Releases

Packages

Languages

License

elliotthwang/L0Learn

Folders and files

Latest commit

History

Repository files navigation

L0Learn: Fast Best Subset Selection

Hussein Hazimeh and Rahul Mazumder

Introduction

R Package Installation and Usage

Citing L0Learn

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages