Skip to content

BouchardLab/pyuoi

Repository files navigation

PyUoI logo

Actions Status codecov Documentation Status PyPI Anaconda-Server Badge DOI

PyUoI contains implementations of Union of Intersections framework for a variety of penalized generalized linear models as well as dimensionality reductions techniques such as column subset selection and non-negative matrix factorization. In general, UoI is a statistical machine learning framework that leverages two concepts in model inference:

  1. Separating the selection and estimation problems to simultaneously achieve sparse models with low-bias and low-variance parameter estimates.
  2. Stability to perturbations in both selection and estimation.

PyUoI is designed to function similarly to scikit-learn, as it often builds upon scikit-learn's implementations of the aforementioned algorithms.

Further details on the UoI framework can be found in the NeurIPS paper (Bouchard et al., 2017).

Installation

PyUoI is available for Python 3 on PyPI:

pip install pyuoi

and through conda-forge:

conda install pyuoi -c conda-forge

Requirements

Runtime

PyUoI requires

  • numpy>=1.14
  • h5py>=2.8
  • scikit-learn>=0.24

and optionally

  • pycasso
  • mpi4py

to run.

Develop

To develop PyUoI you will additionally need

  • cython

to build from source and

  • pytest
  • flake8

to run the tests and check formatting.

PyUoI has been built and tested on Python 3.9.18 with

  • numpy==1.26.1
  • h5py==3.10.0
  • scikit-learn==1.3.1
  • cython==3.0.4
  • pytest==7.4.2
  • flake8==6.1.0

Features

PyUoI is split up into two modules, with the following UoI algorithms:

  • linear_model (generalized linear models)
    • Lasso penalized linear regression UoILasso.
    • Elastic-net penalized linear regression (UoIElasticNet).
    • Logistic regression (Bernoulli and multinomial) (UoILogistic).
    • Poisson regression (UoIPoisson).
  • decomposition (dimensionality reduction)
    • Column subset selection (UoICSS).
    • Non-negative matrix factorization (UoINMF).

Similar to scikit-learn, each UoI algorithm has its own Python class.

Documentation

Please see our ReadTheDocs page for an introduction to Union of Intersections, usage of PyUoI, and the API.

Copyright

PyUoI Copyright (c) 2019, The Regents of the University of California, through Lawrence Berkeley National Laboratory (subject to receipt of any required approvals from the U.S. Dept. of Energy). All rights reserved.

If you have questions about your rights to use or distribute this software, please contact Berkeley Lab's Innovation & Partnerships Office at [email protected] referring to " PyUoI" (LBNL Ref 2019-157)."

NOTICE. This software was developed under funding from the U.S. Department of Energy. As such, the U.S. Government has been granted for itself and others acting on its behalf a paid-up, nonexclusive, irrevocable, worldwide license in the Software to reproduce, prepare derivative works, and perform publicly and display publicly. The U.S. Government is granted for itself and others acting on its behalf a paid-up, nonexclusive, irrevocable, worldwide license in the Software to reproduce, prepare derivative works, distribute copies to the public, perform publicly and display publicly, and to permit others to do so.