Skip to content

Latest commit

 

History

History
230 lines (188 loc) · 7.73 KB

README.md

File metadata and controls

230 lines (188 loc) · 7.73 KB

randUTV

Authors

  • Per-Gunnar Martinsson, Dept. of Applied Mathematics, University of Colorado at Boulder, 526 UCB, Boulder, CO 80309-0526, USA.

  • Gregorio Quintana-Orti, Depto. de Ingenieria y Ciencia de Computadores, Universitat Jaume I, 12.071 Castellon, Spain.

  • Nathan Heavner, Dept. of Applied Mathematics, University of Colorado at Boulder, 526 UCB, Boulder, CO 80309-0526, USA.

Correspondence

Please send correspondence about the code to Gregorio Quintana-Ortí: [email protected]

Correspondence about the paper should be sent to Per-Gunnar J. Martinsson: [email protected]

License

New 3-clause BSD. See file License.txt for more details.

Disclaimer

This code is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY EXPRESSED OR IMPLIED.

Description

This code repository contains several implementations of an algorithm for computing a so called UTV factorization efficiently.

Given a matrix A, the algorithm randUTV computes a factorization A = U T Vt, where U and V have orthonormal columns, Vt is the transpose of V, and T is triangular (either upper or lower, whichever is preferred).

The algorithm randUTV is developed primarily to be a fast and easily parallelized alternative to algorithms for computing the Singular Value Decomposition (SVD). randUTV provides accuracy very close to that of the SVD for problems such as low-rank approximation, solving ill-conditioned linear systems, determining bases for various subspaces associated with the matrix, etc. Moreover, randUTV produces highly accurate approximations to the singular values of A. Unlike the SVD, the randomized algorithm proposed builds a UTV factorization in an incremental, single-stage, and non-iterative way, making it possible to halt the factorization process once a specified tolerance has been met.

The new code can be downloaded from https://github.com/flame/randutv/.

The algorithm was originally implemented using the FLAME/C API with a variation of the compact WY transform we call the UT transform.

However, an implementation that uses the original compact WY transform is also supplied so that many routines from LAPACK can be employed in a seamless fashion.

Both implementations will eventually be included in the libflame library: https://github.com/flame/libflame/

We will appreciate feedback from the community on the use of this code.

Performance benefit

alt tag

Several experiments demonstrate that in comparison to column pivoted QR, which is another factorization that is often used as a relatively economic alternative to the SVD, randUTV compares favorably in terms of speed while providing far higher accuracy.

Citing this work

We ask those who benefit from this work to cite the following reports and articles:

@article{Martinsson:2019:RBR:3314951.3242670,
 author = {Martinsson, P. G. and Quintana-Ort\'{\i}, G. and Heavner, N.},
 title = {randUTV: A Blocked Randomized Algorithm for Computing a Rank-Revealing UTV Factorization},
 journal = {ACM Trans. Math. Softw.},
 issue_date = {March 2019},
 volume = {45},
 number = {1},
 month = mar,
 year = {2019},
 issn = {0098-3500},
 pages = {4:1--4:26},
 articleno = {4},
 numpages = {26},
 url = {http://doi.acm.org/10.1145/3242670},
 doi = {10.1145/3242670},
 acmid = {3242670},
 publisher = {ACM},
 address = {New York, NY, USA},
 keywords = {Numerical linear algebra, high performance, randomized methods, rank-revealing matrix factorization, singular value decomposition},
}

@ARTICLE{2017arXiv170300998M,
  author        = {{Martinsson}, P.-G. and {Quintana-Orti}, G. 
                   and {Heavner}, N.},
  title         = "{randUTV: A blocked randomized algorithm for computing 
                    a rank-revealing UTV factorization}",
  journal       = {ArXiv e-prints},
  archivePrefix = "arXiv",
  eprint        = {1703.00998},
  primaryClass  = "math.NA",
  keywords      = {Mathematics - Numerical Analysis},
  year          = 2017,
  month         = mar,
  adsurl        = {http://adsabs.harvard.edu/abs/2017arXiv170300998M},
  adsnote       = {Provided by the SAO/NASA Astrophysics Data System}
}

Details

We offer two variants of the code for computing the randUTV factorization, each one in a different directory:

  • LAPACK-compatible pure C code: This code uses compact WY transformations. The sources are stored in the lapack_compatible_sources folder.

  • LAPACK-like libflame code: This code uses compact UT transformations and it resembles the algorithm in the paper. The sources are stored in the libflame_sources folder.

The matlab folder contains the sources of the NoFLA_UTV_WY_blk_var2 routine (LAPACK-compatible code) for computing the randUTV factorization modified to be employed from Matlab. The original code was adapted to Matlab by Alex Barnett, Flatiron Institute.

Details of LAPACK-compatible pure C code:

The new code contains the following main routine:

int NoFLA_UTV_WY_blk_var2(
        int m_A, int n_A, double * buff_A, int ldim_A,
        int build_u, int m_U, int n_U, double * buff_U, int ldim_U,
        int build_v, int m_V, int n_V, double * buff_V, int ldim_V,
        int nb_alg, int pp, int n_iter ) {
//
// randUTV: It computes the UTV factorization of matrix A.
//
// Main features:
//   * BLAS-3 based.
//   * Compact WY transformations are used instead of UT transformations.
//   * No use of libflame.
//
// Matrices A, U, and V must be stored in column-order.
// If provided, matrices U,V must be square and with the right dimensions.
//
// Arguments:
// ----------
// m_A:      Number of rows of matrix A.
// n_A:      Number of columns of matrix A.
// buff_A:   Address of data in matrix A. Matrix to be factorized.
// ldim_A:   Leading dimension of matrix A.
// build_u:  If build_u==1, matrix U is built.
// m_U:      Number of rows of matrix U.
// n_U:      Number of columns of matrix U.
// buff_U:   Address of data in matrix U.
// ldim_U:   Leading dimension of matrix U.
// build_v:  If build_v==1, matrix V is built.
// m_V:      Number of rows of matrix V.
// n_V:      Number of columns of matrix V.
// buff_V:   Address of data in matrix V.
// ldim_V:   Leading dimension of matrix V.
// nb_alg:   Block size. Usual values for nb_alg are 32, 64, etc.
// pp:       Oversampling size. Usual values for pp are 5, 10, etc.
// n_iter:   Number of "power" iterations. Usual values are 2.
//

This routine is stored in the NoFLA_UTV_WY_blk_var2.c file. The simple_test.c file contains a main program to test it.

Details of LAPACK-like libflame code:

The new code contains the following main routine:

FLA_Error FLA_UTV_UT_blk_var1( FLA_Obj A, int build_u, FLA_Obj U, 
              int build_v, FLA_Obj V, int nb_alg, int pp, int n_iter ) {
//
// randUTV: It computes the UTV factorization of matrix A.
//
// Main features:
//   * BLAS-3 based.
//   * Use of Householder UT block transformations.
//   * Use of libflame.
//
// Matrices A, U, and V must be stored in column-order.
// If provided, matrices U,V must be square and with the right dimensions.
//
// Arguments:
// ----------
// A:       (input)  Matrix to be factorized.
//          (output) Matrix factorized.
// build_u: (input)  If build_u==1, matrix U is built.
// U:       (output) Matrix U of the factorization.
// build_v: (input)  If build_v==1, matrix V is built.
// V:       (output) Matrix V of the factorization.
// nb_alg:  (input)  Block size. Usual values for nb_alg are 32, 64, etc.
// pp:      (input)  Oversampling size. Usual values for pp are 5, 10, etc.
// n_iter:  (input)  Number of "power" iterations. Usual values are 2.
//

This routine is stored in the FLA_UTV_UT_blk_var1.c file. The simple_test.c file contains a main program to test it.