Divide-and-Conquer Matrix Factorization
Lester Mackey   Ameet Talwalkar   Michael I. Jordan

Divide-Factor-Combine (DFC) is a parallel divide-and-conquer framework for noisy matrix factorization problems, e.g., matrix completion and robust matrix factorization. DFC divides a large-scale matrix factorization task into smaller subproblems, solves each subproblem in parallel using an arbitrary base matrix factorization algorithm, and combines the subproblem solutions using techniques from randomized matrix approximation.


      


Our experiments with collaborative filtering, video background modeling, and simulated data demonstrate the near-linear to super-linear speed-ups attainable with this approach. Moreover, our analysis shows that DFC enjoys high probability recovery guarantees comparable to those of its base algorithm.

More details about this work can be found in our paper:

       Divide-and-Conquer Matrix Factorization (pdf, long version)
       L. Mackey, A. Talwalkar and M.I. Jordan
       Neural Information Processing Systems (NIPS) 2011

If you would like to experiment with DFC, you can download our compiled code (README, compiled.tar.gz) or our underlying matlab code (matlab.tar).