arXiv:1307.0803 Abstract | arXiv Analytics

arXiv:1307.0803 [cs.LG]Abstract References Reviews Resources

Data Fusion by Matrix Factorization

Published 2013-07-02Version 1

For most problems in science and engineering we can obtain data that describe the system from various perspectives and record the behaviour of its individual components. Heterogeneous data sources can be collectively mined by data fusion. Fusion can focus on a specific target relation and exploit directly associated data together with data on the context or additional constraints. In the paper we describe a data fusion approach with penalized matrix tri-factorization that simultaneously factorizes data matrices to reveal hidden associations. The approach can directly consider any data sets that can be expressed in a matrix, including those from attribute-based representations, ontologies, associations and networks. We demonstrate its utility on a gene function prediction problem in a case study with eleven different data sources. Our fusion algorithm compares favourably to state-of-the-art multiple kernel learning and achieves higher accuracy than can be obtained from any single data source alone.

Comments: Preprint, 13 pages, 3 Figures, 3 Tables

Categories: cs.LG, cs.AI, cs.DB, stat.ML

Subjects: 15A83, 15A23, 40C05, H.2.8, G.1.3, I.2.6, 65F30, H.3.3

Keywords: matrix factorization, gene function prediction problem, achieves higher accuracy, state-of-the-art multiple kernel, fusion algorithm compares

Related articles: Most relevant | Search more

arXiv:2310.12688 [cs.LG] (Published 2023-10-19)

Compression of Recurrent Neural Networks using Matrix Factorization

Lucas Maison, Hélion du Mas des Bourboux, Thomas Courtat

arXiv:1501.07320 [cs.LG] (Published 2015-01-29)

Tensor Factorization via Matrix Factorization

Volodymyr Kuleshov, Arun Tejasvi Chaganty, Percy Liang

arXiv:1707.05926 [cs.LG] (Published 2017-07-19)