A variable selection method for simultaneous component based data integration
Gu,Z. ; Van Deun,K.
Gu,Z.
Van Deun,K.
Abstract
The integration of multiblock high throughput data from multiple sources is one of the major challenges in several disciplines including metabolomics, computational biology, genomics, and clinical psychology. A main challenge in this line of research is to obtain interpretable results 1) that give an insight into the common and distinctive sources of variations associated to the multiple and heterogeneous data blocks and 2) that facilitate the identification of relevant variables. We present a novel variable selection method for performing data integration, providing easily interpretable results, and recovering underlying data structure such as common and distinctive components. The flexibility and applicability of this method are showcased via numerical simulations and an application to metabolomics data.
Description
Date
2016-11-15
Journal Title
Journal ISSN
Volume Title
Publisher
Research Projects
Organizational Units
Journal Issue
Keywords
Citation
Gu, Z & Van Deun, K 2016, 'A variable selection method for simultaneous component based data integration', Chemometrics & Intelligent Laboratory Systems, vol. 158, pp. 187-199. https://doi.org/10.1016/j.chemolab.2016.07.013
