Our lab specialises computational methods and software developments, as well as the application of our methods and tools to biological data sets generated by our collaborators.

Development of data integration methods using multivariate projection-based methodologies

Our dimension reduction methods are based on the Projection to Latent Structures algorithm (PLS, a term we prefer to Partial Least Squares regression, Wold et al. 2001) that are combined with LASSO regularization to identify important biological features or biomarkers in large-scale biological data sets.  Our latest frameworks include DIABLO (Singh et al. in prep) to integration multiple data sets measured on the same N samples (N-integration); MINT (Rohart et al. 2017a) to integrate independent studies measured on the same P variables / genes (P-integration) and mixMC (Lê Cao et al. 2016) for the multivariate analysis of microbial communities.

Specifically, we are currently developing novel multivariate methodologies to

  • integrate multiple ‘omics time course data
  • integrate genotype (SNP) data

Development of the mixOmics R toolkit package (

mixOmics is one of the few R package dedicated to the integration of multiple ‘omics data (19 novel methodologies implemented so far, amongst which 13 were developed by our lab) and with an increasing uptake from the research community. The package has been downloaded > 29K times in 2017, (R CRAN package download logs). Programming developments are on-going for interactive web interfaces,  and efficient programming for large-scale studies. Check our our recent publication (Rohart et al. 2017b) and a poster that gives an overview of this large project. The mixOmics team run multiple day workshops for an introduction to multivariate projection-based methods for data integration using mixOmics, see our website for news and tutorials.

Development and application of multivariate methods for microbiome studies

There are major statistical and computational challenges in analysing microbial communities that currently hinder the potential of microbiome research to substantially advance biomedical understanding. We are currently expanding mixMC to better characterise and understand important microbiome-host interactions. Some of our methods developments aim at addressing batch effects in microbiome experiments and analyse scarce temporal sampling in time course studies.

We analyse microbiome datasets from our collaborators for a wide range of studies, including investigating the role of gut and oral microbiome in spondyloarthropathy diseases, the development of intestinal or salivary microbiota in toddlers and infants, investigating the gut-brain crosstalk in Huntington’s disease.