ARDC Research Link Australia

ORCID Profile
Orcid icon. 0000-0003-4174-7373

Current Organisation
CNRS

Does something not look right? The information on this page has been harvested from data sources that may not be up to date. We continue to work with information providers to improve coverage and quality. To report an issue, use the Feedback Form.

Publications

Publication

Efficient toolkit implementing best practices for principal component analysis of population genetic data

Publisher: Cold Spring Harbor Laboratory

Date: 14-11-2019

DOI: 10.1101/841452

Abstract: Principal Component Analysis (PCA) of genetic data is routinely used to infer ancestry and control for population structure in various genetic analyses. However, conducting PCA analyses can be complicated and has several potential pitfalls. These pitfalls include (1) capturing Linkage Disequilibrium (LD) structure instead of population structure, (2) projected PCs that suffer from shrinkage bias, (3) detecting s le outliers, and (4) uneven population sizes. In this work, we explore these potential issues when using PCA, and present efficient solutions to these. Following applications to the UK Biobank and the 1000 Genomes project datasets, we make recommendations for best practices and provide efficient and user-friendly implementations of the proposed solutions in R packages bigsnpr and bigutilsr. For ex le, we find that PC19 to PC40 in the UK Biobank capture complex LD structure rather than population structure. Using our automatic algorithm for removing long-range LD regions, we recover 16 PCs that capture population structure only. Therefore, we recommend using only 16-18 PCs from the UK Biobank to account for population structure confounding. We also show how to use PCA to restrict analyses to in iduals of homogeneous ancestry. Finally, when projecting in idual genotypes onto the PCA computed from the 1000 Genomes project data, we find a shrinkage bias that becomes large for PC5 and beyond. We then demonstrate how to obtain unbiased projections efficiently using bigsnpr. Overall, we believe this work would be of interest for anyone using PCA in their analyses of genetic data, as well as for other omics data.

Publication

Diagnostic tools for approximate Bayesian computation using the coverage property

Publisher: Wiley

Date: 11-05-2202

DOI: 10.1111/ANZS.12087

Publication

A comparative review of dimension reduction methods in approximate bayesian computation

Publisher: Institute of Mathematical Statistics

Date: 05-2013

DOI: 10.1214/12-STS406

Publication

Integrating deep learning CT-scan model, biological and clinical variables to predict severity of COVID-19 patients

Publisher: Springer Science and Business Media LLC

Date: 27-01-2021

DOI: 10.1038/S41467-020-20657-4

Abstract: The SARS-COV-2 pandemic has put pressure on intensive care units, so that identifying predictors of disease severity is a priority. We collect 58 clinical and biological variables, and chest CT scan data, from 1003 coronavirus-infected patients from two French hospitals. We train a deep learning model based on CT scans to predict severity. We then construct the multimodal AI-severity score that includes 5 clinical and biological variables (age, sex, oxygenation, urea, platelet) in addition to the deep learning model. We show that neural network analysis of CT-scans brings unique prognosis information, although it is correlated with other markers of severity (oxygenation, LDH, and CRP) explaining the measurable but limited 0.03 increase of AUC obtained when adding CT-scan information to clinical variables. Here, we show that when comparing AI-severity with 11 existing severity scores, we find significantly improved prognosis performance AI-severity can therefore rapidly become a reference scoring approach.

Publication

BIBLIOBICLUSTER: A Bicluster Algorithm for Bibliometrics

Publisher: Springer International Publishing

Date: 2021

DOI: 10.1007/978-3-030-79150-6_22

Related Organisations

Organisation

CNRS

Location: France

View Organisation

Related Funding Activities

No related grants have been discovered for Michael Blum.

Michael Blum

Researcher

Related Links

Publications

Efficient toolkit implementing best practices for principal component analysis of population genetic data

Diagnostic tools for approximate Bayesian computation using the coverage property

A comparative review of dimension reduction methods in approximate bayesian computation

Integrating deep learning CT-scan model, biological and clinical variables to predict severity of COVID-19 patients

BIBLIOBICLUSTER: A Bicluster Algorithm for Bibliometrics

Related Organisations

CNRS

Related Funding Activities

ARDC NEWSLETTER SIGNUP