ARDC Research Link Australia

ORCID Profile
Orcid icon. 0000-0001-9326-9954

Current Organisation
University of California, Irvine

Does something not look right? The information on this page has been harvested from data sources that may not be up to date. We continue to work with information providers to improve coverage and quality. To report an issue, use the Feedback Form.

Publications

Publication

MaxHiC: A robust background correction model to identify biologically relevant chromatin interactions in Hi-C and capture Hi-C experiments

Publisher: Public Library of Science (PLoS)

Date: 24-06-2022

DOI: 10.1371/JOURNAL.PCBI.1010241

Abstract: Hi-C is a genome-wide chromosome conformation capture technology that detects interactions between pairs of genomic regions and exploits higher order chromatin structures. Conceptually Hi-C data counts interaction frequencies between every position in the genome and every other position. Biologically functional interactions are expected to occur more frequently than transient background and artefactual interactions. To identify biologically relevant interactions, several background models that take biases such as distance, GC content and mappability into account have been proposed. Here we introduce MaxHiC, a background correction tool that deals with these complex biases and robustly identifies statistically significant interactions in both Hi-C and capture Hi-C experiments. MaxHiC uses a negative binomial distribution model and a maximum likelihood technique to correct biases in both Hi-C and capture Hi-C libraries. We systematically benchmark MaxHiC against major Hi-C background correction tools including Hi-C significant interaction callers (SIC) and Hi-C loop callers using published Hi-C, capture Hi-C, and Micro-C datasets. Our results demonstrate that 1) Interacting regions identified by MaxHiC have significantly greater levels of overlap with known regulatory features (e.g. active chromatin histone marks, CTCF binding sites, DNase sensitivity) and also disease-associated genome-wide association SNPs than those identified by currently existing models, 2) the pairs of interacting regions are more likely to be linked by eQTL pairs and 3) more likely to link known regulatory features including known functional enhancer-promoter pairs validated by CRISPRi than any of the existing methods. We also demonstrate that interactions between different genomic region types have distinct distance distributions only revealed by MaxHiC. MaxHiC is publicly available as a python package for the analysis of Hi-C, capture Hi-C and Micro-C data.

Publication

DEMNUni: massive neutrinos and the bispectrum of large scale structures

Publisher: IOP Publishing

Date: 05-03-2018

DOI: 10.1088/1475-7516/2018/03/003

Publication

Somatic point mutations are enriched in non-coding RNAs with possible regulatory function in breast cancer

Publisher: Springer Science and Business Media LLC

Date: 07-06-2022

DOI: 10.1038/S42003-022-03528-0

Abstract: Non-coding RNAs (ncRNAs) form a large portion of the mammalian genome. However, their biological functions are poorly characterized in cancers. In this study, using a newly developed tool, SomaGene, we analyze de novo somatic point mutations from the International Cancer Genome Consortium (ICGC) whole-genome sequencing data of 1,855 breast cancer s les. We identify 1030 candidates of ncRNAs that are significantly and explicitly mutated in breast cancer s les. By integrating data from the ENCODE regulatory features and FANTOM5 expression atlas, we show that the candidate ncRNAs significantly enrich active chromatin histone marks (1.9 times), CTCF binding sites (2.45 times), DNase accessibility (1.76 times), HMM predicted enhancers (2.26 times) and eQTL polymorphisms (1.77 times). Importantly, we show that the 1030 ncRNAs contain a much higher level (3.64 times) of breast cancer-associated genome-wide association (GWAS) single nucleotide polymorphisms (SNPs) than genome-wide expectation. Such enrichment has not been seen with GWAS SNPs from other cancers. Using breast cell line related Hi-C data, we then show that 82% of our candidate ncRNAs (1.9 times) significantly interact with the promoter of protein-coding genes, including previously known cancer-associated genes, suggesting the critical role of candidate ncRNA genes in the activation of essential regulators of development and differentiation in breast cancer. We provide an extensive web-based resource ( www.ihealthe.unsw.edu.au/research ) to communicate our results with the research community. Our list of breast cancer-specific ncRNA genes has the potential to provide a better understanding of the underlying genetic causes of breast cancer. Lastly, the tool developed in this study can be used to analyze somatic mutations in all cancers.

Publication

Somatic point mutations are enriched in long non-coding RNAs with possible regulatory function in breast cancer

Publisher: Cold Spring Harbor Laboratory

Date: 20-07-2021

DOI: 10.1101/2021.07.19.453012

Abstract: De novo somatic point mutations identified in breast cancer are predominantly non-coding and typically attributed to altered regulatory elements such as enhancers and promoters. However, while the non-coding RNAs (ncRNAs) form a large portion of the mammalian genome, their biological functions are mostly poorly characterized in cancers. In this study, using a newly developed tool, SomaGene, we reanalyze de novo somatic point mutations from the International Cancer Genome Consortium (ICGC) whole-genome sequencing data of 1,855 breast cancers. We identify 929 candidates of ncRNAs that are significantly and explicitly mutated in breast cancer s les. By integrating data from the ENCODE regulatory features and FANTOM5 expression atlas, we show that the candidate ncRNAs in breast cancer s les significantly enrich for active chromatin histone marks (1.9 times), CTCF binding sites (2.45 times), DNase accessibility (1.76 times), HMM predicted enhancers (2.26 times) and eQTL polymorphisms (1.77 times). Importantly, we show that the 929 ncRNAs contain a much higher level (3.64 times) of breast cancer-associated genome-wide association (GWAS) single nucleotide polymorphisms (SNPs) than genome-wide expectation. Such enrichment has not been seen with GWAS SNPs from other diseases. Using breast tissue related Hi-C data we then show that 82% of our candidate ncRNAs (1.9 times) significantly interact with the promoter of protein-coding genes, including previously known cancer-associated genes, suggesting the critical role for candidate ncRNA genes in activation of essential regulators of development and differentiation in breast cancer. We provide an extensive web-based resource ( ncrna.ictic.sharif.edu ), to communicate our results with the research community. Our list of breast cancer-specific ncRNA genes has the potential to provide a better understanding of the underlying genetic causes of breast cancer. Lastly, the tool developed in this study can be used in the analysis of somatic mutations in all cancers.

Related Organisations

Organisation

University Of California, Irvine

Location: United States of America

View Organisation

Related Funding Activities

No related grants have been discovered for Narges Rezaie.

Narges Rezaie

Researcher

Related Links

Publications

MaxHiC: A robust background correction model to identify biologically relevant chromatin interactions in Hi-C and capture Hi-C experiments

DEMNUni: massive neutrinos and the bispectrum of large scale structures

Somatic point mutations are enriched in non-coding RNAs with possible regulatory function in breast cancer

Somatic point mutations are enriched in long non-coding RNAs with possible regulatory function in breast cancer

Related Organisations

University Of California, Irvine

Related Funding Activities

ARDC NEWSLETTER SIGNUP