ORCID Profile
0000-0001-8780-894X
Current Organisations
Garvan Institute of Medical Research
,
University of Queensland
Does something not look right? The information on this page has been harvested from data sources that may not be up to date. We continue to work with information providers to improve coverage and quality. To report an issue, use the Feedback Form.
In Research Link Australia (RLA), "Research Topics" refer to ANZSRC FOR and SEO codes. These topics are either sourced from ANZSRC FOR and SEO codes listed in researchers' related grants or generated by a large language model (LLM) based on their publications.
Neurosciences not elsewhere classified | Neurogenetics | Cell Neurochemistry | Neurosciences
Publisher: Springer Science and Business Media LLC
Date: 13-11-2012
DOI: 10.1038/NBT.2024
Publisher: Elsevier BV
Date: 03-2020
Publisher: Elsevier BV
Date: 02-2022
DOI: 10.1016/J.GIM.2021.09.020
Abstract: Branchpoint elements are required for intron removal, and variants at these elements can result in aberrant splicing. We aimed to assess the value of branchpoint annotations generated from recent large-scale studies to select branchpoint-abrogating variants, using hereditary cancer genes as model. We identified branchpoint elements in 119 genes associated with hereditary cancer from 3 genome-wide experimentally-inferred and 2 predicted branchpoint data sets. We then identified variants that occur within branchpoint elements from public databases. We compared conservation, unique variant observations, and population frequencies at different nucleotides within branchpoint motifs. Finally, selected minigene assays were performed to assess the splicing effect of variants at branchpoint elements within mismatch repair genes. There was poor overlap between predicted and experimentally-inferred branchpoints. Our analysis of cancer genes suggested that variants at -2 nucleotide, -1 nucleotide, and branchpoint positions in experimentally-inferred canonical motifs are more likely to be clinically relevant. Minigene assay data showed the -2 nucleotide to be more important to branchpoint motif integrity but also showed fluidity in branchpoint usage. Data from cancer gene analysis suggest that there are few high-risk alleles that severely impact function via branchpoint abrogation. Results of this study inform a general scheme to prioritize branchpoint motif variants for further study.
Publisher: Oxford University Press (OUP)
Date: 21-09-2009
DOI: 10.1093/BFGP/ELP038
Abstract: Genome-wide analyses of the eukaryotic transcriptome have revealed that the majority of the genome is transcribed, producing large numbers of non-protein-coding RNAs (ncRNAs). This surprising observation challenges many assumptions about the genetic programming of higher organisms and how information is stored and organized within the genome. Moreover, the rapid advances in genomics have given little opportunity for biologists to integrate these emerging findings into their intellectual and experimental frameworks. This problem has been compounded by the perception that genome-wide studies often generate more questions than answers, which in turn has led to confusion and controversy. In this article, we address common questions associated with the phenomenon of pervasive transcription and consider the indices that can be used to evaluate the function (or lack thereof) of the resulting ncRNAs. We suggest that many lines of evidence, including expression profiles, conservation signatures, chromatin modification patterns and examination of increasing numbers of in idual cases, argue in favour of the widespread functionality of non-coding transcription. We also discuss how informatic and experimental approaches used to analyse protein-coding genes may not be applicable to ncRNAs and how the general perception that protein-coding genes form the main informational output of the genome has resulted in much of the misunderstanding surrounding pervasive transcription and its potential significance. Finally, we present the conceptual implications of the majority of the eukaryotic genome being functional and describe how appreciating this perspective will provide considerable opportunity to further understand the molecular basis of development and complex diseases.
Publisher: Wiley
Date: 15-05-2019
DOI: 10.1002/PROS.23823
Abstract: The androgen‐regulated gene TMPRSS2 to the ETS transcription factor gene ERG fusion is the most common genomic alteration acquired during prostate tumorigenesis and biased toward men of European ancestry. In contrast, African American men present with more advanced disease, yet their tumors are less likely to acquire TMPRSS2‐ERG . Data for Africa is scarce. RNA was made available for genomic analyses from 181 prostate tissue biopsy cores from Black South African men, 94 with and 87 without pathological evidence for prostate cancer. Reverse transcription polymerase chain reaction was used to screen for the TMPRSS2‐ERG fusion, while transcript junction coordinates and isoform frequencies, including novel gene fusions, were determined using targeted RNA sequencing. Here we report a frequency of 13% for TMPRSS2‐ERG in tumors from Black South Africans. Present in 12/94 positive versus 1/87 cancer negative prostate tissue cores, this suggests a 92.62% predictivity for a positive cancer diagnosis ( P = 0.0031). At a frequency of almost half that reported for African Americans and roughly a quarter of that reported for men of European ancestry, acquisition of TMPRSS2‐ERG appears to be inversely associated with aggressive prostate cancer. Further support was provided by linking the presence of TMPRSS2‐ERG to low‐grade disease in younger patients ( P = 0.0466), with higher expressing distal ERG fusion junction coordinates. Only the second study of its kind for the African continent, we support a link between TMPRSS2‐ERG status and prostate cancer racial health disparity beyond the borders of the United States. We call for urgent evaluation of androgen deprivation therapy within Africa.
Publisher: The American Association of Immunologists
Date: 15-06-2009
Abstract: Previous research into the molecular mechanisms that underlie Ag-specific CD8+ T cell differentiation and function has largely focused on the role of proteins. However, it is now apparent that the mammalian genome expresses large numbers of long (& nt) nonprotein-coding RNAs (ncRNAs), and there is increasing evidence that these RNAs have important regulatory functions, particularly in the regulation of epigenetic processes underpinning cell differentiation. In this study, we show that CD8+ T cells express hundreds of long ncRNAs, many of which are lymphoid-specific and/or change dynamically with lymphocyte differentiation or activation. Numerous ncRNAs surround or overlap immunologically important protein-coding genes and can be predicted to function via a range of regulatory mechanisms. The overlap of many long ncRNAs expressed in CD8+ T cells with microRNAs and small interfering RNAs further suggests that long ncRNAs may be processed into and exert their effects via smaller functional species. Finally, we show that the majority of long ncRNAs expressed in CD8+ T cells harbor signatures of evolutionary conservation, secondary structures, and/or regulated promoters, further supporting their functionality. Taken together, our findings represent the first systematic discovery of long ncRNAs expressed in CD8+ T cells and suggest that many of these transcripts are likely to play a role in adaptive immunity.
Publisher: Wiley
Date: 30-10-2010
DOI: 10.1002/PATH.2638
Abstract: For 50 years the term 'gene' has been synonymous with regions of the genome encoding mRNAs that are translated into protein. However, recent genome-wide studies have shown that the human genome is pervasively transcribed and produces many thousands of regulatory non-protein-coding RNAs (ncRNAs), including microRNAs, small interfering RNAs, PIWI-interacting RNAs and various classes of long ncRNAs. It is now clear that these RNAs fulfil critical roles as transcriptional and post-transcriptional regulators and as guides of chromatin-modifying complexes. Here we review the biology of ncRNAs, focusing on the fundamental mechanisms by which ncRNAs facilitate normal development and physiology and, when dysfunctional, underpin disease. We also discuss evidence that intergenic regions associated with complex diseases express ncRNAs, as well as the potential use of ncRNAs as diagnostic markers and therapeutic targets. Taken together, these observations emphasize the need to move beyond the confines of protein-coding genes and highlight the fact that continued investigation of ncRNA biogenesis and function will be necessary for a comprehensive understanding of human disease.
Publisher: Cold Spring Harbor Laboratory
Date: 25-10-2011
Abstract: Human mitochondrial long noncoding RNAs (lncRNAs) have not been described to date. By analysis of deep-sequencing data we have identified three lncRNAs generated from the mitochondrial genome and confirmed their expression by Northern blotting and strand-specific qRT–PCR. We show that the abundance of these lncRNAs is comparable to their complementary mRNAs and that nuclear-encoded mitochondrial proteins involved in RNA processing regulate their expression. We also identify the 5′ and 3′ transcript ends of the three lncRNAs and show that mitochondrial RNase P protein 1 (MRPP1) is important for the processing of these transcripts. Finally, we show that mitochondrial lncRNAs form intermolecular duplexes and that their abundance is cell- and tissue-specific, suggesting a functional role in the regulation of mitochondrial gene expression.
Publisher: ZappyLab, Inc.
Date: 08-07-2020
DOI: 10.17504/PROTOCOLS.IO.BIC3KAYN
Abstract: Metagenome sequins are a set of synthetic DNA controls that reflect the sequence complexity, GC content, phylogenetic ersity and abundance of a natural microbial community. The sequins are ‘spiked-in’ to your DNA s le, which together undergo to library preparation, sequencing and analysis. The sequins can then be distinguished from you s le DNA in the output library by their synthetic sequence, and analyzed as internal controls. Sequins are compatible with all standards library preparation and sequencing methods. This protocol describes the laboratory steps required to re-suspend and spike the sequins into your DNA s le, as well as the bioinformatic steps required to analyze sequins in your output library. For further details on the design, validation and use of sequins, we refer users to ‘Synthetic microbe communities provide internal reference standards for metagenome sequencing and analysis’ by Hardwick et. al., (2018) Nature Communications.
Publisher: Springer Science and Business Media LLC
Date: 2013
Publisher: Cold Spring Harbor Laboratory
Date: 05-2016
Abstract: Targeted RNA sequencing (CaptureSeq) uses oligonucleotide probes to capture RNAs for sequencing, providing enriched read coverage, accurate measurement of gene expression, and quantitative expression data. We applied CaptureSeq to refine transcript annotations in the current murine GRCm38 assembly. More than 23,000 regions corresponding to putative or annotated long noncoding RNAs (lncRNAs) and 154,281 known splicing junction sites were selected for targeted sequencing across five mouse tissues and three brain subregions. The results illustrate that the mouse transcriptome is considerably more complex than previously thought. We assemble more complete transcript isoforms than GENCODE, expand transcript boundaries, and connect interspersed islands of mapped reads. We describe a novel filtering pipeline that identifies previously unannotated but high-quality transcript isoforms. In this set, 911 GENCODE neighboring genes are condensed into 400 expanded gene models. Additionally, 594 GENCODE lncRNAs acquire an open reading frame (ORF) when their structure is extended with CaptureSeq. Finally, we validate our observations using current FANTOM and Mouse ENCODE resources.
Publisher: Springer Science and Business Media LLC
Date: 05-09-2011
Publisher: Springer Science and Business Media LLC
Date: 11-2021
Publisher: Springer Science and Business Media LLC
Date: 04-05-2021
Publisher: Elsevier BV
Date: 04-2022
DOI: 10.1016/J.TIPS.2022.01.008
Abstract: The human genome expresses vast numbers of long noncoding RNAs (lncRNA) that fulfil erse roles in gene regulation, cell biology, development, and human disease. These roles are often mediated by sequence motifs and secondary structures bound by proteins and can regulate epigenetic, transcriptional, and translational pathways. These functional domains can be further optimised and engineered into RNA devices that are widely used in synthetic biology. We propose that natural lncRNA structures can be explored and exploited for the rational design and assembly of synthetic RNA therapies. This potential has been enabled by advances in the stability, immunogenicity, manufacture, and delivery of other RNA-based therapies, from which we can anticipate the pharmacological properties of lncRNA therapies that have not yet otherwise entered clinical trials.
Publisher: Proceedings of the National Academy of Sciences
Date: 15-01-2008
Abstract: A major proportion of the mammalian transcriptome comprises long RNAs that have little or no protein-coding capacity (ncRNAs). Only a handful of such transcripts have been examined in detail, and it is unknown whether this class of transcript is generally functional or merely artifact. Using in situ hybridization data from the Allen Brain Atlas, we identified 849 ncRNAs (of 1,328 examined) that are expressed in the adult mouse brain and found that the majority were associated with specific neuroanatomical regions, cell types, or subcellular compartments. Examination of their genomic context revealed that the ncRNAs were expressed from erse places including intergenic, intronic, and imprinted loci and that many overlap with, or are transcribed antisense to, protein-coding genes of neurological importance. Comparisons between the expression profiles of ncRNAs and their associated protein-coding genes revealed complex relationships that, in combination with the specific expression profiles exhibited at both regional and subcellular levels, are inconsistent with the notion that they are transcriptional noise or artifacts of chromatin remodeling. Our results show that the majority of ncRNAs are expressed in the brain and provide strong evidence that the majority of processed transcripts with no protein-coding capacity function intrinsically as RNAs.
Publisher: Elsevier BV
Date: 08-2017
Publisher: Cold Spring Harbor Laboratory
Date: 07-2013
Abstract: An expansive functionality and complexity has been ascribed to the majority of the human genome that was unanticipated at the outset of the draft sequence and assembly a decade ago. We are now faced with the challenge of integrating and interpreting this complexity in order to achieve a coherent view of genome biology. We argue that the linear representation of the genome exacerbates this complexity and an understanding of its three-dimensional structure is central to interpreting the regulatory and transcriptional architecture of the genome. Chromatin conformation capture techniques and high-resolution microscopy have afforded an emergent global view of genome structure within the nucleus. Chromosomes fold into complex, territorialized three-dimensional domains in concert with specialized subnuclear bodies that harbor concentrations of transcription and splicing machinery. The signature of these folds is retained within the layered regulatory landscapes annotated by chromatin immunoprecipitation, and we propose that genome contacts are reflected in the organization and expression of interweaved networks of overlapping coding and noncoding transcripts. This pervasive impact of genome structure favors a preeminent role for the nucleoskeleton and RNA in regulating gene expression by organizing these folds and contacts. Accordingly, we propose that the local and global three-dimensional structure of the genome provides a consistent, integrated, and intuitive framework for interpreting and understanding the regulatory and transcriptional complexity of the human genome.
Publisher: Cold Spring Harbor Laboratory
Date: 12-11-2021
DOI: 10.1101/2021.11.11.467877
Abstract: Neurons live for the lifespan of the in idual and underlie our ability for lifelong learning and memory. However, aging alters neuron morphology and function resulting in age-related cognitive decline. It is well established that epigenetic alterations are essential for learning and memory, yet few neuron-specific genome-wide epigenetic maps exist into old age. Comprehensive mapping of H3K4me3 and H3K27ac in mouse neurons across lifespan revealed plastic H3K4me3 marking that differentiates neuronal age linked to known characteristics of cellular and neuronal aging. We determined that neurons in old age recapitulate the H3K27ac enrichment at promoters, enhancers and super enhancers from young adult neurons, likely representing a re-activation of pathways to maintain neuronal output. Finally, this study identified new characteristics of neuronal aging, including altered rDNA regulation and epigenetic regulatory mechanisms. Collectively, these findings indicate a key role for epigenetic regulation in neurons, that is inextricably linked with aging.
Publisher: Springer Science and Business Media LLC
Date: 22-03-2019
DOI: 10.1038/S41467-019-09272-0
Abstract: Chirality is a property describing any object that is inequivalent to its mirror image. Due to its 5′–3′ directionality, a DNA sequence is distinct from a mirrored sequence arranged in reverse nucleotide-order, and is therefore chiral. A given sequence and its opposing chiral partner sequence share many properties, such as nucleotide composition and sequence entropy. Here we demonstrate that chiral DNA sequence pairs also perform equivalently during molecular and bioinformatic techniques that underpin genetic analysis, including PCR lification, hybridization, whole-genome, target-enriched and nanopore sequencing, sequence alignment and variant detection. Given these shared properties, synthetic DNA sequences mirroring clinically relevant or analytically challenging regions of the human genome are ideal controls for clinical genomics. The addition of synthetic chiral sequences (sequins) to patient tumor s les can prevent false-positive and false-negative mutation detection to improve diagnosis. Accordingly, we propose that sequins can fulfill the need for commutable internal controls in precision medicine.
Publisher: Springer Science and Business Media LLC
Date: 03-2013
DOI: 10.1038/NSMB.2480
Abstract: Genomes of complex organisms encode an abundance and ersity of long noncoding RNAs (lncRNAs) that are expressed throughout the cell and fulfill a wide variety of regulatory roles at almost every stage of gene expression. These roles, which encompass sensory, guiding, scaffolding and allosteric capacities, derive from folded modular domains in lncRNAs. In this erse functional repertoire, we focus on the well-characterized ability for lncRNAs to function as epigenetic modulators. Many lncRNAs bind to chromatin-modifying proteins and recruit their catalytic activity to specific sites in the genome, thereby modulating chromatin states and impacting gene expression. Considering this regulatory potential in combination with the abundance of lncRNAs suggests that lncRNAs may be part of a broad epigenetic regulatory network.
Publisher: Informa UK Limited
Date: 09-2011
Abstract: Mammalian mitochondrial DNA is transcribed as precursor polycistronic transcripts containing 13 mRNAs, 2 rRNAs, punctuated by 22 tRNAs. The mechanisms involved in the excision of mitochondrial tRNAs from these polycistronic transcripts have remained largely unknown. We have investigated the roles of ELAC2, mitochondrial RNase P proteins 1 and 3, and pentatricopeptide repeat domain protein 1 in the processing of mitochondrial polycistronic transcripts. We used a deep sequencing approach to characterize the 5' and 3' ends of processed mitochondrial transcripts and provide a detailed map of mitochondrial tRNA processing sites affected by these proteins. We show that MRPP1 and MRPP3 process the 5' ends of tRNAs and the 5' unconventional, non tRNA containing site of the CO1 transcript. By contrast, we find that ELAC2 and PTCD1 affect the 3' end processing of tRNAs. Finally, we found that MRPP1 is essential for transcript processing, RNA modification, translation and mitochondrial respiration.
Publisher: Springer Science and Business Media LLC
Date: 10-05-2021
DOI: 10.1186/S13059-021-02369-0
Abstract: Pseudogenes are gene copies presumed to mainly be functionless relics of evolution due to acquired deleterious mutations or transcriptional silencing. Using deep full-length PacBio cDNA sequencing of normal human tissues and cancer cell lines, we identify here hundreds of novel transcribed pseudogenes expressed in tissue-specific patterns. Some pseudogene transcripts have intact open reading frames and are translated in cultured cells, representing unannotated protein-coding genes. To assess the biological impact of noncoding pseudogenes, we CRISPR-Cas9 delete the nucleus-enriched pseudogene PDCL3P4 and observe hundreds of perturbed genes. This study highlights pseudogenes as a complex and dynamic component of the human transcriptional landscape.
Publisher: Cold Spring Harbor Laboratory
Date: 20-04-2010
DOI: 10.1261/RNA.2019810
Abstract: In humans, more than 30,000 chimeric transcripts originating from 23,686 genes have been identified. The mechanisms and association of chimeric transcripts arising from chromosomal rearrangements with cancer are well established, but much remains unknown regarding the biogenesis and importance of other chimeric transcripts that arise from nongenomic alterations. Recently, a SLC45A3 – ELK4 chimera has been shown to be androgen-regulated, and is overexpressed in metastatic or high-grade prostate tumors relative to local prostate cancers. Here, we characterize the expression of a KLK4 cis sense–antisense chimeric transcript, and show other ex les in prostate cancer. Using non-protein-coding microarray analyses, we initially identified an androgen-regulated antisense transcript within the 3′ untranslated region of the KLK4 gene in LNCaP cells. The KLK4 cis -NAT was validated by strand-specific linker-mediated RT-PCR and Northern blotting. Characterization of the KLK4 cis -NAT by 5′ and 3′ rapid lification of cDNA ends (RACE) revealed that this transcript forms multiple fusions with the KLK4 sense transcript. Lack of KLK4 antisense promoter activity using reporter assays suggests that these transcripts are unlikely to arise from a trans -splicing mechanism. 5′ RACE and analyses of deep sequencing data from LNCaP cells treated ±androgens revealed six high-confidence sense–antisense chimeras of which three were supported by the cDNA databases. In this study, we have shown complex gene expression at the KLK4 locus that might be a hallmark of cis sense–antisense chimeric transcription.
Publisher: Springer Science and Business Media LLC
Date: 27-03-2019
DOI: 10.1038/S41467-019-09374-9
Abstract: Fusion genes are a major cause of cancer. Their rapid and accurate diagnosis can inform clinical action, but current molecular diagnostic assays are restricted in resolution and throughput. Here, we show that targeted RNA sequencing (RNAseq) can overcome these limitations. First, we establish that fusion gene detection with targeted RNAseq is both sensitive and quantitative by optimising laboratory and bioinformatic variables using spike-in standards and cell lines. Next, we analyse a clinical patient cohort and improve the overall fusion gene diagnostic rate from 63% with conventional approaches to 76% with targeted RNAseq while demonstrating high concordance for patient s les with previous diagnoses. Finally, we show that targeted RNAseq offers additional advantages by simultaneously measuring gene expression levels and profiling the immune-receptor repertoire. We anticipate that targeted RNAseq will improve clinical fusion gene detection, and its increasing use will provide a deeper understanding of fusion gene biology.
Publisher: Springer Science and Business Media LLC
Date: 06-08-2018
DOI: 10.1038/S41467-018-05555-0
Abstract: The complexity of microbial communities, combined with technical biases in next-generation sequencing, pose a challenge to metagenomic analysis. Here, we develop a set of internal DNA standards, termed “sequins” (sequencing spike-ins), that together constitute a synthetic community of artificial microbial genomes. Sequins are added to environmental DNA s les prior to library preparation, and undergo concurrent sequencing with the accompanying s le. We validate the performance of sequins by comparison to mock microbial communities, and demonstrate their use in the analysis of real metagenome s les. We show how sequins can be used to measure fold change differences in the size and structure of accompanying microbial communities, and perform quantitative normalization between s les. We further illustrate how sequins can be used to benchmark and optimize new methods, including nanopore long-read sequencing technology. We provide metagenome sequins, along with associated data sets, protocols, and an accompanying software toolkit, as reference standards to aid in metagenomic studies.
Publisher: Cold Spring Harbor Laboratory
Date: 03-05-2019
DOI: 10.1101/626440
Abstract: Clinical applications of precision oncology require accurate tests that can distinguish tumor-specific mutations from errors introduced at each step of next generation sequencing (NGS). For NGS to successfully improve patient lives, discriminating between true mutations and artifacts is crucial. We systematically interrogated somatic mutations in paired tumor-normal cell lines to identify factors affecting detection reproducibility and accuracy. Different types of s les with varying input amount and tumor purity were processed using multiple library construction protocols. Whole-genome and whole-exome sequencing were carried out at six sequencing centers followed by processing with nine bioinformatics pipelines to evaluate their reproducibility. We identified artifacts due to s le and library processing and evaluated the capabilities and limitations of bioinformatics tools for artifact detection and removal. By examining the interaction and effect of various wet lab and computational parameters concomitantly, here we recommend actionable best practices for mutation detection in clinical applications using NGS technologies.
Publisher: Springer Science and Business Media LLC
Date: 12-01-2022
DOI: 10.1186/S13059-021-02579-6
Abstract: Next-generation sequencing (NGS) can identify mutations in the human genome that cause disease and has been widely adopted in clinical diagnosis. However, the human genome contains many polymorphic, low-complexity, and repetitive regions that are difficult to sequence and analyze. Despite their difficulty, these regions include many clinically important sequences that can inform the treatment of human diseases and improve the diagnostic yield of NGS. To evaluate the accuracy by which these difficult regions are analyzed with NGS, we built an in silico decoy chromosome, along with corresponding synthetic DNA reference controls, that encode difficult and clinically important human genome regions, including repeats, microsatellites, HLA genes, and immune receptors. These controls provide a known ground-truth reference against which to measure the performance of erse sequencing technologies, reagents, and bioinformatic tools. Using this approach, we provide a comprehensive evaluation of short- and long-read sequencing instruments, library preparation methods, and software tools and identify the errors and systematic bias that confound our resolution of these remaining difficult regions. This study provides an analytical validation of diagnosis using NGS in difficult regions of the human genome and highlights the challenges that remain to resolve these difficult regions.
Publisher: University of Queensland Library
Date: 2008
DOI: 10.14264/177175
Publisher: Frontiers Media SA
Date: 12-04-2019
Publisher: Proceedings of the National Academy of Sciences
Date: 06-03-2007
Abstract: A genomic region on distal mouse chromosome 1 and its syntenic human counterpart 1q23–42 show strong evidence of harboring lupus susceptibility genes. We found evidence of linkage at 1q32.2 in a targeted genome scan of 1q21–43 in 126 lupus multiplex families containing 151 affected sibpairs (nonparametric linkage score 2.52, P = 0.006). A positional candidate gene at 1q32.2, complement receptor 2 ( CR2 ), is also a candidate in the murine Sle1c lupus susceptibility locus. To explore its role in human disease, we analyzed 1,416 in iduals from 258 Caucasian and 142 Chinese lupus simplex families and demonstrated that a common three-single-nucleotide polymorphism CR2 haplotype (rs3813946, rs1048971, rs17615) was associated with lupus susceptibility ( P = 0.00001) with a 1.54-fold increased risk for the development of disease. Single-nucleotide polymorphism 1 (rs3813946), located in the 5′ untranslated region of the CR2 gene, altered transcriptional activity, suggesting a potential mechanism by which CR2 could contribute to the development of lupus. Our findings reveal that CR2 is a likely susceptibility gene for human lupus at 1q32.2, extending previous studies suggesting that CR2 participates in the pathogenesis of systemic lupus erythematosus.
Publisher: Elsevier BV
Date: 07-2017
DOI: 10.1016/J.TIG.2017.04.004
Abstract: The combination of pervasive transcription and prolific alternative splicing produces a mammalian transcriptome of great breadth and ersity. The majority of transcribed genomic bases are intronic, antisense, or intergenic to protein-coding genes, yielding a plethora of short and long non-protein-coding regulatory RNAs. Long noncoding RNAs (lncRNAs) share most aspects of their biogenesis, processing, and regulation with mRNAs. However, lncRNAs are typically expressed in more restricted patterns, frequently from enhancers, and exhibit almost universal alternative splicing. These features are consistent with their role as modular epigenetic regulators. We describe here the key studies and technological advances that have shaped our understanding of the dimensions, dynamics, and biological relevance of the mammalian noncoding transcriptome.
Publisher: Cold Spring Harbor Laboratory
Date: 05-01-2015
Abstract: During the splicing reaction, the 5′ intron end is joined to the branchpoint nucleotide, selecting the next exon to incorporate into the mature RNA and forming an intron lariat, which is excised. Despite a critical role in gene splicing, the locations and features of human splicing branchpoints are largely unknown. We use exoribonuclease digestion and targeted RNA-sequencing to enrich for sequences that traverse the lariat junction and, by split and inverted alignment, reveal the branchpoint. We identify 59,359 high-confidence human branchpoints in ,000 genes, providing a first map of splicing branchpoints in the human genome. Branchpoints are predominantly adenosine, highly conserved, and closely distributed to the 3′ splice site. Analysis of human branchpoints reveals numerous novel features, including distinct features of branchpoints for alternatively spliced exons and a family of conserved sequence motifs overlapping branchpoints we term B-boxes, which exhibit maximal nucleotide ersity while maintaining interactions with the keto-rich U2 snRNA. Different B-box motifs exhibit ergent usage in vertebrate lineages and associate with other splicing elements and distinct intron–exon architectures, suggesting integration within a broader regulatory splicing code. Lastly, although branchpoints are refractory to common mutational processes and genetic variation, mutations occurring at branchpoint nucleotides are enriched for disease associations.
Publisher: Springer Science and Business Media LLC
Date: 05-02-2010
Abstract: Long non-protein-coding RNAs (ncRNAs) are emerging as important regulators of cellular differentiation and are widely expressed in the brain. Here we show that many long ncRNAs exhibit dynamic expression patterns during neuronal and oligodendrocyte (OL) lineage specification, neuronal-glial fate transitions, and progressive stages of OL lineage elaboration including myelination. Consideration of the genomic context of these dynamically regulated ncRNAs showed they were part of complex transcriptional loci that encompass key neural developmental protein-coding genes, with which they exhibit concordant expression profiles as indicated by both microarray and in situ hybridization analyses. These included ncRNAs associated with differentiation-specific nuclear subdomains such as Gomafu and Neat1 , and ncRNAs associated with developmental enhancers and genes encoding important transcription factors and homeotic proteins. We also observed changes in ncRNA expression profiles in response to treatment with trichostatin A, a histone deacetylase inhibitor that prevents the progression of OL progenitors into post-mitotic OLs by altering lineage-specific gene expression programs. This is the first report of long ncRNA expression in neuronal and glial cell differentiation and of the modulation of ncRNA expression by modification of chromatin architecture. These observations explicitly link ncRNA dynamics to neural stem cell fate decisions, specification and epigenetic reprogramming and may have important implications for understanding and treating neuropsychiatric diseases.
Publisher: Springer Science and Business Media LLC
Date: 09-2021
Publisher: American Society of Clinical Oncology (ASCO)
Date: 12-2023
DOI: 10.1200/PO.18.00297
Abstract: Before anaplastic lymphoma kinase (ALK) inhibitors, treatment options for ALK-positive inflammatory myofibroblastic tumors (AP-IMTs) were unsatisfactory. We retrospectively analyzed the outcome of patients with AP-IMT treated with crizotinib to document response, toxicity, survival, and features associated with relapse. The cohort comprised eight patients with AP-IMT treated with crizotinib and surgery. Outcome measures were progression-free and overall survival after commencing crizotinib, treatment-related toxicities, features associated with relapse, outcome after relapse, and outcome after ceasing crizotinib. The median follow-up after commencing crizotinib was 3 years (range, 0.9 to 5.5 years). The major toxicity was neutropenia. All patients responded to crizotinib. Five were able to discontinue therapy without recurrence (median treatment duration, 1 year range, 0.2 to 3.0 years) one continues on crizotinib. Two critically ill patients with initial complete response experienced relapse while on therapy. Both harbored RANBP2-ALK fusions and responded to alternative ALK inhibitors one ultimately died as a result of progressive disease, whereas the other remains alive on treatment. Progression-free and overall survival since commencement of crizotinib is 0.75 ± 0.15% and 0.83 ± 0.15%, respectively. We confirm acceptable toxicity and excellent disease control in patients with AP-IMT treated with crizotinib, which may be ceased without recurrence in most. Relapses occurred in two of three patients with RANBP2-ALK translocated IMT, which suggests that such patients require additional therapy.
Publisher: Springer Science and Business Media LLC
Date: 09-03-2015
DOI: 10.1038/NMETH.3321
Abstract: We compared quantitative RT-PCR (qRT-PCR), RNA-seq and capture sequencing (CaptureSeq) in terms of their ability to assemble and quantify long noncoding RNAs and novel coding exons across 20 human tissues. CaptureSeq was superior for the detection and quantification of genes with low expression, showed little technical variation and accurately measured differential expression. This approach expands and refines previous annotations and simultaneously generates an expression atlas.
Publisher: Springer Science and Business Media LLC
Date: 23-06-2013
DOI: 10.1038/NG.2677
Publisher: Springer Science and Business Media LLC
Date: 16-04-2021
DOI: 10.1186/S13059-021-02316-Z
Abstract: Oncopanel genomic testing, which identifies important somatic variants, is increasingly common in medical practice and especially in clinical trials. Currently, there is a paucity of reliable genomic reference s les having a suitably large number of pre-identified variants for properly assessing oncopanel assay analytical quality and performance. The FDA-led Sequencing and Quality Control Phase 2 (SEQC2) consortium analyze ten erse cancer cell lines in idually and their pool, termed S le A, to develop a reference s le with suitably large numbers of coding positions with known (variant) positives and negatives for properly evaluating oncopanel analytical performance. In reference S le A, we identify more than 40,000 variants down to 1% allele frequency with more than 25,000 variants having less than 20% allele frequency with 1653 variants in COSMIC-related genes. This is 5–100× more than existing commercially available s les. We also identify an unprecedented number of negative positions in coding regions, allowing statistical rigor in assessing limit-of-detection, sensitivity, and precision. Over 300 loci are randomly selected and independently verified via droplet digital PCR with 100% concordance. Agilent normal reference S le B can be admixed with S le A to create new s les with a similar number of known variants at much lower allele frequency than what exists in S le A natively, including known variants having allele frequency of 0.02%, a range suitable for assessing liquid biopsy panels. These new reference s les and their admixtures provide superior capability for performing oncopanel quality control, analytical accuracy, and validation for small to large oncopanels and liquid biopsy assays.
Publisher: Elsevier BV
Date: 02-2018
DOI: 10.1016/J.CELS.2017.12.005
Abstract: The human transcriptome is so large, erse, and dynamic that, even after a decade of investigation by RNA sequencing (RNA-seq), we have yet to resolve its true dimensions. RNA-seq suffers from an expression-dependent bias that impedes characterization of low-abundance transcripts. We performed targeted single-molecule and short-read RNA-seq to survey the transcriptional landscape of a single human chromosome (Hsa21) at unprecedented resolution. Our analysis reaches the lower limits of the transcriptome, identifying a fundamental distinction between protein-coding and noncoding gene content: almost every noncoding exon undergoes alternative splicing, producing a seemingly limitless variety of isoforms. Analysis of syntenic regions of the mouse genome shows that few noncoding exons are shared between human and mouse, yet human splicing profiles are recapitulated on Hsa21 in mouse cells, indicative of regulation by a deeply conserved splicing code. We propose that noncoding exons are functionally modular, with alternative splicing generating an enormous repertoire of potential regulatory RNAs and a rich transcriptional reservoir for gene evolution.
Publisher: Elsevier BV
Date: 10-2012
Publisher: ZappyLab, Inc.
Date: 17-02-2019
DOI: 10.17504/PROTOCOLS.IO.X8CFRSW
Abstract: RNA sequencing can measure both gene or isoform expression, and reconstruct novel and complex spliced isoforms. However, the sheer size and complexity of the transcriptome, as well as technical bias, can confound analysis with RNA-seq. To assess the impact of these variables, we have developed a set of RNA sequins that represent synthetic genes that act as internal controls during RNA sequencing. Each RNA sequin represents an in idual isoform, with multiple isoforms forming artificial gene loci that are encoded within the in silico chromosome (chrIS). By modulating the relative abundance of in idual sequin isoforms we can emulate alternative splicing, whilst modulating the abundance of multiple isoforms we can emulate gene expression. Accordingly, RNA sequins are mixed at different concentration to emulate differences in gene expression and alternative splicing. By sequentially diluting sequins, we can establish a reference ladder across a range of gene expressions. We formulate multiple alternative mixtures that differ in the concentration of in idual sequins. By comparing mixtures, we can emulate differential gene expression and alternative splicing between s les. By contrast, RNA sequins with invariant concentrations between mixtures provide static scaling factors that enable quantitative normalization between multiple RNAseq libraries. The RNA sequin mixture is added to a user’s RNA s le at a fractional concentration prior to library preparation. The combined s le and the sequins then together undergo sequencing. The sequins can then be distinguished in the output library by their synthetic sequence, and analyzed as internal controls. For further detailed background on the design, validation and use of sequins, we refer users to ‘Spliced synthetic genes as internal controls in RNA sequencing experiments’ by Hardwick et al., (2016) NAture Methods DOI:10.1038/nmeth.3958\\
Publisher: American Association for the Advancement of Science (AAAS)
Date: 28-03-2008
Abstract: The past few years have revealed that the genomes of all studied eukaryotes are almost entirely transcribed, generating an enormous number of non–protein-coding RNAs (ncRNAs). In parallel, it is increasingly evident that many of these RNAs have regulatory functions. Here, we highlight recent advances that illustrate the ersity of ncRNA control of genome dynamics, cell biology, and developmental programming.
Publisher: SAGE Publications
Date: 16-11-2021
DOI: 10.1177/15353702211052280
Abstract: There is an evolution and increasing need for the utilization of emerging cellular, molecular and in silico technologies and novel approaches for safety assessment of food, drugs, and personal care products. Convergence of these emerging technologies is also enabling rapid advances and approaches that may impact regulatory decisions and approvals. Although the development of emerging technologies may allow rapid advances in regulatory decision making, there is concern that these new technologies have not been thoroughly evaluated to determine if they are ready for regulatory application, singularly or in combinations. The magnitude of these combined technical advances may outpace the ability to assess fit for purpose and to allow routine application of these new methods for regulatory purposes. There is a need to develop strategies to evaluate the new technologies to determine which ones are ready for regulatory use. The opportunity to apply these potentially faster, more accurate, and cost-effective approaches remains an important goal to facilitate their incorporation into regulatory use. However, without a clear strategy to evaluate emerging technologies rapidly and appropriately, the value of these efforts may go unrecognized or may take longer. It is important for the regulatory science field to keep up with the research in these technically advanced areas and to understand the science behind these new approaches. The regulatory field must understand the critical quality attributes of these novel approaches and learn from each other's experience so that workforces can be trained to prepare for emerging global regulatory challenges. Moreover, it is essential that the regulatory community must work with the technology developers to harness collective capabilities towards developing a strategy for evaluation of these new and novel assessment tools.
Publisher: Springer Science and Business Media LLC
Date: 18-02-2015
DOI: 10.1038/NATURE14248
Publisher: Springer Science and Business Media LLC
Date: 16-04-2021
DOI: 10.1186/S13059-021-02315-0
Abstract: Targeted sequencing using oncopanels requires comprehensive assessments of accuracy and detection sensitivity to ensure analytical validity. By employing reference materials characterized by the U.S. Food and Drug Administration-led SEquence Quality Control project phase2 (SEQC2) effort, we perform a cross-platform multi-lab evaluation of eight Pan-Cancer panels to assess best practices for oncopanel sequencing. All panels demonstrate high sensitivity across targeted high-confidence coding regions and variant types for the variants previously verified to have variant allele frequency (VAF) in the 5–20% range. Sensitivity is reduced by utilizing VAF thresholds due to inherent variability in VAF measurements. Enforcing a VAF threshold for reporting has a positive impact on reducing false positive calls. Importantly, the false positive rate is found to be significantly higher outside the high-confidence coding regions, resulting in lower reproducibility. Thus, region restriction and VAF thresholds lead to low relative technical variability in estimating promising biomarkers and tumor mutational burden. This comprehensive study provides actionable guidelines for oncopanel sequencing and clear evidence that supports a simplified approach to assess the analytical performance of oncopanels. It will facilitate the rapid implementation, validation, and quality control of oncopanels in clinical use.
Publisher: Oxford University Press (OUP)
Date: 22-03-2011
DOI: 10.1093/NAR/GKR110
Publisher: Springer Science and Business Media LLC
Date: 08-08-2016
DOI: 10.1038/NMETH.3958
Abstract: RNA sequencing (RNA-seq) can be used to assemble spliced isoforms, quantify expressed genes and provide a global profile of the transcriptome. However, the size and ersity of the transcriptome, the wide dynamic range in gene expression and inherent technical biases confound RNA-seq analysis. We have developed a set of spike-in RNA standards, termed 'sequins' (sequencing spike-ins), that represent full-length spliced mRNA isoforms. Sequins have an entirely artificial sequence with no homology to natural reference genomes, but they align to gene loci encoded on an artificial in silico chromosome. The combination of multiple sequins across a range of concentrations emulates alternative splicing and differential gene expression, and it provides scaling factors for normalization between s les. We demonstrate the use of sequins in RNA-seq experiments to measure s le-specific biases and determine the limits of reliable transcript assembly and quantification in accompanying human RNA s les. In addition, we have designed a complementary set of sequins that represent fusion genes arising from rearrangements of the in silico chromosome to aid in cancer diagnosis. RNA sequins provide a qualitative and quantitative reference with which to navigate the complexity of the human transcriptome.
Publisher: Springer Science and Business Media LLC
Date: 08-08-2016
DOI: 10.1038/NMETH.3957
Abstract: The identification of genetic variation with next-generation sequencing is confounded by the complexity of the human genome sequence and by biases that arise during library preparation, sequencing and analysis. We have developed a set of synthetic DNA standards, termed 'sequins', that emulate human genetic features and constitute qualitative and quantitative spike-in controls for genome sequencing. Sequencing reads derived from sequins align exclusively to an artificial in silico reference chromosome, rather than the human reference genome, which allows them them to be partitioned for parallel analysis. Here we use this approach to represent common and clinically relevant genetic variation, ranging from single nucleotide variants to large structural rearrangements and copy-number variation. We validate the design and performance of sequin standards by comparison to ex les in the NA12878 reference genome, and we demonstrate their utility during the detection and quantification of variants. We provide sequins as a standardized, quantitative resource against which human genetic variation can be measured and diagnostic performance assessed.
Publisher: Springer Science and Business Media LLC
Date: 11-07-2010
DOI: 10.1038/NSMB.1841
Abstract: We have recently shown that transcription initiation RNAs (tiRNAs) are derived from sequences immediately downstream of transcription start sites. Here, using cytoplasmic and nuclear small RNA high-throughput sequencing datasets, we report the identification of a second class of nuclear-specific approximately 17- to 18-nucleotide small RNAs whose 3' ends map precisely to the splice donor site of internal exons in animals. These splice-site RNAs (spliRNAs) are associated with highly expressed genes and show evidence of developmental stage- and region-specific expression. We also show that tiRNAs are localized to the nucleus, are enriched at chromatin marks associated with transcription initiation and possess a 3'-nucleotide bias. Additionally, we find that microRNA-offset RNAs (moRNAs), the miR-15/16 cluster previously linked to oncosuppression and most small nucleolar RNA (snoRNA)-derived small RNAs (sdRNAs) are enriched in the nucleus, whereas most miRNAs and two H/ACA sdRNAs are cytoplasmically enriched. We propose that nuclear-localized tiny RNAs are involved in the epigenetic regulation of gene expression.
Publisher: Springer Science and Business Media LLC
Date: 19-06-2019
DOI: 10.1038/S41596-019-0175-1
Abstract: Next-generation sequencing (NGS) has been widely adopted to identify genetic variants and investigate their association with disease. However, the analysis of sequencing data remains challenging because of the complexity of human genetic variation and confounding errors introduced during library preparation, sequencing and analysis. We have developed a set of synthetic DNA spike-ins-termed 'sequins' (sequencing spike-ins)-that are directly added to DNA s les before library preparation. Sequins can be used to measure technical biases and to act as internal quantitative and qualitative controls throughout the sequencing workflow. This step-by-step protocol explains the use of sequins for both whole-genome and targeted sequencing of the human genome. This includes instructions regarding the dilution and addition of sequins to human DNA s les, followed by the bioinformatic steps required to separate sequin- and s le-derived sequencing reads and to evaluate the diagnostic performance of the assay. These practical guidelines are accompanied by a broader discussion of the conceptual and statistical principles that underpin the design of sequin standards. This protocol is suitable for users with standard laboratory and bioinformatic experience. The laboratory steps require ~1-4 d and the bioinformatic steps (which can be performed with the provided ex le data files) take an additional day.
Publisher: Springer Science and Business Media LLC
Date: 21-09-2023
Publisher: Springer Science and Business Media LLC
Date: 04-04-2014
Abstract: RNA sequencing (RNAseq) s les the majority of expressed genes infrequently, owing to the large size, complex splicing and wide dynamic range of eukaryotic transcriptomes. This results in sparse sequencing coverage that can hinder robust isoform assembly and quantification. RNA capture sequencing (CaptureSeq) addresses this challenge by using oligonucleotide probes to capture selected genes or regions of interest for targeted sequencing. Targeted RNAseq provides enhanced coverage for sensitive gene discovery, robust transcript assembly and accurate gene quantification. Here we describe a detailed protocol for all stages of RNA CaptureSeq, from initial probe design considerations and capture of targeted genes to final assembly and quantification of captured transcripts. Initial probe design and final analysis can take less than 1 d, whereas the central experimental capture stage requires ∼7 d.
Publisher: Elsevier BV
Date: 11-2014
Publisher: Wiley
Date: 09-07-2012
DOI: 10.1002/WRNA.1128
Abstract: The human mitochondrial transcriptome, although produced from a small and compact genome, has revealed surprising complexity in its composition and regulation. Wide variation between in idual tRNAs, mRNAs, and rRNAs indicate the importance of post-transcriptional processing, maturation, and degradation mechanisms in the regulation of mitochondrial gene expression. RNA-binding proteins play essential roles in controlling the mitochondrial transcriptome from its synthesis to its destruction and have evolved unique features to complement the unusual features of mitochondrial RNAs. Recent studies have shown how changes in mitochondrial RNAs and their binding proteins can have significant effects on human health. This opens new avenues for investigation of mitochondrial RNA-binding proteins and the mechanisms by which they regulate mitochondrial gene expression.
Publisher: Bioscientifica
Date: 16-01-2008
DOI: 10.1677/JME-07-0160
Abstract: RNA is emerging as a major component of the regulatory circuitry that underpins the development and physiology of complex organisms. Here we review recent evidence that suggests that RNA may supplement endocrine and paracrine signaling by small molecules and proteins, and act as an efficient and evolutionarily flexible source of sequence-specific information transfer between cells, both locally and systemically. As such, RNA signaling may play a central but previously hidden role in multicellular ontogeny, homeostasis, and transmitted epigenetic memory.
Publisher: Springer Science and Business Media LLC
Date: 04-07-0012
DOI: 10.1038/S41467-020-17445-5
Abstract: Standard units of measurement are required for the quantitative description of nature however, few standard units have been established for genomics to date. Here, we have developed a synthetic DNA ladder that defines a quantitative standard unit that can measure DNA sequence abundance within a next-generation sequencing library. The ladder can be spiked into a DNA s le, and act as an internal scale that measures quantitative genetics features. Unlike previous spike-ins, the ladder is encoded within a single molecule, and can be equivalently and independently synthesized by different laboratories. We show how the ladder can measure erse quantitative features, including human genetic variation and microbial abundance, and also estimate uncertainty due to technical variation and improve normalization between libraries. This ladder provides an independent quantitative unit that can be used with any organism, application or technology, thereby providing a common metric by which genomes can be measured.
Publisher: Elsevier BV
Date: 2021
DOI: 10.2139/SSRN.3830017
Publisher: Elsevier BV
Date: 10-2014
Publisher: Springer Science and Business Media LLC
Date: 16-01-2013
Abstract: Actinobacteria form a major bacterial phylum that includes numerous human pathogens. Actinobacteria are primary contributors to carbon cycling and also represent a primary source of industrial high value products such as antibiotics and biopesticides. Consistent with other members of the actinobacterial phylum, Saccharopolyspora erythraea undergo a transitional switch. This switch is characterized by numerous metabolic and morphological changes. We performed RNA sequencing to analyze the transcriptional changes that occur during growth of Saccharopolyspora erythraea in batch culture. By sequencing RNA across the fermentation time course, at a mean coverage of 4000X, we found the vast majority of genes to be prominently expressed, showing that we attained close to saturating sequencing coverage of the transcriptome. During the metabolic switch, global changes in gene expression influence the metabolic machinery of Saccharopolyspora erythraea, resetting an entirely novel gene expression program. After the switch, global changes include the broad repression of half the genes regulated by complex transcriptional mechanisms. Paralogous transposon clusters, delineate these transcriptional programs. The new transcriptional program is orchestrated by a bottleneck event during which mRNA levels are severely restricted by targeted mRNA degradation. Our results, which attained close to saturating sequencing coverage of the transcriptome, revealed unanticipated transcriptional complexity with almost one third of transcriptional content originating from un-annotated sequences. We showed that the metabolic switch is a sophisticated mechanism of transcriptional regulation capable of resetting and re-synchronizing gene expression programs at extraordinary speed and scale.
Publisher: Springer Science and Business Media LLC
Date: 08-11-2021
Publisher: Springer Science and Business Media LLC
Date: 19-06-2017
DOI: 10.1038/NRG.2017.44
Abstract: Next-generation sequencing (NGS) provides a broad investigation of the genome, and it is being readily applied for the diagnosis of disease-associated genetic features. However, the interpretation of NGS data remains challenging owing to the size and complexity of the genome and the technical errors that are introduced during s le preparation, sequencing and analysis. These errors can be understood and mitigated through the use of reference standards - well-characterized genetic materials or synthetic spike-in controls that help to calibrate NGS measurements and to evaluate diagnostic performance. The informed use of reference standards, and associated statistical principles, ensures rigorous analysis of NGS data and is essential for its future clinical use.
Publisher: Oxford University Press (OUP)
Date: 28-10-2017
DOI: 10.1093/BIOINFORMATICS/BTX688
Abstract: The branchpoint element is required for the first lariat-forming reaction in splicing. However current catalogues of human branchpoints remain incomplete due to the difficulty in experimentally identifying these splicing elements. To address this limitation, we have developed a machine-learning algorithm—branchpointer—to identify branchpoint elements solely from gene annotations and genomic sequence. Using branchpointer, we annotate branchpoint elements in 85% of human gene introns with sensitivity (61.8%) and specificity (97.8%). In addition to annotation, branchpointer can evaluate the impact of SNPs on branchpoint architecture to inform functional interpretation of genetic variants. Branchpointer identifies all published deleterious branchpoint mutations annotated in clinical variant databases, and finds thousands of additional clinical and common genetic variants with similar predicted effects. This genome-wide annotation of branchpoints provides a reference for the genetic analysis of splicing, and the interpretation of noncoding variation. Branchpointer is written and implemented in the statistical programming language R and is freely available under a BSD license as a package through Bioconductor. Supplementary data are available at Bioinformatics online.
Publisher: Public Library of Science (PLoS)
Date: 28-11-2008
Publisher: American Society of Hematology
Date: 07-02-2022
DOI: 10.1182/BLOODADVANCES.2021005741
Abstract: Leukemia stem cells (LSCs) are linked to relapse in acute myeloid leukemia (AML). The LSC17 gene expression score robustly captures LSC stemness properties in AML and can be used to predict survival outcomes and response to therapy, enabling risk-adapted, upfront treatment approaches. The LSC17 score was developed and validated in a research setting. To enable widespread use of the LSC17 score in clinical decision making, we established a laboratory-developed test (LDT) for the LSC17 score that can be deployed broadly in clinical molecular diagnostic laboratories. We extensively validated the LSC17 LDT in a College of American Pathologists/Clinical Laboratory Improvements Act (CAP/CLIA)-certified laboratory, determining specimen requirements, a synthetic control, and performance parameters for the assay. Importantly, we correlated values from the LSC17 LDT to clinical outcome in a reference cohort of patients with AML, establishing a median assay value that can be used for clinical risk stratification of in idual patients with newly diagnosed AML. The assay was established in a second independent CAP/CLIA-certified laboratory, and its technical performance was validated using an independent cohort of patient s les, demonstrating that the LSC17 LDT can be readily implemented in other settings. This study enables the clinical use of the LSC17 score for upfront risk-adapted management of patients with AML.
Publisher: Springer Science and Business Media LLC
Date: 20-12-2117
Publisher: Cold Spring Harbor Laboratory
Date: 18-06-2008
Abstract: The transcriptional networks that regulate embryonic stem (ES) cell pluripotency and lineage specification are the subject of considerable attention. To date such studies have focused almost exclusively on protein-coding transcripts. However, recent transcriptome analyses show that the mammalian genome contains thousands of long noncoding RNAs (ncRNAs), many of which appear to be expressed in a developmentally regulated manner. The functions of these remain untested. To identify ncRNAs involved in ES cell biology, we used a custom-designed microarray to examine the expression profiles of mouse ES cells differentiating as embryoid bodies (EBs) over a 16-d time course. We identified 945 ncRNAs expressed during EB differentiation, of which 174 were differentially expressed, many correlating with pluripotency or specific differentiation events. Candidate ncRNAs were identified for further characterization by an integrated examination of expression profiles, genomic context, chromatin state, and promoter analysis. Many ncRNAs showed coordinated expression with genomically associated developmental genes, such as Dlx1 , Dlx4 , Gata6 , and Ecsit . We examined two novel developmentally regulated ncRNAs, Evx1as and Hoxb5/6as , which are derived from homeotic loci and share similar expression patterns and localization in mouse embryos with their associated protein-coding genes. Using chromatin immunoprecipitation, we provide evidence that both ncRNAs are associated with trimethylated H3K4 histones and histone methyltransferase MLL1, suggesting a role in epigenetic regulation of homeotic loci during ES cell differentiation. Taken together, our data indicate that long ncRNAs are likely to be important in processes directing pluripotency and alternative differentiation programs, in some cases through engagement of the epigenetic machinery.
Publisher: Springer Science and Business Media LLC
Date: 29-01-2021
DOI: 10.1038/S41598-021-81760-0
Abstract: DNA synthesis in vitro has enabled the rapid production of reference standards. These are used as controls, and allow measurement and improvement of the accuracy and quality of diagnostic tests. Current reference standards typically represent target genetic material, and act only as positive controls to assess test sensitivity. However, negative controls are also required to evaluate test specificity. Using a pair of chimeric A/B RNA standards, this allowed incorporation of positive and negative controls into diagnostic testing for the Severe Acute Respiratory Syndrome Coronavirus-2 (SARS-CoV-2). The chimeric standards constituted target regions for RT-PCR primer robe sets that are joined in tandem across two separate synthetic molecules. Accordingly, a target region that is present in standard A provides a positive control, whilst being absent in standard B, thereby providing a negative control. This design enables cross-validation of positive and negative controls between the paired standards in the same reaction, with identical conditions. This enables control and test failures to be distinguished, increasing confidence in the accuracy of results. The chimeric A/B standards were assessed using the US Centres for Disease Control real-time RT-PCR protocol, and showed results congruent with other commercial controls in detecting SARS-CoV-2 in patient s les. This chimeric reference standard design approach offers extensive flexibility, allowing representation of erse genetic features and distantly related sequences, even from different organisms.
Publisher: Oxford University Press (OUP)
Date: 12-11-2010
DOI: 10.1093/NAR/GKQ1158
Publisher: Wiley
Date: 2009
DOI: 10.1002/BIES.080099
Abstract: There is increasing evidence that dynamic changes to chromatin, chromosomes and nuclear architecture are regulated by RNA signalling. Although the precise molecular mechanisms are not well understood, they appear to involve the differential recruitment of a hierarchy of generic chromatin modifying complexes and DNA methyltransferases to specific loci by RNAs during differentiation and development. A significant fraction of the genome-wide transcription of non-protein coding RNAs may be involved in this process, comprising a previously hidden layer of intermediary genetic information that underpins developmental ontogeny and the differences between species, ecotypes and in iduals. It is also evident that RNA editing is a primary means by which hardwired genetic information in animals can be altered by environmental signals, especially in the brain, indicating a dynamic RNA-mediated interplay between the transcriptome, the environment and the epigenome. Moreover, RNA-directed regulatory processes may also transfer epigenetic information not only within cells but also between cells and organ systems, as well as across generations.
Publisher: Springer Science and Business Media LLC
Date: 10-04-2019
Publisher: Springer Science and Business Media LLC
Date: 03-2009
DOI: 10.1038/NRG2521
Abstract: In mammals and other eukaryotes most of the genome is transcribed in a developmentally regulated manner to produce large numbers of long non-coding RNAs (ncRNAs). Here we review the rapidly advancing field of long ncRNAs, describing their conservation, their organization in the genome and their roles in gene regulation. We also consider the medical implications, and the emerging recognition that any transcript, regardless of coding potential, can have an intrinsic function as an RNA.
Publisher: Cold Spring Harbor Laboratory
Date: 02-11-2010
Abstract: The complexity of the eukaryotic transcriptome is generated by the interplay of transcription initiation, termination, alternative splicing, and other forms of post-transcriptional modification. It was recently shown that RNA transcripts may also undergo cleavage and secondary 5′ capping. Here, we show that post-transcriptional cleavage of RNA contributes to the ersification of the transcriptome by generating a range of small RNAs and long coding and noncoding RNAs. Using genome-wide histone modification and RNA polymerase II occupancy data, we confirm that the vast majority of intraexonic CAGE tags are derived from post-transcriptional processing. By comparing exonic CAGE tags to tissue-matched PARE data, we show that the cleavage and subsequent secondary capping is regulated in a developmental-stage- and tissue-specific manner. Furthermore, we find evidence of prevalent RNA cleavage in numerous transcriptomic data sets, including SAGE, cDNA, small RNA libraries, and deep-sequenced size-fractionated pools of RNA. These cleavage products include mRNA variants that retain the potential to be translated into shortened functional protein isoforms. We conclude that post-transcriptional RNA cleavage is a key mechanism that expands the functional repertoire and scope for regulatory control of the eukaryotic transcriptome.
Publisher: Elsevier BV
Date: 11-2013
DOI: 10.1016/J.CELREP.2013.09.036
Abstract: Human mitochondrial DNA is transcribed as long polycistronic transcripts that encompass each strand of the genome and are processed subsequently into mature mRNAs, tRNAs, and rRNAs, necessitating widespread posttranscriptional regulation. Here, we establish methods for massively parallel sequencing and analyses of RNase-accessible regions of human mitochondrial RNA and thereby identify specific regions within mitochondrial transcripts that are bound by proteins. This approach provides a range of insights into the contribution of RNA-binding proteins to the regulation of mitochondrial gene expression.
Publisher: Springer Science and Business Media LLC
Date: 12-2017
Publisher: Oxford University Press (OUP)
Date: 10-2009
DOI: 10.1093/NAR/GKN617
Publisher: Oxford University Press (OUP)
Date: 27-01-2017
DOI: 10.1093/BIOINFORMATICS/BTX038
Abstract: Spike-in controls are synthetic nucleic-acid sequences that are added to a user’s s le and constitute internal standards for subsequent steps in the next generation sequencing workflow. The Anaquin software toolkit can be used to analyze the performance of spike-in controls at multiple steps during RNA sequencing or genome sequencing analysis, providing useful diagnostic statistics, data visualization and s le normalization. The software is implemented in C ++/R and is freely available under BSD license. The source code is available from tudent-t/Anaquin, binaries and user manual from www.sequin.xyz/software and R package from ackages/Anaquin Supplementary data are available at Bioinformatics online.
Publisher: Cold Spring Harbor Laboratory
Date: 04-2011
DOI: 10.1261/RNA.2528811
Abstract: Long noncoding RNAs (lncRNAs) are increasingly recognized to play major regulatory roles in development and disease. To identify novel regulators in breast biology, we identified differentially regulated lncRNAs during mouse mammary development. Among the highest and most differentially expressed was a transcript ( Zfas1 ) antisense to the 5′ end of the protein-coding gene Znfx1 . In vivo, Zfas1 RNA is localized within the ducts and alveoli of the mammary gland. Zfas1 intronically hosts three previously undescribed C/D box snoRNAs (SNORDs): Snord12 , Snord12b , and Snord12c . In contrast to the general assumption that noncoding SNORD-host transcripts function only as vehicles to generate snoRNAs, knockdown of Zfas1 in a mammary epithelial cell line resulted in increased cellular proliferation and differentiation, while not substantially altering the levels of the SNORDs. In support of an independent function, we also found that Zfas1 is extremely stable, with a half-life h. Expression analysis of the SNORDs revealed these were expressed at different levels, likely a result of distinct structures conferring differential stability. While there is relatively low primary sequence conservation between Zfas1 and its syntenic human ortholog ZFAS1 , their predicted secondary structures have similar features. Like Zfas1 , ZFAS1 is highly expressed in the mammary gland and is down-regulated in breast tumors compared to normal tissue. We propose a functional role for Zfas1/ ZFAS1 in the regulation of alveolar development and epithelial cell differentiation in the mammary gland, which, together with its dysregulation in human breast cancer, suggests ZFAS1 as a putative tumor suppressor gene.
Publisher: SAGE Publications
Date: 16-11-2008
Abstract: Current research exploring the molecular basis of memory focuses mainly on proteins despite recent genomic studies reporting the abundant transcription of non-protein-coding RNA (ncRNA). Although ncRNAs are involved in a erse range of biological processes, they are particularly prevalent within the nervous system, where they contribute towards the complexity and function of the mammalian brain. In this review, we apply recent advances in ncRNA biology to predict a critical role for ncRNAs in the molecular mechanisms underlying memory formation and maintenance. We describe the role of ncRNAs in regulating the translation, stability, and editing of mRNA populations in response to synaptic activity during memory formation and the role of ncRNAs in the epigenetic and transcriptional programs that underlie long-term memory storage. We also consider ncRNAs acting as an additional avenue of communication between neurons by their intercellular trafficking. Taken together, the emerging evidence suggests a central role for ncRNAs in memory formation and provokes novel research directions in this field. NEUROSCIENTIST 14(5):434—445, 2008. DOI: 10.1177/1073858408319187
Publisher: American Association for Cancer Research (AACR)
Date: 14-08-2017
DOI: 10.1158/0008-5472.CAN-16-2550
Abstract: Despite intensive multimodal treatment of sarcomas, a heterogeneous group of malignant tumors arising from connective tissue, survival remains poor. Candidate-based targeted treatments have demonstrated limited clinical success, urging an unbiased and comprehensive analysis of oncogenic signaling networks to reveal therapeutic targets and personalized treatment strategies. Here we applied mass spectrometry–based phosphoproteomic profiling to the largest and most heterogeneous set of sarcoma cell lines characterized to date and identified novel tyrosine phosphorylation patterns, enhanced tyrosine kinases in specific subtypes, and potential driver kinases. ALK was identified as a novel driver in the Aska-SS synovial sarcoma (SS) cell line via expression of an ALK variant with a large extracellular domain deletion (ALKΔ2–17). Functional ALK dependency was confirmed in vitro and in vivo with selective inhibitors. Importantly, ALK immunopositivity was detected in 6 of 43 (14%) of SS patient specimens, one of which exhibited an ALK rearrangement. High PDGFRα phosphorylation also characterized SS cell lines, which was accompanied by enhanced MET activation in Yamato-SS cells. Although Yamato-SS cells were sensitive to crizotinib (ALK/MET-inhibitor) but not pazopanib (VEGFR/PDGFR-inhibitor) monotherapy in vitro, synergistic effects were observed upon drug combination. In vivo, both drugs were in idually effective, with pazopanib efficacy likely attributable to reduced angiogenesis. MET or PDGFRα expression was detected in 58% and 84% of SS patients, respectively, with coexpression in 56%. Consequently, our integrated approach has led to the identification of ALK and MET as promising therapeutic targets in SS. Cancer Res 77(16) 4279–92. ©2017 AACR.
Publisher: Elsevier BV
Date: 08-2011
Location: United States of America
Location: Germany
Start Date: 06-2021
End Date: 12-2024
Amount: $649,910.00
Funder: Australian Research Council
View Funded Activity