ARDC Research Link Australia

ORCID Profile
Orcid icon. 0000-0002-4806-5140

Current Organisation
Barcelona Supercomputing Center

Does something not look right? The information on this page has been harvested from data sources that may not be up to date. We continue to work with information providers to improve coverage and quality. To report an issue, use the Feedback Form.

Publications

Publication

Extensible benchmarking of methods that identify and quantify polyadenylation sites from RNA-seq data

Publisher: Cold Spring Harbor Laboratory

Date: 26-06-2023

DOI: 10.1101/2023.06.23.546284

Abstract: The tremendous rate with which data is generated and analysis methods emerge makes it increasingly difficult to keep track of their domain of applicability, assumptions, and limitations and consequently, of the efficacy and precision with which they solve specific tasks. Therefore, there is an increasing need for benchmarks, and for the provision of infrastructure for continuous method evaluation. APAeval is an international community effort, organized by the RNA Society in 2021, to benchmark tools for the identification and quantification of the usage of alternative polyadenylation (APA) sites from short-read, bulk RNA-sequencing (RNA-seq) data. Here, we reviewed 17 tools and benchmarked eight on their ability to perform APA identification and quantification, using a comprehensive set of RNA-seq experiments comprising real, synthetic, and matched 3′-end sequencing data. To support continuous benchmarking, we have incorporated the results into the OpenEBench online platform, which allows for seamless extension of the set of methods, metrics, and challenges. We envisage that our analyses will assist researchers in selecting the appropriate tools for their studies. Furthermore, the containers and reproducible workflows generated in the course of this project can be seamlessly deployed and extended in the future to evaluate new methods or datasets.

Publication

The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens

Publisher: Springer Science and Business Media LLC

Date: 19-11-2019

DOI: 10.1186/S13059-019-1835-8

Abstract: The Critical Assessment of Functional Annotation (CAFA) is an ongoing, global, community-driven effort to evaluate and improve the computational annotation of protein function. Here, we report on the results of the third CAFA challenge, CAFA3, that featured an expanded analysis over the previous CAFA rounds, both in terms of volume of data analyzed and the types of analysis performed. In a novel and major new development, computational predictions and assessment goals drove some of the experimental assays, resulting in new functional annotations for more than 1000 genes. Specifically, we performed experimental whole-genome mutation screening in Candida albicans and Pseudomonas aureginosa genomes, which provided us with genome-wide experimental data for genes associated with biofilm formation and motility. We further performed targeted assays on selected genes in Drosophila melanogaster , which we suspected of being involved in long-term memory. We conclude that while predictions of the molecular function and biological process annotations have slightly improved over time, those of the cellular component have not. Term-centric prediction of experimental annotations remains equally challenging although the performance of the top methods is significantly better than the expectations set by baseline methods in C. albicans and D. melanogaster , it leaves considerable room and need for improvement. Finally, we report that the CAFA community now involves a broad range of participants with expertise in bioinformatics, biological experimentation, biocuration, and bio-ontologies, working together to improve functional annotation, computational function prediction, and our ability to manage big data in the era of large experimental screens.

Publication

Packaging research artefacts with RO-Crate

Publisher: IOS Press

Date: 20-07-2022

DOI: 10.3233/DS-210053

Abstract: An increasing number of researchers support reproducibility by including pointers to and descriptions of datasets, software and methods in their publications. However, scientific articles may be ambiguous, incomplete and difficult to process by automated systems. In this paper we introduce RO-Crate, an open, community-driven, and lightweight approach to packaging research artefacts along with their metadata in a machine readable manner. RO-Crate is based on Schema.org annotations in JSON-LD, aiming to establish best practices to formally describe metadata in an accessible and practical way for their use in a wide variety of situations. An RO-Crate is a structured archive of all the items that contributed to a research outcome, including their identifiers, provenance, relations and annotations. As a general purpose packaging approach for data and their metadata, RO-Crate is used across multiple areas, including bioinformatics, digital humanities and regulatory sciences. By applying “just enough” Linked Data standards, RO-Crate simplifies the process of making research outputs FAIR while also enhancing research reproducibility. An RO-Crate for this article11 o/doi/10.5281/zenodo.5146227 is archived at 0.5281/zenodo.5146227.

Publication

An expanded evaluation of protein function prediction methods shows an improvement in accuracy

Publisher: Springer Science and Business Media LLC

Date: 07-09-2016

DOI: 10.1186/S13059-016-1037-6

Related Organisations

Organisation

Barcelona Supercomputing Center

Location: Spain

View Organisation

Organisation

Bioalma

Location: Spain

View Organisation

Organisation

Centro Nacional De Biotecnología

Location: Spain

View Organisation

Organisation

Centro Nacional De Investigaciones Oncológicas

Location: Spain

View Organisation

Organisation

Universidad Autónoma De Madrid

Location: Spain

View Organisation

Organisation

Universidad De Málaga

Location: Spain

View Organisation

Related Funding Activities

No related grants have been discovered for José María Fernández González.

José María Fernández González

Researcher

Related Links

Publications

Extensible benchmarking of methods that identify and quantify polyadenylation sites from RNA-seq data

The CAFA challenge reports improved protein function prediction and new functional annotations for hundreds of genes through experimental screens

Packaging research artefacts with RO-Crate

An expanded evaluation of protein function prediction methods shows an improvement in accuracy

Related Organisations

Barcelona Supercomputing Center

Bioalma

Centro Nacional De Biotecnología

Centro Nacional De Investigaciones Oncológicas

Universidad Autónoma De Madrid

Universidad De Málaga

Related Funding Activities

ARDC NEWSLETTER SIGNUP