Related Concept Videos
Ribosome Profiling
Applications of ribosome profiling
Ribosome profiling has many applications, including in vivo monitoring of translation inside a particular organ or tissue type and quantifying new protein synthesis levels.
The technique...
RNA-seq
Before the discovery of RNA-seq, microarray-based methods and Sanger sequencing were used for transcriptome analysis. However, while...
Improving Translational Accuracy
Genome Annotation and Assembly
lncRNA - Long Non-coding RNAs
You might also read
Related Articles
Articles linked to this work by shared authors, journal, and citation graph.
Mining single-cell data for cell type-disease associations.
Pushing the boundaries of rare disease diagnostics with the help of the first Undiagnosed Hackathon.
Time-course RNAseq data of murine AB1 mesothelioma and Renca renal cancer following immune checkpoint therapy.
SAMStat 2: quality control for next generation sequencing data.
Temporally restricted activation of IFNβ signaling underlies response to immune checkpoint therapy in mice.
OmicsTransformer: Self-Supervised Masked Consistency and Uncertainty-Aware Fusion for Robust Multi-Omics Prediction.
Computational Tool Choice Impacts CRISPR Spacer-Proto spacer Detection.
ARISE: RNA-Anchored Shared-Edge Topology and Hierarchical Fusion for Spatial Multi-Omics Integration.
Interactive exploration of biobank-scale ancestral recombination graphs with Lorax.
PepMCP: A Graph-Based Membrane Contact Probability Predictor for Membrane-Lytic Antimicrobial Peptides.
ARGscape: A modular, interactive tool for manipulation of spatiotemporal ancestral recombination graphs.
Related Experiment Video
Updated: Jun 3, 2025

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
Published on: March 1, 2024
SampleExplorer: using language models to discover relevant transcriptome data.
Wee Loong Chin1,2,3, Timo Lassmann3
1National Centre for Asbestos Related Diseases, QEII Medical Centre, Nedlands, WA 6009, Australia.
SampleExplorer enhances biomedical research by enabling efficient discovery of relevant transcriptomic datasets. This tool leverages RNA-sequencing metadata and a language model to improve data retrieval for replication and verification studies.
Area of Science:
- Biomedical Research
- Bioinformatics
- Computational Biology
Background:
- Transcriptomics, particularly RNA-sequencing (RNA-seq), is a cornerstone of modern biomedical research.
- Large public repositories contain vast amounts of RNA-seq data with associated metadata, crucial for study understanding and replication.
- Existing metadata is underutilized for discovering relevant datasets, hindering efficient data reuse.
Purpose of the Study:
- To introduce SampleExplorer, a novel tool designed to facilitate the discovery of relevant transcriptomic datasets.
- To enable researchers to search for data using both text-based queries and gene set information.
- To improve the identification and accessibility of gene expression datasets within large public repositories.
Main Methods:
- SampleExplorer embeds sample metadata and utilizes a transformer-based language model for data retrieval.
- The tool was benchmarked using the ARCHS4 database to assess its effectiveness.
- Implementation details and algorithmic descriptions are available in supplementary materials.
Main Results:
- SampleExplorer effectively retrieves biologically relevant samples from large-scale transcriptomic data.
- The tool provides an efficient method for discovering relevant gene expression datasets.
- It enhances sample and dataset identification across diverse experimental contexts.
Conclusions:
- SampleExplorer offers a powerful solution for leveraging existing transcriptomic data.
- The tool supports replication and verification studies by improving data discoverability.
- It represents a significant advancement in utilizing RNA-seq metadata for research.

