Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

A data-driven clustering method for time course gene expression data.

Ping Ma1, Cristian I Castillo-Davis, Wenxuan Zhong

  • 1Department of Statistics, Harvard University, Cambridge, MA 02138, USA.

Nucleic Acids Research
|March 3, 2006
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

JADE: Joint Alignment and Deep Embedding for Multi-Slice Spatial Transcriptomics.

Advances in neural information processing systems·2026
Same author

Phenotypic prediction of missense variants via deep contrastive learning.

Nature biomedical engineering·2026
Same author

DEDUCE: statistical inference on disease-associated genes uncovers tissue-disease associations.

NAR genomics and bioinformatics·2026
Same author

Designing strongly coupled polaritonic structures via statistical machine learning.

Proceedings of the National Academy of Sciences of the United States of America·2025
Same author

JADE: Joint Alignment and Deep Embedding for Multi-Slice Spatial Transcriptomics.

bioRxiv : the preprint server for biology·2025
Same author

Participation bias in the estimation of heritability and genetic correlation.

Proceedings of the National Academy of Sciences of the United States of America·2025
Same journal

Correction to 'scSuperAnnotator: A platform for benchmarking comparison and visualizing automated cellular annotation methods for scRNA-seq data'.

Nucleic acids research·2026
Same journal

Correction to 'Differentiable partition function calculation for RNA'.

Nucleic acids research·2026
Same journal

Deployment of non-canonical splicing in tunicate genomes is mediated by divergent U2AF function and changing m6A modification in U1 and U6 snRNA.

Nucleic acids research·2026
Same journal

Bacillus subtilis DnaB forms multiple protein-protein interactions essential for DNA replication initiation.

Nucleic acids research·2026
Same journal

Multiple forms of protein-protein and DNA binding are exhibited by BrxC from the BREX phage restriction system.

Nucleic acids research·2026
Same journal

Biosynthesis of glycosylated 5-hydroxycytosine in the DNA of diverse viruses.

Nucleic acids research·2026
See all related articles

This study introduces Smoothing Spline Clustering (SSC) to discover gene expression patterns over time. The method identifies novel, biologically meaningful gene expression curves and their functions in model organisms.

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Systems Biology

Background:

  • Gene expression is a dynamic, continuous process over time.
  • Identifying shared temporal expression patterns (functional forms) is crucial but challenging.
  • Existing methods often require pre-specifying the number of patterns or their shapes.

Purpose of the Study:

  • To develop a novel computational approach for discovering gene expression patterns and their underlying functions directly from data.
  • To identify and characterize distinct temporal gene expression profiles without prior assumptions on cluster number or functional form.
  • To provide a robust method that accounts for biological variability, measurement error, and missing data in gene expression time-series.

Main Methods:

  • Developed Smoothing Spline Clustering (SSC), an algorithm that models gene expression as continuous functions (curves).

Related Experiment Videos

  • SSC handles within-cluster gene expression variability, experimental noise, and missing data.
  • The method generates a 'mean curve' with confidence bands for visual summary and goodness-of-fit assessment.
  • Main Results:

    • Applied SSC to Drosophila melanogaster and Caenorhabditis elegans life-cycle gene expression data.
    • Discovered 17 unique expression patterns in D. melanogaster and 16 in C. elegans.
    • Identified novel and known patterns, with most showing significant biological function enrichment.

    Conclusions:

    • SSC is an effective method for discovering biologically meaningful gene expression patterns and functions from time-series data.
    • The approach offers a flexible and robust alternative to traditional clustering methods for gene expression analysis.
    • The freely available SSClust software facilitates the application of this method in diverse biological studies.