Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Comparing the Survival Analysis of Two or More Groups

Comparing the Survival Analysis of Two or More Groups

Survival analysis is a cornerstone of medical research, used to evaluate the time until an event of interest occurs, such as death, disease recurrence, or recovery. Unlike standard statistical methods, survival analysis is particularly adept at handling censored data—instances where the event has not occurred for some participants by the end of the study or remains unobserved. To address these unique challenges, specialized techniques like the Kaplan-Meier estimator, log-rank test, and Cox...

Cluster Sampling Method

Cluster Sampling Method

Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical Inference Techniques in Hypothesis Testing: Parametric Versus Nonparametric Data

Statistical inference techniques, paramount in hypothesis testing, differentiate into two broad categories: parametric and nonparametric statistics.
Parametric statistics, as the name suggests, assumes that data follow a specific distribution, often a normal distribution. This assumption enables robust hypothesis testing and estimation. Parametric methods, like the Student's t-test or Goodness-of-fit test, are frequently employed in biostatistics due to their robustness. For instance, comparing...

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for k_a Estimation

This lesson introduces two critical methods in pharmacokinetics, the Wagner-Nelson and Loo-Riegelman methods, used for estimating the absorption rate constant (ka) for drugs administered via non-intravenous routes. The Wagner-Nelson method relates ka to the plasma concentration derived from the slope of a semilog percent unabsorbed time plot. However, it is limited to drugs with one-compartment kinetics and can be impacted by factors like gastrointestinal motility or enzymatic degradation.
On...

Distributions to Estimate Population Parameter

Distributions to Estimate Population Parameter

The accurate values of population parameters such as population proportion, population mean, and population standard deviation (or variance) are usually unknown. These are fixed values that can only be estimated from the data collected from the samples. The estimates of each of these parameters are sample proportion, the sample mean, and sample standard deviation (or variance). To obtain the values of these sample statistics, data are required that have particular distribution and central...

Expected Frequencies in Goodness-of-Fit Tests

Expected Frequencies in Goodness-of-Fit Tests

A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n) to the number of categories (k).

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Detection of an arbitrary number of communities in a block spin Ising model.

PloS one·2026

Same author

COVID-19 clinical footprint to infer about mortality.

Journal of the Royal Statistical Society. Series A, (Statistics in Society)·2024

Same author

Latent Nested Nonparametric Priors (with Discussion).

Bayesian analysis·2022

Same author

Prior Sensitivity Analysis in a Semi-Parametric Integer-Valued Time Series Model.

Entropy (Basel, Switzerland)·2020

Same author

Using posterior predictive distributions to analyse epidemic models: COVID-19 in Mexico City.

Physical biology·2020

Same author

Two-group Poisson-Dirichlet mixtures for multiple testing.

Biometrics·2020

Same journal

GMSA: A Graph Matching and Point Cloud Registration-Based Method for Spatial Transcriptomics Data Alignment.

Journal of computational biology : a journal of computational molecular cell biology·2026

Same journal

Investigations on Multiple Protein Scaffold Filling.

Journal of computational biology : a journal of computational molecular cell biology·2026

Same journal

Cell Type Prediction for Single-Cell RNA Sequencing Utilizing Unsupervised Domain Adaptation and Semi-Supervised Learning.

Journal of computational biology : a journal of computational molecular cell biology·2026

Same journal

PPIGAN: Prediction of Protein-Protein Interactions Using Generative Adversarial Networks.

Journal of computational biology : a journal of computational molecular cell biology·2026

Same journal

Deep Structure-Enhanced Cell Clustering Model for Single-Cell RNA Sequencing Data.

Journal of computational biology : a journal of computational molecular cell biology·2026

Same journal

Asymmetric Drug-Drug Interaction Prediction Based on Generative Adversarial Networks and Knowledge Graph.

Journal of computational biology : a journal of computational molecular cell biology·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 27, 2026

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

A Bayesian nonparametric approach for comparing clustering structures in EST libraries.

Antonio Lijoi¹, Ramsés H Mena, Igor Prünster

¹Department of Economics and Quantitative Methods, University of Pavia, Pavia, Italy.

Journal of Computational Biology : a Journal of Computational Molecular Cell Biology

|December 2, 2008

Summary

This summary is machine-generated.

This study introduces a Bayesian nonparametric approach using the Poisson-Dirichlet process to analyze clustering in Expressed Sequence Tags (ESTs) data. It evaluates cDNA library redundancy and compares library compatibility, aiding in data quality assessment.

Related Experiment Videos

Last Updated: Jun 27, 2026

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Large-scale Reconstructions and Independent, Unbiased Clustering Based on Morphological Metrics to Classify Neurons in Selective Populations

Published on: February 15, 2017

Area of Science:

Bioinformatics
Computational Biology
Statistical Genetics

Background:

Expressed Sequence Tags (ESTs) are crucial for gene discovery and analysis.
Evaluating cDNA library redundancy and comparing library structures are essential for accurate biological inference.
Existing methods may not fully capture the complex clustering mechanisms within EST data.

Purpose of the Study:

To develop a robust method for assessing cDNA library redundancy.
To compare the clustering structures of different EST libraries.
To evaluate the impact of error correction on EST data and assess library compatibility.

Main Methods:

Utilizing a Bayesian nonparametric approach for data analysis.
Employing the two-parameter Poisson-Dirichlet (PD) process as a specific nonparametric model.
Implementing a full Bayesian analysis with a described computational algorithm.

Main Results:

The proposed method effectively evaluates cDNA library redundancy.
Numerical results demonstrate the comparison of library clustering structures.
The approach assesses the effect of error correction and the compatibility of EST libraries.

Conclusions:

The Bayesian nonparametric method, specifically the PD process, provides a powerful framework for analyzing EST data clustering.
This approach enhances the understanding of library redundancy and compatibility.
The findings support improved data quality assessment and comparative analysis of biological libraries.