Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Probability Histograms

Probability Histograms

A probability histogram is a visual representation of a probability distribution. Similar a typical histogram, the probability histogram consists of contiguous (adjoining) boxes. It has both a horizontal axis and a vertical axis. The horizontal axis is labeled with what the data represents. The vertical axis is labeled with probability. Each rectangular bar in the histogram is 1 unit wide, which suggests that the area under each bar equals the probability, P(x), where x is 1, 2, 3, and so on.

Expected Frequencies in Goodness-of-Fit Tests

Expected Frequencies in Goodness-of-Fit Tests

A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n) to the number of categories (k).

Probability Distributions

Probability Distributions

The probability of a random variable x is the likelihood of its occurrence. A probability distribution represents the probabilities of a random variable using a formula, graph, or table. There are two types of probability distribution– discrete probability distribution and continuous probability distribution.
A discrete probability distribution is a probability distribution of discrete random variables. It can be categorized into binomial probability distribution and Poisson probability...

Binomial Probability Distribution

Binomial Probability Distribution

A binomial distribution is a probability distribution for a procedure with a fixed number of trials, where each trial can have only two outcomes.
The outcomes of a binomial experiment fit a binomial probability distribution. A statistical experiment can be classified as a binomial experiment if the following conditions are met:
There are a fixed number of trials. Think of trials as repetitions of an experiment. The letter n denotes the number of trials.
There are only two possible outcomes,...

DNA Microarrays

DNA Microarrays

Microarrays are high-throughput and relatively inexpensive assays that can be automated to analyze large quantities of data at a time. They are used in genome-wide studies to compare gene or protein expression under two varied conditions, such as healthy and diseased states. Microarrays consist of glass or silica slides on which probe molecules are covalently attached through surface functionalization. Most commonly, the slides are prepared through the chemisorption of silanes to silica...

Probability in Statistics

Probability in Statistics

Probability is the likelihood of an event occurring. The term event is defined as a collection of results of a procedure. An event is a simple event when an outcome cannot be divided into simpler parts.
An example of a simple event is a coin toss. The result of a coin toss is either a head or a tail. Here, head and tail are two simple events. These two simple events make up the sample space. Further, the probability of an event occurring falls within the range of 0 to 1. The probability of an...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Laser Light Scattering-Enhanced Deep Computer Vision Method for the Detection of Trace Mineral Oil in Vegetable Oils.

Analytical chemistry·2026

Same author

An image-based food quality analysis framework driven by large language models.

Analytical methods : advancing methods and applications·2026

Same author

Multiple carbon quantum dots-enhanced excitation-emission matrix fluorescence strategy for accurate vintage discrimination of Anhua dark tea.

Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy·2026

Same author

The mechanism of auxin driving <i>Xanthium strumarium</i> invasion.

Frontiers in plant science·2025

Same author

Data-driven discovery of near-infrared type I photosensitizers for RNA-targeted tumor photodynamic therapy.

Chemical science·2025

Same author

Information Encryption and Decryption Based on Excitation-Emission Matrix Fluorescence Hyperspectral Imaging and Multiway Chemometrics.

Analytical chemistry·2025

Same journal

Pixel-wise assessment of industrial compost transformation by NIR hyperspectral imaging and chemometrics: an early-warning tool for process monitoring.

Talanta·2026

Same journal

Bifunctional covalent organic framework for rapid isolation of extracellular vesicles and proteomics-based biomarker discovery.

Talanta·2026

Same journal

Machine learning-assisted smart electrochemical platform: High-sensitivity simultaneous detection of rutin and luteolin in biological samples.

Talanta·2026

Same journal

Simultaneous separation of ezetimibe and its seven stereoisomers by supercritical fluid chromatography on a polysaccharide-based chiral stationary phase: Optimization and thermodynamic analysis.

Talanta·2026

Same journal

A novel borondifluoro indolenine-functionalized red-shift ratiometric fluorescent probe for detection of hypochlorous acid in drug-induced liver injury.

Talanta·2026

Same journal

A cost-performance index for nano-optical biosensor evaluation: Systematic evaluation of europium-salicylate luminescent platforms for GPC3-targeted early HCC diagnosis.

Talanta·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Jun 22, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Variable selection using probability density function similarity for support vector machine classification of

Li-Juan Tang¹, Jian-Hui Jiang, Hai-Long Wu

¹State Key Laboratory of Chemo/Biosensing and Chemometrics, College of Chemistry and Chemical Engineering, Hunan University, Changsha 410082, PR China.

|June 30, 2009

Summary

This summary is machine-generated.

This study introduces a novel gene mining approach for microarray data analysis, identifying significant genes for disease subtyping. The method enhances cancer classification accuracy when combined with support vector machines.

Related Experiment Videos

Last Updated: Jun 22, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Area of Science:

Bioinformatics
Computational Biology
Genomics

Background:

Microarray data analysis often suffers from irrelevant or redundant gene information, necessitating effective variable selection methods.
Identifying genes crucial for discriminating specific disease subtypes is challenging yet vital for accurate diagnosis.

Purpose of the Study:

To develop a novel gene mining approach for identifying significant genes informative for individual disease classes.
To integrate this gene mining method with a support vector machine for improved disease classification.

Main Methods:

A new gene mining approach based on probability density function similarity was proposed.
Significant genes were identified for each class, focusing on discriminative power for individual subtypes.
A support vector machine with local kernel transform was constructed using the selected significant genes.

Main Results:

The proposed gene mining method successfully identified significant genes for each cancer type.
The combined approach demonstrated satisfactory performance in both training and prediction across two public cancer datasets.
This method allows for the selection of genes highly relevant to specific disease subtypes.

Conclusions:

The novel gene mining approach effectively addresses challenges in microarray data analysis by selecting informative genes.
The integration with support vector machines provides a robust method for accurate disease classification, particularly for cancer subtypes.
This strategy enhances the utility of microarray data for personalized medicine and diagnostic applications.