Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Probability Histograms01:17

Probability Histograms

A probability histogram is a visual representation of a probability distribution. Similar a typical histogram, the probability histogram consists of contiguous (adjoining) boxes. It has both a horizontal axis and a vertical axis. The horizontal axis is labeled with what the data represents. The vertical axis is labeled with probability. Each rectangular bar in the histogram is 1 unit wide, which suggests that the area under each bar equals the probability, P(x), where x is 1, 2, 3, and so on.
Expected Frequencies in Goodness-of-Fit Tests01:19

Expected Frequencies in Goodness-of-Fit Tests

A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n) to the number of categories (k).
Probability Distributions01:32

Probability Distributions

The probability of a random variable x  is the likelihood of its occurrence. A probability distribution represents the probabilities of a random variable using a formula, graph, or table. There are two types of probability distribution– discrete probability distribution and continuous probability distribution.
A discrete probability distribution is a probability distribution of discrete random variables. It can be categorized into binomial probability distribution and Poisson probability...
Binomial Probability Distribution01:15

Binomial Probability Distribution

A binomial distribution is a probability distribution for a procedure with a fixed number of trials, where each trial can have only two outcomes.
The outcomes of a binomial experiment fit a binomial probability distribution. A statistical experiment can be classified as a binomial experiment if the following conditions are met:
There are a fixed number of trials. Think of trials as repetitions of an experiment. The letter n denotes the number of trials.
There are only two possible outcomes,...
DNA Microarrays02:34

DNA Microarrays

Microarrays are high-throughput and relatively inexpensive assays that can be automated to analyze large quantities of data at a time. They are used in genome-wide studies to compare gene or protein expression under two varied conditions, such as healthy and diseased states. Microarrays consist of glass or silica slides on which probe molecules are covalently attached through surface functionalization. Most commonly, the slides are prepared through the chemisorption of silanes to silica...
Probability in Statistics01:14

Probability in Statistics

Probability is the likelihood of an event occurring. The term event is defined as a collection of results of a procedure. An event is a simple event when an outcome cannot be divided into simpler parts.
An example of a simple event is a coin toss. The result of a coin toss is either a head or a tail. Here, head and tail are two simple events. These two simple events make up the sample space. Further, the probability of an event occurring falls within the range of 0 to 1. The probability of an...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Laser Light Scattering-Enhanced Deep Computer Vision Method for the Detection of Trace Mineral Oil in Vegetable Oils.

Analytical chemistry·2026
Same author

An image-based food quality analysis framework driven by large language models.

Analytical methods : advancing methods and applications·2026
Same author

Multiple carbon quantum dots-enhanced excitation-emission matrix fluorescence strategy for accurate vintage discrimination of Anhua dark tea.

Spectrochimica acta. Part A, Molecular and biomolecular spectroscopy·2026
Same author

The mechanism of auxin driving <i>Xanthium strumarium</i> invasion.

Frontiers in plant science·2025
Same author

Data-driven discovery of near-infrared type I photosensitizers for RNA-targeted tumor photodynamic therapy.

Chemical science·2025
Same author

Information Encryption and Decryption Based on Excitation-Emission Matrix Fluorescence Hyperspectral Imaging and Multiway Chemometrics.

Analytical chemistry·2025
Same journal

Pixel-wise assessment of industrial compost transformation by NIR hyperspectral imaging and chemometrics: an early-warning tool for process monitoring.

Talanta·2026
Same journal

Bifunctional covalent organic framework for rapid isolation of extracellular vesicles and proteomics-based biomarker discovery.

Talanta·2026
Same journal

Machine learning-assisted smart electrochemical platform: High-sensitivity simultaneous detection of rutin and luteolin in biological samples.

Talanta·2026
Same journal

Simultaneous separation of ezetimibe and its seven stereoisomers by supercritical fluid chromatography on a polysaccharide-based chiral stationary phase: Optimization and thermodynamic analysis.

Talanta·2026
Same journal

A novel borondifluoro indolenine-functionalized red-shift ratiometric fluorescent probe for detection of hypochlorous acid in drug-induced liver injury.

Talanta·2026
Same journal

A cost-performance index for nano-optical biosensor evaluation: Systematic evaluation of europium-salicylate luminescent platforms for GPC3-targeted early HCC diagnosis.

Talanta·2026
See all related articles

Related Experiment Video

Updated: Jun 22, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Variable selection using probability density function similarity for support vector machine classification of

Li-Juan Tang1, Jian-Hui Jiang, Hai-Long Wu

  • 1State Key Laboratory of Chemo/Biosensing and Chemometrics, College of Chemistry and Chemical Engineering, Hunan University, Changsha 410082, PR China.

Talanta
|June 30, 2009
PubMed
Summary
This summary is machine-generated.

This study introduces a novel gene mining approach for microarray data analysis, identifying significant genes for disease subtyping. The method enhances cancer classification accuracy when combined with support vector machines.

Related Experiment Videos

Last Updated: Jun 22, 2026

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Genomics

Background:

  • Microarray data analysis often suffers from irrelevant or redundant gene information, necessitating effective variable selection methods.
  • Identifying genes crucial for discriminating specific disease subtypes is challenging yet vital for accurate diagnosis.

Purpose of the Study:

  • To develop a novel gene mining approach for identifying significant genes informative for individual disease classes.
  • To integrate this gene mining method with a support vector machine for improved disease classification.

Main Methods:

  • A new gene mining approach based on probability density function similarity was proposed.
  • Significant genes were identified for each class, focusing on discriminative power for individual subtypes.
  • A support vector machine with local kernel transform was constructed using the selected significant genes.

Main Results:

  • The proposed gene mining method successfully identified significant genes for each cancer type.
  • The combined approach demonstrated satisfactory performance in both training and prediction across two public cancer datasets.
  • This method allows for the selection of genes highly relevant to specific disease subtypes.

Conclusions:

  • The novel gene mining approach effectively addresses challenges in microarray data analysis by selecting informative genes.
  • The integration with support vector machines provides a robust method for accurate disease classification, particularly for cancer subtypes.
  • This strategy enhances the utility of microarray data for personalized medicine and diagnostic applications.