Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Correlation and Regression00:53

Correlation and Regression

1.3K
In statistics, correlation describes the degree of association between two variables. In the subfield of linear regression, correlation is mathematically expressed by the correlation coefficient, which describes the strength and direction of the relationship between two variables. The coefficient is symbolically represented by 'r' and ranges from -1 to +1. A positive value indicates a positive correlation where the two variables move in the same direction. A negative value suggests a...
1.3K
Multiple Regression01:25

Multiple Regression

3.0K
Multiple regression assesses a linear relationship between one response or dependent variable and two or more independent variables. It has many practical applications.
Farmers can use multiple regression to determine the crop yield based on more than one factor, such as water availability, fertilizer, soil properties, etc. Here, the crop yield is the response or dependent variable as it depends on the other independent variables. The analysis requires the construction of a scatter plot...
3.0K
Coefficient of Correlation01:12

Coefficient of Correlation

6.2K
The correlation coefficient, r, developed by Karl Pearson in the early 1900s, is numerical and provides a measure of strength and direction of the linear association between the independent variable x and the dependent variable y.
If you suspect a linear relationship between x and y, then r can measure how strong the linear relationship is.
What the VALUE of r tells us:
The value of r is always between –1 and +1: –1 ≤ r ≤ 1.
The size of the correlation r indicates the...
6.2K
Frequency-dependent Selection01:21

Frequency-dependent Selection

22.1K
When the fitness of a trait is influenced by how common it is (i.e., its frequency) relative to different traits within a population, this is referred to as frequency-dependent selection. Frequency-dependent selection may occur between species or within a single species. This type of selection can either be positive—with more common phenotypes having higher fitness—or negative, with rarer phenotypes conferring increased fitness.
22.1K
Spearman's Rank Correlation Test01:20

Spearman's Rank Correlation Test

840
Spearman's rank correlation test, also known as Spearman's rho, is a nonparametric method for assessing the strength and direction of association between two variables. This test is particularly valuable when the data distribution is unknown or when the assumption of normality does not hold. Named after the English psychologist and statistician Dr. Charles Edward Spearman, it serves as the nonparametric counterpart to Pearson's correlation coefficient.
Spearman's test calculates...
840
Random Sampling Method01:09

Random Sampling Method

11.2K
Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. Data are the result of sampling from a population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest. Among the various sampling methods used by...
11.2K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Hydrophilic Janus Micelles From an ABC Triblock Copolymer.

Angewandte Chemie (International ed. in English)·2026
Same author

Transferrin receptor 1-targeted polymersomes therapy for colorectal cancer.

Materials today. Bio·2025
Same author

Membrane interactions of Ocellatins. Where do antimicrobial gaps stem from?

Amino acids·2021
Same journal

Research on a Regional Availability Evaluation Model for Road-Area High-Entropy Energy Based on Synergy Factors.

Entropy (Basel, Switzerland)·2026
Same journal

Atmospheric Turbulence Channel Modeling and Performance Analysis of a CO-ZP-OFDM Coherent Optical Communication System for UAV Air-to-Ground Scenarios.

Entropy (Basel, Switzerland)·2026
Same journal

Information Geometry and Asymptotic Theory for SMML Estimators.

Entropy (Basel, Switzerland)·2026
Same journal

Correlation Entropy and Power-Law Kinetics.

Entropy (Basel, Switzerland)·2026
Same journal

Research on the Contagion of Systemic Financial Risk Under the Impact of Climate Risks-From the Perspective of Complex Networks and Machine Learning.

Entropy (Basel, Switzerland)·2026
Same journal

The Statistical-Mechanical Meaning of the Wave Function of Quantum Mechanics.

Entropy (Basel, Switzerland)·2026
See all related articles

Related Experiment Video

Updated: Jul 15, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.5K

Distance Correlation-Based Feature Selection in Random Forest.

Suthakaran Ratnasingam1, Jose Muñoz-Lopez1

  • 1Department of Mathematics, California State University, San Bernardino, CA 92407, USA.

Entropy (Basel, Switzerland)
|September 28, 2023
PubMed
Summary
This summary is machine-generated.

Distance correlation offers a robust alternative to Pearson correlation for detecting all dependencies, not just linear ones. Our new filter method using distance correlation excels in Random Forest regression for high-dimensional, nonlinear data.

Keywords:
Pearson correlationdistance correlationfeature selectionrandom forest

More Related Videos

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.6K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

798

Related Experiment Videos

Last Updated: Jul 15, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.5K
A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments
08:12

A Psychophysics Paradigm for the Collection and Analysis of Similarity Judgments

Published on: March 1, 2022

2.6K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

798

Area of Science:

  • Statistics
  • Machine Learning
  • Data Mining

Background:

  • Pearson correlation coefficient (ρ) is limited to linear relationships between variables.
  • Existing feature selection methods may not capture complex, nonlinear dependencies.
  • There is a need for methods that can identify all types of variable dependencies.

Purpose of the Study:

  • To propose a novel filter method for feature selection in Random Forest regression.
  • To utilize distance correlation as a criterion for identifying relevant features.
  • To evaluate the proposed method's performance against existing techniques.

Main Methods:

  • Implementing a filter method based on distance correlation for feature selection.
  • Utilizing Random Forest regression as the predictive model.
  • Conducting extensive simulation studies across various data settings.
  • Comparing prediction mean squared error (MSE) with existing methods.

Main Results:

  • The proposed distance correlation filter method is competitive with existing approaches.
  • The method significantly outperforms other techniques in high-dimensional (p≥300) datasets with nonlinear relationships.
  • The method demonstrates practical applicability through real-world data examples.

Conclusions:

  • Distance correlation is a powerful tool for feature selection, capturing nonlinear dependencies.
  • The proposed filter method enhances Random Forest regression performance, especially in high-dimensional, complex datasets.
  • This approach offers a valuable advancement for statistical modeling and machine learning applications.