Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Scaling01:26

Scaling

331
In designing and analyzing filters, resonant circuits, or circuit analysis at large, working with standard element values like 1 ohm, 1 henry, or 1 farad can be convenient before scaling these values to more realistic figures. This approach is widely utilized by not employing realistic element values in numerous examples and problems; it simplifies mastering circuit analysis through convenient component values. The complexity of calculations is thereby reduced, with the understanding that...
331
Calibration Curves: Linear Least Squares01:20

Calibration Curves: Linear Least Squares

2.5K
A calibration curve is a plot of the instrument's response against a series of known concentrations of a substance. This curve is used to set the instrument response levels, using the substance and its concentrations as standards. Alternatively, or additionally, an equation is fitted to the calibration curve plot and subsequently used to calculate the unknown concentrations of other samples reliably.
For data that follow a straight line, the standard method for fitting is the linear...
2.5K
Residuals and Least-Squares Property01:11

Residuals and Least-Squares Property

7.9K
The vertical distance between the actual value of y and the estimated value of y. In other words, it measures the vertical distance between the actual data point and the predicted point on the line
If the observed data point lies above the line, the residual is positive, and the line underestimates the actual data value for y. If the observed data point lies below the line, the residual is negative, and the line overestimates the actual data value for y.
The process of fitting the best-fit...
7.9K
Linear Approximation in Frequency Domain01:26

Linear Approximation in Frequency Domain

144
Linear systems are characterized by two main properties: superposition and homogeneity. Superposition allows the response to multiple inputs to be the sum of the responses to each individual input. Homogeneity ensures that scaling an input by a scalar results in the response being scaled by the same scalar.
In contrast, nonlinear systems do not inherently possess these properties. However, for small deviations around an operating point, a nonlinear system can often be approximated as linear....
144
Regression Toward the Mean01:52

Regression Toward the Mean

6.5K
Regression toward the mean (“RTM”) is a phenomenon in which extremely high or low values—for example, and individual’s blood pressure at a particular moment—appear closer to a group’s average upon remeasuring. Although this statistical peculiarity is the result of random error and chance, it has been problematic across various medical, scientific, financial and psychological applications. In particular, RTM, if not taken into account, can interfere when...
6.5K
Range Rule of Thumb to Interpret Standard Deviation01:13

Range Rule of Thumb to Interpret Standard Deviation

9.4K
The range rule of thumb in statistics helps us calculate a dataset's minimum and maximum values with known standard deviation. This rule is based on the concept that 95% of all values in a dataset lie within two standard deviations from the mean.
For instance, the range rule of thumb can be used to find the tallest and the shortest student in a class, given the mean student height and standard deviation. If the mean student height is 1.6 m and the standard deviation, s is 0.05 m, the height...
9.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Variational Garrote for Statistical Physics-based Sparse and Robust Variable Selection.

ArXiv·2026
Same author

Multiplicative Learning.

ArXiv·2025
Same author

Annealing approach to root finding.

Physical review. E·2024
Same author

Annealing approach to root-finding.

ArXiv·2024
Same author

Mirror Descent of Hopfield Model.

Neural computation·2023
Same author

Inference of stochastic time series with missing data.

Physical review. E·2021
Same journal

Erratum: Low-dimensional model for adaptive networks of spiking neurons [Phys. Rev. E 111, 014422 (2025)].

Physical review. E·2026
Same journal

Disentangling the effects of many-body forces on depletion interactions.

Physical review. E·2026
Same journal

Charge transport and mode transition in dual-energy electron beam diodes.

Physical review. E·2026
Same journal

Optimization of multisite reactions in complex compartmentalized media.

Physical review. E·2026
Same journal

Origin of geometric cohesion in nonconvex granular materials: Interplay between interdigitation and rotational constraints enhancing frictional stability.

Physical review. E·2026
Same journal

Interaction of walkers with a standing Faraday wave.

Physical review. E·2026
See all related articles

Related Experiment Video

Updated: Sep 22, 2025

Author Spotlight: Efficient Image Recognition Using Directional Gradient Histogram Technique and Support Vector Machines
08:27

Author Spotlight: Efficient Image Recognition Using Directional Gradient Histogram Technique and Support Vector Machines

Published on: January 5, 2024

1.3K

Scale-invariant representation of machine learning.

Sungyeop Lee1, Junghyo Jo2,3

  • 1Department of Physics and Astronomy, Seoul National University, Seoul 08826, Korea.

Physical Review. E
|May 20, 2022
PubMed
Summary
This summary is machine-generated.

Machine learning models compress frequent data into internal codes and identify outliers, following power law distributions. This scale-invariant representation optimizes data grouping for learning accuracy.

More Related Videos

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data
14:27

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

15.8K
Visualization Method for Proprioceptive Drift on a 2D Plane Using Support Vector Machine
07:05

Visualization Method for Proprioceptive Drift on a 2D Plane Using Support Vector Machine

Published on: October 27, 2016

9.3K

Related Experiment Videos

Last Updated: Sep 22, 2025

Author Spotlight: Efficient Image Recognition Using Directional Gradient Histogram Technique and Support Vector Machines
08:27

Author Spotlight: Efficient Image Recognition Using Directional Gradient Histogram Technique and Support Vector Machines

Published on: January 5, 2024

1.3K
Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data
14:27

Identification of Disease-related Spatial Covariance Patterns using Neuroimaging Data

Published on: June 26, 2013

15.8K
Visualization Method for Proprioceptive Drift on a 2D Plane Using Support Vector Machine
07:05

Visualization Method for Proprioceptive Drift on a 2D Plane Using Support Vector Machine

Published on: October 27, 2016

9.3K

Area of Science:

  • Machine Learning
  • Information Theory
  • Data Science

Background:

  • Machine learning's success stems from structured data representation.
  • Similar data share compressed internal representations for classification or clustering.
  • Observed power law distributions in internal codes/labels of learning models.

Purpose of the Study:

  • Derive the process behind power law emergence in machine learning.
  • Explain the information-theoretic basis of scale-invariant data representation.
  • Connect data grouping uncertainty to learning accuracy.

Main Methods:

  • Analysis of internal representations in supervised and unsupervised learning.
  • Mathematical derivation of power law emergence.
  • Information-theoretic interpretation of data compression and outlier differentiation.

Main Results:

  • Identified power law distributions in machine learning's internal data representations.
  • Demonstrated that these distributions naturally arise from learning processes.
  • Linked scale-invariant representations to maximally uncertain data groupings.

Conclusions:

  • Power laws are a natural outcome of machine learning data compression and outlier detection.
  • Scale-invariant representations represent a state of maximal uncertainty for a given accuracy.
  • This finding provides a theoretical foundation for understanding machine learning behavior.