Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Ranks01:02

Ranks

Unlike parametric methods, nonparametric statistics are ideal for nominal and ordinal data, requiring fewer assumptions about the population's nature or distribution. This makes nonparametric methods easier to apply and interpret, as they do not depend on parameters like mean or standard deviation. One common approach in nonparametric analysis is to sort data according to a specific criterion. For instance, we might arrange weather data from hottest to coldest days in a month or rank cities...
Friedman Two-way Analysis of Variance by Ranks01:21

Friedman Two-way Analysis of Variance by Ranks

Friedman's Two-Way Analysis of Variance by Ranks is a nonparametric test designed to identify differences across multiple test attempts when traditional assumptions of normality and equal variances do not apply. Unlike conventional ANOVA, which requires normally distributed data with equal variances, Friedman's test is ideal for ordinal or non-normally distributed data, making it particularly useful for analyzing dependent samples, such as matched subjects over time or repeated measures from...
Ordinal Level of Measurement00:55

Ordinal Level of Measurement

The way a set of data is measured is called its level of measurement. Correct statistical procedures depend on a researcher being familiar with levels of measurement. For analysis, data are classified into four levels of measurement—nominal, ordinal, interval, and ratio.
Data measured using an ordinal scale are similar to nominal scale data, but there is one major difference. The ordinal scale data can be ordered. An example of ordinal scale data is a list of the top five national parks in the...
Review and Preview01:10

Review and Preview

In statistics, several tools are used to interpret the data. Measures of central tendency represent the characteristics of the data, such as mean, median, and mode. Additionally, measures of variance like standard deviation and range are used to find the spread of data from the mean. Relative standing measures the distance between data locations. Commonly used measures of relative standings are percentile, z score, and quartiles.
Percentiles are a type of fractile that partition data into...
Selected Data About Geographic Locations01:25

Selected Data About Geographic Locations

Geographic Information Systems (GIS) rely on two core types of data: spatial data and attribute data.Spatial DataSpatial data defines the physical location of features within a coordinate system, typically expressed in terms of latitude and longitude. It provides precise positioning for elements like roads, rivers, or buildings.Attribute DataAttribute data complements spatial data by adding descriptive information about these features. For example, a road's spatial data includes its start and...
Lattice Centering and Coordination Number02:33

Lattice Centering and Coordination Number

The structure of a crystalline solid, whether a metal or not, is best described by considering its simplest repeating unit, which is referred to as its unit cell. The unit cell consists of lattice points that represent the locations of atoms or ions. The entire structure then consists of this unit cell repeating in three dimensions. The three different types of unit cells present in the cubic lattice are illustrated in Figure 1.
Types of Unit Cells
Imagine taking a large number of identical...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Refining the Definition for "Low Risk" in Pulmonary Arterial Hypertension: Time to Reduce Morbidity and Mortality.

JACC. Heart failure·2026
Same author

Reply to Blette and Kawut: Surrogate Endpoints are Risky Business in Pulmonary Arterial Hypertension.

American journal of respiratory and critical care medicine·2026
Same author

szKendall: spatial-structural-zero-aware dissimilarity measures for subtype discovery using single cell Hi-C data.

Nature communications·2026
Same author

Study of risk assessment scores as surrogate endpoints in pulmonary arterial hypertension trials.

American journal of respiratory and critical care medicine·2026
Same author

Can random walking on a Hi-C contact matrix lead to data quality improvement? An assessment.

PloS one·2025
Same author

Novel Spatial-Structural-Zero-Aware Dissimilarity Measures for Subtype Discovery Using Single Cell Hi-C Data.

bioRxiv : the preprint server for biology·2025
Same journal

Balanced mediated pathway detection in genomic data.

Statistical applications in genetics and molecular biology·2026
Same journal

Annealed variational mixtures for disease subtyping and biomarker discovery.

Statistical applications in genetics and molecular biology·2026
Same journal

Performance of the permutation test approach with base calling errors for detecting changes in variant allele frequencies in ctDNA for a single patient.

Statistical applications in genetics and molecular biology·2026
Same journal

BLOG: Bayesian longitudinal omics with group constraints.

Statistical applications in genetics and molecular biology·2026
Same journal

AI-driven risk prediction and categorization in cystic fibrosis leveraging AttentiveLSTM and Fox Wolf Optimizer.

Statistical applications in genetics and molecular biology·2026
Same journal

Perfect collinearity not created equal: measuring and visualizing the severity of multi-collinearity of modern omics data.

Statistical applications in genetics and molecular biology·2026
See all related articles

Related Experiment Video

Updated: Jun 13, 2026

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues
07:08

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues

Published on: July 14, 2015

Space oriented rank-based data integration.

Shili Lin1

  • 1The Ohio State University, USA. shili@stat.ohio-state.edu

Statistical Applications in Genetics and Molecular Biology
|May 4, 2010
PubMed
Summary
This summary is machine-generated.

Integrating omics data requires considering the unique spaces of each data source. New algorithms, inspired by Borda counts or Markov chains, explicitly account for these spaces, improving data integration and reducing bias.

More Related Videos

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering
09:43

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

Published on: November 22, 2019

Related Experiment Videos

Last Updated: Jun 13, 2026

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues
07:08

Optimization of Synthetic Proteins: Identification of Interpositional Dependencies Indicating Structurally and/or Functionally Linked Residues

Published on: July 14, 2015

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering
09:43

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

Published on: November 22, 2019

Area of Science:

  • Bioinformatics
  • Computational Biology
  • Systems Biology

Background:

  • Integrating data from multiple omics platforms presents a significant challenge in complex systems research.
  • Differences in underlying data spaces across platforms can lead to inefficient data use, biases, and suboptimal results.

Purpose of the Study:

  • To propose novel heuristic algorithms for integrating ranked lists from omics data.
  • To explicitly address the challenge of differing underlying data spaces in multi-platform omics integration.

Main Methods:

  • Development of two classes of space-oriented heuristic algorithms.
  • Algorithms are inspired by Borda count methods or Markov chain principles.
  • Explicitly incorporating the underlying spaces of individual ranked lists into the integration process.

Main Results:

  • The proposed algorithms effectively integrate ranked lists from omics data by considering underlying spaces.
  • Demonstrated successful application in aggregating results from heterogeneous gene expression studies (cDNA and Affymetrix).
  • Highlighted the importance of accounting for platform-specific data spaces, such as between cDNA and Affymetrix platforms.

Conclusions:

  • Space-oriented algorithms offer a more effective approach to multi-platform omics data integration.
  • Explicitly considering underlying data spaces mitigates biases and improves the efficiency of omics data analysis.
  • The developed methods provide a robust framework for combining diverse omics datasets.