Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Column Efficiency: Rate Theory

Column Efficiency: Rate Theory

The rate theory of chromatography provides quantitative insight into the shapes and widths of elution bands. These bands are based on the random-walk mechanism governing molecular migration within a column. The Gaussian profile of chromatographic bands arises from the cumulative effect of random molecular motions as they progress through the column.
During elution, a solute molecule experiences numerous transitions between stationary and mobile phases, exhibiting irregular residence times in...

Maximum Size of Aggregate

Maximum Size of Aggregate

The maximum size of aggregate is defined as the aperture of the sieve retaining 15 percent or more of the particles present in the aggregate sample. The aggregate's maximum size impacts the concrete's water requirement, workability, and strength. Larger aggregates reduce the surface area needing cement paste coverage, which can lower water needs, thereby allowing a decrease in the water-to-cement ratio when the desired workability and richness of the mix are to be maintained, which can...

Multiple Comparison Tests

Multiple Comparison Tests

Multiple comparison test, abbreviated as MCT, is a post hoc analysis generally performed after comparing multiple samples with one or more tests. An MCT will help identify a significantly different sample among multiple samples or a factor among multiple factors.
It would be easy to compare two samples using a significance alpha level of 0.05. In other words, there is only one sample pair to be compared. However, it would be difficult to identify a significantly different sample if the number...

Parallel Processing

Parallel Processing

The brain processes sensory information rapidly due to parallel processing, which involves sending data across multiple neural pathways at the same time. This method allows the brain to manage various sensory qualities, such as shapes, colors, movements, and locations, all concurrently. For instance, when observing a forest landscape, the brain simultaneously processes the movement of leaves, the shapes of trees, the depth between them, and the various shades of green. This enables a quick and...

Ranks

Ranks

Unlike parametric methods, nonparametric statistics are ideal for nominal and ordinal data, requiring fewer assumptions about the population's nature or distribution. This makes nonparametric methods easier to apply and interpret, as they do not depend on parameters like mean or standard deviation. One common approach in nonparametric analysis is to sort data according to a specific criterion. For instance, we might arrange weather data from hottest to coldest days in a month or rank cities...

Sieve Analysis and Grading Curves

Sieve Analysis and Grading Curves

Sieve analysis is a method used to determine the particle size distribution of aggregate materials. This process involves the following steps:

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

An IoT-Enabled E-Nose for Remote Detection and Monitoring of Airborne Pollution Hazards Using LoRa Network Protocol.

Sensors (Basel, Switzerland)·2023

Same author

Federated blockchain system (FBS) for the healthcare industry.

Scientific reports·2023

Same author

Personal Digital Twin: A Close Look into the Present and a Step towards the Future of Personalised Healthcare Industry.

Sensors (Basel, Switzerland)·2022

Same author

Blockchain-Based Digital Twins Collaboration for Smart Pandemic Alerting: Decentralized COVID-19 Pandemic Alerting Use Case.

Computational intelligence and neuroscience·2022

Same author

An Optimized Hybrid Deep Learning Model to Detect COVID-19 Misleading Information.

Computational intelligence and neuroscience·2021

Same author

Exploiting Reused-Based Sharing Work Opportunities in Big Data Multiquery Optimization with Flink.

Big data·2021

Same journal

Big Data-Driven Video Anomaly Detection Using VideoMAE for Visual Analytics in CCTV Surveillance.

Big data·2026

Same journal

Agentic Artificial Intelligence-Driven Explainable Deep Learning for Deciphering Noncoding Pathogenic Mechanisms of Delirium Through Genomic Big Data Integration.

Big data·2026

Same journal

Personalized Driven Instruction Through Explainable Agentic AI in Multicultural Higher Education Environments.

Big data·2026

Same journal

Big Data-Driven Explainable Agentic AI Decision Frameworks for Enterprise Innovation in FinTech Ecosystems.

Big data·2026

Same journal

An Edge-Enabled Low-Latency Cross-Lingual Speech-to-Text Framework for Efficient Human-Robot Interaction.

Big data·2026

Same journal

DS<sup>2</sup>PT: A Deep Two-Stage Patent Text Segmentation Framework Informed by Low-Latency Neural Network Characteristics.

Big data·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Dec 29, 2025

Executing Complexity-Increasing Queries in Relational MySQL and NoSQL MongoDB and EXist Size-Growing ISO/EN 13606 Standardized EHR Databases

Executing Complexity-Increasing Queries in Relational MySQL and NoSQL MongoDB and EXist Size-Growing ISO/EN 13606 Standardized EHR Databases

Published on: March 19, 2018

SOOM: Sort-Based Optimizer for Big Data Multi-Query.

Radhya Sahal^1,2, Mohammed H Khafagy³, Fatma A Omara¹

¹Faculty of Computers and Information, Cairo University, Cairo, Egypt.

|January 31, 2020

Summary

This summary is machine-generated.

Optimizing Big Data multi-queries by reusing shared sort operations significantly reduces execution time and data movement. The SOOM system enhances previous methods by exploiting both explicit and implicit sorts, improving efficiency.

Keywords:

Big Data aggregation multi-query optimization sharing opportunity sort

More Related Videos

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

Published on: November 22, 2019

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications

Published on: February 23, 2019

Related Experiment Videos

Last Updated: Dec 29, 2025

Executing Complexity-Increasing Queries in Relational MySQL and NoSQL MongoDB and EXist Size-Growing ISO/EN 13606 Standardized EHR Databases

Executing Complexity-Increasing Queries in Relational MySQL and NoSQL MongoDB and EXist Size-Growing ISO/EN 13606 Standardized EHR Databases

Published on: March 19, 2018

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

Databases to Efficiently Manage Medium Sized, Low Velocity, Multidimensional Data in Tissue Engineering

Published on: November 22, 2019

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications

Cloud-Based Phrase Mining and Analysis of User-Defined Phrase-Category Association in Biomedical Publications

Published on: February 23, 2019

Area of Science:

Computer Science
Data Management
Database Systems

Background:

Sorting data is resource-intensive, especially in Big Data environments with multiple queries.
Shared sort operations in Big Data multi-queries incur high I/O costs due to repeated data shuffling.
Existing systems like MOTH optimize data sharing but overlook redundant network movement for sorting.

Purpose of the Study:

To address the overheads of redundant data movement in Big Data multi-queries.
To develop an optimized system for handling both explicit and implicit sort operations in multi-query scenarios.
To extend the MOTH system to exploit sharing sort opportunities for improved efficiency.

Main Methods:

Extended the Multi-Query Optimization Using Tuple Size and Histogram (MOTH) system to create the Sort-Based Optimizer over MOTH (SOOM) system.
Introduced two new modules: query explorer and sort exploiter, to identify and leverage sharing sort opportunities.
Integrated SOOM with the existing MOTH system to optimize multiple aggregation and sort queries.

Main Results:

The SOOM system reduced query execution time by 45% compared to naive methods and 30% compared to state-of-the-art techniques.
Achieved significant intermediate data size reduction, averaging 67% over naive methods and 61% over state-of-the-art techniques.
Demonstrated improved performance on Hadoop-like infrastructures.

Conclusions:

The SOOM system effectively optimizes Big Data multi-queries by exploiting sharing sort opportunities.
Reusing intermediate sort results significantly cuts down execution time and network data transfer.
SOOM offers a substantial improvement over existing methods for Big Data query optimization.