Achieving Reliability in Cloud Computing by a Novel Hybrid Approach
View abstract on PubMed
Summary
This summary is machine-generated.This study enhances cloud computing fault tolerance by comparing machine learning algorithms and proposes a modified sequential minimal optimization (MSMO) with delta-checkpointing for improved reliability and accuracy.
Area Of Science
- Cloud Computing
- Machine Learning
- Fault Tolerance
Background
- Cloud computing (CC) offers significant benefits but faces challenges in resource allocation, security, and fault tolerance (FT).
- Existing fault tolerance methods struggle with heterogeneity, lack of standards, and the need for automation in cloud environments.
- Accurate fault prediction and system reliability are critical for robust cloud infrastructure.
Purpose Of The Study
- To evaluate the performance of various machine learning (ML) algorithms for fault prediction in cloud computing.
- To identify the most accurate and reliable ML classifier for virtual machine (VM) fault detection.
- To propose an improved fault tolerance method combining ML with delta-checkpointing.
Main Methods
- Comparison of ML algorithms including Naïve Bayes (NB), Support Vector Machine (SVM), Logistic Regression (MLR), K-Nearest Neighbor (KNN), and Random Forest (RF).
- Utilized both secondary data from an HPC system and primary data from virtual machines (VMs).
- Employed data splitting ratios (80/20, 70/30) and 5-fold cross-validation for model evaluation.
- Introduced a modified Sequential Minimal Optimization (MSMO) algorithm coupled with delta-checkpointing (D-CP).
Main Results
- Naïve Bayes excelled in CPU-Mem fault prediction, while Sequential Minimal Optimization (SMO) performed well on HDD fault prediction using secondary data.
- Random Forest showed high accuracy but poor time complexity with primary data; SMO offered a balance of accuracy and time efficiency.
- The proposed Modified Sequential Minimal Optimization (MSMO) with Delta-Checkpointing (D-CP) demonstrated enhanced accuracy, reduced fault prediction error, and improved reliability.
Conclusions
- Machine learning algorithms show promise in enhancing cloud computing fault tolerance.
- The modified MSMO algorithm combined with D-CP offers a superior approach for improving cloud system reliability and fault prediction.
- This research contributes to more robust and dependable cloud computing environments.
Related Concept Videos
Distribution reliability in electrical power systems is critical for ensuring an uninterrupted power supply to consumers at minimal cost. According to IEEE Standard Terms, reliability is the probability that a device will function without failure over a specified time period or amount of usage. For electric power distribution, this translates to maintaining continuous power supply and addressing customer concerns over power outages. Several indices, as defined by IEEE Standard 1366-2012, are...
Reliability and validity are two important considerations that must be made with any type of data collection. Reliability refers to the ability to consistently produce a given result. In the context of psychological research, this would mean that any instruments or tools used to collect data do so in consistent, reproducible ways.
Unfortunately, being consistent in measurement does not necessarily mean that you have measured something correctly. To illustrate this concept, consider a kitchen...
Hybridoma technology is used for the large-scale production of monoclonal antibodies. Monoclonal antibodies bind to only a single antigenic determinant or epitope. Such antibodies are used in research, diagnostics, and disease therapy. The hybridoma technology established in 1975 by Georges Köhler and Cesar Milstein was awarded the Nobel Prize in Medicine in 1984 for revolutionizing research and therapy.
Hybridoma Selection
Commonly used fusion techniques — electroporation,...
Beams are structural elements commonly employed in engineering applications requiring different load-carrying capacities. The first step in analyzing a beam under a distributed load is to simplify the problem by dividing the load into smaller regions, which allows one to consider each region separately and calculate the magnitude of the equivalent resultant load acting on each portion of the beam. The magnitude of the equivalent resultant load for each region can be determined by calculating...
Multimachine stability analysis is crucial for understanding the dynamics and stability of power systems with multiple synchronous machines. The objective is to solve the swing equations for a network of M machines connected to an N-bus power system.
In analyzing the system, the nodal equations represent the relationship between bus voltages, machine voltages, and machine currents. The nodal equation is given by:
V is the N-vector of bus voltages, E is the M-vector of machine voltages, I is...
Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...

