Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Distributed machine learning: scaling up with coarse-grained parallelism

F J Provost1, D N Hennessy

  • 1Computer Science Department, University of Pittsburgh, PA 15260, USA.

Proceedings. International Conference on Intelligent Systems for Molecular Biology
|January 1, 1994
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same journal

Proceedings of the 8th International Conference on Intelligent Systems for Molecular Biology (ISMB 2000). San Diego, California, USA. August 19-23, 2000.

Proceedings. International Conference on Intelligent Systems for Molecular Biology·2001
Same journal

Analysis of gene expression data with pathway scores.

Proceedings. International Conference on Intelligent Systems for Molecular Biology·2000
Same journal

Towards a complete map of the protein space based on a unified sequence and structure analysis of all known proteins.

Proceedings. International Conference on Intelligent Systems for Molecular Biology·2000
Same journal

Mining for putative regulatory elements in the yeast genome using gene expression data.

Proceedings. International Conference on Intelligent Systems for Molecular Biology·2000
Same journal

A multiple alignment algorithm for metabolic pathway analysis using enzyme hierarchy.

Proceedings. International Conference on Intelligent Systems for Molecular Biology·2000
Same journal

Sequence database search using jumping alignments.

Proceedings. International Conference on Intelligent Systems for Molecular Biology·2000
See all related articles

This study introduces a distributed learning system, CorPRL, to efficiently analyze large biological datasets. It leverages the invariant-partitioning property to overcome computational challenges in machine learning for big data.

Area of Science:

  • Computational Biology
  • Bioinformatics
  • Machine Learning

Background:

  • Machine learning (ML) is increasingly used in biological data analysis.
  • Scaling ML algorithms to large biological and medical datasets presents computational and memory challenges.
  • Existing methods struggle with the demands of big data in life sciences.

Purpose of the Study:

  • To address the scalability issues of machine learning in large biological datasets.
  • To develop a distributed learning system that can handle massive data volumes.
  • To introduce the invariant-partitioning property for efficient data subset analysis.

Main Methods:

  • Utilized ubiquitous workstation networks for distributed computing.
  • Introduced the invariant-partitioning property for data set partitioning.

Related Experiment Videos

  • Developed a distributed learning system named CorPRL.
  • Employed interprocess communication for cooperative learning.
  • Main Results:

    • CorPRL effectively learns from very large datasets.
    • The invariant-partitioning property enables efficient data subset analysis.
    • Demonstrated effectiveness in analyzing data from two biological databases.
    • Overcame computational search effort and memory limitations.

    Conclusions:

    • Distributed learning systems can overcome scalability challenges in big biological data.
    • The invariant-partitioning property is a key enabler for efficient distributed ML.
    • CorPRL provides a viable solution for analyzing massive biological and medical datasets.