Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Survival Tree01:19

Survival Tree

354
Survival trees are a non-parametric method used in survival analysis to model the relationship between a set of covariates and the time until an event of interest occurs, often referred to as the "time-to-event" or "survival time." This method is particularly useful when dealing with censored data, where the event has not occurred for some individuals by the end of the study period, or when the exact time of the event is unknown.
 Building a Survival Tree
Constructing a...
354
Distribution Reliability and Automation01:25

Distribution Reliability and Automation

450
Distribution reliability in electrical power systems is critical for ensuring an uninterrupted power supply to consumers at minimal cost. According to IEEE Standard Terms, reliability is the probability that a device will function without failure over a specified time period or amount of usage. For electric power distribution, this translates to maintaining continuous power supply and addressing customer concerns over power outages. Several indices, as defined by IEEE Standard 1366-2012, are...
450
Stereotype Content Model02:16

Stereotype Content Model

15.3K
The Stereotype Content Model (SCM) was first proposed by Susan Fiske and her colleagues (Fiske, Cuddy, Glick & Xu, 2002; see also Fiske, 2012 and Fiske, 2017). The SCM specifies that when someone encounters a new group, they will stereotype them based on two metrics: warmth—or that group’s perceived intent, and how likely they are to provide help or inflict harm—and competence—or their ability to carry out that objective. Depending on the warmth-competence...
15.3K
Automatic Processing and Automatic Social Behavior01:28

Automatic Processing and Automatic Social Behavior

186
Automatic processing refers to the cognitive operations that occur without conscious intent or awareness, playing a fundamental role in shaping social cognition and behavior. These processes enable individuals to navigate complex social environments efficiently by relying on mental shortcuts and pre-existing knowledge structures known as schemas. One of the most influential mechanisms underlying automatic processing is priming, which subtly activates mental representations through exposure to...
186
Labeling DNA Probes03:31

Labeling DNA Probes

9.1K
DNA probes are fragments of DNA labeled with a reporter tag to enable their detection or purification. The resulting labeled DNA probes can then hybridize to target nucleic acid sequences through complementary base-pairing, and may be used to recover or identify these regions.
Radioisotopes, fluorophores, or small molecule binding partners like biotin or digoxigenin, are the most widely used reporter tags for labeling DNA probes. These labels can be attached to the probe DNA molecule via...
9.1K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Genome modelling and design across all domains of life with Evo 2.

Nature·2026
Same author

Sequence modeling and design from molecular to genome scale with Evo.

Science (New York, N.Y.)·2024
Same author

Prospector Heads: Generalized Feature Attribution for Large Models & Data.

ArXiv·2024
Same author

Towards trustworthy seizure onset detection using workflow notes.

NPJ digital medicine·2024
Same author

Development and Validation of a Machine Learning System to Identify Reflux Events in Esophageal 24-Hour pH/Impedance Studies.

Clinical and translational gastroenterology·2023
Same author

Noise2Recon: Enabling SNR-robust MRI reconstruction with semi-supervised and self-supervised learning.

Magnetic resonance in medicine·2023
Same journal

Uldp-FL: Federated Learning with Across-Silo User-Level Differential Privacy.

Proceedings of the VLDB Endowment. International Conference on Very Large Data Bases·2025
Same journal

Efficient Join Algorithms For Large Database Tables in a Multi-GPU Environment.

Proceedings of the VLDB Endowment. International Conference on Very Large Data Bases·2024
Same journal

Models and Mechanisms for Spatial Data Fairness.

Proceedings of the VLDB Endowment. International Conference on Very Large Data Bases·2023
Same journal

Beyond Equi-joins: Ranking, Enumeration and Factorization.

Proceedings of the VLDB Endowment. International Conference on Very Large Data Bases·2022
Same journal

Optimal Algorithms for Ranked Enumeration of Answers to Full Conjunctive Queries.

Proceedings of the VLDB Endowment. International Conference on Very Large Data Bases·2021
Same journal

ConTPL: Controlling Temporal Privacy Leakage in Differentially Private Continuous Data Release.

Proceedings of the VLDB Endowment. International Conference on Very Large Data Bases·2019
See all related articles

Related Experiment Video

Updated: Jan 3, 2026

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis
05:41

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

Published on: February 6, 2020

9.8K

Snuba: Automating Weak Supervision to Label Training Data.

Paroma Varma1, Christopher Ré1

  • 1Stanford University.

Proceedings of the VLDB Endowment. International Conference on Very Large Data Bases
|November 29, 2019
PubMed
Summary
This summary is machine-generated.

Snuba automatically generates weak supervision heuristics for deep learning tasks, significantly improving label quality and reducing manual effort. This system outperforms user-defined heuristics and semi-supervised learning methods.

More Related Videos

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

9.5K
Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
06:37

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

5.2K

Related Experiment Videos

Last Updated: Jan 3, 2026

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis
05:41

A Step-by-Step Implementation of DeepBehavior, Deep Learning Toolbox for Automated Behavior Analysis

Published on: February 6, 2020

9.8K
Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment
08:25

Combining Eye-tracking Data with an Analysis of Video Content from Free-viewing a Video of a Walk in an Urban Park Environment

Published on: May 7, 2019

9.5K
Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention
06:37

Author Spotlight: Addressing Technical and Subjective Challenges in Measuring Classroom Attention

Published on: December 15, 2023

5.2K

Area of Science:

  • Machine Learning
  • Artificial Intelligence
  • Data Science

Background:

  • Deep learning models require extensive high-quality training data, a significant bottleneck for diverse applications.
  • Weak supervision, using imperfect label sources like heuristics, is a common alternative but requires manual design for each task.
  • Manual heuristic design is time-consuming and resource-intensive, involving domain experts in repetitive tasks.

Purpose of the Study:

  • To present Snuba, a system for automatically generating heuristics in the weak supervision setting.
  • To address the challenges of manual heuristic design by automating the creation of labeling functions.
  • To improve the efficiency and effectiveness of generating training labels for large unlabeled datasets.

Main Methods:

  • Snuba utilizes a small labeled dataset to generate heuristics that label subsets of a large unlabeled dataset.
  • It iteratively generates and refines heuristics until a substantial portion of the data is labeled.
  • A statistical measure ensures the termination of the iterative process, maintaining label quality.

Main Results:

  • Snuba automatically generates heuristics in under five minutes.
  • The system achieved up to 9.74 F1 points improvement over the best user-defined heuristics.
  • Snuba outperformed semi-supervised learning approaches by up to 14.35 F1 points in real-world collaborations.

Conclusions:

  • Snuba offers an automated and efficient solution for generating high-quality weak supervision labels.
  • The system significantly reduces the time and cost associated with preparing training data for deep learning.
  • Snuba demonstrates superior performance compared to both manual heuristic design and other automated methods.