Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Vision01:24

Vision

60.3K
Vision is the result of light being detected and transduced into neural signals by the retina of the eye. This information is then further analyzed and interpreted by the brain. First, light enters the front of the eye and is focused by the cornea and lens onto the retina—a thin sheet of neural tissue lining the back of the eye. Because of refraction through the convex lens of the eye, images are projected onto the retina upside-down and reversed.
60.3K
Stream Function01:20

Stream Function

2.1K
In two-dimensional incompressible fluid flow, the continuity equation is essential for ensuring mass conservation, meaning that any change in fluid entering or exiting a region is balanced by a corresponding change elsewhere. For incompressible flow, where density remains constant, this requirement simplifies to the condition that the divergence of the velocity field must be zero. Mathematically, this is expressed as,
2.1K
Language01:16

Language

921
Language is a unique communication system that uses words and systematic rules to organize and transmit information. Unlike other forms of communication, which may involve postures, movements, odors, or vocalizations, language relies on symbols and grammar. This makes human communication distinct from that of other species, who also communicate but do not use language in the same way humans do.
Corballis and Suddendorf (2007) and Tomasello and Rakoczy (2003) highlight the role of language in...
921
Color Vision01:24

Color Vision

1.5K
Color perception begins in the retina, the light-sensitive layer at the back of the eye. Two main theories explain how colors are seen: the trichromatic theory and the opponent-process theory. The trichromatic theory, proposed by Thomas Young in 1802 and extended by Hermann von Helmholtz in 1852, suggests that color vision is based on three types of cone receptors in the retina. These cones are sensitive to different but overlapping ranges of wavelengths corresponding to red, blue, and green.
1.5K
Components of Language01:24

Components of Language

831
Language, whether spoken, signed, or written, consists of specific components: lexicon and grammar. The lexicon is the vocabulary of a language, comprising its words. Grammar is the set of rules used to convey meaning through the lexicon. For example, English grammar adds “-ed” to most verbs to indicate past tense. Words are formed by combining phonemes, which are the basic sound units of a language. Different languages have different sets of phonemes (e.g., “ah” vs.
831
Language Development01:22

Language Development

939
Children master language quickly and with relative ease, supported by both biological predisposition and reinforcement. B. F. Skinner (1957) proposed that language is learned through reinforcement, while Noam Chomsky (1965) argued that language acquisition mechanisms are biologically determined.
The critical period for language acquisition suggests that the ability to acquire language is at its peak early in life. As people age, this proficiency decreases. Language development begins very...
939

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Clinical characteristics and predictors of in-hospital mortality in patients with ANCA-associated vasculitis complicated by diffuse alveolar hemorrhage: a retrospective cohort study.

Frontiers in medicine·2026
Same author

Neural Effects of Meditation Following a Randomized Controlled Trial of the Emotion Awareness and Skills Enhancement (EASE).

IEEE transactions on neural systems and rehabilitation engineering : a publication of the IEEE Engineering in Medicine and Biology Society·2026
Same author

Wearable Camera-Based Dietary Assessment of Mother-Father Dyads in Urban and Rural Households in Ghana.

Current developments in nutrition·2026
Same author

Eating architecture components and their associations with BMI in urban and rural Ghanaian mothers, fathers, children, and adolescents, assessed using a wearable camera: A cross-sectional study.

Chronobiology international·2026
Same author

FRIENDS GUI: A Graphical User Interface for Data Collection and Visualization of Vaping Behavior from a Passive Vaping Monitor.

Journal of open research software·2026
Same author

Sleep quality is associated with default mode and salience network connectivity differently across age and sex.

Neurobiology of aging·2026
Same journal

Long-Tailed Continual Learning For Visual Food Recognition.

IEEE transactions on multimedia·2025
Same journal

Support Vector Regression-based Reduced-Reference Perceptual Quality Model for Compressed Point Clouds.

IEEE transactions on multimedia·2024
Same journal

Cross Modality Bias in Visual Question Answering: A Causal View with Possible Worlds VQA.

IEEE transactions on multimedia·2024
Same journal

Indoor Camera Pose Estimation from Room Layouts and Image Outer Corners.

IEEE transactions on multimedia·2023
Same journal

Cross-Referencing Self-Training Network for Sound Event Detection in Audio Mixtures.

IEEE transactions on multimedia·2023
Same journal

Real-Time and Accurate UAV Pedestrian Detection for Social Distancing Monitoring in COVID-19 Pandemic.

IEEE transactions on multimedia·2022
See all related articles

Related Experiment Video

Updated: Feb 14, 2026

A View of Their Own: Capturing the Egocentric View of Infants and Toddlers with Head-Mounted Cameras
03:56

A View of Their Own: Capturing the Egocentric View of Infants and Toddlers with Head-Mounted Cameras

Published on: October 5, 2018

8.0K

Screen Detection from Egocentric Image Streams Leveraging Multi-View Vision Language Model.

Xueshen Li1, Sen Shen2, Xinlong Hou1

  • 1Department of Biomedical Engineering, Stevens Institute of Technology.

IEEE Transactions on Multimedia
|February 13, 2026
PubMed
Summary
This summary is machine-generated.

Researchers developed a new screen detection system for children using a wearable camera and AI. This method accurately tracks screen time, improving child behavior studies and screen use research.

Keywords:
Egocentric Image StreamsVision Language ModelWearable Sensor

More Related Videos

Subjective Refraction Test Using a Smartphone for Vision Screening
05:36

Subjective Refraction Test Using a Smartphone for Vision Screening

Published on: October 18, 2024

1.8K
Detection and Quantification of Tunneling Nanotubes Using 3D Volume View Images
12:45

Detection and Quantification of Tunneling Nanotubes Using 3D Volume View Images

Published on: August 31, 2022

3.7K

Related Experiment Videos

Last Updated: Feb 14, 2026

A View of Their Own: Capturing the Egocentric View of Infants and Toddlers with Head-Mounted Cameras
03:56

A View of Their Own: Capturing the Egocentric View of Infants and Toddlers with Head-Mounted Cameras

Published on: October 5, 2018

8.0K
Subjective Refraction Test Using a Smartphone for Vision Screening
05:36

Subjective Refraction Test Using a Smartphone for Vision Screening

Published on: October 18, 2024

1.8K
Detection and Quantification of Tunneling Nanotubes Using 3D Volume View Images
12:45

Detection and Quantification of Tunneling Nanotubes Using 3D Volume View Images

Published on: August 31, 2022

3.7K

Area of Science:

  • Child development
  • Human-computer interaction
  • Wearable technology

Background:

  • Accurate monitoring of children's screen time is crucial for research on health and social behaviors.
  • Current methods like self-reports and bulky sensors lack accuracy and efficiency.
  • Existing technologies struggle to capture quantitative screen exposure data effectively.

Purpose of the Study:

  • To develop an efficient and accurate framework for monitoring children's screen exposure.
  • To introduce a novel approach combining wearable sensors and advanced AI for screen time tracking.
  • To improve the methodologies used in studying screen use and its impact on child development.

Main Methods:

  • Developed a novel screen detection framework using egocentric images from a wearable sensor called the screen time tracker (STT).
  • Utilized a multi-view vision language model (VLM) to dynamically interpret screen exposure from multiple image streams.
  • Validated the framework using a dataset of children's free-living activities.

Main Results:

  • The proposed multi-view VLM framework demonstrated significant improvements over conventional vision language models and object detection models.
  • The system achieved higher accuracy and efficiency in capturing quantitative screen exposure data.
  • The lightweight hardware design of the STT combined with the VLM offers a practical solution.

Conclusions:

  • The novel screen detection framework provides an accurate and efficient method for monitoring children's screen time.
  • This technology has significant potential to advance research in child behavior, screen use, and related health outcomes.
  • The combination of wearable sensors and VLM offers a promising direction for future child behavior studies.