Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Transformations of Functions III01:20

Transformations of Functions III

304
Transformations modify the graphical representation of a function without changing its fundamental form. One common transformation is reflection, which flips the graph across a designated axis. When the vertical coordinates of all points are multiplied by the negative one, the entire graph is mirrored over the horizontal axis. This transformation reverses the vertical orientation of peaks and troughs, akin to signal inversion in electrical systems, where a waveform is flipped, but the timing of...
304

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

uniGradICON: A Foundation Model for Medical Image Registration.

Medical image computing and computer-assisted intervention : MICCAI ... International Conference on Medical Image Computing and Computer-Assisted Intervention·2026
Same author

Prediction of Lobar Emphysema Progression with a CT-Based Foundational Model.

Radiology·2026
Same author

NFL-BA: Near-Field Light Bundle Adjustment for SLAM in Dynamic Lighting.

Advances in neural information processing systems·2026
Same author

Leveraging Near-Field Lighting for Monocular Depth Estimation from Endoscopy Videos.

Computer vision - ECCV ... : ... European Conference on Computer Vision : proceedings. European Conference on Computer Vision·2026
Same author

PseudoClick: Interactive Image Segmentation with Click Imitation.

Computer vision - ECCV ... : ... European Conference on Computer Vision : proceedings. European Conference on Computer Vision·2025
Same author

4D-computed tomography in the evaluation of dynamic tracheal obstruction in children: A case series.

International journal of pediatric otorhinolaryngology·2025
Same journal

Unsupervised Identification of Protein Compositions and Conformations via Implicit Content-Transformation Disentanglement.

Proceedings. IEEE International Conference on Computer Vision·2026
Same journal

Recover Biological Structure from Sparse-View Diffraction Images with Neural Volumetric Prior.

Proceedings. IEEE International Conference on Computer Vision·2026
Same journal

Scaling 3D Compositional Models for Robust Classification and Pose Estimation.

Proceedings. IEEE International Conference on Computer Vision·2026
Same journal

UniEgoMotion: A Unified Model for Egocentric Motion Reconstruction, Forecasting, and Generation.

Proceedings. IEEE International Conference on Computer Vision·2026
Same journal

Multi-Class Cell Detection Using Spatial Context Representation.

Proceedings. IEEE International Conference on Computer Vision·2024
Same journal

Enhancing Modality-Agnostic Representations via Meta-learning for Brain Tumor Segmentation.

Proceedings. IEEE International Conference on Computer Vision·2024
See all related articles

Related Experiment Video

Updated: May 2, 2026

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images
04:23

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

1.8K

SimpleClick: Interactive Image Segmentation with Simple Vision Transformers.

Qin Liu1, Zhenlin Xu1, Gedas Bertasius1

  • 1University of North Carolina at Chapel Hill.

Proceedings. IEEE International Conference on Computer Vision
|September 9, 2024
PubMed
Summary
This summary is machine-generated.

SimpleClick is a novel interactive image segmentation method using a plain Vision Transformer (ViT) backbone. It achieves state-of-the-art results with fewer clicks, demonstrating broad applicability.

More Related Videos

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

2.7K
Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
04:48

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

378

Related Experiment Videos

Last Updated: May 2, 2026

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images
04:23

A Swin Transformer-Based Model for Thyroid Nodule Detection in Ultrasound Images

Published on: April 21, 2023

1.8K
Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography
04:48

Application of Deep Learning-Based Medical Image Segmentation via Orbital Computed Tomography

Published on: November 30, 2022

2.7K
Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique
04:48

Swin-PSAxialNet: An Efficient Multi-Organ Segmentation Technique

Published on: July 5, 2024

378

Area of Science:

  • Computer Vision
  • Machine Learning
  • Artificial Intelligence

Background:

  • Interactive image segmentation methods commonly use hierarchical backbones.
  • Plain Vision Transformers (ViTs) show promise for dense prediction tasks.
  • ViTs offer a foundation model approach adaptable to downstream tasks without backbone redesign.

Purpose of the Study:

  • Introduce SimpleClick, the first interactive segmentation method utilizing a plain backbone.
  • Explore the effectiveness of plain ViTs in interactive image segmentation.
  • Develop a method for efficient object extraction with minimal user interaction.

Main Methods:

  • Leveraged a plain, non-hierarchical Vision Transformer (ViT) backbone.
  • Introduced a symmetric patch embedding layer for click encoding.
  • Utilized a masked autoencoder (MAE) pretraining strategy for the ViT backbone.

Main Results:

  • Achieved state-of-the-art performance in interactive image segmentation.
  • Obtained 4.15 NoC@90 on the SBD dataset, a 21.8% improvement.
  • Demonstrated generalizability on medical imaging datasets.

Conclusions:

  • SimpleClick offers a highly effective and efficient approach to interactive image segmentation.
  • The method's performance and generalizability highlight its potential as a practical annotation tool.
  • Plain backbones are a viable and powerful alternative for interactive segmentation architectures.