Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Experiment Videos

Efficient streaming text clustering.

Shi Zhong1

  • 1Department of Computer Science and Engineering, Florida Atlantic University, Boca Raton, FL 33431, USA. zhong@cse.fau.edu

Neural Networks : the Official Journal of the International Neural Network Society
|August 9, 2005
PubMed
Summary
This summary is machine-generated.

Related Concept Videos

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Mesoporous polydopamine and inulin hydrogel for improved deoxynojirimycin effect in type 2 diabetes mellitus management.

Materials today. Bio·2026
Same author

Remimazolam reduces State-Trait Anxiety Inventory-State Scale scores in hemorrhoid surgery with spinal-epidural anesthesia: A randomized trial.

World journal of gastrointestinal surgery·2025
Same author

Cyanidin-3-rutinoside from Mori Fructus ameliorates dyslipidemia via modulating gut microbiota and lipid metabolism pathway.

The Journal of nutritional biochemistry·2024
Same author

In reply to dexmedetomidine's role in inflammation and pain post video-assisted thoracoscopic surgery for lung.

Surgery·2024
Same author

Dexmedetomidine attenuates inflammatory response and chronic pain following video-assisted thoracoscopic surgery for lung cancer.

Surgery·2024
Same author

The gut microbiota-aromatic hydrocarbon receptor (AhR) axis mediates the anticolitic effect of polyphenol-rich extracts from <i>Sanghuangporus</i>.

iMeta·2024
Same journal

TraNce: Type-aware hypergraph neural network with biological mediators for drug repositioning.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Decentralized ADMM for factorization-based Low-rank matrix estimation.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Memristive neuromorphic circuit design inspired by the neural mechanisms of conditioned fear.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Q-learning based asynchronous Boolean control networks stabilization with data loss.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

New results on prescribed-time synchronization of complex networks via intermittent control.

Neural networks : the official journal of the International Neural Network Society·2026
Same journal

Variance-constrained multi-view ensemble broad network for imbalanced data.

Neural networks : the official journal of the International Neural Network Society·2026
See all related articles

Clustering high-dimensional text streams requires forgetting past data for adaptability. An online spherical k-means algorithm with a forgetting factor achieves efficient and adaptive text stream clustering.

Area of Science:

  • Data Mining
  • Machine Learning
  • Natural Language Processing

Background:

  • Clustering streaming data is a growing research area with many real-world applications.
  • Clustering high-dimensional streaming text data remains a significant challenge due to data volume and velocity.
  • Existing scalable clustering strategies struggle with memory limitations and multiple data scans.

Purpose of the Study:

  • To develop a fast and adaptive algorithm for clustering high-dimensional streaming text data.
  • To combine an efficient online spherical k-means (OSKM) algorithm with a scalable clustering strategy.
  • To enhance the algorithm's adaptability to evolving data streams through a forgetting mechanism.

Main Methods:

  • Modified the spherical k-means (SPKM) algorithm into an online spherical k-means (OSKM) algorithm using Winner-Take-All competitive learning for online centroid updates.

Related Experiment Videos

  • Integrated OSKM with a scalable clustering strategy designed for large datasets that exceed memory capacity.
  • Introduced a forgetting factor applying exponential decay to historical data's importance, making the algorithm adaptive.
  • Main Results:

    • The proposed OSKM algorithm demonstrated efficiency comparable to SPKM but with superior clustering quality.
    • The combined approach effectively handles large-scale text streams within limited memory constraints.
    • Experimental results confirmed the algorithm's efficiency and adaptability in clustering text streams.

    Conclusions:

    • The developed algorithm provides an efficient and adaptive solution for clustering high-dimensional text streams.
    • A key finding is that forgetting historical data is crucial for achieving adaptability in text stream clustering.
    • The research highlights the importance of dynamic weighting of data based on recency for effective stream analysis.