Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Language and Cognition01:27

Language and Cognition

874
Language serves as a bridge between ideas and communication, influencing how individuals perceive and interact with the world. Psychologists have long debated whether language shapes thought or vice versa. This discussion gained grip with Edward Sapir and Benjamin Lee Whorf in the 1940s, who proposed that language determines thought, a concept known as linguistic determinism. They suggested that the vocabulary and structure of a language influence how its speakers think and perceive reality.
874
Termination of Translation01:44

Termination of Translation

5.7K
5.7K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Time series for blind biosignal classification model.

Computers in biology and medicine·2014
Same author

Chinese unknown word recognition for PCFG-LA parsing.

TheScientificWorldJournal·2014
Same author

Unsupervised quality estimation model for English to German translation and its application in extensive supervised evaluation.

TheScientificWorldJournal·2014
Same author

A relationship: word alignment, phrase table, and translation quality.

TheScientificWorldJournal·2014
Same author

Unsupervised chunking based on graph propagation from bilingual corpus.

TheScientificWorldJournal·2014
Same author

A systematic comparison of data selection criteria for SMT domain adaptation.

TheScientificWorldJournal·2014
Same journal

The Eco-Friendly Preparation of Se, Zn, and Ag MONPs and Their Current Medical Applications and Drug Delivery for AD Diseases.

TheScientificWorldJournal·2026
Same journal

Fear of COVID-19: A Comparative Study Among University Students in Peru.

TheScientificWorldJournal·2026
Same journal

Opportunities and Challenges of Integrating Ethiopian Traditional Medicine System Into Modern Medicine: A Narrative Review.

TheScientificWorldJournal·2026
Same journal

Exploring the Antiparasitic Activity of the Sea Cucumber Isostichopus sp. aff. badionotus From the Northern Coast of Colombia Against Trypanosoma cruzi.

TheScientificWorldJournal·2026
Same journal

Kalanchoe ceratophylla (Crassulaceae): The True Identity of Sidingin, a Medicinal Plant From Sumatra, Based on Morphological and Molecular Evidence.

TheScientificWorldJournal·2026
Same journal

Genetic Variation of Chicken Growth Differentiation Factor-9 Gene and Association With Egg Characteristics: A Systematic Review.

TheScientificWorldJournal·2026
See all related articles

Related Experiment Video

Updated: Apr 28, 2026

A Bilingual Computational Workflow for Identifying Potential PLK1 Inhibitors in American Sign Language and English
14:34

A Bilingual Computational Workflow for Identifying Potential PLK1 Inhibitors in American Sign Language and English

Published on: April 3, 2026

295

iSentenizer-μ: multilingual sentence boundary detection model.

Derek F Wong1, Lidia S Chao1, Xiaodong Zeng1

  • 1NLPCT Laboratory, Department of Computer and Information Science, University of Macau, Macau.

Thescientificworldjournal
|June 3, 2014
PubMed
Summary
This summary is machine-generated.

A new multilingual sentence boundary detection (SBD) system, iSentenizer-μ, accurately processes mixed genres and languages. It uses incremental learning to adapt without retraining, outperforming existing SBD models.

More Related Videos

Examining Online Syntactic Processing of Spoken Complex Sentences in Chinese Using Dual-Modal Interference Tasks
08:32

Examining Online Syntactic Processing of Spoken Complex Sentences in Chinese Using Dual-Modal Interference Tasks

Published on: September 5, 2019

4.8K
Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
09:09

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

1.1K

Related Experiment Videos

Last Updated: Apr 28, 2026

A Bilingual Computational Workflow for Identifying Potential PLK1 Inhibitors in American Sign Language and English
14:34

A Bilingual Computational Workflow for Identifying Potential PLK1 Inhibitors in American Sign Language and English

Published on: April 3, 2026

295
Examining Online Syntactic Processing of Spoken Complex Sentences in Chinese Using Dual-Modal Interference Tasks
08:32

Examining Online Syntactic Processing of Spoken Complex Sentences in Chinese Using Dual-Modal Interference Tasks

Published on: September 5, 2019

4.8K
Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
09:09

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

1.1K

Area of Science:

  • Natural Language Processing
  • Computational Linguistics

Background:

  • Sentence boundary detection (SBD) systems are typically sensitive to training data genres and languages.
  • Retraining SBD models for new data requires discarding previous work and starting from scratch.

Purpose of the Study:

  • To introduce iSentenizer-μ, a novel multilingual SBD system.
  • To develop an adaptable SBD system capable of handling diverse text genres and languages.

Main Methods:

  • Utilized an incremental tree learning architecture, specifically the i(+)Learning algorithm.
  • Developed a system adaptable to various text topics and Roman-alphabet languages through incremental knowledge merging.
  • Designed iSentenizer-μ to revise existing models rather than requiring complete retraining.

Main Results:

  • iSentenizer-μ demonstrated high accuracy in detecting sentence boundaries across a mixture of text genres and languages.
  • The system was extensively evaluated on Danish, German, English, Spanish, Dutch, French, Italian, Portuguese, Greek, Finnish, and Swedish.
  • Outperformed two state-of-the-art SBD systems, Punkt and MaxEnt, on all tested datasets.

Conclusions:

  • The proposed iSentenizer-μ system offers a robust and adaptable solution for multilingual SBD.
  • Incremental learning enables efficient adaptation to new data, overcoming limitations of traditional retraining approaches.
  • iSentenizer-μ provides superior performance compared to existing SBD systems in diverse linguistic and topical contexts.