Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Language Development01:22

Language Development

545
Children master language quickly and with relative ease, supported by both biological predisposition and reinforcement. B. F. Skinner (1957) proposed that language is learned through reinforcement, while Noam Chomsky (1965) argued that language acquisition mechanisms are biologically determined.
The critical period for language acquisition suggests that the ability to acquire language is at its peak early in life. As people age, this proficiency decreases. Language development begins very...
545
Air-entraining Agents01:27

Air-entraining Agents

119
Air-entraining agents improve the durability and workability of concrete in climates with frequent freezing and thawing. These agents prevent cracks by introducing small air bubbles into the mix, creating spaces accommodating water expansion when temperatures drop. The air-entraining agents lower the surface tension of water, forming stable, small air bubbles. This method is more effective than having accidental large voids, as the intentional, smaller, and evenly distributed air voids improve...
119
Components of Language01:24

Components of Language

515
Language, whether spoken, signed, or written, consists of specific components: lexicon and grammar. The lexicon is the vocabulary of a language, comprising its words. Grammar is the set of rules used to convey meaning through the lexicon. For example, English grammar adds “-ed” to most verbs to indicate past tense. Words are formed by combining phonemes, which are the basic sound units of a language. Different languages have different sets of phonemes (e.g., “ah” vs.
515
Associative Learning01:27

Associative Learning

670
Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...
670
Improving Translational Accuracy02:07

Improving Translational Accuracy

12.0K
Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...
12.0K
Tip-of-the-Tongue Phenomenon01:10

Tip-of-the-Tongue Phenomenon

268
The tip-of-the-tongue (TOT) phenomenon is a cognitive experience characterized by a temporary inability to retrieve specific information from memory despite having a strong feeling of knowing the information. Although individuals cannot access the target word or detail, they frequently recall related elements, such as its initial letter, syllable count, or context. This partial retrieval often causes frustration, as one might recognize a familiar face or know that a name starts with a specific...
268

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Malignant Adult Ovarian Germ Cell Tumors: An International Multicenter Study to Identify Relevant Prognostic Risk Factors for Stage IC and Beyond.

Journal of clinical oncology : official journal of the American Society of Clinical Oncology·2026
Same author

Exploring the role of nutritional status and anthropometric factors in anemia among adolescent girls in Pekanbaru, Indonesia.

SAGE open medicine·2025
Same author

Pregnant women and healthcare workers' perceptions on tuberculosis and screening: a cross-sectional study in Indonesia.

International health·2024
Same author

Understanding stunting risk factors in Kampar Regency: Insights from mothers with stunted children (qualitative study).

SAGE open medicine·2024
Same author

Efficacy of pretomanid-containing regiments for drug-resistant tuberculosis: A systematic review and meta-analysis of clinical trials.

Narra J·2024
Same author

Sequence of COVID-19 Vaccination and COVID-19 Infection and Their Association With the Development of Active Tuberculosis: A Case-Control Study.

Cureus·2023
Same journal

Novel Parent Survey Measures Sensory Behaviors Incorporating Sensory Modality and Stimulus Intensity.

Heliyon·2026
Same journal

Expression of concern: "SQSTM1/p62 promotes the progression of gastric cancer through epithelial-mesenchymal transition" [Heliyon 10 (2024) e24409].

Heliyon·2026
Same journal

Expression of concern: "TL1A promotes metastasis and EMT process of colorectal cancer" [Heliyon 10 (2024) e24392].

Heliyon·2026
Same journal

Expression of concern: "Factors affecting timing of surgery following neoadjuvant chemoradiation for esophageal cancer" [Heliyon 9 (2023) e23212].

Heliyon·2026
Same journal

Expression of concern: "On stratified single-valued soft topogenous structures" [Heliyon 10 (2024) e27926].

Heliyon·2026
Same journal

Expression of concern: "Artifact removal and motor imagery classification in EEG using advanced algorithms and modified DNN" [Heliyon 10 (2024) e27198].

Heliyon·2026
See all related articles

Related Experiment Video

Updated: Oct 15, 2025

Transcranial Direct Current Stimulation tDCS of Wernicke's and Broca's Areas in Studies of Language Learning and Word Acquisition
12:49

Transcranial Direct Current Stimulation tDCS of Wernicke's and Broca's Areas in Studies of Language Learning and Word Acquisition

Published on: July 13, 2019

17.3K

Augmented words to improve a deep learning-based Indonesian syllabification.

Suyanto Suyanto1, Ade Romadhony1, Febryanti Sthevanie1

  • 1School of Computing, Telkom University, Bandung, Indonesia.

Heliyon
|October 25, 2021
PubMed
Summary
This summary is machine-generated.

This study enhances deep learning syllabification for Indonesian, a low-resource language, by using data augmentation and validation. These methods significantly reduce word error rates, improving model performance on formal words and named entities.

Keywords:
Creating acronymsFlipping onsetsIndonesian orthographic syllabificationSwapping consonant-graphemesTransposing nuclei

More Related Videos

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
05:48

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

1.7K
Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
09:09

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

584

Related Experiment Videos

Last Updated: Oct 15, 2025

Transcranial Direct Current Stimulation tDCS of Wernicke's and Broca's Areas in Studies of Language Learning and Word Acquisition
12:49

Transcranial Direct Current Stimulation tDCS of Wernicke's and Broca's Areas in Studies of Language Learning and Word Acquisition

Published on: July 13, 2019

17.3K
Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception
05:48

Author Spotlight: Investigating the Impact of Emotional Prosodies on Voice Recognition and Perception

Published on: August 9, 2024

1.7K
Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
09:09

Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody

Published on: September 27, 2024

584

Area of Science:

  • Computational Linguistics
  • Natural Language Processing
  • Machine Learning

Background:

  • Deep learning syllabification models excel in high-resource languages but struggle with low-resource ones.
  • Indonesian, a low-resource language, presents challenges for accurate syllabification due to limited data.
  • Existing models often yield high error rates for specific linguistic contexts like named entities.

Purpose of the Study:

  • To improve the performance of deep learning-based syllabification models for the Indonesian language.
  • To address the limitations of current models in low-resource language scenarios.
  • To enhance accuracy in syllabifying both formal Indonesian words and named entities.

Main Methods:

  • Proposed two key procedures: massive data augmentation and phonotactic-based validation.
  • Implemented a deep learning architecture combining Bidirectional Long Short-Term Memory (BiLSTM), Convolutional Neural Networks (CNN), and Conditional Random Fields (CRF).
  • Employed four data augmentation techniques: transposing nuclei, swapping consonant-graphemes, flipping onsets, and creating acronyms.

Main Results:

  • Massive data augmentation significantly expanded the Indonesian dataset by 12.8 million valid words based on phonotactic rules.
  • The enhanced BiLSTM-CNN-CRF model demonstrated significant improvements across formal words and named-entities datasets using 5-fold cross-validation.
  • Augmenting the training data effectively reduced the word error rate (WER), particularly for long formal words and named entities.

Conclusions:

  • Massive data augmentation and validation are effective strategies for improving syllabification in low-resource languages like Indonesian.
  • The proposed methods enhance the robustness and accuracy of deep learning models for complex linguistic data.
  • This approach offers a viable solution for overcoming data scarcity challenges in natural language processing tasks.