Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Classification of Signals01:30

Classification of Signals

1.1K
In signal processing, signals are classified based on various characteristics: continuous-time versus discrete-time, periodic versus aperiodic, analog versus digital, and causal versus noncausal. Each category highlights distinct properties crucial for understanding and manipulating signals.
A continuous-time signal holds a value at every instant in time, representing information seamlessly. In contrast, a discrete-time signal holds values only at specific moments, often denoted as x(n), where...
1.1K
Aggregates Classification01:29

Aggregates Classification

526
Aggregate classification is generally based on its size, petrographic characteristics, weight, and source. Size classification ranges from coarse to fine aggregates, defined by the size of the particles. Coarse aggregates are particles that do not pass through ASTM sieve No. 4, and aggregates that pass through the sieve are fine aggregates.
Petrographic classification groups aggregates based on common mineralogical characteristics. Some of the common mineral groups found in aggregates are...
526
How Data are Classified: Categorical Data01:11

How Data are Classified: Categorical Data

39.3K
A variable, usually notated by capital letters such as X and Y, is a characteristic or measurement that can be determined for each member of a population. Data are the actual values of variables. They may be numbers, or they may be words. Datum is a single value.
Data are classified based on whether they are measurable or not. Categorical data cannot be measured; instead, it can be divided into categories. For example, if Y denotes a person's party affiliation, some examples of Y include...
39.3K
Stereotype Content Model02:16

Stereotype Content Model

15.1K
The Stereotype Content Model (SCM) was first proposed by Susan Fiske and her colleagues (Fiske, Cuddy, Glick & Xu, 2002; see also Fiske, 2012 and Fiske, 2017). The SCM specifies that when someone encounters a new group, they will stereotype them based on two metrics: warmth—or that group’s perceived intent, and how likely they are to provide help or inflict harm—and competence—or their ability to carry out that objective. Depending on the warmth-competence...
15.1K
Regression Analysis01:11

Regression Analysis

6.8K
Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:
6.8K
One-Way ANOVA: Unequal Sample Sizes01:15

One-Way ANOVA: Unequal Sample Sizes

6.3K
One-way ANOVA can be performed on three or more samples of unequal sizes. However, calculations get complicated when sample sizes are not always the same. So, while performing ANOVA with unequal samples size, the following equation is used:
6.3K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Comprehensive assessment of adults severely ill with TB referred after triage in Tamil Nadu, India.

Public health action·2026
Same author

Sustained reduction in program-reported TB death rate in six districts following Tamil Nadu <i>Kasanoi Erappila Thittam</i> in southern India.

Global health action·2026
Same author

Amplifying performance, combustion, and emission characteristics of a CRDI engine using diesel-WCO methyl ester-dyglyme ternary fuel blends with carbon nanotubes.

Scientific reports·2026
Same author

First-ever Experience of Implementing Therapeutic Nutrition for Very Severely Undernourished Adults with TB in Routine Program Settings: A Longitudinal Descriptive Study.

Indian journal of community medicine : official publication of Indian Association of Preventive & Social Medicine·2026
Same author

Role of triage audit in an ongoing differentiated TB care initiative to reduce deaths in Tamil Nadu, India.

Public health action·2025
Same author

Impact of Brexit on Pharmaceutical Regulations: EMA vs. MHRA.

Reviews on recent clinical trials·2025
Same journal

STIF: Intuitionistic fuzzy Gaussian membership function with statistical transformation weight of evidence and information value for private information preservation.

Distributed and parallel databases·2023
Same journal

Bio-SODA UX: enabling natural language question answering over knowledge graphs with user disambiguation.

Distributed and parallel databases·2022
Same journal

Scalable probabilistic truss decomposition using central limit theorem and H-index.

Distributed and parallel databases·2022
Same journal

Subscribing to big data at scale.

Distributed and parallel databases·2022
Same journal

MICAR: multi-inhabitant context-aware activity recognition in home environments.

Distributed and parallel databases·2022
Same journal

RETRACTED ARTICLE: Application of machine learning (ML) and internet of things (IoT) in healthcare to predict and tackle pandemic situation.

Distributed and parallel databases·2021
See all related articles

Related Experiment Video

Updated: Nov 11, 2025

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model
07:15

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

7.1K

Sentimental analysis from imbalanced code-mixed data using machine learning approaches.

R Srinivasan1, C N Subalalitha1

  • 1Department of Computer Science and Engineering, SRM Institute of Science and Technology, Kattankulathur, 603 203 India.

Distributed and Parallel Databases
|March 29, 2021
PubMed
Summary
This summary is machine-generated.

This study tackles class imbalance in sentiment analysis for code-mixed data, a common issue. It proposes a solution combining sampling techniques and Levenshtein distance for better sentiment classification.

Keywords:
Code-mixed dataImbalanced dataMachine learningSamplingSentimental analysis

More Related Videos

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

828

Related Experiment Videos

Last Updated: Nov 11, 2025

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model
07:15

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

7.1K
Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness
03:14

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

828

Area of Science:

  • Natural Language Processing
  • Machine Learning
  • Computational Linguistics

Background:

  • Sentiment analysis is crucial for knowledge discovery across fields.
  • Class imbalance is a significant challenge in sentiment analysis, particularly with code-mixed data.
  • Existing research has largely overlooked sentiment analysis in imbalanced, code-mixed datasets.

Purpose of the Study:

  • To address the challenge of class imbalance in sentiment analysis for code-mixed text.
  • To propose and evaluate a novel approach for sentiment analysis on imbalanced code-mixed data.
  • To compare the effectiveness of various machine learning classifiers for this specific task.

Main Methods:

  • A combination of sampling techniques and Levenshtein distance metrics was employed.
  • The study evaluated multiple machine learning algorithms: Random Forest, Logistic Regression, XGBoost, Support Vector Machine, and Naïve Bayes.
  • Performance was assessed using the F1-Score metric.

Main Results:

  • The proposed method effectively handles class imbalance in code-mixed sentiment analysis.
  • Comparative analysis revealed the performance variations among different machine learning classifiers.
  • The F1-Score was utilized to quantify and compare the effectiveness of the implemented approaches.

Conclusions:

  • The developed approach offers a viable solution for sentiment analysis in challenging code-mixed, imbalanced datasets.
  • The findings provide insights into the suitability of different machine learning models for this task.
  • Further research can build upon these methods to enhance sentiment analysis accuracy in multilingual contexts.