Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Classification of Signals

Classification of Signals

In signal processing, signals are classified based on various characteristics: continuous-time versus discrete-time, periodic versus aperiodic, analog versus digital, and causal versus noncausal. Each category highlights distinct properties crucial for understanding and manipulating signals.
A continuous-time signal holds a value at every instant in time, representing information seamlessly. In contrast, a discrete-time signal holds values only at specific moments, often denoted as x(n), where...

Aggregates Classification

Aggregates Classification

Aggregate classification is generally based on its size, petrographic characteristics, weight, and source. Size classification ranges from coarse to fine aggregates, defined by the size of the particles. Coarse aggregates are particles that do not pass through ASTM sieve No. 4, and aggregates that pass through the sieve are fine aggregates.
Petrographic classification groups aggregates based on common mineralogical characteristics. Some of the common mineral groups found in aggregates are...

How Data are Classified: Categorical Data

How Data are Classified: Categorical Data

A variable, usually notated by capital letters such as X and Y, is a characteristic or measurement that can be determined for each member of a population. Data are the actual values of variables. They may be numbers, or they may be words. Datum is a single value.
Data are classified based on whether they are measurable or not. Categorical data cannot be measured; instead, it can be divided into categories. For example, if Y denotes a person's party affiliation, some examples of Y include...

Stereotype Content Model

Stereotype Content Model

The Stereotype Content Model (SCM) was first proposed by Susan Fiske and her colleagues (Fiske, Cuddy, Glick & Xu, 2002; see also Fiske, 2012 and Fiske, 2017). The SCM specifies that when someone encounters a new group, they will stereotype them based on two metrics: warmth—or that group’s perceived intent, and how likely they are to provide help or inflict harm—and competence—or their ability to carry out that objective. Depending on the warmth-competence...

Regression Analysis

Regression Analysis

Regression analysis is a statistical tool that describes a mathematical relationship between a dependent variable and one or more independent variables.
In regression analysis, a regression equation is determined based on the line of best fit– a line that best fits the data points plotted in a graph. This line is also called the regression line. The algebraic equation for the regression line is called the regression equation. It is represented as:

One-Way ANOVA: Unequal Sample Sizes

One-Way ANOVA: Unequal Sample Sizes

One-way ANOVA can be performed on three or more samples of unequal sizes. However, calculations get complicated when sample sizes are not always the same. So, while performing ANOVA with unequal samples size, the following equation is used:

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

Comprehensive assessment of adults severely ill with TB referred after triage in Tamil Nadu, India.

Public health action·2026

Same author

Sustained reduction in program-reported TB death rate in six districts following Tamil Nadu <i>Kasanoi Erappila Thittam</i> in southern India.

Global health action·2026

Same author

Amplifying performance, combustion, and emission characteristics of a CRDI engine using diesel-WCO methyl ester-dyglyme ternary fuel blends with carbon nanotubes.

Scientific reports·2026

Same author

First-ever Experience of Implementing Therapeutic Nutrition for Very Severely Undernourished Adults with TB in Routine Program Settings: A Longitudinal Descriptive Study.

Indian journal of community medicine : official publication of Indian Association of Preventive & Social Medicine·2026

Same author

Role of triage audit in an ongoing differentiated TB care initiative to reduce deaths in Tamil Nadu, India.

Public health action·2025

Same author

Impact of Brexit on Pharmaceutical Regulations: EMA vs. MHRA.

Reviews on recent clinical trials·2025

Same journal

STIF: Intuitionistic fuzzy Gaussian membership function with statistical transformation weight of evidence and information value for private information preservation.

Distributed and parallel databases·2023

Same journal

Bio-SODA UX: enabling natural language question answering over knowledge graphs with user disambiguation.

Distributed and parallel databases·2022

Same journal

Scalable probabilistic truss decomposition using central limit theorem and H-index.

Distributed and parallel databases·2022

Same journal

Subscribing to big data at scale.

Distributed and parallel databases·2022

Same journal

MICAR: multi-inhabitant context-aware activity recognition in home environments.

Distributed and parallel databases·2022

Same journal

RETRACTED ARTICLE: Application of machine learning (ML) and internet of things (IoT) in healthcare to predict and tackle pandemic situation.

Distributed and parallel databases·2021

See all related articles

Search research articles

Related Experiment Video

Updated: Nov 11, 2025

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

Sentimental analysis from imbalanced code-mixed data using machine learning approaches.

R Srinivasan¹, C N Subalalitha¹

¹Department of Computer Science and Engineering, SRM Institute of Science and Technology, Kattankulathur, 603 203 India.

Distributed and Parallel Databases

|March 29, 2021

Summary

This summary is machine-generated.

This study tackles class imbalance in sentiment analysis for code-mixed data, a common issue. It proposes a solution combining sampling techniques and Levenshtein distance for better sentiment classification.

Keywords:

Code-mixed data Imbalanced data Machine learning Sampling Sentimental analysis

More Related Videos

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Related Experiment Videos

Last Updated: Nov 11, 2025

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Machine Learning Algorithms for Early Detection of Bone Metastases in an Experimental Rat Model

Published on: August 16, 2020

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Area of Science:

Natural Language Processing
Machine Learning
Computational Linguistics

Background:

Sentiment analysis is crucial for knowledge discovery across fields.
Class imbalance is a significant challenge in sentiment analysis, particularly with code-mixed data.
Existing research has largely overlooked sentiment analysis in imbalanced, code-mixed datasets.

Purpose of the Study:

To address the challenge of class imbalance in sentiment analysis for code-mixed text.
To propose and evaluate a novel approach for sentiment analysis on imbalanced code-mixed data.
To compare the effectiveness of various machine learning classifiers for this specific task.

Main Methods:

A combination of sampling techniques and Levenshtein distance metrics was employed.
The study evaluated multiple machine learning algorithms: Random Forest, Logistic Regression, XGBoost, Support Vector Machine, and Naïve Bayes.
Performance was assessed using the F1-Score metric.

Main Results:

The proposed method effectively handles class imbalance in code-mixed sentiment analysis.
Comparative analysis revealed the performance variations among different machine learning classifiers.
The F1-Score was utilized to quantify and compare the effectiveness of the implemented approaches.

Conclusions:

The developed approach offers a viable solution for sentiment analysis in challenging code-mixed, imbalanced datasets.
The findings provide insights into the suitability of different machine learning models for this task.
Further research can build upon these methods to enhance sentiment analysis accuracy in multilingual contexts.