Search research articles

ABOUT JoVE

Overview Leadership Blog JoVE Help Center

AUTHORS

Publishing Process Editorial Board Scope & Policies Peer Review FAQ Submit

LIBRARIANS

Testimonials Subscriptions Access Resources Library Advisory Board FAQ

RESEARCH

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments Archive

EDUCATION

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual Faculty Resource Center Faculty Site

Terms & Conditions of Use

Related Concept Videos

Passive Filters

Passive Filters

Passive filters are utilized to shape the frequency spectrum of signals across a diverse array of applications. These filters, using only passive elements like resistors (R), inductors (L), and capacitors (C), are capable of selectively allowing or blocking certain frequency ranges without the need for external power sources.
Low-Pass Filters
Low-pass filters are designed to transmit signals with frequencies lower than the cutoff frequency, ωc, and attenuate those above it. The cutoff...

Deconvolution

Deconvolution

Deconvolution, also known as inverse filtering, is the process of extracting the impulse response from known input and output signals. This technique is vital in scenarios where the system's characteristics are unknown, and they must be inferred from the observable signals.
Deconvolution involves several mathematical techniques to derive the impulse response. One common approach is polynomial division. In this method, the input and output sequences are treated as coefficients of...

Filtration

Filtration

Filtration is a physical separation process that involves passing a suspension through a porous medium to separate solids from fluids. During filtration, solids collect on the porous medium while liquids, also collectively known as the filtrate, pass through. The filtration medium is selected based on the filtration purpose, quantity, and nature of the precipitate. The general criteria for a suitable filtering medium are that it is inert, mechanically strong, nonabsorbent toward dissolved...

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for ka Estimation

One-Compartment Open Model: Wagner-Nelson and Loo Riegelman Method for k_a Estimation

This lesson introduces two critical methods in pharmacokinetics, the Wagner-Nelson and Loo-Riegelman methods, used for estimating the absorption rate constant (ka) for drugs administered via non-intravenous routes. The Wagner-Nelson method relates ka to the plasma concentration derived from the slope of a semilog percent unabsorbed time plot. However, it is limited to drugs with one-compartment kinetics and can be impacted by factors like gastrointestinal motility or enzymatic degradation.
On...

Linear Approximation in Frequency Domain

Linear Approximation in Frequency Domain

Linear systems are characterized by two main properties: superposition and homogeneity. Superposition allows the response to multiple inputs to be the sum of the responses to each individual input. Homogeneity ensures that scaling an input by a scalar results in the response being scaled by the same scalar.
In contrast, nonlinear systems do not inherently possess these properties. However, for small deviations around an operating point, a nonlinear system can often be approximated as linear....

Expected Frequencies in Goodness-of-Fit Tests

Expected Frequencies in Goodness-of-Fit Tests

A goodness-of-fit test is conducted to determine whether the observed frequency values are statistically similar to the frequencies expected for the dataset. Suppose the expected frequencies for a dataset are equal such as when predicting the frequency of any number appearing when casting a die. In that case, the expected frequency is the ratio of the total number of observations (n) to the number of categories (k).

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by

Same author

A survey on deep learning applied to medical images: from simple artificial neural networks to generative models.

Neural computing & applications·2022

Same author

Improving the text classification using clustering and a novel HMM to reduce the dimensionality.

Computer methods and programs in biomedicine·2016

Same author

A linear-RBF multikernel SVM to classify big text corpora.

BioMed research international·2015

Same author

Study of query expansion techniques and their application in the biomedical information retrieval.

TheScientificWorldJournal·2014

Same author

Improving imbalanced scientific text classification using sampling strategies and dictionaries.

Journal of integrative bioinformatics·2011

Same journal

Thymidylate synthase inhibitory drugs induce p53-dependent pathways differently.

PloS one·2026

Same journal

Top-down and bottom-up attention for joint pattern classification and reconstruction.

PloS one·2026

Same journal

Short- and long-term scaling behavior of blood pressure and pulse arrival time during sleep in healthy controls and patients with obstructive sleep apnea.

PloS one·2026

Same journal

Double DQN-based secrecy energy efficiency and fairness performance in IRS-assisted NOMA systems with friendly jamming.

PloS one·2026

Same journal

10 recommendations for strengthening citizen science for improved societal and ecological outcomes: A co-produced analysis of challenges and opportunities in the 21st century.

PloS one·2026

Same journal

Paying in public: Peer effects, impression management, and willingness to pay on digital payment platforms.

PloS one·2026

See all related articles

Search research articles

Related Experiment Video

Updated: Dec 1, 2025

A Multimodal Imaging Framework to Advance Phenotyping of Living Label-free Breast Cancer Cells

A Multimodal Imaging Framework to Advance Phenotyping of Living Label-free Breast Cancer Cells

Published on: August 22, 2025

LDA filter: A Latent Dirichlet Allocation preprocess method for Weka.

P Celard^1,2,3, A Seara Vieira^1,2,3, E L Iglesias^1,2,3

¹Computer Science Dept., Univ. of Vigo, Escuela Superior de Ingeniería Informática, Ourense, Spain.

|November 9, 2020

Summary

This summary is machine-generated.

This study introduces a new document representation method using Latent Dirichlet Allocation (LDA) topic probabilities. It achieves comparable accuracy to Bag of Words (BoW) but significantly speeds up text classification processing times.

More Related Videos

Assisted Selection of Biomarkers by Linear Discriminant Analysis Effect Size LEfSe in Microbiome Data

Assisted Selection of Biomarkers by Linear Discriminant Analysis Effect Size LEfSe in Microbiome Data

Published on: May 16, 2022

Supervised Machine Learning for Semi-Quantification of Extracellular DNA in Glomerulonephritis

Supervised Machine Learning for Semi-Quantification of Extracellular DNA in Glomerulonephritis

Published on: June 18, 2020

Related Experiment Videos

Last Updated: Dec 1, 2025

A Multimodal Imaging Framework to Advance Phenotyping of Living Label-free Breast Cancer Cells

A Multimodal Imaging Framework to Advance Phenotyping of Living Label-free Breast Cancer Cells

Published on: August 22, 2025

Assisted Selection of Biomarkers by Linear Discriminant Analysis Effect Size LEfSe in Microbiome Data

Assisted Selection of Biomarkers by Linear Discriminant Analysis Effect Size LEfSe in Microbiome Data

Published on: May 16, 2022

Supervised Machine Learning for Semi-Quantification of Extracellular DNA in Glomerulonephritis

Supervised Machine Learning for Semi-Quantification of Extracellular DNA in Glomerulonephritis

Published on: June 18, 2020

Area of Science:

Computer Science
Information Retrieval
Machine Learning

Background:

Traditional text representation methods like Bag of Words (BoW) are widely used in document classification.
Latent Dirichlet Allocation (LDA) is a generative statistical model for discovering latent topics within a corpus.
Representing documents based on LDA topic distributions offers a potential alternative to existing methods.

Purpose of the Study:

To propose and evaluate a novel document representation technique leveraging LDA topic probabilities.
To assess the impact of this LDA-based representation on the performance of various classification algorithms.
To compare the proposed method against the standard Bag of Words (BoW) representation.

Main Methods:

Developed a new text representation filter based on the probability of a document belonging to each LDA-generated topic.
Integrated the filter as an extension within the Weka software environment.
Evaluated the filter using multiple classifiers (SVM, k-NN, Naive Bayes) on diverse document corpora (OHSUMED, Reuters, 20Newsgroup, Yahoo! Answers, YELP, TREC Genomics).

Main Results:

The LDA-based document representation achieved classification accuracy comparable to the Bag of Words (BoW) method.
A significant reduction in classification processing times was observed when using the proposed LDA-based representation.
Performance was consistent across various classifiers and diverse datasets.

Conclusions:

The proposed LDA topic probability-based document representation is an effective alternative to BoW.
This method offers a valuable trade-off between accuracy and computational efficiency in text classification.
The Weka extension provides a practical tool for implementing this advanced text representation technique.