Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Stratified Sampling Method01:16

Stratified Sampling Method

14.4K
Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a stratified sample, divide the population into groups called strata and then take a...
14.4K
Stereotype Content Model02:16

Stereotype Content Model

15.3K
The Stereotype Content Model (SCM) was first proposed by Susan Fiske and her colleagues (Fiske, Cuddy, Glick & Xu, 2002; see also Fiske, 2012 and Fiske, 2017). The SCM specifies that when someone encounters a new group, they will stereotype them based on two metrics: warmth—or that group’s perceived intent, and how likely they are to provide help or inflict harm—and competence—or their ability to carry out that objective. Depending on the warmth-competence...
15.3K
The Representativeness Heuristic02:13

The Representativeness Heuristic

16.7K
The representative heuristic describes a biased way of thinking, in which you unintentionally stereotype someone or something. For example, you may assume that your professors spend their free time reading books and engaging in intellectual conversation, because the idea of them spending their time playing volleyball or visiting an amusement park does not fit in with your stereotypes of professors.
16.7K
Cluster Sampling Method01:20

Cluster Sampling Method

13.9K
Appropriate sampling methods ensure that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
To choose a cluster sample, divide the population into clusters (groups) and then randomly select some of the clusters. All the members from these clusters are in the cluster sample. For example, if you randomly sample four departments from your...
13.9K
Structuralism01:26

Structuralism

3.1K
Structuralism, an early psychological theory developed by Wilhelm Wundt and his student Edward Bradford Titchener, sought to dissect the human mind into its most fundamental components. Wundt's groundbreaking work in his laboratory set the stage for Titchener to define structuralism's goal as cataloging the "atoms" of the mind—sensations, images, and feelings—akin to how chemists identify elements of matter.
Titchener's approach to structuralism was unique. He...
3.1K
Systematic Sampling Method01:17

Systematic Sampling Method

12.4K
Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. Data are the result of sampling from a population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest.
Systematic sampling is one of the simplest methods...
12.4K

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Genomic analyses implicate hormonal and metabolic dysregulation in polycystic ovary syndrome.

Nature genetics·2026
Same author

Trajectory-informed gene feature selection in single-cell analysis with SEEK-VFI.

bioRxiv : the preprint server for biology·2025
Same author

Polygenic prediction of body mass index and obesity through the life course and across ancestries.

Nature medicine·2025
Same author

LACE-UP: An ensemble machine-learning method for health subtype classification on multidimensional binary data.

Proceedings of the National Academy of Sciences of the United States of America·2025
Same author

The genetic basis of endometriosis and comorbidity with other pain and inflammatory conditions.

Nature genetics·2023
Same author

A saturated map of common genetic variants associated with human height.

Nature·2022
Same journal

Layered social competition coordinates reproductive hierarchy formation in ants.

bioRxiv : the preprint server for biology·2026
Same journal

Combination epigenetic-targeted therapy increases the immunogenicity of poorly immunogenic sarcomas.

bioRxiv : the preprint server for biology·2026
Same journal

Loss of LanC-like proteins delays post-injury regeneration of aging skeletal muscles.

bioRxiv : the preprint server for biology·2026
Same journal

Integrative Transfer Network: Deep Transfer Learning Across Populations and Prediction Targets.

bioRxiv : the preprint server for biology·2026
Same journal

Confidence-supported label-free metabolic imaging with FPhaS phase autofluorescence microscopy.

bioRxiv : the preprint server for biology·2026
Same journal

Sequence-encoded autoinhibition couples mRNA decapping activity to phase separation.

bioRxiv : the preprint server for biology·2026
See all related articles

Related Experiment Video

Updated: Jan 8, 2026

Decoding Natural Behavior from Neuroethological Embedding
08:00

Decoding Natural Behavior from Neuroethological Embedding

Published on: October 3, 2025

548

SEEK-VEC: Robust Latent Structure Discovery via Ensemble Topic Modeling.

Rebecca Danning1, Zheng Tracy Ke2, Rong Ma1

  • 1Department of Biostatistics, Harvard T.H. Chan School of Public Health.

Biorxiv : the Preprint Server for Biology
|December 22, 2025
PubMed
Summary
This summary is machine-generated.

SEEK-VEC, a novel ensemble framework, enhances topic modeling for count data by integrating multiple models to reveal hidden patterns. This method robustly identifies latent structures, outperforming existing techniques, especially with weak signals.

Related Experiment Videos

Last Updated: Jan 8, 2026

Decoding Natural Behavior from Neuroethological Embedding
08:00

Decoding Natural Behavior from Neuroethological Embedding

Published on: October 3, 2025

548

Area of Science:

  • Computational biology
  • Data science
  • Statistical modeling

Background:

  • Topic modeling is crucial for uncovering latent structure in count data.
  • Standard methods face limitations like restrictive assumptions, noise sensitivity, and topic number misspecification, especially for non-textual data.

Purpose of the Study:

  • Introduce SEEK-VEC (Spectral Ensembling of topic models with Eigenscore for K-agnostic Vocabulary Embedding and Classification), an ensemble framework for count data analysis.
  • Develop a method that automatically reinforces signal, mitigates noise, and generates a consensus low-dimensional embedding.
  • Enable variable classification, pattern discovery, and model diagnostics through prioritization and grouping scores.

Main Methods:

  • Utilizes a spectral ensembling procedure to integrate insights from multiple candidate topic models.
  • Applies eigenscore for K-agnostic vocabulary embedding and classification.
  • Generates prioritization and grouping scores for data interpretation.

Main Results:

  • SEEK-VEC demonstrates robustness in realistic settings and outperforms state-of-the-art methods, particularly with weak signal strength.
  • Successfully applied to diverse datasets including psychopathology, food preferences, and single-cell transcriptomics.
  • Reveals scientifically meaningful latent structures in real-world data.

Conclusions:

  • SEEK-VEC offers a powerful and robust approach for latent structure discovery in count data.
  • The framework effectively handles noise and avoids restrictive assumptions of traditional topic models.
  • Provides valuable tools for data classification, pattern discovery, and diagnostic analysis across various scientific domains.