Jove
Visualize
Contact Us
JoVE
x logofacebook logolinkedin logoyoutube logo
ABOUT JoVE
OverviewLeadershipBlogJoVE Help Center
AUTHORS
Publishing ProcessEditorial BoardScope & PoliciesPeer ReviewFAQSubmit
LIBRARIANS
TestimonialsSubscriptionsAccessResourcesLibrary Advisory BoardFAQ
RESEARCH
JoVE JournalMethods CollectionsJoVE Encyclopedia of ExperimentsArchive
EDUCATION
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab ManualFaculty Resource CenterFaculty Site
Terms & Conditions of Use
Privacy Policy
Policies

Related Concept Videos

Biostatistics: Overview01:20

Biostatistics: Overview

Biostatistics plays a crucial role in understanding and analyzing data in healthcare and biology. Biostatisticians conduct experiments, gather evidence, and draw meaningful conclusions using statistical methods and techniques. Different variables form the foundation of biostatistical analysis, allowing researchers to understand and interpret data effectively. These variables are classified into different types, each serving a specific purpose in statistical analysis.
Discrete variables are...

You might also read

Related Articles

Articles linked to this work by shared authors, journal, and citation graph.

Sort by
Same author

Structural Equation Modeling of Genetic and Residual Covariance Matrices for Multiple-Trait Evaluation in Beef Cattle.

Animals : an open access journal from MDPI·2026
Same author

Improving polygenic score prediction for underrepresented groups through transfer learning.

Nature communications·2026
Same author

G2P datasets: a hub for genomic datasets for predictive modeling in plants and animals.

G3 (Bethesda, Md.)·2026
Same author

Predictive models of the genetic bases underlying budding yeast fitness in multiple environments.

bioRxiv : the preprint server for biology·2025
Same author

Multi-trait/environment sparse genomic prediction using the SFSI R-package.

The plant genome·2025
Same author

Generative AI for predictive breeding: hopes and caveats.

TAG. Theoretical and applied genetics. Theoretische und angewandte Genetik·2025
Same journal

The complete sequence of the silkworm W chromosome uncovers its rapid evolution by large-scale duplications/deletions and translocation of W-linked genes.

G3 (Bethesda, Md.)·2026
Same journal

Revisiting the genome assembly of Lupinus species reveals differential diploidization after a shared whole-genome duplication.

G3 (Bethesda, Md.)·2026
Same journal

Deconstructing empirical fitness seascapes across scales of granularity.

G3 (Bethesda, Md.)·2026
Same journal

Genomes of Conopholis americana and Epifagus virginiana: Two holoparasitic plants (Orobanchaceae).

G3 (Bethesda, Md.)·2026
Same journal

"A chromosome-level reference genome for the colonial marine hydrozoan Podocoryna americana".

G3 (Bethesda, Md.)·2026
Same journal

Mating strategies in Caenorhabditis elegans populations are determined by male developmental history.

G3 (Bethesda, Md.)·2026
See all related articles

Related Experiment Video

Updated: Jun 8, 2026

Droplet Barcoding-Based Single Cell Transcriptomics of Adult Mammalian Tissues
10:12

Droplet Barcoding-Based Single Cell Transcriptomics of Adult Mammalian Tissues

Published on: January 10, 2019

18.4K

Fast analysis of biobank-size data and meta-analysis using the BGLR R-package.

Paulino Pérez-Rodríguez1, Gustavo de Los Campos2,3,4, Hao Wu2

  • 1Colegio de Postgraduados, Montecillo, Estado de México 56230, México.

G3 (Bethesda, Md.)
|December 10, 2024
PubMed
Summary
This summary is machine-generated.

New Bayesian methods in the BGLR R-package efficiently analyze large genomic datasets using sufficient statistics. This enables joint analysis of multiple biobanks without sharing individual data, improving polygenic score prediction for under-represented groups.

Keywords:
Bayesian modelsgenomic predictionmeta-analysispolygenic scoressufficient statisticssummary statistics

More Related Videos

Performing Data Mining And Integrative Analysis Of Biomarker in Breast Cancer Using Multiple Publicly Accessible Databases
07:41

Performing Data Mining And Integrative Analysis Of Biomarker in Breast Cancer Using Multiple Publicly Accessible Databases

Published on: May 17, 2019

8.9K
Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations
08:03

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

2.1K

Related Experiment Videos

Last Updated: Jun 8, 2026

Droplet Barcoding-Based Single Cell Transcriptomics of Adult Mammalian Tissues
10:12

Droplet Barcoding-Based Single Cell Transcriptomics of Adult Mammalian Tissues

Published on: January 10, 2019

18.4K
Performing Data Mining And Integrative Analysis Of Biomarker in Breast Cancer Using Multiple Publicly Accessible Databases
07:41

Performing Data Mining And Integrative Analysis Of Biomarker in Breast Cancer Using Multiple Publicly Accessible Databases

Published on: May 17, 2019

8.9K
Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations
08:03

Heuristic Mining of Hierarchical Genotypes and Accessory Genome Loci in Bacterial Populations

Published on: December 7, 2021

2.1K

Area of Science:

  • Genomics
  • Bioinformatics
  • Statistical Genetics

Background:

  • Analyzing large-scale genomic datasets (n>p) is computationally intensive.
  • Existing methods struggle with massive sample sizes common in biobanks and genetic evaluations.
  • Sharing individual genotype-phenotype data across sources is often infeasible due to privacy concerns.

Purpose of the Study:

  • To develop and implement efficient computational methods for Bayesian genomic analyses using sufficient statistics.
  • To enable joint analysis of multi-source genomic data without sharing individual-level information.
  • To improve the prediction accuracy of polygenic scores, particularly for under-represented populations.

Main Methods:

  • Incorporated functionality into the BGLR R-package to generate posterior samples from sufficient statistics.
  • Developed Bayesian shrinkage and variable selection models compatible with sufficient statistics.
  • Utilized UK-Biobank, All of Us, and Hispanic Community Health Study/Study of Latinos cohort data for real-world application.

Main Results:

  • Demonstrated computational efficiency of sufficient statistics-based methods compared to individual data.
  • Successfully implemented joint analysis of multiple cohorts without data sharing.
  • Showcased improved prediction accuracy for polygenic scores in Hispanic populations.

Conclusions:

  • The BGLR R-package now supports efficient Bayesian genomic analyses using sufficient statistics.
  • Joint analysis of distributed genomic data is feasible and beneficial, especially for under-represented groups.
  • This approach enhances the utility of biobank data for developing more equitable genomic prediction models.