Jove
Visualize
联系我们
JoVE
x logofacebook logolinkedin logoyoutube logo
关于 JoVE
概览领导团队博客JoVE 帮助中心
作者
出版流程编辑委员会范围与政策同行评审常见问题投稿
图书馆员
用户评价订阅访问资源图书馆顾问委员会常见问题
研究
JoVE JournalMethods CollectionsJoVE Encyclopedia of Experiments存档
教育
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab Manual教师资源中心教师网站
使用条款与条件
隐私政策
政策

相关概念视频

Quantifying and Rejecting Outliers: The Grubbs Test01:02

Quantifying and Rejecting Outliers: The Grubbs Test

1.5K
Sometimes, a data set can have a recorded numerical observation that greatly  deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier.  To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...
1.5K
Frequency-dependent Selection01:21

Frequency-dependent Selection

21.9K
When the fitness of a trait is influenced by how common it is (i.e., its frequency) relative to different traits within a population, this is referred to as frequency-dependent selection. Frequency-dependent selection may occur between species or within a single species. This type of selection can either be positive—with more common phenotypes having higher fitness—or negative, with rarer phenotypes conferring increased fitness.
21.9K

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序
Same author

Impact of Age-Related Hearing Loss on Brain Connectivity and Cognitive Performance: A Systematic Review.

Trends in hearing·2026
Same author

Fixation strength of anterior tibial tuberosity osteotomy in revision knee arthroplasty according to cerclage wire configuration: An experimental animal model.

The Knee·2026
Same author

Dissecting self-supervised learning strategies for transfer learning in MRI prostate cancer diagnosis.

Scientific reports·2026
Same author

Beyond binary classification: a pilot study of imaging-derived glioma severity modeling using T1-weighted and diffusion MRI radiomics.

Magma (New York, N.Y.)·2026
Same author

Patient complexity profiles in depression: a machine learning approach to personalized mental health.

Frontiers in psychiatry·2026
Same author

Epidemiology and severity risk factors of dengue virus infection during the 2023-2024 outbreak in Colombia.

PLoS neglected tropical diseases·2025
Same journal

Interpretable machine learning for Parkinson's disease diagnosis, staging, and biological mechanism exploration: a multicenter analysis.

BioData mining·2026
Same journal

Learning a distance for the clustering of patients with amyotrophic lateral sclerosis.

BioData mining·2026
Same journal

Multi-domain feature fusion with variational mode decomposition and hybrid LightGBM-Logistic Regression for multi-class seizure classification.

BioData mining·2026
Same journal

Large-scale transcriptomic data mining using explainable XGBoost and SHAP reveals shared biomarkers and molecular mechanisms between type-2 diabetes and triple-negative breast cancer for drug repurposing.

BioData mining·2026
Same journal

AVSeg-XAI: Deep learning framework for A/V segmentation with vascular features reveals retinal oculomics as biomarker for cardiovascular disease.

BioData mining·2026
Same journal

Navigating the uncharted: AI-driven advances in protein structure, dynamics, interactions and ligand interactions for understudied families.

BioData mining·2026
查看所有相关文章

相关实验视频

Updated: Jun 13, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.4K

知识倾斜的随机森林方法用于高维数据和小样本大小,用于基因表达数据的特征选择应用程序.

Erika Cantor1, Sandra Guauque-Olarte2, Roberto León3

  • 1Department of clinical epidemiology and biostatistics, Pontificia Universidad Javeriana, Bogotá, 110221, Colombia. erika.cantor@javeriana.edu.co.

BioData mining
|September 10, 2024
PubMed
概括
此摘要是机器生成的。

我们开发了一个知识倾斜的随机森林 (RF),以改善高维基因组学数据中的基因选择. 该方法整合了生物网络,提高了预测准确性和可解释性,特别是在小样本大小的情况下.

关键词:
可以解释的可解释性.选择功能选择功能选择.基因选择 基因选择高维的高维空间之前的知识 之前的知识蛋白质与蛋白质的相互作用在RNA-Seqq.随机的森林随机的森林

更多相关视频

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

658
Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts
08:51

Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts

Published on: September 20, 2024

1.2K

相关实验视频

Last Updated: Jun 13, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.4K
Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers
03:37

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

658
Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts
08:51

Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts

Published on: September 20, 2024

1.2K

科学领域:

  • 计算生物学 计算生物学
  • 机器学习 机器学习
  • 基因组学就是基因组学.

背景情况:

  • 高维基遗传和基因组学数据带来了诸如维度诅咒之类的挑战.
  • 传统的随机森林 (RF) 模型在高维设置中可能表现出不高的准确性,特别是在有限的样本大小的情况下.
  • 整合先前的生物知识是提高机器学习模型性能的一个有希望的策略.

研究的目的:

  • 提出一种新的知识倾斜的随机森林 (RF) 模型.
  • 提高基因选择在高维基因组学数据中的性能和可解释性.
  • 在采用小样本大小的场景中解决传统射频的局限性.

主要方法:

  • 知识倾向的RF集成生物网络 (例如,蛋白质-蛋白质相互作用网络) 作为先前知识.
  • 一个随机步行与重启算法确定基因相关性基于网络拓.
  • 基因相关性得分修改了RF算法的特征选择概率,并通过修改的Boruta算法进行了增强.

主要成果:

  • 与传统的RF和在模拟数据集上的物流拉索回归相比,知识倾斜的RF在结果预测中表现出更好的精度.
  • 该方法有效地识别了更多具有生物相关性的基因.
  • 与标准RF方法相比,观察到更好的解释性.

结论:

  • 知识倾向的RF提供了一种强大的方法来处理高维基因组学数据,克服维度的诅咒.
  • 整合先前的生物网络知识显著提高了模型的性能和可解释性.
  • 这种方法在复杂疾病中识别相关基因方面显示出前景,正如 calcific主动脉狭窄症的案例研究中验证的那样.