Search research articles

关于 JoVE

概览领导团队博客 JoVE 帮助中心

作者

出版流程编辑委员会范围与政策同行评审常见问题投稿

图书馆员

用户评价订阅访问资源图书馆顾问委员会常见问题

研究

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments 存档

教育

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual 教师资源中心教师网站

使用条款与条件

相关概念视频

Quantifying and Rejecting Outliers: The Grubbs Test

Quantifying and Rejecting Outliers: The Grubbs Test

Sometimes, a data set can have a recorded numerical observation that greatly deviates from the rest of the data. Assuming that the data is normally distributed, a statistical method called the Grubbs test can be used to determine whether the observation is truly an outlier. To perform a two-tailed Grubbs test, first, calculate the absolute difference between the outlier and the mean. Then, calculate the ratio between this difference and the standard deviation of the sample. This...

Frequency-dependent Selection

Frequency-dependent Selection

When the fitness of a trait is influenced by how common it is (i.e., its frequency) relative to different traits within a population, this is referred to as frequency-dependent selection. Frequency-dependent selection may occur between species or within a single species. This type of selection can either be positive—with more common phenotypes having higher fitness—or negative, with rarer phenotypes conferring increased fitness.

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序

Same author

Impact of Age-Related Hearing Loss on Brain Connectivity and Cognitive Performance: A Systematic Review.

Trends in hearing·2026

Same author

Fixation strength of anterior tibial tuberosity osteotomy in revision knee arthroplasty according to cerclage wire configuration: An experimental animal model.

The Knee·2026

Same author

Dissecting self-supervised learning strategies for transfer learning in MRI prostate cancer diagnosis.

Scientific reports·2026

Same author

Beyond binary classification: a pilot study of imaging-derived glioma severity modeling using T1-weighted and diffusion MRI radiomics.

Magma (New York, N.Y.)·2026

Same author

Patient complexity profiles in depression: a machine learning approach to personalized mental health.

Frontiers in psychiatry·2026

Same author

Epidemiology and severity risk factors of dengue virus infection during the 2023-2024 outbreak in Colombia.

PLoS neglected tropical diseases·2025

Same journal

Interpretable machine learning for Parkinson's disease diagnosis, staging, and biological mechanism exploration: a multicenter analysis.

BioData mining·2026

Same journal

Learning a distance for the clustering of patients with amyotrophic lateral sclerosis.

BioData mining·2026

Same journal

Multi-domain feature fusion with variational mode decomposition and hybrid LightGBM-Logistic Regression for multi-class seizure classification.

BioData mining·2026

Same journal

Large-scale transcriptomic data mining using explainable XGBoost and SHAP reveals shared biomarkers and molecular mechanisms between type-2 diabetes and triple-negative breast cancer for drug repurposing.

BioData mining·2026

Same journal

AVSeg-XAI: Deep learning framework for A/V segmentation with vascular features reveals retinal oculomics as biomarker for cardiovascular disease.

BioData mining·2026

Same journal

Navigating the uncharted: AI-driven advances in protein structure, dynamics, interactions and ligand interactions for understudied families.

BioData mining·2026

查看所有相关文章

Search research articles

相关实验视频

Updated: Jun 13, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

知识倾斜的随机森林方法用于高维数据和小样本大小,用于基因表达数据的特征选择应用程序.

Erika Cantor¹, Sandra Guauque-Olarte², Roberto León³

¹Department of clinical epidemiology and biostatistics, Pontificia Universidad Javeriana, Bogotá, 110221, Colombia. erika.cantor@javeriana.edu.co.

|September 10, 2024

概括

此摘要是机器生成的。

我们开发了一个知识倾斜的随机森林 (RF),以改善高维基因组学数据中的基因选择. 该方法整合了生物网络,提高了预测准确性和可解释性,特别是在小样本大小的情况下.

关键词:

可以解释的可解释性.选择功能选择功能选择.基因选择基因选择高维的高维空间之前的知识之前的知识蛋白质与蛋白质的相互作用在RNA-Seqq.随机的森林随机的森林

更多相关视频

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts

Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts

Published on: September 20, 2024

相关实验视频

Last Updated: Jun 13, 2025

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Author Spotlight: Impact of Intergenic Interactions on Disease-Identifying Dark Biomarkers

Published on: March 1, 2024

Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts

Author Spotlight: Integrated Multi-Omics Analysis for Unveiling Multicellular Immune Signatures in Clinical Heart Attack Cohorts

Published on: September 20, 2024

科学领域:

计算生物学计算生物学
机器学习机器学习
基因组学就是基因组学.

背景情况:

高维基遗传和基因组学数据带来了诸如维度诅咒之类的挑战.
传统的随机森林 (RF) 模型在高维设置中可能表现出不高的准确性,特别是在有限的样本大小的情况下.
整合先前的生物知识是提高机器学习模型性能的一个有希望的策略.

研究的目的:

提出一种新的知识倾斜的随机森林 (RF) 模型.
提高基因选择在高维基因组学数据中的性能和可解释性.
在采用小样本大小的场景中解决传统射频的局限性.

主要方法:

知识倾向的RF集成生物网络 (例如,蛋白质-蛋白质相互作用网络) 作为先前知识.
一个随机步行与重启算法确定基因相关性基于网络拓.
基因相关性得分修改了RF算法的特征选择概率,并通过修改的Boruta算法进行了增强.

主要成果:

与传统的RF和在模拟数据集上的物流拉索回归相比,知识倾斜的RF在结果预测中表现出更好的精度.
该方法有效地识别了更多具有生物相关性的基因.
与标准RF方法相比,观察到更好的解释性.

结论:

知识倾向的RF提供了一种强大的方法来处理高维基因组学数据,克服维度的诅咒.
整合先前的生物网络知识显著提高了模型的性能和可解释性.
这种方法在复杂疾病中识别相关基因方面显示出前景,正如 calcific主动脉狭窄症的案例研究中验证的那样.