Jove
Visualize
联系我们
JoVE
x logofacebook logolinkedin logoyoutube logo
关于 JoVE
概览领导团队博客JoVE 帮助中心
作者
出版流程编辑委员会范围与政策同行评审常见问题投稿
图书馆员
用户评价订阅访问资源图书馆顾问委员会常见问题
研究
JoVE JournalMethods CollectionsJoVE Encyclopedia of Experiments存档
教育
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab Manual教师资源中心教师网站
使用条款与条件
隐私政策
政策

相关概念视频

Synthetic Biology02:55

Synthetic Biology

5.5K
Synthetic biology is an interdisciplinary science that involves using principles from disciplines such as engineering, molecular biology, cell biology, and systems biology. It involves remodeling existing organisms from nature or constructing completely new synthetic organisms for applications such as protein or enzyme production, bioremediation, value-added macromolecule production, and the addition of desirable traits to crops, to name a few.
Golden rice
Golden rice is a genetically modified...
5.5K
Combinatorial Gene Control02:33

Combinatorial Gene Control

9.5K
Combinatorial gene control is the synergistic action of several transcriptional factors to regulate the expression of a single gene. The absence of one or more of these factors may lead to a significant difference in the level of gene expression or repression.
The expression of more than 30,000 genes is controlled by approximately 2000-3000 transcription factors. This is possible because a single transcription factor can recognize more than one regulatory sequence. The specificity in gene...
9.5K
Random Sampling Method01:09

Random Sampling Method

14.1K
Sampling is a technique to select a portion (or subset) of the larger population and study that portion (the sample) to gain information about the population. Data are the result of sampling from a population. The sampling method ensures that samples are drawn without bias and accurately represent the population. Because measuring the entire population in a study is not practical, researchers use samples to represent the population of interest. Among the various sampling methods used by...
14.1K
Synthetic Disvision of Polynomials01:28

Synthetic Disvision of Polynomials

139
Synthetic division is an efficient algorithmic approach for dividing a polynomial by a linear binomial of the form x - c, where c is a real number. This method is helpful due to its streamlined process, which avoids the more cumbersome steps involved in the traditional long division of polynomials. It simplifies computation and serves as a practical tool for evaluating polynomials and identifying their factors.To perform synthetic division, one begins by listing the coefficients of the...
139
Mechanistic Models: Compartment Models in Individual and Population Analysis01:23

Mechanistic Models: Compartment Models in Individual and Population Analysis

237
Mechanistic models are utilized in individual analysis using single-source data, but imperfections arise due to data collection errors, preventing perfect prediction of observed data. The mathematical equation involves known values (Xi), observed concentrations (Ci), measurement errors (εi), model parameters (ϕj), and the related function (ƒi) for i number of values. Different least-squares metrics quantify differences between predicted and observed values. The ordinary least...
237
Bootstrapping01:24

Bootstrapping

798
The term "bootstrap" originated in the 19th century as a metaphor for self-improvement or achieving something independently, without external assistance. This concept extends to statistical bootstrapping, a self-contained method for estimating population parameters through resampling, even though it can be computationally intensive. Developed by the American statistician Dr. Bradley Efron in 1979, bootstrapping provides a robust way to perform inference when the original sample size is...
798

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序
Same author

Knowledge, attitude, and practice of the general population towards ear care and hearing health in Karnali Province, Nepal.

Journal of public health research·2025
Same author

Prevalence and causes of hearing impairment: a cross-sectional study in Karnali Province, Nepal - CORRIGENDUM.

The Journal of laryngology and otology·2025
Same author

Input Space Coverage Matters.

Computer·2025
Same author

Prevalence and causes of hearing impairment: a cross-sectional study in Karnali Province, Nepal.

The Journal of laryngology and otology·2025
Same author

Combinatorial Test Generation for Multiple Input Models with Shared Parameters.

IEEE transactions on pattern analysis and machine intelligence·2024
Same author

A Trusted Federated System to Share Granular Data Among Disparate Database Resources.

Computer·2024
Same journal

Toward Cybersecurity Testing and Monitoring of IoT Ecosystems.

SN computer science·2026
Same journal

Voxel-based Deep Regression for Enhanced Body Composition Estimation from 3D Body Scans.

SN computer science·2026
Same journal

Detecting Adverse Drug Events in Social Media: A Brief Literature Review.

SN computer science·2026
Same journal

TRAM: The Telecommunications-Related AcciMap Method.

SN computer science·2026
Same journal

To Signal or Not to Signal? A Non-cooperative Game-Theoretic Approach to Discretionary Communication Between Road Users.

SN computer science·2025
Same journal

Fast and Secure Multiparty Querying over Federated Graph Databases.

SN computer science·2025
查看所有相关文章
  1. 首页
  2. 一种用于机器学习的合成数据生成的组合方法.
  1. 首页
  2. 一种用于机器学习的合成数据生成的组合方法.

相关实验视频

A Data Integration Workflow to Identify Drug Combinations Targeting Synthetic Lethal Interactions
07:40

A Data Integration Workflow to Identify Drug Combinations Targeting Synthetic Lethal Interactions

Published on: May 27, 2021

4.5K

一种用于机器学习的合成数据生成的组合方法.

Krishna Khadka1, Jaganmohan Chandrasekaran2, Yu Lei1

  • 1Department of Computer Science and Engineering, The University of Texas at Arlington, Arlington, TX 76019 USA.

SN computer science
|January 12, 2026

在PubMed 上查看摘要

概括
此摘要是机器生成的。

这项研究引入了一种用于生成合成数据的新型组合采样方法,大大减少了用于可比机器学习模型性能所需的样本数量,并加强了隐私保护.

关键词:
组合测试试验 组合测试试验不同的隐私差异性隐私.综合数据生成的合成数据生成.变量自动编码器变量自动编码器

更多相关视频

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.9K
Constructing and Visualizing Models using Mime-based Machine-learning Framework
06:19

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

2.3K

相关实验视频

A Data Integration Workflow to Identify Drug Combinations Targeting Synthetic Lethal Interactions
07:40

A Data Integration Workflow to Identify Drug Combinations Targeting Synthetic Lethal Interactions

Published on: May 27, 2021

4.5K
Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances
07:35

Selecting Multiple Biomarker Subsets with Similarly Effective Binary Classification Performances

Published on: October 11, 2018

7.9K
Constructing and Visualizing Models using Mime-based Machine-learning Framework
06:19

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

2.3K

科学领域:

  • 机器学习 机器学习
  • 数据 隐私 数据 隐私 数据
  • 合成数据生成 合成数据生成

背景情况:

  • 机器学习数据集经常包含敏感的个人健康和财务信息,构成隐私风险.
  • 现有的合成数据生成方法通常需要大量的样本,影响下游任务效率.
  • 目前的技术包括编码数据,在潜空间中随机采样,以及解码以生成合成数据.

研究的目的:

  • 开发一种高效的合成数据生成技术,尽量减少样本要求.
  • 增强合成数据生成方法的隐私保护能力.
  • 用合成数据提高机器学习模型的性能.

主要方法:

  • 建议采用组合方法对隐性空间进行采样,重点关注隐性维度之间的t路相互作用.
  • 这种方法的动机是发现模型预测通常是由有限数量的特征之间的相互作用驱动的.
  • 该方法通过利用这些已识别的特征相互作用来生成合成数据样本.

主要成果:

  • 与传统的随机抽样相比,组合抽样方法需要更少的合成样本来实现类似的模型性能.
  • 当与差异隐私相结合时,这种方法比随机抽样显示出较小的性能退化.
  • 经验结果证明了利用特征交互来有效生成合成数据的有效性.

结论:

  • 拟议的组合采样方法为生成高质量的合成数据提供了更有效的替代方案.
  • 这种技术在机器学习中改善了数据实用性和隐私保护之间的权衡.
  • 这些发现表明,基于特征相互作用的有针对性的采样可以显著提高合成数据生成过程.