Search research articles

关于 JoVE

概览领导团队博客 JoVE 帮助中心

作者

出版流程编辑委员会范围与政策同行评审常见问题投稿

图书馆员

用户评价订阅访问资源图书馆顾问委员会常见问题

研究

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments 存档

教育

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual 教师资源中心教师网站

使用条款与条件

相关概念视频

Observational Learning

Observational Learning

Albert Bandura's observational learning, also known as imitation or modeling, occurs when a person observes and imitates another's behavior. It is a quicker process than operant conditioning. A well-known example is the Bobo doll study, where children who saw an adult acting aggressively towards the doll were more likely to act aggressively when left alone, compared to those who observed a nonaggressive adult. Many psychologists view observational learning as a form of latent learning...

Model Approaches for Pharmacokinetic Data: Distributed Parameter Models

Model Approaches for Pharmacokinetic Data: Distributed Parameter Models

Pharmacokinetic models are mathematical constructs that represent and predict the time course of drug concentrations in the body, providing meaningful pharmacokinetic parameters. These models are categorized into compartment, physiological, and distributed parameter models.
The distributed parameter models are specifically designed to account for variations and differences in some drug classes. This model is particularly useful for assessing regional concentrations of anticancer or...

Associative Learning

Associative Learning

Associative learning is a fundamental concept in behavioral psychology, wherein a connection is established between two stimuli or events, leading to a learned response. This process is critical in understanding how behaviors are acquired and modified. Conditioning, the mechanism through which associations are formed, can be divided into two main types: classical conditioning and operant conditioning, each elucidating different aspects of associative learning.
Classical conditioning, also known...

Reinforcement

Reinforcement

Positive and negative reinforcement are key concepts in operant conditioning, a learning process where the consequences of a behavior affect the likelihood of that behavior being repeated.
Positive reinforcement occurs when a behavior is followed by the presentation of a rewarding stimulus, increasing the frequency of that behavior. For example:

Distributed Loads: Problem Solving

Distributed Loads: Problem Solving

Beams are structural elements commonly employed in engineering applications requiring different load-carrying capacities. The first step in analyzing a beam under a distributed load is to simplify the problem by dividing the load into smaller regions, which allows one to consider each region separately and calculate the magnitude of the equivalent resultant load acting on each portion of the beam. The magnitude of the equivalent resultant load for each region can be determined by calculating...

Reinforcement Schedules

Reinforcement Schedules

Positive reinforcement is a powerful method for teaching new behaviors to both animals and humans. B.F. Skinner demonstrated this with his experiments using rats in a Skinner box. When a rat pressed a lever, it received a food pellet. This immediate reward encouraged the rat to repeat the behavior. This method, where a reward follows every instance of the behavior, is known as continuous reinforcement. It is highly effective for establishing new behaviors quickly.
Once a behavior is learned,...

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序

Same author

An interpretable machine learning framework for classifying human and machine translations across genres.

Frontiers in artificial intelligence·2026

Same author

FairGen: preference-aligned diffusion for demographically equitable medical image synthesis.

NPJ digital medicine·2026

Same author

Greenhouse Gas Emission Fluxes in Urban Wetlands of Qinghai-Tibet Plateau.

Biology·2026

Same author

Liposomal Salvianolic Acid B Enhances ALA-PDT in Oral Leucoplakia via ROS and AKT/mTOR Signalling.

International dental journal·2026

Same author

CT and MRI features of renal inflammatory myofibroblastic tumor and its differential diagnosis from clear cell renal cell carcinoma and chromophobe renal cell carcinoma: a study of 13 cases from two centers.

BMC urology·2026

Same author

MVHumanNet++: A Large-scale Dataset of Multi-view Daily Dressing Human Captures with Richer Annotations for 3D Human Digitization.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

HardFlow: Hard-Constrained Sampling for Flow-Matching Models Via Trajectory Optimization.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Industrial Brain: Self-Evolving Neuro-Symbolic Autonomy with Causal Resilience for Cyber-Physical Systems.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Adaptive Hardness-Driven Dictionary Distillation for Incomplete Streaming View Clustering.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Mixture of Global and Local Experts with Diffusion Transformer for Controllable Face Generation.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Task-KV: Task-aware KV Cache Optimization via Semantic Differentiation of Attention Heads.

IEEE transactions on pattern analysis and machine intelligence·2026

Same journal

Achieving Text-based Person Retrieval with Any Granularity.

IEEE transactions on pattern analysis and machine intelligence·2026

查看所有相关文章

Search research articles

相关实验视频

Updated: Sep 17, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

强化学习与LLM互动对于分布式扩散模型服务.

Hongyang Du, Ruichen Zhang, Dusit Niyato

IEEE transactions on pattern analysis and machine intelligence

|June 30, 2025

概括

此摘要是机器生成的。

本研究介绍了一种交互式AI (IAI) 方法,用于分布式生成扩散模型 (GDM) 图像生成,提高体验质量 (QoE) 和效率. 通过优化资源配置,G-DDPG算法将总 QoE 提高 15%.

更多相关视频

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

相关实验视频

Last Updated: Sep 17, 2025

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Augmenting Large Language Models via Vector Embeddings to Improve Domain-Specific Responsiveness

Published on: December 6, 2024

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Constructing and Visualizing Models using Mime-based Machine-learning Framework

Published on: July 22, 2025

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems

Published on: June 13, 2025

科学领域:

人工智能的人工智能
计算机视觉计算机视觉
分布式系统分布式系统

背景情况:

分布式人工智能生成内容 (AIGC) 在体验质量 (QoE) 和能源效率方面面临挑战,特别是在生成扩散模型 (GDM) 图像生成方面.
当前的GDM框架缺乏以用户为中心的管理,以优化主观体验和资源利用.

研究的目的:

提出一种以用户为中心的新型交互式AI (IAI) 方法来管理基于GDM的分布式AIGC服务.
提高主观的体验质量 (QoE),提高人工智能产生的图像服务的能源效率.
为动态无线环境开发适应性资源分配算法.

主要方法:

重组GDM推理允许具有相似提示的用户共享denoising进程.
引入了强化学习与大型语言模型交互 (RLLI) 实时,主观QoE反使用LLM驱动的代理.
将深度决定性政策梯度 (DDPG) 算法调整为G-DDPG,以实现有效的资源配置.

主要成果:

拟议的IAI框架允许合作部署和高效的GDM推断.
RLLI有效地复制用户交互,提供个性化的QOE反.
与标准 DDPG 算法相比,G-DDPG 在总 QoE 中表现出 15% 的改善.

结论:

拟议的IAI方法,加上RLLI和G-DDPG,在分布式AIGC服务中显著提高了QOE和资源效率.
这一框架为以用户为中心的服务管理在生成性AI中提供了一个有希望的方向.
这些发现强调了整合LLM和强化学习的潜力,以优化复杂的AI系统.