Search research articles

关于 JoVE

概览领导团队博客 JoVE 帮助中心

作者

出版流程编辑委员会范围与政策同行评审常见问题投稿

图书馆员

用户评价订阅访问资源图书馆顾问委员会常见问题

研究

JoVE Journal Methods Collections JoVE Encyclopedia of Experiments 存档

教育

JoVE Core JoVE Business JoVE Science Education JoVE Lab Manual 教师资源中心教师网站

使用条款与条件

相关概念视频

Multi-input and Multi-variable systems

Multi-input and Multi-variable systems

Cruise control systems in cars are designed as multi-input systems to maintain a driver's desired speed while compensating for external disturbances such as changes in terrain. The block diagram for a cruise control system typically includes two main inputs: the desired speed set by the driver and any external disturbances, such as the incline of the road. By adjusting the engine throttle, the system maintains the vehicle's speed as close to the desired value as possible.
In the absence...

Prediction Intervals

Prediction Intervals

The interval estimate of any variable is known as the prediction interval. It helps decide if a point estimate is dependable.
However, the point estimate is most likely not the exact value of the population parameter, but close to it. After calculating point estimates, we construct interval estimates, called confidence intervals or prediction intervals. This prediction interval comprises a range of values unlike the point estimate and is a better predictor of the observed sample value, y.

Propagation of Uncertainty from Random Error

Propagation of Uncertainty from Random Error

An experiment often consists of more than a single step. In this case, measurements at each step give rise to uncertainty. Because the measurements occur in successive steps, the uncertainty in one step necessarily contributes to that in the subsequent step. As we perform statistical analysis on these types of experiments, we must learn to account for the propagation of uncertainty from one step to the next. The propagation of uncertainty depends on the type of arithmetic operation performed on...

Improving Translational Accuracy

Improving Translational Accuracy

Base complementarity between the three base pairs of mRNA codon and the tRNA anticodon is not a failsafe mechanism. Inaccuracies can range from a single mismatch to no correct base pairing at all. The free energy difference between the correct and nearly correct base pairs can be as small as 3 kcal/ mol. With complementarity being the only proofreading step, the estimated error frequency would be one wrong amino acid in every 100 amino acids incorporated. However, error frequencies observed in...

Decision Making: P-value Method

Decision Making: P-value Method

The process of hypothesis testing based on the P-value method includes calculating the P- value using the sample data and interpreting it.
First, a specific claim about the population parameter is proposed. The claim is based on the research question and is stated in a simple form. Further, an opposing statement to the claim is also stated. These statements can act as null and alternative hypotheses: a null hypothesis would be a neutral statement while the alternative hypothesis can...

Difference from Background: Limit of Detection

Difference from Background: Limit of Detection

The limit of detection (LOD) is the smallest amount of analyte that can be distinguished from the background noise. The LOD value corresponds to the concentration at which the analyte signal is three times larger than the standard deviation of the blank signal. Below this value, the analyte signal cannot be differentiated from the background noise. It is calculated by dividing the calibration slope by 3 times the standard deviation of the blank signals.
The LOD indicates the presence or absence...

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序

Same author

Utilization of Lung Cancer Registries in Learning Health Systems for Health Care Improvement.

JCO clinical cancer informatics·2025

Same author

Transbronchial cryobiopsy followed by as-needed surgical lung biopsy versus immediate surgical lung biopsy for diagnosing interstitial lung disease (the COLD study): a randomised controlled trial.

The Lancet. Respiratory medicine·2024

Same author

Minimally invasive technique for gastric GIST at challenging locations: single incision surgical gastroscopy.

Updates in surgery·2023

Same author

[Effect of melatonin on hyperoxia-induced oxidant/antioxidant imbalance in the lung of neonatal rats with chronic lung disease].

Zhongguo dang dai er ke za zhi = Chinese journal of contemporary pediatrics·2009

Same author

Phase I/II trial of AEG35156 X-linked inhibitor of apoptosis protein antisense oligonucleotide combined with idarubicin and cytarabine in patients with relapsed or primary refractory acute myeloid leukemia.

Journal of clinical oncology : official journal of the American Society of Clinical Oncology·2009

Same author

Multiplex single-nucleotide polymorphism typing by nanoparticle-coupled DNA-templated reactions.

Journal of the American Chemical Society·2009

Same journal

VideoPASTA: 7K Preference Pairs That Matter for Video-LLM Alignment.

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing·2026

Same journal

Synth-SBDH: A Synthetic Dataset of Social and Behavioral Determinants of Health for Clinical Text.

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing·2026

Same journal

X-CoT: Explainable Text-to-Video Retrieval via LLM-based Chain-of-Thought Reasoning.

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing·2026

Same journal

DischargeSim: A Simulation Benchmark for Educational Doctor-Patient Communication at Discharge.

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing·2026

Same journal

From Scores to Steps: Diagnosing and Improving LLM Performance in Evidence-Based Medical Calculations.

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing·2026

Same journal

BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers.

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing·2026

查看所有相关文章

Search research articles

相关实验视频

Updated: Sep 17, 2025

P300-Based Brain-Computer Interface Speller Performance Estimation with Classifier-Based Latency Estimation

P300-Based Brain-Computer Interface Speller Performance Estimation with Classifier-Based Latency Estimation

Published on: September 8, 2023

用多提示符改进最小湾区风险解码.

David Heineman¹, Yao Dou¹, Wei Xu¹

¹School of Interactive Computing, Georgia Institute of Technology.

Proceedings of the Conference on Empirical Methods in Natural Language Processing. Conference on Empirical Methods in Natural Language Processing

|July 4, 2025

概括

此摘要是机器生成的。

微调指令的大型语言模型 (LLM) 从多提示符解码中受益,从而产生各种候选项以提高性能. 这种方法增强了最小贝叶斯风险 (MBR) 解码,以便在各种任务中生成更稳定和最佳的文本.

更多相关视频

Measuring the Subjective Value of Risky and Ambiguous Options using Experimental Economics and Functional MRI Methods

Measuring the Subjective Value of Risky and Ambiguous Options using Experimental Economics and Functional MRI Methods

Published on: September 19, 2012

A Tactile Automated Passive-Finger Stimulator TAPS

A Tactile Automated Passive-Finger Stimulator TAPS

Published on: June 3, 2009

相关实验视频

Last Updated: Sep 17, 2025

P300-Based Brain-Computer Interface Speller Performance Estimation with Classifier-Based Latency Estimation

P300-Based Brain-Computer Interface Speller Performance Estimation with Classifier-Based Latency Estimation

Published on: September 8, 2023

Measuring the Subjective Value of Risky and Ambiguous Options using Experimental Economics and Functional MRI Methods

Measuring the Subjective Value of Risky and Ambiguous Options using Experimental Economics and Functional MRI Methods

Published on: September 19, 2012

A Tactile Automated Passive-Finger Stimulator TAPS

A Tactile Automated Passive-Finger Stimulator TAPS

Published on: June 3, 2009

科学领域:

自然语言处理自然语言处理.
人工智能的人工智能
机器学习机器学习

背景情况:

指令微调的大型语言模型 (LLM) 显示出强大的文本生成能力,但由于提示敏感性而遭受性能不稳定.
一个单一的提示可能不包括给定的生成任务的所有最佳策略,导致次优化结果.

研究的目的:

引入和评估一种新的多提示解码策略,以提高LLM文本生成的稳定性和性能.
调查从一个提示银行生成多个候选输出是否可以改善下游任务性能.

主要方法:

建议多提示解码,在推断时间从精心策划的提示库中生成众多候选文本输出.
采用最小贝叶斯风险 (MBR) 解码来组合这些候选者,根据训练有素的价值指标选择最终输出.

主要成果:

多提示符解码显著提高了MBR解码性能在广泛的条件文本生成任务.
与单一提示方法相比,增强的性能归因于创建了一个更多样化,更高质量的候选解决方案空间.
进一步的实验验证了跨不同LLM架构,任务和评估指标的多提示符解码的有效性.

结论:

多提示符解码提供了一种强大的方法来克服指令微调的LLM中的提示敏感性问题.
这种技术导致更稳定,最佳和多样化的文本生成,改善条件生成场景中的LLM整体实用性.