相关概念视频
Improving Translational Accuracy
Language and Cognition
Per-Unit Sequence Models
Zero-sequence currents, which are identical in magnitude and phase, generate a neutral current, resulting in voltage drops across the neutral impedance and the low-voltage winding. If the...
Model Approaches for Pharmacokinetic Data: Distributed Parameter Models
The distributed parameter models are specifically designed to account for variations and differences in some drug classes. This model is particularly useful for assessing regional concentrations of anticancer or...
Generalization, Discrimination, and Extinction
Generalization occurs when a behavior reinforced in one context is performed in similar situations. For instance, a student who studies diligently for calculus and receives excellent grades might apply the same study habits to psychology and history, expecting similar results. Generalization shows how learning in one setting can influence behavior in...
Multicompartment Models: Overview
These models offer a more comprehensive representation of drug behavior in the body than one-compartment models. They accommodate the complexity of drug distribution,...
您也可能阅读
相关文章
通过共同作者、期刊和引用图与本文相关的文章。
Evaluating the robustness and readiness of large frontier models in health AI applications.
Desert <i>Chlorella</i> Malate Synthase 1 Enhances Salt Tolerance by Promoting Soluble Sugar and Lipid Accumulation.
Hypothesis Generation via Interpretable Machine Learning: A Case Study on Risk Factors for Postradiation Therapy Lung Cancer Recurrence.
A Machine Learning Approach to Determine the Optimal Age for Total Hip Arthroplasty: When Is Risk for Adverse Outcomes Lowest?
ProtSATT: An Advanced Protein Solubility Predictor Based on Attention Mechanism.
The most important features in generalized additive models might be groups of features.
Interplay between oxygen redox and interfacial stability of Li-rich positive electrodes in sulfide-based all-solid-state batteries.
Breaking dependence on melanisation imparts diversity to a dogmatic invasion strategy of phytopathogenic fungi.
Hydroxyl-rich nanocavities on perovskite enable nearly barrierless intramolecular hydrogen transfer for nitrate electroreduction to ammonia.
Household mobility responses to weather extremes in Kyrgyzstan.
Autonomous Motion Vision with Tri-bulk-heterojunctioned Organic Adaptation Transistor.
Tissue-adhesive hydrogel optical fiber for peripheral optogenetic neuromodulation.
在培训期间,用大型语言模型增强可解释模型.
Chandan Singh1, Armin Askari2, Rich Caruana3
1Microsoft Research, Redmond, WA, USA. chansingh@microsoft.com.
Aug模型利用大语言模型 (LLM) 进行高效和可解释的预测. 这个框架为推理提供了显著的速度和内存改进,使人工智能更容易获得.
更多相关视频
05:47Evidence-based Knowledge Synthesis and Hypothesis Validation: Navigating Biomedical Knowledge Bases via Explainable AI and Agentic Systems
Published on: June 13, 2025
09:09Foreign Accent and Forensic Speaker Identification in Voice Lineups: The Influence of Acoustic Features Based on Prosody
Published on: September 27, 2024
科学领域:
- 人工智能的人工智能
- 自然语言处理自然语言处理.
- 计算神经科学是一种神经科学.
背景情况:
- 大型语言模型 (LLM) 显示出高性能,但缺乏可解释性和效率.
- 高风险领域和计算有限的设置需要可解释和高效的AI模型.
研究的目的:
- 提出Aug模型,一个框架,以创建高效和可解释的预测模型,利用LLMs的知识.
- 允许LLM知识转移仅用于推断,确保透明度和速度.
主要方法:
- 开发了Aug模型框架,在培训过程中使用LLM,但不推断.
- 在NLP中实例化的Aug模型:Aug-Linear (线性模型+LLM嵌入) 和Aug-Tree (决策树+LLM功能扩展).
- 应用Aug模型对文本分类和自然语言fMRI数据分析.
主要成果:
- 与LLMs相比,Aug模型显著提高了推断速度和内存效率 (超过1000倍).
- 在文本分类中,Aug-Linear和Aug-Tree的表现优于非增强式可解释模型.
- 高线性,具有1万倍较少的参数,超越了60亿参数GPT-J模型,同时保持透明.
结论:
- 在资源有限的环境中,Aug模型为可解释和高效的AI提供了可行的解决方案.
- 该框架成功地将LLM功能转移到更小,透明的模型中.
- 增强模型从复杂的数据中提供了有价值的科学解释,例如fMRI.
- 这种方法使先进的人工智能能力在各种科学领域的使用得到了民主化.
