Jove
Visualize
联系我们
JoVE
x logofacebook logolinkedin logoyoutube logo
关于 JoVE
概览领导团队博客JoVE 帮助中心
作者
出版流程编辑委员会范围与政策同行评审常见问题投稿
图书馆员
用户评价订阅访问资源图书馆顾问委员会常见问题
研究
JoVE JournalMethods CollectionsJoVE Encyclopedia of Experiments存档
教育
JoVE CoreJoVE BusinessJoVE Science EducationJoVE Lab Manual教师资源中心教师网站
使用条款与条件
隐私政策
政策

相关概念视频

Fast Fourier Transform01:10

Fast Fourier Transform

270
The Fast Fourier Transform (FFT) is a computational algorithm designed to compute the Discrete Fourier Transform (DFT) efficiently. By breaking down the calculations into smaller, manageable sections, the FFT significantly reduces the computational complexity involved. Direct computation of an N-point DFT requires N2 complex multiplications, whereas the FFT algorithm needs only (N/2)log⁡2N multiplications, offering a much faster performance.
The computational efficiency of the FFT becomes...
270
Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving01:29

Mechanistic Models: Compartment Models in Algorithms for Numerical Problem Solving

41
Mechanistic models play a crucial role in algorithms for numerical problem-solving, particularly in nonlinear mixed effects modeling (NMEM). These models aim to minimize specific objective functions by evaluating various parameter estimates, leading to the development of systematic algorithms. In some cases, linearization techniques approximate the model using linear equations.
In individual population analyses, different algorithms are employed, such as Cauchy's method, which uses a...
41

您也可能阅读

相关文章

通过共同作者、期刊和引用图与本文相关的文章。

排序
Same author

NN-xTB: density functional accuracy at semi empirical speed with neural network extended tight binding.

Nature communications·2026
Same author

Double-Hybrid, but Not Double-Cost: GPU-Accelerated DHDFT for the COMPAS-3 Data Set of Polybenzenoid Hydrocarbons.

Journal of chemical theory and computation·2026
Same author

Efficient Algorithms for GPU Accelerated Evaluation of the DFT Exchange-Correlation Functional.

Journal of chemical theory and computation·2025
Same author

High-Performance, High-Angular-Momentum J Engine on Graphics Processing Units.

Journal of chemical theory and computation·2025
Same author

Acceleration of Self-Consistent Field Calculations Using Basis Set Projection and Many-Body Expansion as Initial Guess Methods.

Journal of chemical theory and computation·2025
Same author

An Efficient RI-MP2 Algorithm for Distributed Many-GPU Architectures.

Journal of chemical theory and computation·2024
Same journal

Nuclear Gradients from Auxiliary-Field Quantum Monte Carlo and Their Applications in ML-Driven Geometry Optimization and Transition State Search.

Journal of chemical theory and computation·2026
Same journal

Correction to "Cluster-in-Molecule Local Correlation Method with an Accurate Distant Pair Correction for Large Systems".

Journal of chemical theory and computation·2026
Same journal

Machine-Learned Force Fields for Lattice Dynamics at Coupled-Cluster Level Accuracy.

Journal of chemical theory and computation·2026
Same journal

Systematic Molecularity-Dependent Entropy Errors in Continuum/RRHO Solution Thermochemistry: Origin and Correction.

Journal of chemical theory and computation·2026
Same journal

After 100 Years of Quantum Mechanics: Toward a Constructive Observation-Centered Perspective.

Journal of chemical theory and computation·2026
Same journal

Sample-Based Quantum Diagonalization Methods for Modeling the Photochemistry of Diazirine and Diazo Compounds.

Journal of chemical theory and computation·2026
查看所有相关文章

相关实验视频

Updated: Jun 6, 2025

Design and Optimization Strategies of a High-Performance Vented Box
14:23

Design and Optimization Strategies of a High-Performance Vented Box

Published on: June 9, 2023

1.1K

在GPU集群上构建高性能Fock矩阵的先进技术.

Elise Palethorpe1, Ryan Stocks1, Giuseppe M J Barca2

  • 1School of Computing, Australian National University, Canberra, ACT 2601, Australia.

Journal of chemical theory and computation
|November 25, 2024
PubMed
概括
此摘要是机器生成的。

两个新的多GPU算法,opt-UM和opt-Brc,加速了用于电子结构计算的Fock矩阵构造. 这些优化的方法显著优于现有的GPU和CPU实现,提供了大量的加速度和更好的功率效率.

更多相关视频

Computation of Atmospheric Concentrations of Molecular Clusters from ab initio Thermochemistry
12:11

Computation of Atmospheric Concentrations of Molecular Clusters from ab initio Thermochemistry

Published on: April 8, 2020

8.1K
Microfluidic Imaging Flow Cytometry by Asymmetric-detection Time-stretch Optical Microscopy ATOM
07:19

Microfluidic Imaging Flow Cytometry by Asymmetric-detection Time-stretch Optical Microscopy ATOM

Published on: June 28, 2017

10.3K

相关实验视频

Last Updated: Jun 6, 2025

Design and Optimization Strategies of a High-Performance Vented Box
14:23

Design and Optimization Strategies of a High-Performance Vented Box

Published on: June 9, 2023

1.1K
Computation of Atmospheric Concentrations of Molecular Clusters from ab initio Thermochemistry
12:11

Computation of Atmospheric Concentrations of Molecular Clusters from ab initio Thermochemistry

Published on: April 8, 2020

8.1K
Microfluidic Imaging Flow Cytometry by Asymmetric-detection Time-stretch Optical Microscopy ATOM
07:19

Microfluidic Imaging Flow Cytometry by Asymmetric-detection Time-stretch Optical Microscopy ATOM

Published on: June 28, 2017

10.3K

科学领域:

  • 计算化学的计算化学
  • 量子化学 是一个量子化学.
  • 高性能计算 高性能计算

背景情况:

  • 福克矩阵构造是电子结构计算中的关键步骤.
  • 现有的算法面临性能瓶,特别是在大型系统和先进的基础集.
  • 优化福克矩阵构造对于推进计算化学能力至关重要.

研究的目的:

  • 开发和介绍两种新的,优化的多GPU算法用于Fock矩阵构造.
  • 为了提高Hartree-Fock计算的计算效率和可扩展性.
  • 在现有的CPU和GPU实现中提供显著的加快速度.

主要方法:

  • 实现了两个优化的多GPU算法:opt-UM和opt-Brc.
  • 纳入先进技术:改进了整体选,稀疏性/对称性利用和线性缩放交换矩阵组装.
  • 在使用极端规模电子结构系统 (EXESS) 的NVIDIA A100 GPU上进行基准测试.
  • 测试各种系统类型 (线性,球形) 和基础集 (双ζ,三ζ).

主要成果:

  • 与TeraChem,QUICK,GPU4PySCF,LibIntX,ORCA和Q-Chem相比,这些算法表现出更高的性能.
  • 与TeraChem,QUICK和GPU4PySCF相比,观察到的平均加速度分别为1.4×,8.4×和9.4×.
  • 在单个节点上实现了高达42×和31×的加速度,而不是ORCA和Q-Chem.
  • 在四个GPU上选择Brc的并行效率超过91%.
  • 功率效率提高了多达18×.

结论:

  • 开发的opt-UM和opt-Brc算法代表了福克矩阵构造的重大进步.
  • 这些算法为电子结构计算提供了显著的性能提升和更好的可扩展性.
  • 优化的方法为更高效和更强大的计算化学研究铺平了道路.