论文标题
网格:Onecode和Fourapis
Grid: OneCode and FourAPIs
论文作者
论文摘要
我们讨论了针对晶格QCD的网格软件库的实质更新,使其能够端口到多个GPU架构,同时将CPU矢量化和SIMD执行在OpenMP线程中。支持的GPU环境包括供应商特定的NVIDIA CUDA和AMD髋关节环境以及基于标准的SYCL实现。这是通过内部抽象接口来执行的,从而为所有计划的Exascale架构以及美国能源部计划的所有计划提供了单一源跨平台性能便携性。
We discuss a substantial update to the Grid software library for Lattice QCD, enabling it to port to multiple GPU architectures while retaining CPU vectorisation and SIMD execution within OpenMP threads. The GPU environments supported include vendor specific Nvidia CUDA and AMD HIP environments and a (mostly) standards based SYCL implementation. This is performed by an internal abstraction interface giving single source cross-platform performance portability across all number of planned Exascale architectures, and all those planned by the US Department of Energy.