论文标题
容器可执行文件的自动缓存
Automated Cache for Container Executables
论文作者
论文摘要
Linux容器技术(例如Docker和Singularity)提供了封装的环境,可轻松执行软件。在高性能计算中,这对于必须与必须共存的依赖性相互矛盾的依赖性且相互矛盾的软件堆栈尤其重要。创建了奇异性注册表HPC(“ SHPC”)作为在此环境中安装容器的一种努力,作为模块,无缝允许将通常隐藏的可执行文件作为命令显示给用户,并显着简化了用户体验。但是,剩下的挑战是导出容器中重要的可执行文件列表。在这项工作中,我们提出了新的自动化和方法,这些自动化和方法不仅允许在大型社区集合中发现新容器,还允许在具有重要可执行文件的容器条目中找到新的容器。通过这项工作,我们添加了来自Biocontainers社区的8,000多个容器,随着时间的推移,软件自动化可以维护和更新。所有软件均在GitHub平台上公开可用,可以对容器注册表和基础架构提供商有益,以自动生成容器模块以降低使用障碍并改善用户体验。
Linux container technologies such as Docker and Singularity offer encapsulated environments for easy execution of software. In high performance computing, this is especially important for evolving and complex software stacks with conflicting dependencies that must co-exist. Singularity Registry HPC ("shpc") was created as an effort to install containers in this environment as modules, seamlessly allowing for typically hidden executables inside containers to be presented to the user as commands, and as such significantly simplifying the user experience. A remaining challenge, however, is deriving the list of important executables in the container. In this work, we present new automation and methods that allow for not only discovering new containers in large community sets, but also deriving container entries with important executables. With this work we have added over 8,000 containers from the BioContainers community that can be maintained and updated by the software automation over time. All software is publicly available on the GitHub platform, and can be beneficial to container registries and infrastructure providers for automatically generating container modules to lower the usage entry barrier and improve user experience.