一种基于NVMeoF存储池的分域共享并发存储架构被引量：5

A regional shared and high concurrent storage architecture based on NVMeoF storage pool

在线阅读下载PDF

导出

摘要 E级计算和大数据时代,为了充分利用超级计算机系统的并行计算能力,许多大数据应用程序在高性能计算HPC系统上运行,超级计算机的I/O模式更趋复杂,I/O瓶颈问题日益严峻。当前基于闪存的存储阵列或存储服务器已逐步应用在高性能计算机的并行存储系统中,但传统存储体系结构、I/O协议软件栈和存储网络的较高延迟使得新型存储介质不能发挥性能优势,存储系统依然存在I/O访问延迟高、并发I/O吞吐率和瞬发I/O(Burst I/O)带宽受限的问题。针对上述问题和技术挑战,提出了一种基于非易失存储介质NVM的分域共享并发存储架构,设计了一种支持NVMeoF网络存储的Burst I/O缓冲存储池NV-BSP,实现了虚拟化存储池资源管理、基于天河高速互连网的NVMeoF网络存储通信等关键技术,具有横向和纵向扩展能力,可有效支持面向特定计算任务的Burst I/O加速和低延迟远程存储访问。基于HPC和大数据应用程序混合运行性能分析模型,提出了一种混合应用程序QoS控制策略。小规模验证系统上的性能测评结果表明:NV-BSP存储池的读写性能可随并发I/O处理线程数良好扩展;与Linux操作系统自带的MD-RAID相比具有明显的性能优势;相比本地I/O访问,基于天河互连网络的NVMeoF远程存储读写延迟仅增加了59.25μs和54.03μs。通过计算与存储分离,NV-BSP在提供堪比本地存储池性能的同时,提高了系统存储资源动态调配的灵活性和系统可靠性。 In the era of exascale computing and big data,High Performance Computing(HPC)systems have been widely deployed as the infrastructure for big data analytics,in order to leverage their parallel computing capabilities.As the I/O patterns in HPC systems get increasingly complicated and heterogeneous,breaking through the I/O bottleneck is challenging and urgent for HPC systems.In recent years,flash-based storage arrays and storage servers have been gradually deployed in HPC storage systems.However,the conventional shared storage architectures,I/O software stack,and storage networking designs are primarily for Hard Disk Drives(HDD),which induces severe I/O overhead in the I/O path and prevents the HPC storage systems from taking full advantage of the performance benefits from Non-Volatile Memory(NVM).To achieve low I/O latency,high concurrent I/O throughput,and high burst I/O bandwidth,this paper proposes a regional shared and high concurrent storage architecture.We design an NVMeoF-based burst I/O storage pool(NV-BSP),which implements the key techniques such as virtualized storage pool resource management and NVeoF network storage communication based on Tianhe high-speed Internet.It has horizontal and vertical expansion capabilities and can effectively support Burst I/O acceleration and low-latency remote for specific computing tasks.Besides,we further propose a Quality-of-Service(QoS)control strategy for the storage systems with HPC and big data mixed applications.The experimental results on a prototype system show that NV-BSP achieves the scalable write performance as the number of I/O handling threads increases.Compared with the built-in MD-RAID in Linux,NV-BSP obtains higher I/O bandwidth.Compared with the node-local storage pool,I/O latencies of NVMeoF-based remote storage only increase 59.25us for read and 54.03us for write.By disaggregating storage from computation,NV-BSP significantly improves the system scalability and reliability while delivering the comparable performance to local storage.

作者李琼宋振龙袁远谢徐超 LI Qiong;SONG Zhen-long;YUAN Yuan;XIE Xu-chao(School of Computer,National University of Defense Technology,Changsha 410073,China)

机构地区国防科技大学计算机学院

出处《计算机工程与科学》 CSCD 北大核心 2020年第10期1711-1719,共9页 Computer Engineering & Science

基金国家重点研发计划(2018YFB0204301)。

关键词存储系统结构 Burst Buffer NVMe SSD NVMeoF 高性能计算大数据 storage architecture burst buffer NVMe SSD NVMe over fabrics high performance computing big data

分类号 TP334.5 [自动化与计算机技术—计算机系统结构]

引文网络
相关文献

参考文献4

1Xuchao XIE,Liquan XIAO,Dengping WEI,Qiong LI,Zhenlong SONG,Xiongzi GE.Pinpointing and scheduling access conflicts to improve internal resource utilization in solid-state drives[J].Frontiers of Computer Science,2019,13(1):35-50. 被引量：2
2Wei HU,Guang-ming LIU,Qiong LI,Yan-huang JIANG,Gui-lin CAI.Storage wall for exascale supercomputing[J].Frontiers of Information Technology & Electronic Engineering,2016,17(11):1154-1175. 被引量：3
3李琼,邓明堂,杨学军.涵盖I/O的广义存储一致性模型[J].国防科技大学学报,2010,32(1):68-73. 被引量：1
4Weixia XU,Yutong LU,Qiong LI,Enqiang ZHOU,Zhenlong SONG,Yong DONG,Wei ZHANG,Dengping WEI,Xiaoming ZHANG,Haitao CHEN,Jianying XING,Yuan YUAN.Hybrid hierarchy storage system in MilkyWay-2 supercomputer[J].Frontiers of Computer Science,2014,8(3):367-377. 被引量：11

二级参考文献38

1Mark D H, Anne E C, Manoj P, et al. A System-level Specification Framework for I/O Architectures[C]//The 11^th SPAA, 1999:27- 30.
2Yang X J, Dai H D. Operating System-centric Memory Consistency Model--Thread Consistency Model[ C]//The Fourth APPT'01, Germany, 2001 : 12- 15.
3Iftode L, Singh J P. Scope Consistency: A Bridge between Release Consistency and Entry Consistency[ C ]//The 8^th SPAA, 1996:10- 18.
4Li Q, Pang Z B, Guo Y F, et al. A GPDMA-based Distributed Shared I/O Solution for CC-NUMA System[C]//The 9^th Inter. Conf. for Yotmg Computer Scientists,.2008:172- 177.
5Franks B. Taming the Big Data Tidal Wave: Finding Opportunities in Huge Data Streams with Advanced Analytics. www.wiley.com. 2012.
6Verta 0, Mastroianni C, Talia D. A super-peer model for resource discovery services in large-scale grids. Future Generation Computer Systems, 2005,21(8): 1235-1248.
7Bent J, Grider G, Kettering Br, Manzanares A, McClelland M, Torres A, Torrez A. Storage challenges at Los Alamos National Lab. In: Proceedings of the 2012 Symposium on Massive Storage Systems and Technologies. 2012: 1-5.
8Watson R W, Coyne R A. The parallel I/O architecture of the highperformance storage system. In: Proceedings of the 14th IEEE Symposium on Mass Storage Systems. 1995,27-44.
9Lofstead J, Zheng F, Liu Q, Klasky S, Oldfield R, Kordenbrock T, Schwan K, Wolf M. Managing variability in the 10 performance of petascale storage system. IEEE Computer Society, 2010: 1-12.
10Zhuge H. The Knowledge Grid. Singapore: World Scientific, 2004.

共引文献13

1Xuchao XIE,Liquan XIAO,Dengping WEI,Qiong LI,Zhenlong SONG,Xiongzi GE.Pinpointing and scheduling access conflicts to improve internal resource utilization in solid-state drives[J].Frontiers of Computer Science,2019,13(1):35-50. 被引量：2
2杜然,黄秋兰,阚文枭,王聪,徐琪,陈刚.基于Chord算法的可扩展高性能元数据存储环设计[J].计算机工程,2016,42(8):1-8. 被引量：8
3廖湘科,肖侬.新型高性能计算系统与技术[J].中国科学：信息科学,2016,46(9):1175-1210. 被引量：31
4杜然,黄秋兰,程耀东,陈刚.基于块的分级存储系统多样性机制设计与研究[J].计算机工程,2016,42(12):50-59. 被引量：3
5陈曦,朱建涛,何晓斌.一种面向高性能计算的分布式对象存储系统[J].计算机工程,2017,34(8):69-73. 被引量：10
6董勇,周恩强,卢宇彤,张伟.基于天河2高速互连网络实现混合层次文件系统H^2FS高速通信[J].计算机学报,2017,40(9):1961-1979. 被引量：8
7徐艺峰,李健,王杰,范冰丰,王钢.基于天河二号超算的网格无关性及并行研究[J].计算机工程与设计,2018,39(7):2036-2041. 被引量：2
8LI Wenjing,LI Songzhao,LU Jianbo.Research on Petri Net System Parallel Subnet Partitioning Completeness Theory and Algorithm[J].Wuhan University Journal of Natural Sciences,2019,24(3):205-217. 被引量：1
9万威强,肖俊敏,洪学海,谭光明.面向大规模海洋数据同化算法的并行实现及优化[J].计算机工程与科学,2019,41(5):765-772. 被引量：3
10Yu-Tong Lu,Peng Cheng,Zhi-Guang Chen.Design and Implementation of the Tianhe-2 Data Storage and Management System[J].Journal of Computer Science & Technology,2020,35(1):27-46. 被引量：2

同被引文献59

1卢万杰,徐青,蓝朝桢,吕亮,施群山.基于SQL/NoSQL的空间目标光学特性数据混合存储策略[J].天文学报,2020,61(1):59-69. 被引量：3
2张鑫磊,邢帅,徐青(指导),张国平,李鹏程,焦麟,刘宸博.ATLAS数据与资源三号02星影像联合区域网平差[J].红外与激光工程,2020(S02):155-162. 被引量：8
3许少尉,吕浩.多核处理器系统节能调度技术研究[J].航空计算技术,2018,48(1):98-101. 被引量：4
4李运喜,何翔.符合ARINC653的多核操作系统任务调度研究[J].航空计算技术,2017,47(5):108-111. 被引量：8
5秦东明,喻剑,张波,赵勤.基于分布式无共享架构的海量数据并行查询平台[J].计算机科学,2019,46(4):44-49. 被引量：9
6李炳森,胡全贵,陈小峰,高秉强.电网企业数据中台的研究与设计[J].电力信息与通信技术,2019,17(7):29-34. 被引量：65
7陈永南,许桂明,张新建.一种基于数据湖的大数据处理机制研究[J].计算机与数字工程,2019,47(10):2540-2545. 被引量：25
8周璇,何锋,熊华钢.DIMA系统网络通信技术方案选择[J].电光与控制,2019,26(11):25-30. 被引量：6
9杨茜,喻谦,张乾.基于盲数BM模型的配电网谐波数据存储安全控制方法[J].电网与清洁能源,2019,35(12):43-48. 被引量：10
10沈川,贾渊,杨珂珂.基于高分遥感影像的断裂道路连接方法[J].计算机测量与控制,2020,28(2):246-249. 被引量：2

引证文献5

1张仙梅,郭鑫.电力企业中台共享服务架构设计[J].信息与电脑,2022,34(3):176-178.
2段海军,郭勇,陈福.航空电子系统多核资源并发访问技术的研究[J].航空计算技术,2022,52(4):91-95.
3王艋,柳新强,刘松.基于无人机倾斜摄影技术的多源遥感影像变化检测并行系统设计[J].计算机测量与控制,2023,31(2):91-96. 被引量：4
4杨阔,李海涛,张雪梅.基于可信云计算的非集中式元数据存储结构优化[J].计算技术与自动化,2023,42(1):183-187. 被引量：1
5袁敬中,苏东禹,姜宇,郭嘉,孙密,卢诗华.基于LSM树的电网远程监控数据混合存储方法[J].微型电脑应用,2023,39(4):76-78. 被引量：1

二级引证文献6

1王凯,王莹,常勤慧.无人机倾斜摄影测量在矿山监测中的应用策略分析——以河南A矿山为例[J].装备制造技术,2023(5):240-243. 被引量：7
2李鹏.遥感影像对典型地物的自动化智能提取研究[J].自动化与仪器仪表,2023(10):63-67.
3张全明,王猛,薛亚波.基于改进关联规则的交互式元数据快速聚类方法[J].微型电脑应用,2024,40(3):165-168.
4王晓雯.基于元数据的自定义数据结构设计与应用[J].信息技术与信息化,2024(4):130-133.
5苏光日,刘博,张学之.贴近摄影与倾斜摄影测量技术在宗教活动场所三维建模的应用[J].测绘与空间地理信息,2024,47(S01):260-263.
6曹志威,欧崇阳,王炳懿,张毅超,周琛.云原生架构计算平台在影像医学教学中的应用研究[J].计算机测量与控制,2024,32(12):257-262.

1舒继武,王意洁.前言[J].计算机研究与发展,2020,57(2):241-242.
2程先峰,严勇杰.基于MAXQ分层强化学习的有人机/无人机协同路径规划研究[J].信息化研究,2020,46(1):13-19. 被引量：5
3宋振龙,李小芳,李琼,谢徐超,魏登萍,董勇,王睿伯.BeeGFS并行文件系统性能优化技术研究[J].计算机工程与科学,2020,42(10):1765-1773. 被引量：2
4吴惑,刘一清.基于FPGA的万兆以太网TCP/IP协议处理架构[J].电子设计工程,2020,28(9):81-87. 被引量：13
5易永丰,魏中伟.华夏银行数据中心厚积薄发智能运维助力抗疫与生产[J].中国金融电脑,2020(4):35-38.
6岳阳,徐昆,康利刚.面向大数据的存储系统结构设计[J].数字技术与应用,2020,38(9):115-117. 被引量：1
7韩静,李艳平,禹勇,丁勇.用户可动态撤销及数据可实时更新的云审计方案[J].软件学报,2020,31(2):578-596. 被引量：20
8何晓斌,蒋金虎.面向大数据异构系统的神威并行存储系统[J].大数据,2020,6(4):30-39. 被引量：2
9万玛宁,王莹,周继芹,张伟功.动态可重构高速串行总线中断请求方法的设计与实现[J].微电子学与计算机,2018,35(10):23-28. 被引量：1
10张立柱,马嵩阳,潘星.电能质量治理与电网降损技术分析[J].中国科技投资,2019,0(36):93-93.

计算机工程与科学

2020年第10期

浏览历史

内容加载中请稍等...

一种基于NVMeoF存储池的分域共享并发存储架构被引量：5

参考文献4

二级参考文献38

共引文献13

同被引文献59

引证文献5

二级引证文献6

相关作者

相关机构

相关主题

浏览历史

一种基于NVMeoF存储池的分域共享并发存储架构 被引量：5

参考文献4

二级参考文献38

共引文献13

同被引文献59

引证文献5

二级引证文献6

相关作者

相关机构

相关主题

浏览历史

一种基于NVMeoF存储池的分域共享并发存储架构被引量：5