期刊文献+
共找到408篇文章
< 1 2 21 >
每页显示 20 50 100
Multi - weighted tree based query optimization method for parallel relational database systems
1
作者 LI Jian - zhong (College of Computer Science and Technology, Heilongjiang University Harbin 150080, P.R. China College of computer Science and Engineering,Harbin Institute of Technology, Harbin 150001, P.R. China) 《黑龙江大学自然科学学报》 CAS 2001年第3期35-43,共9页
The author investigates the query optimization problem for parallel relational databases. A multi - weighted tree based query optimization method is proposed. The method consists of a multi - weighted tree based paral... The author investigates the query optimization problem for parallel relational databases. A multi - weighted tree based query optimization method is proposed. The method consists of a multi - weighted tree based parallel query plan model, a cost model for parallel qury plans and a query optimizer. The parallel query plan model is the first one to model all basic relational operations, all three types of parallelism of query execution, processor and memory allocation to operations, memory allocation to the buffers between operations in pipelines and data redistribution among processors. The cost model takes the waiting time of the operations in pipelining execution into consideration and is computable in a bottom - up fashion. The query optimizer addresses the query optimization problem in the context of Select - Project - Join queries that are widely used in commercial DBMSs. Several heuristics determining the processor allocation to operations are derived and used in the query optimizer. The query optimizer is aware of memory resources in order to generate good - quality plans. It includes the heuristics for determining the memory allocation to operations and buffers between operations in pipelines so that the memory resourse is fully exploit. In addition, multiple algorithms for implementing join operations are consided in the query optimizer. The query optimizer can make an optimal choice of join algorithm for each join operation in a query. The proposed query optimization method has been used in a prototype parallel database management system designed and implemented by the author. 展开更多
关键词 多重加权树 并列关系数据库 查询 优化
在线阅读 下载PDF
A solution of spatial query processing and query optimization for spatial databases
2
作者 YUANJie XIEKun-qing +2 位作者 MAXiu-jun ZHANGMin SUNLe-bin 《重庆邮电学院学报(自然科学版)》 2004年第5期165-172,共8页
Recently, attention has been focused on spatial query language which is used to query spatial databases. A design of spatial query language has been presented in this paper by extending the standard relational databas... Recently, attention has been focused on spatial query language which is used to query spatial databases. A design of spatial query language has been presented in this paper by extending the standard relational database query language SQL. It recognizes the significantly different requirements of spatial data handling and overcomes the inherent problems of the application of conventional database query languages. This design is based on an extended spatial data model, including the spatial data types and the spatial operators on them. The processing and optimization of spatial queries have also been discussed in this design. In the end, an implementation of this design is given in a spatial query subsystem. 展开更多
关键词 空间数据库 询问语言 空间数据模型 空间操作 最优化
在线阅读 下载PDF
A Distributed DBMS Based Dynamic Programming Method for Query Optimization
3
作者 孙纪舟 李阳 +2 位作者 蒋志勇 顾云苏 何清法 《Journal of Donghua University(English Edition)》 EI CAS 2012年第1期55-58,共4页
Dynamic programming(DP) is an effective query optimization approach to select an appropriate join order for relational database management system(RDBMS) in multi-table joins. This method was extended and made availabl... Dynamic programming(DP) is an effective query optimization approach to select an appropriate join order for relational database management system(RDBMS) in multi-table joins. This method was extended and made available in distributed DBMS(D-DBMS). The structure of this optimal solution was firstly characterized according to the distributing status of tables and data, and then the recurrence relations between a problem and its sub-problems were recursively defined. DP in D-DBMS has the same time-complexity with that in centralized DBMS, while it has the capability to solve a much more sophisticated optimal problem of multi-table join in D-DBMS. The effectiveness of this optimal strategy has been proved by experiments. 展开更多
关键词 distributed database dynamic programming (DP) multitable loin: auery optimization
在线阅读 下载PDF
Query Optimization for Deductive Databases
4
作者 周傲英 施伯乐 《Journal of Computer Science & Technology》 SCIE EI CSCD 1995年第2期134-148,共15页
A systematic, efficient compilation method for query evaluation of DeductiveDatabases (DeDB) is proposed in this paper. In order to eliminate redundancyand to minimize the potentially relevant facts, which are two key... A systematic, efficient compilation method for query evaluation of DeductiveDatabases (DeDB) is proposed in this paper. In order to eliminate redundancyand to minimize the potentially relevant facts, which are two key issues to theefficiency of a DeDB, the compilation process is decomposed into two phases.The first is the pre-compilation phase, which is responsible for the minimiza-tion of the potentially relevant facts. The second, which we refer to as thegeneral compilation phase, is responsible for the elimination of redundancy.The rule/goal graph devised by J. D. Ullman is appropriately extended andused as a uniform formalism. Two general algorithms corresponding to the twophases respectively are described intuitively and formally 展开更多
关键词 deductive database query evaluation query optimization
原文传递
Dynamic Query Optimization Approach for Semantic Database Grid 被引量:2
5
作者 郑骁庆 陈华钧 +1 位作者 吴朝晖 毛郁欣 《Journal of Computer Science & Technology》 SCIE EI CSCD 2006年第4期597-608,共12页
Fundamentally, semantic grid database is about bringing globally distributed databases together in order to coordinate resource sharing and problem solving in which information is given well-defined meaning, and DartG... Fundamentally, semantic grid database is about bringing globally distributed databases together in order to coordinate resource sharing and problem solving in which information is given well-defined meaning, and DartGrid II is the implemented database gird system whose goal is to provide a semantic solution for integrating database resources on the Web. Although many algorithms have been proposed for optimizing query-processing in order to minimize costs and/or response time, associated with obtaining the answer to query in a distributed database system, database grid query optimization problem is fundamentally different from traditional distributed query optimization. These differences are shown to be the consequences of autonomy and heterogeneity of database nodes in database grid. Therefore, more challenges have arisen for query optimization in database grid than traditional distributed database. Following this observation, the design of a query optimizer in DartGrid II is presented, and a heuristic, dynamic and parallel query optimization approach to processing query in database grid is proposed. A set of semantic tools supporting relational database integration and semantic-based information browsing has also been implemented to realize the above vision. 展开更多
关键词 database integration query optimization semantic database grid
原文传递
Query Optimization Framework for Graph Database in Cloud Dew Environment
6
作者 Tahir Alyas Ali Alzahrani +3 位作者 Yazed Alsaawy Khalid Alissa Qaiser Abbas Nadia Tabassum 《Computers, Materials & Continua》 SCIE EI 2023年第1期2317-2330,共14页
The query optimizer uses cost-based optimization to create an execution plan with the least cost,which also consumes the least amount of resources.The challenge of query optimization for relational database systems is... The query optimizer uses cost-based optimization to create an execution plan with the least cost,which also consumes the least amount of resources.The challenge of query optimization for relational database systems is a combinatorial optimization problem,which renders exhaustive search impossible as query sizes rise.Increases in CPU performance have surpassed main memory,and disk access speeds in recent decades,allowing data compression to be used—strategies for improving database performance systems.For performance enhancement,compression and query optimization are the two most factors.Compression reduces the volume of data,whereas query optimization minimizes execution time.Compressing the database reduces memory requirement,data takes less time to load into memory,fewer buffer missing occur,and the size of intermediate results is more diminutive.This paper performed query optimization on the graph database in a cloud dew environment by considering,which requires less time to execute a query.The factors compression and query optimization improve the performance of the databases.This research compares the performance of MySQL and Neo4j databases in terms of memory usage and execution time running on cloud dew servers. 展开更多
关键词 query optimization compression cloud dew DECOMPRESSION graph database
在线阅读 下载PDF
Accomplishing Deterministic XML Query Optimization 被引量:1
7
作者 Dun-RenChe 《Journal of Computer Science & Technology》 SCIE EI CSCD 2005年第3期357-366,共10页
As the popularity of XML (extensible Markup Language) keeps growing rapidly,the management of XML compliant structured-document databases has become a very interesting andcompelling research area. Query optimization f... As the popularity of XML (extensible Markup Language) keeps growing rapidly,the management of XML compliant structured-document databases has become a very interesting andcompelling research area. Query optimization for XML structured-documents stands out as one of themost challenging research issues in this area because of the much enlarged optimization (search)space, which is a consequence of the intrinsic complexity of the underlying data model of XML data.We therefore propose to apply deterministic transformations on query expressions to mostaggressively prune the search space and fast achieve a sufficiently improved alternative (if not theoptimal) for each incoming query expression. This idea is not just exciting but practicallyattainable. This paper first provides an overview of our optimization strategy, and then focuses onthe key implementation issues of our rule-based transformation system for XML query optimization ina database environment. The performance results we obtained from experimentation show that ourapproach is a valid and effective one. 展开更多
关键词 XML query optimization XML query query optimization XML database structured-document database
原文传递
Processing and optimization of UMQL-based multimedia queries
8
作者 吴宗大 曹忠升 +1 位作者 王元珍 李桂玲 《Journal of Southeast University(English Edition)》 EI CAS 2009年第3期320-325,共6页
Through the mapping from UMQL ( unified multimedia query language) conditional expressions to UMQA (unified multimedia query algebra) query operations, a translation algorithm from a UMQL query to a UMQA query pla... Through the mapping from UMQL ( unified multimedia query language) conditional expressions to UMQA (unified multimedia query algebra) query operations, a translation algorithm from a UMQL query to a UMQA query plan is put forward, which can generate an equivalent UMQA internal query plan for any UMQL query. Then, to improve the execution costs of UMQA query plans effectively, equivalent UMQA translation formulae and general optimization strategies are studied, and an optimization algorithm for UMQA internal query plans is presented. This algorithm uses equivalent UMQA translation formulae to optimize query plans, and makes the optimized query plans accord with the optimization strategies as much as possible. Finally, the logic implementation methods of UMQA plans, i.e., logic implementation methods of UMQA operators, are discussed to obtain useful target data from a muifirnedia database. All of these algorithms are implemented in a UMQL prototype system. Application results show that these query processing techniques are feasible and applicable. 展开更多
关键词 multimedia database multimedia query language query optimization unified multimedia query language
在线阅读 下载PDF
Tailored Partitioning for Healthcare Big Data: A Novel Technique for Efficient Data Management and Hash Retrieval in RDBMS Relational Architectures
9
作者 Ehsan Soltanmohammadi Neset Hikmet Dilek Akgun 《Journal of Data Analysis and Information Processing》 2025年第1期46-65,共20页
Efficient data management in healthcare is essential for providing timely and accurate patient care, yet traditional partitioning methods in relational databases often struggle with the high volume, heterogeneity, and... Efficient data management in healthcare is essential for providing timely and accurate patient care, yet traditional partitioning methods in relational databases often struggle with the high volume, heterogeneity, and regulatory complexity of healthcare data. This research introduces a tailored partitioning strategy leveraging the MD5 hashing algorithm to enhance data insertion, query performance, and load balancing in healthcare systems. By applying a consistent hash function to patient IDs, our approach achieves uniform distribution of records across partitions, optimizing retrieval paths and reducing access latency while ensuring data integrity and compliance. We evaluated the method through experiments focusing on partitioning efficiency, scalability, and fault tolerance. The partitioning efficiency analysis compared our MD5-based approach with standard round-robin methods, measuring insertion times, query latency, and data distribution balance. Scalability tests assessed system performance across increasing dataset sizes and varying partition counts, while fault tolerance experiments examined data integrity and retrieval performance under simulated partition failures. The experimental results demonstrate that the MD5-based partitioning strategy significantly reduces query retrieval times by optimizing data access patterns, achieving up to X% better performance compared to round-robin methods. It also scales effectively with larger datasets, maintaining low latency and ensuring robust resilience under failure scenarios. This novel approach offers a scalable, efficient, and fault-tolerant solution for healthcare systems, facilitating faster clinical decision-making and improved patient care in complex data environments. 展开更多
关键词 Healthcare Data Partitioning Relational database Management Systems (RDBMS) Big Data Management Load Balance query Performance Improvement Data Integrity and Fault Tolerance EFFICIENT Big Data in Healthcare Dynamic Data Distribution Healthcare Information Systems Partitioning Algorithms Performance evaluation in databases
在线阅读 下载PDF
数据库系统性能优化的关键技术研究——基于缓存设计理念与查询优化技术
10
作者 李虎 赵倩 高磊 《科学与信息化》 2025年第2期70-72,共3页
文章旨在探索数据库系统性能优化的关键技术,特别是缓存设计理念与查询优化技术。内容围绕缓存优化、内存优化以及高效查询算法,详细分析了顺序查询、B-树查询和Hash查询算法的实现与优化。研究结果表明,通过优化缓存配置和内存管理,可... 文章旨在探索数据库系统性能优化的关键技术,特别是缓存设计理念与查询优化技术。内容围绕缓存优化、内存优化以及高效查询算法,详细分析了顺序查询、B-树查询和Hash查询算法的实现与优化。研究结果表明,通过优化缓存配置和内存管理,可以有效提升数据库系统的数据访问和处理速度;同时,精心设计的查询算法能够显著提高查询效率和系统响应速度。这一研究为未来数据库技术的发展方向提供了理论基础和技术预见。 展开更多
关键词 数据路系统 性能优化 缓存优化 查询优化
在线阅读 下载PDF
PORLES:A Parallel Object Relational Database System
11
作者 Sun Yong\|qiang, Xu Shu\|ting, Zhu Feng\|hua, Lai Shu\|huaDepartment of Computer Science and Engineering, Shanghai Jiaotong University, Shanghai 200030,China 《Wuhan University Journal of Natural Sciences》 CAS 2001年第Z1期100-109,共10页
We developed a parallel object relational DBMS named PORLES. It uses BSP model as its parallel computing model, and monoid calculus as its basis of data model. In this paper, we introduce its data model, parallel que... We developed a parallel object relational DBMS named PORLES. It uses BSP model as its parallel computing model, and monoid calculus as its basis of data model. In this paper, we introduce its data model, parallel query optimization, transaction processing system and parallel access method in detail. 展开更多
关键词 parallel object relational database BSP model data model query optimization
在线阅读 下载PDF
面向分布式数据库的算子并行优化策略 被引量:1
12
作者 刘文洁 吕靖超 《西北工业大学学报》 EI CAS CSCD 北大核心 2024年第3期453-459,共7页
随着网络技术的不断发展,数据规模呈现爆发式增长,使得传统的单机数据库逐步被分布式数据库所取代。分布式数据库采用节点协同工作方式解决了大规模数据存储问题,但由于增加了节点间通信开销,查询效率却不如单机数据库。分布式架构下,... 随着网络技术的不断发展,数据规模呈现爆发式增长,使得传统的单机数据库逐步被分布式数据库所取代。分布式数据库采用节点协同工作方式解决了大规模数据存储问题,但由于增加了节点间通信开销,查询效率却不如单机数据库。分布式架构下,存储节点的数据仅用作多备份的冗余,为系统故障时提供数据恢复,并未被利用起来改善查询效率。针对上述问题,提出了一种面向分布式数据库的算子并行优化策略,通过对关键物理算子进行拆分,将拆分后的子请求均匀分配到存储层多个节点,由多个节点并行处理,从而减少查询响应时间。上述策略已经在分布式数据库CBase上进行了应用,实验表明,提出的并行优化策略可显著缩短SQL请求查询时间,并提高系统资源利用率。 展开更多
关键词 分布式数据库 并行查询 查询优化 负载均衡 数据分区
在线阅读 下载PDF
GPU数据库实现技术发展演进
13
作者 刘鹏 陈红 +1 位作者 张延松 李翠平 《计算机学报》 EI CAS CSCD 北大核心 2024年第11期2691-2724,共34页
爆炸式增长的数据对存储和处理数据提出了更高的需求,GPU数据库作为新硬件数据库的一个重要分支,在大容量和高性能处理方面有其独特的优势.GPU数据库作为高性能数据库的代表,在最近几年受到学术界和产业界的关注,一批具有代表性的研究... 爆炸式增长的数据对存储和处理数据提出了更高的需求,GPU数据库作为新硬件数据库的一个重要分支,在大容量和高性能处理方面有其独特的优势.GPU数据库作为高性能数据库的代表,在最近几年受到学术界和产业界的关注,一批具有代表性的研究成果和标志性的实际产品已经出现.GPU数据库的技术发展按照GPU加速型和GPU内存型两种技术路线展开.两种技术路线都有相应的原型系统或产品出现.虽然两种GPU数据库的发展路线在实现上有所不同,但GPU数据库最基本的功能部分和核心技术是相似的,都有查询编译、查询优化、查询执行以及存储管理等功能.当前主流的数据传输方案除了PCIe之外,NVLink、RDMA和CXL等传输方案也为不同处理器之间的数据传输提供了更多的可能性.大多数GPU数据库使用列存储模型来存储数据,少数GPU数据库(如PG-Strom)对两种存储模型都支持.在列存储模型上利用压缩技术能减少数据的存储空间和传输时延.在GPU数据库上进行的压缩和解压的时间应该在整个数据处理的过程中占比很少.在GPU数据库上建立和维护索引不应该有很大的系统开销.JIT编译时间短、编译效率高,是GPU数据库编译的主流.操作符对数据库查询性能的影响非常明显,连接操作、分组聚集和OLAP运算符是目前研究最多的三个类型.目前大多数的研究中,连接和分组聚集算子通常结合在一起研究.在连接算子执行的过程中还和表的连接顺序结合在一起进行考虑.OLAP算子是GPU数据库中的又一个被大量研究的算子,GPU数据库在OLAP算子和模型方面持续受到研究者的关注.GPU数据库有三种查询处理模型,即行处理、列处理和向量化处理.向量化处理和列处理在实际系统中应用较多.由于GPU加速型数据库技术的发展,CPU-GPU协同处理模型上的查询方案与查询引擎也有一定数量的研究成果出现.当前GPU数据库的查询优化研究主要有三部分:多表连接顺序、查询重写和代价模型.然而,GPU数据库的代价评估模型在目前还没有很好的解决方案,GPU数据库的查询优化在未来仍有很大的研究空间.事务在GPU数据库中没有得到很好的研究,尽管有单独的原型系统,但目前的研究还没有取得重大进展.本文总结了GPU数据库各种关键技术已有的研究成果,指出GPU数据库当前存在的问题和面临的挑战,对未来的研究方向进行了展望. 展开更多
关键词 GPU数据库 数据压缩 算子优化 OLAP查询 查询处理
在线阅读 下载PDF
环控生保地面测控系统数据库自适应并行查询优化方法的研究
14
作者 杨京松 刘孝马 +2 位作者 张震 吴奥青 彭卓 《航天医学与医学工程》 2024年第6期367-370,共4页
载人航天器环控生保系统长期地面试验状态监测持续积累海量测点数据,如何高效存储并快速查询各测点数据是我们必须面对的关键问题。笔者针对传统SQLServer数据库基于union操作的多表查询逻辑在数据库分表数量比较大时效率低下的问题,设... 载人航天器环控生保系统长期地面试验状态监测持续积累海量测点数据,如何高效存储并快速查询各测点数据是我们必须面对的关键问题。笔者针对传统SQLServer数据库基于union操作的多表查询逻辑在数据库分表数量比较大时效率低下的问题,设计了一种自适应分页并行查询的算法,可以根据查询数据量的不同,自适应选择最优的分页查询数量,从而充分利用系统多线程进行并行查询。通过典型查询场景测试表明,与传统查询方法相比,本文中设计的查询优化方法可以明显提高查询效率,大幅减少查询时间。 展开更多
关键词 环控生保 数据库 查询优化 自适应 并行查询
在线阅读 下载PDF
基于TreeLSTM的查询基数估计 被引量:2
15
作者 齐凯阳 于炯 +1 位作者 何贞贞 苏子航 《东北师大学报(自然科学版)》 CAS 北大核心 2024年第1期55-64,共10页
针对传统的数据库管理系统无法很好地学习谓词之间的交互以及无法准确地估计复杂查询的基数问题,提出了一种树形结构的长短期记忆神经网络(Tree Long Short Term Memory, TreeLSTM)模型建模查询,并使用该模型对新的查询基数进行估计.所... 针对传统的数据库管理系统无法很好地学习谓词之间的交互以及无法准确地估计复杂查询的基数问题,提出了一种树形结构的长短期记忆神经网络(Tree Long Short Term Memory, TreeLSTM)模型建模查询,并使用该模型对新的查询基数进行估计.所提出的模型考虑了查询语句中包含的合取和析取运算,根据谓词之间的操作符类型将子表达式构建为树形结构,根据组合子表达式向量来表示连续向量空间中的任意逻辑表达式.TreeLSTM模型通过捕捉查询谓词之间的顺序依赖关系从而提升基数估计的性能和准确度,将TreeLSTM与基于直方图方法、基于学习的MSCN和TreeRNN方法进行了比较.实验结果表明:TreeLSTM的估算误差比直方图、MSCN、TreeRNN方法的误差分别降低了60.41%,33.33%和11.57%,该方法显著提高了基数估计器的性能. 展开更多
关键词 基数估计 数据库管理系统 查询优化器 神经网络 长短期记忆网络
在线阅读 下载PDF
基于声明式推理的高效协同查询处理技术
16
作者 邱志林 寿黎但 +3 位作者 陈珂 江大伟 骆歆远 陈刚 《软件学报》 EI CSCD 北大核心 2024年第12期5558-5581,共24页
由于深度学习领域的不断进步,人们对用协同查询处理(CQP)技术扩展关系数据库以处理涉及结构化和非结构化数据的高级分析查询越来越感兴趣.最先进的CQP方法使用用户定义函数(UDFs)来实现深度神经网络(NN)模型来处理非结构化数据,并使用... 由于深度学习领域的不断进步,人们对用协同查询处理(CQP)技术扩展关系数据库以处理涉及结构化和非结构化数据的高级分析查询越来越感兴趣.最先进的CQP方法使用用户定义函数(UDFs)来实现深度神经网络(NN)模型来处理非结构化数据,并使用关系操作来处理结构化数据.基于UDF的方法简化了查询书写,允许用户使用单一的SQL提交分析查询,但要求在即席数据分析中能够根据所需性能指标手动选择合适且高效的模型,这对用户提出了很高的挑战.为了解决该问题,提出基于声明式推理函数(DIF)的协同查询处理技术,通过优化模型选择、执行方式、设备绑定等多个查询实现路径构建完整的协同查询处理框架.基于所提研究设计的成本模型和优化规则,查询处理器能够计算出不同查询计划的代价,并自动选择最优的物理查询计划.在4个数据集上的实验结果证实了提出的基于DIF的CQP方法的有效性和效率. 展开更多
关键词 数据库查询优化 声明式推理函数 协同查询处理 模型选择
在线阅读 下载PDF
面向Select和Sort的数据库算子缓存的设计与实现
17
作者 蔡万里 王新硕 +3 位作者 胡卉芪 蔡鹏 周烜 屠要峰 《计算机学报》 EI CAS CSCD 北大核心 2024年第9期2084-2103,共20页
缓存是数据库中提高查询性能的一种常用技术.目前,现有数据库缓存主要有两个方向:查询结果缓存和存储层块缓存.查询结果缓存是利用数据库查询执行的最终结果或中间结果(如子查询),而存储层块缓存则缓存查询涉及的底层数据块.本文从另外... 缓存是数据库中提高查询性能的一种常用技术.目前,现有数据库缓存主要有两个方向:查询结果缓存和存储层块缓存.查询结果缓存是利用数据库查询执行的最终结果或中间结果(如子查询),而存储层块缓存则缓存查询涉及的底层数据块.本文从另外一个角度“缓存中含有的计算量”来重新审视缓存在查询优化中的应用,并以此为基础进一步划分数据库缓存方式.在查询执行过程中,数据库查询被转换成一系列操作(例如选择、排序等)的集合,而算子对应操作.查询处理中算子输出的数据为中间结果,含有部分计算量,我们将这部分数据进行缓存并加以利用.我们将这种缓存部分计算量的缓存方式称为算子缓存,即缓存每个操作执行后的结果.由于不同查询之间可能会存在相同算子,对相近数据执行相同计算,因此利用算子缓存加速查询执行性能具有相当大的潜力.本文的新颖之处在于从缓存含有的计算量角度出发,提出并研究算子缓存如何在查询优化中应用.本文以Filter、Sort算子为例,针对缓存复用提出了一种基于语义树的匹配算法,用于快速匹配缓存中的结果集.同时,针对复用缓存可能劣化查询性能的情况,提出使用基于成本的代价优化器防止使用缓存劣化查询性能.最后,本文基于开源分析型数据库ClickHouse实现了Filter、Sort算子缓存的原型,并对提出的算子缓存方案进行了大量的实验测试.结果表明,相比块缓存、物化视图方式,本文提出的算子缓存方案在本地SSD部署下最大能够分别提升9倍以及1.5倍的查询响应速度,在云环境下部署能够分别提升30倍以及2倍的查询响应速度. 展开更多
关键词 数据库 查询执行 查询优化 算子缓存 联机分析处理
在线阅读 下载PDF
基于机器学习的基数估计技术综述
18
作者 岳文静 屈稳稳 +1 位作者 林宽 王晓玲 《计算机研究与发展》 EI CSCD 北大核心 2024年第2期413-427,共15页
基数估计是数据库关系系统查询优化器的基础和核心.随着人工智能技术的发展,其在数据处理、提取数据之间的关系等方面显现出优越的性能.近年来,基于机器学习的基数估计技术取得了显著的进展,受到了学术界的广泛关注.首先总结了基于机器... 基数估计是数据库关系系统查询优化器的基础和核心.随着人工智能技术的发展,其在数据处理、提取数据之间的关系等方面显现出优越的性能.近年来,基于机器学习的基数估计技术取得了显著的进展,受到了学术界的广泛关注.首先总结了基于机器学习的技术估计技术的发展现状,其次给出了基数估计的相关概念及其特征编码技术.接着建立了基数估计技术的分类体系.在此基础上,进一步将基于机器学习的基数估计技术细分为查询驱动、数据驱动和混合模型这3类基数估计技术.然后重点分析了每一类技术的建模流程、典型技术和模型特点,并对其在SQL和NoSQL中的应用进行了分析和总结.最后讨论了基于机器学习的基数估计技术面临的挑战和未来的研究方向。 展开更多
关键词 数据库 基数估计 机器学习 查询优化 神经网络
在线阅读 下载PDF
OLAP查询基数预估能力评估
19
作者 简炜 胡梓锐 张蓉 《华东师范大学学报(自然科学版)》 CAS CSCD 北大核心 2024年第5期141-151,共11页
查询优化可以显著提升联机分析处理(online analytical processing,OLAP)数据库系统对海量教育数据的分析效率,为智能教学系统提供快速、精准的数据支持.优化器主要包含基数预估、空间枚举和代价模型3个模块.其中,基数预估决定代价模型... 查询优化可以显著提升联机分析处理(online analytical processing,OLAP)数据库系统对海量教育数据的分析效率,为智能教学系统提供快速、精准的数据支持.优化器主要包含基数预估、空间枚举和代价模型3个模块.其中,基数预估决定代价模型的结果,并指导查询计划的选择.因此,优化器的基数预估模块评估对OLAP数据库系统优化起到积极的推动作用.本文设计并实现了一套基于主键驱动的、构造多样化数据分布和数据关联关系的有效负载生成工具,包含自定义关联关系的数据生成技术、基于有限状态机的负载模版生成技术和目标基数驱动的参数实例化技术.并在3个数据库OceanBase、TiDB和PostgreSQL上进行了实验,分析了3个数据库优化器存在的问题,并给出了建议. 展开更多
关键词 分析型数据库 查询优化 基数预估
在线阅读 下载PDF
多源异构数据融合与高性能图数据库查询引擎设计
20
作者 李坤朋 《移动信息》 2024年第2期185-187,共3页
随着信息时代的快速发展,多源异构数据的整合和利用变得日益重要。文中着眼于解决多源异构数据融合与高性能图数据库查询引擎设计的问题,提出了一种创新的方法。首先,探讨了数据融合的重要性以及在整合过程中面临的挑战,特别是在异构数... 随着信息时代的快速发展,多源异构数据的整合和利用变得日益重要。文中着眼于解决多源异构数据融合与高性能图数据库查询引擎设计的问题,提出了一种创新的方法。首先,探讨了数据融合的重要性以及在整合过程中面临的挑战,特别是在异构数据类型、格式和来源多样的情况下。接着,介绍了高性能图数据库的基本原理,并根据数据融合的需求设计了一种高效的查询引擎。文中设计了一个多层次的数据融合模型,并提出了相应的查询优化算法,以应对数据融合过程的复杂性和多样性。 展开更多
关键词 多源异构数据 数据融合 高性能图数据库 查询引擎 查询优化算法
在线阅读 下载PDF
上一页 1 2 21 下一页 到第
使用帮助 返回顶部