Connect public, paid and private patent data with Google Patents Public Datasets

Method for analyzing and processing non-structured data query operating language

Info

Publication number
CN102750354A
CN102750354A CN 201210190832 CN201210190832A CN102750354A CN 102750354 A CN102750354 A CN 102750354A CN 201210190832 CN201210190832 CN 201210190832 CN 201210190832 A CN201210190832 A CN 201210190832A CN 102750354 A CN102750354 A CN 102750354A
Authority
CN
Grant status
Application
Patent type
Prior art keywords
query
language
structured
data
method
Prior art date
Application number
CN 201210190832
Other languages
Chinese (zh)
Other versions
CN102750354B (en )
Inventor
丁贵广
卓安
王建民
黄向东
Original Assignee
清华大学
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date

Links

Abstract

The invention relates to a method for analyzing and processing a non-structured data query operating language, which belongs to the technical field of management of computer data. According to the method for analyzing and processing a non-structured data query operating language provided by the invention, a structured query language is defined specific to the query of non-structured data, and the language is easy to extend and can be fused with customized query functions like the query language grammar of the conventional relation database. The method comprises the following steps of starting a query module in a key value library, receiving a query language request of a user, analyzing a language and converting into an internal command; calling each functional module in the key value library to execute by using the query module according to the internal command; and returning a result to the user after the command is executed. According to the method, the query module is taken as a core, and a key value library on a bottom layer is accessed in a way of designing a similar SQL (Structured Query Language), so that the user can operate the key value library easily and manage non-structured data.

Description

一种非结构化数据查询操作语言的解析与处理方法 Analytical method of processing unstructured data and query languages

技术领域 FIELD

[0001] 本发明涉及一种非结构化数据查询操作语言的解析与处理方法,属于计算机数据管理技术领域。 [0001] The present invention relates to a method for analysis and processing unstructured data query language, belonging to the technical field of data management computer.

背景技术 Background technique

[0002] 随着互联网等新兴应用的日益丰富以及企业信息化建设的不断发展,出现了大量的非结构化数据。 [0002] With the increasingly rich and the continuous development of enterprise information construction of the emerging applications such as the Internet, there has been a lot of unstructured data. 由于非结构化数据数据类型丰富,结构复杂,没有明确的、统一定义的数据结构约束,加之其海量的数据规模,高度动态的数据特性,多样的应用场景,统一的联合访问需求,使得非结构化数据管理面临巨大挑战。 Since the rich unstructured data types, complex structure, there is no clear and uniform definition of data structure constraints, coupled with the massive size of its data, the data characteristics of highly dynamic, diverse application scenarios, the joint unified access requirements, making non-structural data management faces enormous challenges.

[0003] 传统的关系数据库在处理海量的非结构化数据上难以提出有效的解决方案。 [0003] Traditional relational database on a data processing unstructured mass is difficult to propose an effective solution. 传统数据库的数据模型都是模式优先的逻辑结构,而非结构化数据则是模式滞后的逻辑结构,这使得建立在关系代数基础上的数据管理方法在解决非结构化数据的问题上不再有效。 Traditional database data model is a logical priority mode structure, while unstructured data is a logical structure model lag, which makes the establishment of the algebraic relations on the basis of the data management methods are no longer effective in addressing the issue of unstructured data . 非结构化数据的海量特性也使得传统数据库在性能和扩展性上无能为力。 Mass characteristic makes unstructured data in traditional database inability performance and scalability.

[0004] 新兴的键值库以无模式的方式打破了传统数据库的模式优先逻辑,同时它以键值的方式保证了高速的读写。 [0004] key emerging library modelessly way to break the traditional mode of priority database logic, and it's the key way to ensure high-speed read and write. 现在流行并发展迅速的键值库有!Base、MangoDB> Dynamo和Cassandra等等。 Now the popular and fast-growing value library there! Base, MangoDB> Dynamo and Cassandra and so on. 他们以分布式集群方式保证了海量数据的存储与扩展性,本发明正是基于这样的键值库。 They cluster in a distributed manner to ensure that the massive data storage and scalability, the present invention is based on the key-value library.

[0005] 然而新兴的键值库并没有完善的查询方式和查询语言。 [0005] However, the emerging key repository and there is no perfect ways to search and query language. 如HBase提供了API访问, Cassandra提供了API与一种名为CQL的类SQL语言方式访问。 Such as HBase provides API access, Cassandra provides a SQL-like language called CQL way with API access. 然而他们由于自身数据库的限制,仅能对非结构化数据进行简单的查询与更新,没有提供复杂的分析函数,也没有考虑大容量数据的语言描述方式。 However, due to the limitations of their own database, unstructured data can only perform simple queries and updates, does not provide sophisticated analysis functions, also did not consider language to describe the way large amounts of data. CouchDB与SQLite两创始人联合在试图设计键值库的统一查询语言UnQL,然而目前也仅仅只有雏形,对于非结构化数据的多特征查询这一特点也没有有效考虑。 CouchDB and SQLite two co-founders of trying to design a unified query language UnQL value library, but currently only a prototype only, for multi-feature unstructured data query This feature does not consider valid.

[0006] 从最终用户和应用的角度,非结构化数据查询语言应该解决以下问题: [0006] From the perspective of end users and applications, unstructured data query language should address the following issues:

[0007] (I)支持面向键值库存储的非结构化数据查询; [0007] (I) support for unstructured data store key query;

[0008] 现有的非结构化数据多以存储在键值库中作为海量与高效读写的解决方案,而键值库往往没有提供易用的查询语言。 [0008] Multi-existing unstructured data is stored as a massive and highly efficient solutions in the key literacy library, and libraries often do not provide key-to-use query language.

[0009] (2)能有效解决不同非结构化数据的多种特征的统一查询; [0009] (2) can solve the various features of different uniform unstructured data inquiry;

[0010] 现有的CQL等语言只提供简单的查询功能,无法对非结构化数据进行特征检索。 [0010] Existing languages ​​such as CQL only provide simple query function, not unstructured data retrieval feature. 比如对图像数据进行直方图、颜色等特征检索,对音频进行MFCC特征检索等等。 Image data such as a histogram, color and other characteristics retrieval, retrieval of audio MFCC feature like.

[0011] (3)如何进行有效地数据查询与分析。 [0011] (3) how effectively data query and analysis.

[0012] 传统数据查询仅仅实现索引和简单的统计函数。 [0012] Traditional data query and index only achieve a simple statistical functions. 对于海量的非结构化数据而言,很多结果需要进行数据的分析得出,因此查询语言应该尽可能的支持更多的数据分析函数。 For the vast amounts of unstructured data, the results need to be analyzed a lot of data to draw, so the query language should support more data analysis functions as possible.

发明内容[0013] 本发明的目的是提出一种非结构化数据查询操作语言的解析与处理方法,针对非结构化数据管理领域存在的问题,用一种类似SQL语言的方式来访问底层的键值库,以达到让用户轻松操作键值库来管理非结构化数据的目的。 SUMMARY OF THE INVENTION [0013] The object of the present invention is to provide a method of analysis and processing unstructured data query language, for the presence of unstructured data management problems, in a way similar to the SQL language to access the underlying bond the value of the library, allowing users to easily operate in order to achieve the key aim library to manage unstructured data.

[0014] 本发明提出的非结构化数据管理查询语言的解析和处理方法,包括以下步骤: [0014] The parsing and processing method of the present invention proposed query language Unstructured data management, comprising the steps of:

[0015] (I)启动键值库中的查询模块,查询模块监听用户的查询语言请求; [0015] (I) start key value query library module, the query monitor module requests the user's query language;

[0016] (2)查询模块接收用户的的查询语言请求,对语言进行解析,解析步骤如下: [0017] (2-1)用户端采用查询语言驱动方式连接查询模块,建立用户端与查询模块之间的会话,并保存会话过程中的会话信息,访问查询模块,向查询模块发送查询语言; [0016] (2) a query module receives a user request query language, language parsing, analysis step as follows: [0017] (2-1) using the UE drivingly connected query language query module, a query module and the UE establish session between and save session information during the session, and access the query module sends a query language to query module;

[0018] (2-2)通过查询模块中的解析器,查询模块将用户端发送的查询语言请求转换为内部命令; [0018] (2-2) by querying the parser module, the module will query language query request sent by the client into an internal command;

[0019] (3)对上述内部命令进行判断,若该内部命令为指定本次会话的键值库表的命令,则查询模块保存该指定键值库表的名字,并在后续的命令中默认本次会话在该键值库表下执行;若查询语言中的任意位置具有一个相似关键字,则查询模块将该内部命令转交给键值库中的索引调用模块;若查询语言中的任意位置具有一个函数关键字,则查询模块将该内部命令转交给键值库中的函数调用模块; [0019] (3) of the inner command is determined, if the internal command for the specified database table key of this session command, then the query module stores the designated key database table name, and a default in subsequent command the session key performed at the database table; if anywhere in the query language having a similar keyword, the query module is transferred to the internal command key index calling module library; if anywhere in a query language has a function key, the query module inside the command module function calls forwarded to key library;

[0020] (4)键值库中的查询模块根据内部命令,调用键值库中的各功能模块执行内部命令,具体过程如下: [0020] (4) a query key library module according to an internal command to invoke the function modules execute key library internal command procedure is as follows:

[0021] (4-1)若内部命令为结构化查询命令,则采用键值库中的服务器执行命令; [0021] (4-1) When the internal command Structured Query command is used in the key database server executes the command;

[0022] (4-2)若内部命令为创建键值库索引命令,则采用键值库中的服务器执行命令; [0022] (4-2) If the internal command to create a database index key command key repository server is used to perform the command;

[0023] (4-3)若内部命令为创建非键值库索引命令,则构建一个索引实现库,并调用索引实现库执行命令; [0023] (4-3) If the internal command to create a database index of non-key command, you build a library that implements the index, the index achieved library and calling execute the command;

[0024] (4-4)若内部命令为运行数据函数分析命令,则构建一个数据函数分析模块,并调用数据函数分析模块执行命令,查询模块获取命令的执行状态和执行结果; [0024] (4-4) When the command to run inside the data analysis function command, the function to build a data analysis module, and invoke the function data analysis module execution command, a query execution state acquisition command module and an execution result;

[0025] (4-5)若内部命令为大数据传输,则使用独立的数据传输流等待与用户端连接,完成连接后,通过数据传输流进行文件传输;传输结束后,查询模块保存传输的文件,并保持用户端与查询模块之间的会话; [0025] (4-5) When the internal command large data transfers, using independent data transport stream is connected to the UE waits for, after the completion of the connection, file transfer through the data transmission stream; After transfer, the query module stores transmission document, and maintaining a session between a client and a query module;

[0026] (4-6)若内部命令是自定义创建索引、查询索引和建立函数,自定义创建索引和查询索引的执行命令,则通过一个关键字标明索引的创建参数和索引创建类型,完成索引的创建和查询;对于自定义建立函数的执行命令,查询模块根据查询语言中的函数关键字和函数的变长参数,从查询模块的配置文件中列出的函数支持类型中,选择相应的函数,完成函数的建立; [0026] (4-6) If the internal order is custom-created index, query and build an index function, create custom indexing and query index of execution command, by creating parameters and index marked a key index to create the type of finish index creation and query; for the custom build command execution function, the query module based on variable-length parameter query language function key and function, the function listed in the configuration file query module in the type of support, select the appropriate function, complete the establishment of the function;

[0027] (4-7)若内部命令为多种类型索引的联合查询,则查询模块对多种类型索引进行分拆,得到各个类型索引的查询子句,根据查询子句,读取查询模块的配置文件中不同索引查询的优先级,调整多个查询子句的查询顺序,进行查询; [0027] (4-7) When a plurality of types of internal commands indexed federated query, the query module index split plurality of types, each type of query to obtain an index of clause clause of the query, the query module reads index profile different priority of the query, the query sequence to adjust the plurality of query clauses, query;

[0028] (5)查询模块向用户端返回查询结果。 [0028] (5) The query module returns the query results to the client.

[0029] 本发明提出的非结构化数据管理查询语言的解析和处理方法,针对非结构化数据的查询,定义了结构化的查询语言,与传统关系数据库的查询语言语法类似,该语言易扩展并可融合自定义的查询函数。 [0029] The method for parsing and processing unstructured data management query language proposed by the invention, for the query unstructured data, defines a structured query language, and relational database query language syntax similar to the conventional, easy to expand the language and integration of custom query functions. 本发明方法的核心是查询模块,通过设计接口使查询模块与键值库松耦合,可以方便的将现有键值库的查询模块移植到其他键值库中;本发明方法提供了多种包括自定义在内的特征检索,因此可以直接管理多种非结构化数据;本发明方法可以支持大数据(如文件)的读写操作,提供支持数据分析等分布式函数的执行操作和可以配置的查询优先级设置等特点,保证高效的管理非结构化数据。 The core of the present invention is a method of querying module queries by designing the interface module that the key repository loosely coupled, can the existing key database query module easily be ported to other key database; The present invention provides a variety of methods including custom features, including the retrieval, it is possible to manage a variety of unstructured data directly; method of the present invention can support read-write data (e.g., file) operation, support the distributed data analysis function can be configured to perform operations and query priority setting and so on, to ensure efficient management of unstructured data.

具体实施方式 detailed description

[0030] 本发明提出的非结构化数据管理查询语言的解析和处理方法,包括以下步骤: [0030] The parsing and processing method of the present invention proposed query language Unstructured data management, comprising the steps of:

[0031] (I)启动键值库中的查询模块,查询模块监听用户的查询语言请求; [0031] (I) start key value query library module, the query monitor module requests the user's query language;

[0032] (2)查询模块接收用户的的查询语言请求,对语言进行解析,解析步骤如下: [0032] (2) a query module receives a user request query language, language parsing, the following analysis step:

[0033] (2-1)用户端采用查询语言驱动方式连接查询模块,建立用户端与查询模块之间的会话,并保存会话过程中的会话信息,访问查询模块,向查询模块发送查询语言; [0033] (2-1) using the client query language query module drivingly connected, to establish a session between a client and a query module, and stores the session information in the session, access the query module, send to the query module the query language;

[0034] (2-2)通过查询模块中的解析器,查询模块将用户端发送的查询语言请求转换为内部命令; [0034] (2-2) by querying the parser module, the module will query language query request sent by the client into an internal command;

[0035] (3)对上述内部命令进行判断,若该内部命令为指定本次会话的键值库表的命令,则查询模块保存该指定键值库表的名字,并在后续的命令中默认本次会话在该键值库表下执行;若查询语言中的任意位置具有一个相似(like)关键字,则查询模块将该内部命令转交给键值库中的索引调用模块;若查询语言中的任意位置具有一个函数(function)关键字,则查询模块将该内部命令转交给键值库中的函数调用模块; [0035] (3) of the inner command is determined, if the internal command for the specified database table key of this session command, then the query module stores the designated key database table name, and a default in subsequent command the session key performed at the database table; if anywhere in the query language having a similar (like) the keyword, the query module the commands transmitted to the internal key index calling module library; if query language anywhere has a function (function) key, the query module inside the command module function calls forwarded to key library;

[0036] (4)键值库中的查询模块根据内部命令,调用键值库中的各功能模块执行内部命令,具体过程如下: [0036] (4) a query key library module according to an internal command to invoke the function modules execute key library internal command procedure is as follows:

[0037] (4-1)若内部命令为结构化查询命令,如创建表、创建列族或在列族中添加删除数据,则采用键值库中的服务器执行命令; [0037] (4-1) When the internal command Structured Query commands, such as creating a table, create or add a column group delete data in the column group, is used in the key repository server execution command;

[0038] (4-2)若内部命令为创建键值库索引命令,则采用键值库中的服务器执行命令; [0038] (4-2) If the internal command to create a database index key command key repository server is used to perform the command;

[0039] (4-3)若内部命令为创建非键值库索引命令,如图片的高维索引、文本的全文索弓I,则构建一个索引实现库,并调用索引实现库执行命令; [0039] (4-3) If the internal command to create a database index of non-key commands, such as pictures of high-dimensional indexing, full text search bow I, is to build a library that implements the index, the index achieved library and calling execute the command;

[0040] (4-4)若内部命令为运行数据函数分析命令,则构建一个数据函数分析模块,并调用数据函数分析模块执行命令,查询模块获取命令的执行状态和执行结果; [0040] (4-4) When the command to run inside the data analysis function command, the function to build a data analysis module, and invoke the function data analysis module execution command, a query execution state acquisition command module and an execution result;

[0041] (4-5)若内部命令为大数据传输,则使用独立的数据传输流等待与用户端连接,完成连接后,通过数据传输流进行文件传输;传输结束后,查询模块保存传输的文件,并保持用户端与查询模块之间的会话; [0041] (4-5) When the internal command large data transfers, using independent data transport stream is connected to the UE waits for, after the completion of the connection, file transfer through the data transmission stream; After transfer, the query module stores transmission document, and maintaining a session between a client and a query module;

[0042] (4-6)若内部命令是自定义创建索引、查询索引和建立函数,本发明提出的查询语言通过半开放式关键字设置达到多种索引创建与查询、多种函数支持的效果;对于自定义创建索引和查询索引的执行命令,则通过一个关键字(例如with)标明索引的创建参数和索引创建类型,完成索引的创建和查询;对于自定义建立函数的执行命令,查询模块根据查询语言中的函数关键字和函数的变长参数,从查询模块的配置文件中列出的函数支持类型中,选择相应的函数,完成函数的建立; [0042] (4-6) If the internal order is custom-created index, query and build an index function, query language proposed by the invention by a semi-open keyword is set to achieve a variety of index creation and query, the effect of a variety of support functions ; to create custom indexing and query execution command index is created through a keyword (eg with) to create parameters indicate the type of index and index complete index creation and query; for the custom build command execution function, the query module the variable-length argument function key and the query language function, a function of the type listed in the support profile query module, select the appropriate function, to complete the establishment of the function;

[0043] (4-7)若内部命令为多种类型索引的联合查询,在较为复杂的查询语句中,会同时存在键值库默认索引查询(列值或者键值的过滤)、多个自定义索引查询的联合查询;则查询模块对多种类型索引进行分拆,得到各个类型索引的查询子句,根据查询子句,读取查询模块的配置文件中不同索引查询的优先级,调整多个查询子句的查询顺序,进行查询;[0044] (5)查询模块向用户端返回查询结果。 [0043] (4-7) When a plurality of types of internal commands indexed federated query in a more complex query, the database may exist a default index query key (column key-value or filtration), since a plurality of combined query index query definition; then the query module index split more types, each type of query to obtain an index of clause clause of the query, the query module reads the configuration file index different priority queries, multiple adjustments query clauses query sequence, the query; [0044] (5) the query module returning query results to the end user.

Claims (1)

1. 一种非结构化数据管理查询语言的解析和处理方法,其特征在于该方法包括以下步骤: (1)启动键值库中的查询模块,查询模块监听用户的查询语言请求; (2)查询模块接收用户的的查询语言请求,对语言进行解析,解析步骤如下: (2-1)用户端采用查询语言驱动方式连接查询模块,建立用户端与查询模块之间的会话,并保存会话过程中的会话信息,访问查询模块,向查询模块发送查询语言; (2-2)通过查询模块中的解析器,查询模块将用户端发送的查询语言请求转换为内部命令; (3)对上述内部命令进行判断,若该内部命令为指定本次会话的键值库表的命令,则查询模块保存该指定键值库表的名字,并在后续的命令中默认本次会话在该键值库表下执行;若查询语言中的任意位置具有一个相似关键字,则查询模块将该内部命令转交给键值库中的索引调用 A method for parsing and processing unstructured data management query language, characterized in that the method comprises the steps of: (1) Start key library query module, a query module monitor the user's query language request; (2) a query module receives a user request query language, language parsing, analysis step is as follows: (2-1) a client query language using a query module drivingly connected, to establish a session between a client and a query module and save the session the session information, access the query module, a query module sends a query language; (2-2) a query by the parser module, the module will query language query request sent by the client into an internal command; (3) on the inner command is determined, if the internal command for the specified database table key of this session command, then the query module stores the designated key database table name, and the default this session key in a database table in the subsequent command under execution; if the query language in any position has a similar keyword, the query module command transferred to the internal call key index library 块;若查询语言中的任意位置具有一个函数关键字,则查询模块将该内部命令转交给键值库中的函数调用模块; (4)键值库中的查询模块根据内部命令,调用键值库中的各功能模块执行内部命令,具体过程如下: (4-1)若内部命令为结构化查询命令,则采用键值库中的服务器执行命令; (4-2)若内部命令为创建键值库索引命令,则采用键值库中的服务器执行命令; (4-3)若内部命令为创建非键值库索引命令,则构建一个索引实现库,并调用索引实现库执彳了命令; (4-4)若内部命令为运行数据函数分析命令,则构建一个数据函数分析模块,并调用数据函数分析模块执行命令,查询模块获取命令的执行状态和执行结果; (4-5)若内部命令为大数据传输,则使用独立的数据传输流等待与用户端连接,完成连接后,通过数据传输流进行文件传输;传输结束后,查询 Block; if anywhere in the query language with a function key, then the query module function calls forwarded to the internal command key module library; (4) the key database query module according to an internal command, call key each library function module performs an internal command, the specific process is as follows: (4-1) If the internal command structured query command is used in the key repository server Run; (4-2) to create a bond if internal command library index value command, using the key repository server executes the command; (4-3) if internal commands to create a database index of non-key command, you build a library that implements the index, the index achieved library and call the command execution left foot; (4-4) If the internal command to run the data analysis function command, then build a data analysis function module and call data analysis function module execute the command, the query module obtains the execution status of the command execution and results; (4-5) If the internal large data transmission command, using independent data transport stream and the client waits for the connection, after the completion of the connection, file transfer through the data transmission stream; after transfer, the query 块保存传输的文件,并保持用户端与查询模块之间的会话; (4-6)若内部命令是自定义创建索引、查询索引和建立函数,自定义创建索引和查询索引的执行命令,则通过一个关键字标明索引的创建参数和索引创建类型,完成索引的创建和查询;对于自定义建立函数的执行命令,查询模块根据查询语言中的函数关键字和函数的变长参数,从查询模块的配置文件中列出的函数支持类型中,选择相应的函数,完成函数的建立; (4-7)若内部命令为多种类型索引的联合查询,则查询模块对多种类型索引进行分拆,得到各个类型索引的查询子句,根据查询子句,读取查询模块的配置文件中不同索引查询的优先级,调整多个查询子句的查询顺序,进行查询; (5)查询模块向用户端返回查询结果。 Save the file transfer block and held session between the client module and the query; (4-6) If the internal command is a custom created indexes, and establish the function query index, and the index creating custom query index execution command, then create a type parameter by creating a keyword index and an index indicating the complete index creation and query; for the custom build command execution function, the query module based on variable-length argument function key and function of the query language, from the query module functions supported types listed in the profile, select the appropriate function, to complete the establishment of the function; (4-7) when a plurality of types of internal commands indexed federated query, the query module to a plurality of types split index to give each type of index query clauses, in accordance with clause query, query module configuration file reading different indexes priority of the query, the query sequence to adjust the plurality of query clauses, query; (5) to a user query module end return query results.
CN 201210190832 2012-06-11 2012-06-11 Method for analyzing and processing non-structured data query operating language CN102750354B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN 201210190832 CN102750354B (en) 2012-06-11 2012-06-11 Method for analyzing and processing non-structured data query operating language

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN 201210190832 CN102750354B (en) 2012-06-11 2012-06-11 Method for analyzing and processing non-structured data query operating language

Publications (2)

Publication Number Publication Date
CN102750354A true true CN102750354A (en) 2012-10-24
CN102750354B CN102750354B (en) 2014-08-20

Family

ID=47030539

Family Applications (1)

Application Number Title Priority Date Filing Date
CN 201210190832 CN102750354B (en) 2012-06-11 2012-06-11 Method for analyzing and processing non-structured data query operating language

Country Status (1)

Country Link
CN (1) CN102750354B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103425779A (en) * 2013-08-19 2013-12-04 曙光信息产业股份有限公司 Data processing method and data processing device

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7194483B1 (en) * 2001-05-07 2007-03-20 Intelligenxia, Inc. Method, system, and computer program product for concept-based multi-dimensional analysis of unstructured information
US20080201290A1 (en) * 2007-02-16 2008-08-21 International Business Machines Corporation Computer-implemented methods, systems, and computer program products for enhanced batch mode processing of a relational database
CN102129469A (en) * 2011-03-23 2011-07-20 华中科技大学 Virtual experiment-oriented unstructured data accessing method
CN102298641A (en) * 2011-09-14 2011-12-28 清华大学 Files with the structured data storage method based on a unified key library

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7194483B1 (en) * 2001-05-07 2007-03-20 Intelligenxia, Inc. Method, system, and computer program product for concept-based multi-dimensional analysis of unstructured information
US20080201290A1 (en) * 2007-02-16 2008-08-21 International Business Machines Corporation Computer-implemented methods, systems, and computer program products for enhanced batch mode processing of a relational database
CN102129469A (en) * 2011-03-23 2011-07-20 华中科技大学 Virtual experiment-oriented unstructured data accessing method
CN102298641A (en) * 2011-09-14 2011-12-28 清华大学 Files with the structured data storage method based on a unified key library

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
田万鹏 等: "一种基于特征的非结构化数据演化管理建模框架", 《计算机研究与发展》, no. 47, 31 December 2010 (2010-12-31), pages 394 - 399 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103425779A (en) * 2013-08-19 2013-12-04 曙光信息产业股份有限公司 Data processing method and data processing device

Also Published As

Publication number Publication date Type
CN102750354B (en) 2014-08-20 grant

Similar Documents

Publication Publication Date Title
Cattell Scalable SQL and NoSQL data stores
Chong et al. An efficient SQL-based RDF querying scheme
Ives et al. An XML query engine for network-bound data
US20080172360A1 (en) Querying data and an associated ontology in a database management system
US20050021502A1 (en) Data federation methods and system
US20040002939A1 (en) Schemaless dataflow within an XML storage solution
US20140172914A1 (en) Graph query processing using plurality of engines
US20090138498A1 (en) Rdf store database design for faster triplet access
Alsubaiee et al. AsterixDB: A scalable, open source BDMS
US20070061318A1 (en) System and method of data source agnostic querying
US20130124545A1 (en) System and method implementing a text analysis repository
DeWitt et al. Split query processing in polybase
CN102799622A (en) Distributed structured query language (SQL) query method based on MapReduce expansion framework
Campinas et al. Introducing RDF graph summary with application to assisted SPARQL formulation
CN101021875A (en) Object-oriented data bank access method and system
Umbrich et al. Comparing data summaries for processing live queries over linked data
Ives et al. Efficient query processing for data integration
CN101216840A (en) Data enquiry method and data enquiry system
US20080275907A1 (en) Scalable algorithms for mapping-based xml transformation
WO2002021339A2 (en) Method and apparatus for xml data storage, query rewrites, visualization, mapping and references
CN1965316A (en) Index for accessing XML data
CN103678665A (en) Heterogeneous large data integration method and system based on data warehouses
US20140136473A1 (en) Partial merge
US8918388B1 (en) Custom data warehouse on top of mapreduce
CN102129469A (en) Virtual experiment-oriented unstructured data accessing method

Legal Events

Date Code Title Description
C06 Publication
C10 Entry into substantive examination
C14 Grant of patent or utility model