KR102127785B1

KR102127785B1 - Method and apparatus for providing efficient indexing and computer program included in computer readable medium therefor

Info

Publication number: KR102127785B1
Application number: KR1020200030277A
Authority: KR
Inventors: 이주현
Original assignee: 주식회사 티맥스티베로
Priority date: 2020-03-11
Filing date: 2020-03-11
Publication date: 2020-06-29
Also published as: KR20200029431A

Abstract

본 개시내용의 일 실시예에 따라 데이터베이스 관리 시스템(DBMS)에서 수행되는 방법이 제공된다. 상기 방법은 데이터베이스 관리 시스템에서의 모니터링 알고리즘에 따른 모니터링 결과 정보에 기초하여, 데이터 저장 위치를 포인팅하기 위한 데이터베이스 인덱스 구조를 생성하는 단계를 포함할 수 있다. 데이터베이스 인덱스 구조는, 제 1 레벨의 서브 인덱스 구조 및 상기 제 1 레벨 보다 하위 레벨인 제 2 레벨의 서브 인덱스 구조를 적어도 부분적으로 포함하며, 제 2 레벨의 서브 인덱스 구조는 복수의 컬럼들을 포함하며, 제 2 레벨의 서브 인덱스 구조의 복수의 컬럼들 중 제 1 컬럼은 키 값들을 포함하며, 제 2 레벨의 서브 인덱스 구조의 복수의 컬럼들 중 제 2 컬럼은, 데이터 블록의 로우에 대응되는 ROWID 값들을 포함하며, 그리고 제 2 레벨의 서브 인덱스 구조의 복수의 컬럼들 중 제 3 컬럼은 상기 모니터링 결과 정보에 기초하여 설정되는 최적화 인덱스 값들을 포함할 수 있다.A method performed in a database management system (DBMS) is provided in accordance with one embodiment of the present disclosure. The method may include generating a database index structure for pointing the data storage location, based on the monitoring result information according to the monitoring algorithm in the database management system. The database index structure includes at least partially a sub index structure of the first level and a sub index structure of the second level that is lower than the first level, and the sub index structure of the second level includes a plurality of columns, The first column of the plurality of columns of the second-level sub-index structure includes key values, and the second column of the plurality of columns of the second-level sub-index structure has a ROWID value corresponding to a row of the data block. The third column of the plurality of columns of the second-level sub-index structure may include optimization index values that are set based on the monitoring result information.

Description

METHOD AND DEVICE TO PROVIDE EFFECTIVE INDEXING AND COMPUTER PROGRAM INCLUDED IN COMPUTER-readable media {METHOD AND APPARATUS FOR PROVIDING EFFICIENT INDEXING AND COMPUTER PROGRAM INCLUDED IN COMPUTER READABLE MEDIUM THEREFOR}

본 개시내용은 데이터베이스 인덱싱 기법에 관한 것으로서, 보다 구체적으로 DB 성능을 향상시키기 위한 인덱스 생성 기법에 관한 것이다.The present disclosure relates to a database indexing technique, and more particularly, to an index creation technique for improving DB performance.

기업의 비즈니스는 폭발적인 데이터의 증가와 다양한 환경 및 플랫폼의 등장으로 빠르게 확장되고 있다. 새로운 비즈니스 환경이 도래함에 따라서, 보다 더 효율적이고 유연한 데이터 서비스와 정보의 처리, 데이터 관리 기능이 필요하게 되었다. 이러한 변화에 맞춰서 기업 비지니스 구현의 기반이되는 고성능, 고가용성 및 확장성의 문제를 해결하기 위한 데이터베이스에 대한 연구가 계속되고 있다.Corporate business is rapidly expanding with the explosive growth of data and the emergence of various environments and platforms. With the advent of the new business environment, more efficient and flexible data services, information processing, and data management functions are needed. In response to these changes, research into databases to solve the problems of high performance, high availability, and scalability, which are the foundations of corporate business implementation, is continuing.

데이터베이스 관리 시스템(DBMS)에서 데이터들은 데이터 저장소에 저장될 수 있다. 관계형 데이터베이스 관리 시스템(RDBMS)에서는 이러한 데이터 저장소는 테이블로 지칭될 수 있다. 이러한 테이블은 하나 이상의 로우(row)들을 포함하고 하나 이상의 로우들 각각은 하나 이상의 컬럼들을 포함할 수 있다. In a database management system (DBMS), data can be stored in a data store. In a relational database management system (RDBMS), such a data store may be referred to as a table. Such a table may include one or more rows, and each of the one or more rows may include one or more columns.

데이터베이스가 수많은 양의 데이터를 포함하고 있는 경우, 사용자가 관심있어 하는 데이터를 리트리브하기 위한 쿼리를 수행하는데 있어서 상대적으로 긴 시간이 소요될 수 있다. 데이터베이스가 쿼리에 응답하는데 시간이 많이 소요되는 경우에는 데이터베이스의 성능에 있어서 악영향을 미칠 수 있다. 따라서, 해당 기술 분야에서는 쿼리에 응답하는데 처리되는 시간을 줄이기 위한, 즉, 데이터베이스 관리 시스템의 성능을 증대시키기 위한, 다양한 기법들이 연구되고 있다. When the database contains a large amount of data, it may take a relatively long time to perform a query to retrieve data of interest to the user. If the database takes a long time to respond to a query, it may adversely affect the performance of the database. Accordingly, various techniques have been studied in the related art to reduce processing time in response to a query, that is, to increase the performance of a database management system.

데이터베이스로부터의 데이터 리트리브 속도를 증대시키는데 있어서, 인덱싱 기법이 활용될 수 있다. 이러한 인덱스는 데이터베이스 분야에 있어서 테이블에 대한 동작 속도를 높여주는 데이터 구조를 의미할 수 있다. 인덱스를 사용하는 경우, 데이터 리트리브에 소요되는 시간을 줄여줄 뿐만 아니라 데이터 리트리브에 소모되는 자원의 양 또한 감소시킬 수 있다. In increasing the speed of data retrieval from the database, indexing techniques can be utilized. Such an index may refer to a data structure that increases an operation speed of a table in the database field. In the case of using an index, not only can the time required for data retrieval be reduced, but also the amount of resources consumed for data retrieval can be reduced.

일반적으로 인덱스 구조는 트리 구조로 이루어져 있다. 트리 구조는 그래프의 일종으로 노드(node)와 노드를 가리키는 포인터로 구성되어있다. 여러 노드가 한 노드를 가리킬 수 없지만 한 노드가 여러 노드를 가리킬 수는 있다. 방대한 양의 저장된 자료를 검색해야 하는 경우 자료를 일일히 비교하는 방식은 비효율적이다. 트리 구조를 이용하여 인덱싱을 하여 자료를 정렬된 상태로 보관하면 효율적으로 자료를 검색할 수 있다. 트래버스(traverse)는 인덱스를 탐색하는 일련의 과정을 의미한다. 트래버스는 루트 노드에서 시작하여 탐색 대상의 값과 구분 값을 비교하여 다음 포인터를 찾아나가는 과정으로 진행된다. 트래버스를 통해 탐색 대상의 값에 해당하는 인덱스를 발견하면 트래버스는 종료된다. In general, the index structure consists of a tree structure. The tree structure is a kind of graph, consisting of nodes and pointers to nodes. Multiple nodes cannot point to one node, but one node can point to multiple nodes. If you need to search a huge amount of stored data, comparing data one by one is inefficient. Indexing using a tree structure allows you to efficiently search for data by storing the data in sorted state. Traverse refers to a series of processes for searching an index. Traverse starts from the root node and proceeds by comparing the value of the search target and the distinguished value and finding the next pointer. When the index corresponding to the value of the search target is found through the traverse, the traverse ends.

인덱스는 크게 B+트리(Balanced Tree) 인덱스 및 비트맵(bitmap) 인덱스로 분류될 수 있다.The index may be largely classified into a B+tree (Balanced Tree) index and a bitmap index.

B+트리 인덱스는 루트(root) 블록, 브랜치(branch) 블록 및 리프(leaf) 블록으로의 계층형태의 인덱스 기법을 의미할 수 있다. B+트리 인덱스에서, 리프 블록이 데이터 블록의 해당 로우에 접근할 수 있는 ROWID를 가지고 있으며 그리고 브랜치 블록 및 루트 블록은 각 하위 단계에 접근할 수 있는 키값(예컨대, 블록의 주소값)을 가지고 있다. The B+tree index may refer to a hierarchical indexing technique into a root block, branch block, and leaf block. In the B+tree index, the leaf block has a ROWID that can access the corresponding row of the data block, and the branch block and the root block have a key value (eg, the address value of the block) that can access each lower level.

비트맵 인덱스는 컴퓨터에서 사용하는 최소 단위인 비트를 이용하여 컬럼값을 저장하고 이를 이용하여 ROWID를 자동으로 생성하는 인덱스 기법을 의미할 수 있다. The bitmap index may refer to an indexing technique that automatically stores a column value using a bit, which is the smallest unit used in a computer, and automatically generates a ROWID using it.

B+트리 인덱싱 기법 및 비트맵 인덱싱 기법 각각은 그 구조적 특성으로 인하여 장점 및 단점이 존재할 수 있다. 따라서, 방대한 데이터를 처리하는 상황에서 DBMS의 성능을 향상시키기 위한 효율적인 인덱싱 기법에 대한 당업계의 수요가 존재한다.Each of the B+ tree indexing technique and the bitmap indexing technique may have advantages and disadvantages due to its structural characteristics. Therefore, there is a need in the art for an efficient indexing technique to improve the performance of the DBMS in the situation of processing massive data.

(1) 미국 등록 특허 US5903888(1) US registered patent US5903888 (2) 미국 등록 특허 US6278992(2) US registered patent US6278992 (3) 일본 특허 공보 JP2004326404(3) Japanese Patent Publication JP2004326404

본 개시내용의 목적은, 전술한 당업계에서의 수요를 충족시키기 위해, DBMS에서의 쿼리 처리 시간을 축소시키기 위함이다.The purpose of the present disclosure is to reduce the query processing time in the DBMS in order to meet the demands in the art described above.

본 개시내용의 목적은, 전술한 당업계에서의 수요를 충족시키기 위하여, DBMS에서 효율적인 인덱싱 기법을 제공하기 위함이다.The purpose of the present disclosure is to provide an efficient indexing technique in DBMS in order to meet the needs in the art.

본 개시내용의 일 실시예에 따라, 인코딩된 명령들을 포함하는 컴퓨터-판독가능 저장 매체에 저장된 컴퓨터 프로그램으로서, 상기 컴퓨터 프로그램은 하나 이상의 프로세서들에 의해 실행되는 경우, 상기 하나 이상의 프로세서들로 하여금 데이터의 검색 성능을 향상시키기 위하여 이하의 동작을 수행하도록 하며, 상기 동작은: 데이터베이스 관리 시스템(DBMS)에서의 모니터링 알고리즘에 따른 모니터링 결과 정보에 기초하여, 데이터 저장 위치를 포인팅(pointing)하기 위한 데이터베이스 인덱스 구조를 생성하는 동작을 포함할 수 있다. 데이터베이스 인덱스 구조는, 제 1 레벨의 서브 인덱스 구조 및 상기 제 1 레벨 보다 하위 레벨인 제 2 레벨의 서브 인덱스 구조를 적어도 부분적으로 포함하며, 제 2 레벨의 서브 인덱스 구조는 복수의 컬럼(column)들을 포함하며, 제 2 레벨의 서브 인덱스 구조의 복수의 컬럼들 중 제 1 컬럼은 키 값들을 포함하며, 제 2 레벨의 서브 인덱스 구조의 복수의 컬럼들 중 제 2 컬럼은, 데이터 블록의 로우(row)에 대응되는 ROWID 값들을 포함하며, 그리고 제 2 레벨의 서브 인덱스 구조의 복수의 컬럼들 중 제 3 컬럼은 상기 모니터링 결과 정보에 기초하여 설정되는 최적화(optimized) 인덱스 값들을 포함할 수 있다.A computer program stored in a computer-readable storage medium including encoded instructions, according to one embodiment of the present disclosure, wherein the computer program, when executed by one or more processors, causes the one or more processors to generate data. The following operations are performed to improve the search performance of the database, and the operations are: based on monitoring result information according to a monitoring algorithm in a database management system (DBMS), a database index for pointing a data storage location It may include the operation of creating a structure. The database index structure includes at least partially a sub index structure of the first level and a sub index structure of the second level that is lower than the first level, and the sub index structure of the second level includes a plurality of columns. The first column of the plurality of columns of the second-level sub-index structure includes key values, and the second column of the plurality of columns of the second-level sub-index structure includes a row of a data block. ), and the third column of the plurality of columns of the second-level sub-index structure may include optimized index values that are set based on the monitoring result information.

본 개시내용의 일 실시예에 따라, 데이터베이스 관리 시스템(DBMS)에서 수행되는 방법이 제공된다. 상기 방법은, 데이터베이스 관리 시스템에서의 모니터링 알고리즘에 따른 모니터링 결과 정보에 기초하여, 데이터 저장 위치를 포인팅하기 위한 데이터베이스 인덱스 구조를 생성하는 단계를 포함할 수 있다. 데이터베이스 인덱스 구조는, 제 1 레벨의 서브 인덱스 구조 및 상기 제 1 레벨 보다 하위 레벨인 제 2 레벨의 서브 인덱스 구조를 적어도 부분적으로 포함하며, 제 2 레벨의 서브 인덱스 구조는 복수의 컬럼들을 포함하며, 제 2 레벨의 서브 인덱스 구조의 복수의 컬럼들 중 제 1 컬럼은 키 값들을 포함하며, 제 2 레벨의 서브 인덱스 구조의 복수의 컬럼들 중 제 2 컬럼은, 데이터 블록의 로우에 대응되는 ROWID 값들을 포함하며, 그리고 제 2 레벨의 서브 인덱스 구조의 복수의 컬럼들 중 제 3 컬럼은 상기 모니터링 결과 정보에 기초하여 설정되는 최적화 인덱스 값들을 포함할 수 있다.In accordance with one embodiment of the present disclosure, a method performed in a database management system (DBMS) is provided. The method may include generating a database index structure for pointing the data storage location based on the monitoring result information according to the monitoring algorithm in the database management system. The database index structure includes at least partially a sub index structure of the first level and a sub index structure of the second level that is lower than the first level, and the sub index structure of the second level includes a plurality of columns, The first column of the plurality of columns of the second-level sub-index structure includes key values, and the second column of the plurality of columns of the second-level sub-index structure has a ROWID value corresponding to a row of the data block. The third column of the plurality of columns of the second-level sub-index structure may include optimization index values that are set based on the monitoring result information.

본 개시내용의 일 실시예에 따른 데이터베이스 관리 장치가 개시된다. 상기 데이터베이스 관리 장치는 데이터베이스 관리 시스템(DBMS)에서의 모니터링 알고리즘에 따른 모니터링 결과 정보에 기초하여, 데이터 저장 위치를 포인팅하기 위한 데이터베이스 인덱스 구조를 생성하는 프로세서; 및 상기 생성된 데이터베이스 인덱스 구조를 저장하는 저장부를 포함할 수 있다. 데이터베이스 인덱스 구조는, 제 1 레벨의 서브 인덱스 구조 및 상기 제 1 레벨 보다 하위 레벨인 제 2 레벨의 서브 인덱스 구조를 적어도 부분적으로 포함하며, 제 2 레벨의 서브 인덱스 구조는 복수의 컬럼들을 포함하며, 제 2 레벨의 서브 인덱스 구조의 복수의 컬럼들 중 제 1 컬럼은 키 값들을 포함하며, 제 2 레벨의 서브 인덱스 구조의 복수의 컬럼들 중 제 2 컬럼은, 데이터 블록의 로우에 대응되는 ROWID 값들을 포함하며, 그리고 제 2 레벨의 서브 인덱스 구조의 복수의 컬럼들 중 제 3 컬럼은 상기 모니터링 결과 정보에 기초하여 설정되는 최적화 인덱스 값들을 포함할 수 있다.Disclosed is a database management apparatus according to an embodiment of the present disclosure. The database management apparatus includes a processor that generates a database index structure for pointing a data storage location based on monitoring result information according to a monitoring algorithm in a database management system (DBMS); And a storage unit for storing the created database index structure. The database index structure includes at least partially a sub index structure of the first level and a sub index structure of the second level that is lower than the first level, and the sub index structure of the second level includes a plurality of columns, The first column of the plurality of columns of the second-level sub-index structure includes key values, and the second column of the plurality of columns of the second-level sub-index structure has a ROWID value corresponding to a row of the data block. The third column of the plurality of columns of the second-level sub-index structure may include optimization index values that are set based on the monitoring result information.

본 개시내용의 일 실시예에 따라서 DBMS에서의 쿼리 처리 시간이 축소될 수 있다.Query processing time in a DBMS may be reduced according to an embodiment of the present disclosure.

본 개시내용의 일 실시예에 따라서, DBMS에서 효율적인 인덱싱 기법이 제공될 수 있다.According to one embodiment of the present disclosure, an efficient indexing technique in a DBMS can be provided.

다양한 양상들이 이제 도면들을 참조로 기재되며, 여기서 유사한 참조 번호들은 총괄적으로 유사한 구성요소들을 지칭하는데 이용된다. 이하의 실시예에서, 설명 목적을 위해, 다수의 특정 세부사항들이 하나 이상의 양상들의 총체적 이해를 제공하기 위해 제시된다. 그러나, 그러한 양상(들)이 이러한 특정 세부사항들 없이 실시될 수 있음은 명백할 것이다. 다른 예시들에서, 공지의 구조들 및 장치들이 하나 이상의 양상들의 기재를 용이하게 하기 위해 블록도 형태로 도시된다.
도 1은 본 개시내용의 일 실시예에 따른 데이터베이스 관리 장치 및 클라이언트에 대한 개략도를 도시한다.
도 2는 본 개시내용의 일 실시예에 따른 DBMS에 대한 블록 구성도를 도시한다.
도 3은 본 개시내용의 일 실시예에 따른 데이터베이스 관리 장치의 블록 구성도를 도시한다.
도 4는 본 개시내용의 일 실시예에 따른 데이터베이스 인덱스 구조를 예시적으로 도시한다.
도 5는 본 개시내용의 일 실시예에 따라 파티션 단위로 상이한 데이터베이스 인덱스 구조를 예시적으로 도시한다.
도 6은 본 개시내용의 일 실시예에 따라, 효율적인 인덱싱을 수행하기 위한 순서도를 예시적으로 도시한다.
도 7은 본 개시내용의 일 실시예에 따라, 효율적인 인덱싱을 수행하기 위한 수단, 로직 및 모듈을 예시적으로 도시한다.
도 8은 본 발명의 실시예들이 구현될 수 있는 예시적인 컴퓨팅 환경에 대한 간략하고 일반적인 개략도를 도시한다.Various aspects are now described with reference to the figures, in which like reference numerals are used collectively to refer to similar elements. In the following examples, for illustrative purposes, a number of specific details are presented to provide a holistic understanding of one or more aspects. However, it will be apparent that such aspect(s) may be practiced without these specific details. In other instances, well-known structures and devices are shown in block diagram form in order to facilitate describing one or more aspects.
1 shows a schematic diagram of a database management apparatus and client according to an embodiment of the present disclosure.
2 is a block diagram of a DBMS according to an embodiment of the present disclosure.
3 is a block diagram of a database management apparatus according to an embodiment of the present disclosure.
4 exemplarily illustrates a database index structure according to an embodiment of the present disclosure.
5 exemplarily illustrates different database index structures in units of partitions according to an embodiment of the present disclosure.
6 exemplarily shows a flowchart for performing efficient indexing, according to one embodiment of the present disclosure.
7 exemplarily shows means, logic and modules for performing efficient indexing, according to one embodiment of the present disclosure.
8 shows a simplified and general schematic diagram of an exemplary computing environment in which embodiments of the invention may be implemented.

다양한 실시예들 및/또는 양상들이 이제 도면들을 참조하여 개시된다. 하기 설명에서는 설명을 목적으로, 하나 이상의 양상들의 전반적 이해를 돕기 위해 다수의 구체적인 세부사항들이 개시된다. 그러나, 이러한 양상(들)은 이러한 구체적인 세부사항들 없이도 실행될 수 있다는 점 또한 본 발명의 기술 분야에서 통상의 지식을 가진 자에게 인식될 수 있을 것이다. 이후의 기재 및 첨부된 도면들은 하나 이상의 양상들의 특정한 예시적인 양상들을 상세하게 기술한다. 하지만, 이러한 양상들은 예시적인 것이고 다양한 양상들의 원리들에서의 다양한 방법들 중 일부가 이용될 수 있으며, 기술되는 설명들은 그러한 양상들 및 그들의 균등물들을 모두 포함하고자 하는 의도이다.Various embodiments and/or aspects are now disclosed with reference to the drawings. In the following description, for purposes of explanation, a number of specific details are disclosed to assist in the overall understanding of one or more aspects. However, it will also be appreciated by those of ordinary skill in the art of the present invention that this aspect(s) can be practiced without these specific details. The following description and the annexed drawings set forth in detail certain illustrative aspects of the one or more aspects. However, these aspects are exemplary and some of the various methods in the principles of the various aspects may be used, and the descriptions described are intended to include all such aspects and their equivalents.

또한, 다양한 양상들 및 특징들이 다수의 디바이스들, 컴포넌트들 및/또는 모듈들 등을 포함할 수 있는 시스템에 의하여 제시될 것이다. 다양한 시스템들이, 추가적인 장치들, 컴포넌트들 및/또는 모듈들 등을 포함할 수 있다는 점 그리고/또는 도면들과 관련하여 논의된 장치들, 컴포넌트들, 모듈들 등 전부를 포함하지 않을 수도 있다는 점 또한 이해되고 인식되어야 한다. In addition, various aspects and features will be presented by a system that may include multiple devices, components and/or modules, and the like. Various systems may also include additional devices, components and/or modules, and/or may not include all of the devices, components, modules, etc. discussed in connection with the drawings. It must be understood and recognized.

본 명세서에서 사용되는 "실시예", "예", "양상", "예시" 등은 기술되는 임의의 양상 또는 설계가 다른 양상 또는 설계들보다 양호하다거나, 이점이 있는 것으로 해석되지 않을 수도 있다. 아래에서 사용되는 용어들 '컴포넌트', '모듈', '시스템', '인터페이스' 등은 일반적으로 컴퓨터 관련 엔티티(computer-related entity)를 의미하며, 예를 들어, 하드웨어, 하드웨어와 소프트웨어의 조합, 소프트웨어를 의미할 수 있다.As used herein, "an embodiment", "yes", "a good", "an example", etc., may not be construed as any aspect or design described being better or more advantageous than another aspect or designs. . The terms'component','module','system', and'interface' used in the following generally mean a computer-related entity, for example, a combination of hardware, hardware and software, It can mean software.

더불어, 용어 "또는"은 배타적 "또는"이 아니라 내포적 "또는"을 의미하는 것으로 의도된다. 즉, 달리 특정되지 않거나 문맥상 명확하지 않은 경우에, "X는 A 또는 B를 이용한다"는 자연적인 내포적 치환 중 하나를 의미하는 것으로 의도된다. 즉, X가 A를 이용하거나; X가 B를 이용하거나; 또는 X가 A 및 B 모두를 이용하는 경우, "X는 A 또는 B를 이용한다"가 이들 경우들 어느 것으로도 적용될 수 있다. 또한, 본 명세서에 사용된 "및/또는"이라는 용어는 열거된 관련 아이템들 중 하나 이상의 아이템의 가능한 모든 조합을 지칭하고 포함하는 것으로 이해되어야 한다. In addition, the term “or” is intended to mean an inclusive “or” rather than an exclusive “or”. That is, unless specified otherwise or unclear in context, “X uses A or B” is intended to mean one of the natural inclusive substitutions. That is, X uses A; X uses B; Or if X uses both A and B, "X uses A or B" can be applied in either of these cases. It should also be understood that the term “and/or” as used herein refers to and includes all possible combinations of one or more of the listed related items.

또한, "포함한다" 및/또는 "포함하는"이라는 용어는, 해당 특징 및/또는 구성요소가 존재함을 의미하지만, 하나 이상의 다른 특징, 구성요소 및/또는 이들의 그룹의 존재 또는 추가를 배제하지 않는 것으로 이해되어야 한다. 또한, 달리 특정되지 않거나 단수 형태를 지시하는 것으로 문맥상 명확하지 않은 경우에, 본 명세서와 청구범위에서 단수는 일반적으로 "하나 또는 그 이상"을 의미하는 것으로 해석되어야 한다.Also, the terms “comprises” and/or “comprising” mean that the feature and/or component is present, but excludes the presence or addition of one or more other features, elements and/or groups thereof. It should be understood as not. In addition, unless otherwise specified or contextually unclear as indicating a singular form, the singular in the specification and claims should generally be construed to mean "one or more."

본 명세서에서 사용되는 용어 "컴포넌트", "모듈", "시스템" 등은 컴퓨터-관련 엔티티, 하드웨어, 펌웨어, 소프트웨어, 소프트웨어 및 하드웨어의 조합, 또는 소프트웨어의 실행을 지칭한다. 예를 들어, 컴포넌트는 프로세서상에서 실행되는 처리과정, 프로세서, 객체, 실행 스레드, 프로그램, 및/또는 컴퓨터일 수 있지만, 이들로 제한되는 것은 아니다. 예를 들어, 컴퓨팅 장치에서 실행되는 애플리케이션 및 컴퓨팅 장치 모두 컴포넌트일 수 있다. 하나 이상의 컴포넌트는 프로세서 및/또는 실행 스레드 내에 상주할 수 있고, 일 컴포넌트는 하나의 컴퓨터 내에 로컬화될 수 있고, 또는 2개 이상의 컴퓨터들 사이에 분배될 수 있다. 또한, 이러한 컴포넌트들은 그 내부에 저장된 다양한 데이터 구조들을 갖는 다양한 컴퓨터 판독가능한 매체로부터 실행할 수 있다. 컴포넌트들은 예를 들어 하나 이상의 데이터 패킷들을 갖는 신호(예를 들면, 로컬 시스템, 분산 시스템에서 다른 컴포넌트와 상호작용하는 하나의 컴포넌트로부터 데이터 및/또는 신호를 통해 다른 시스템과 인터넷과 같은 네트워크를 통한 데이터)에 따라 로컬 및/또는 원격 처리들을 통해 통신할 수 있다.As used herein, the terms "component", "module", "system" and the like refer to computer-related entities, hardware, firmware, software, a combination of software and hardware, or execution of software. For example, a component can be, but is not limited to, a process running on a processor, a processor, an object, a thread of execution, a program, and/or a computer. For example, both the application and the computing device running on the computing device can be components. One or more components can reside within a processor and/or thread of execution, and one component can be localized within one computer, or distributed between two or more computers. In addition, these components can execute from various computer readable media having various data structures stored therein. Components are, for example, signals having one or more data packets (e.g., data from one component interacting with another component in a local system, distributed system, and/or data via a network such as the Internet with other systems via signals and/or signals). ) Through local and/or remote processes.

본 명세서에서의 컴퓨터 판독가능 매체는 컴퓨터 시스템에 의해서 판독될 수 있도록 프로그램 및 데이터가 저장되는 모든 종류의 저장 매체를 포함할 수 있다. 본 발명의 일 양상에 따르면, 이러한 컴퓨터 판독가능 매체는, 컴퓨터 판독가능 저장 매체 및 컴퓨터 판독가능 전송 매체를 포함할 수 있다. 컴퓨터 판독가능 저장 매체는, ROM(판독 전용 메모리), RAM(랜덤 액세스 메모리), CD(컴팩트 디스크)-ROM, DVD(디지털 비디오 디스크)-ROM, 자기 테이프, 플로피 디스크, 광 데이터 저장장치 등을 포함할 수 있다. 컴퓨터 판독가능 전송 매체는, 캐리어 웨이브(예컨대, 인터넷을 통한 전송)의 형태로 구현되는 것을 포함할 수 있다. 추가적으로, 이러한 매체는 네트워크로 연결된 시스템에 분산되어, 분산 방식으로 컴퓨터가 판독가능한 코드들 및/또는 명령들을 저장할 수도 있다.Computer-readable media as used herein may include any kind of storage media in which programs and data are stored so that they can be read by a computer system. According to one aspect of the present invention, such a computer-readable medium may include a computer-readable storage medium and a computer-readable transmission medium. Computer-readable storage media include ROM (Read Only Memory), RAM (Random Access Memory), CD (Compact Disk)-ROM, DVD (Digital Video Disk)-ROM, magnetic tape, floppy disk, and optical data storage devices. It can contain. Computer-readable transmission media may include those implemented in the form of carrier waves (eg, transmission over the Internet). Additionally, such media may be distributed over networked systems to store computer readable codes and/or instructions in a distributed fashion.

본 발명의 실시를 위한 구체적인 내용을 설명하기에 앞서, 본 발명의 기술적 요지와 직접적 관련이 없는 구성에 대해서는 본 발명의 기술적 요지를 흩뜨리지 않는 범위 내에서 생략하였음에 유의하여야 할 것이다. 또한, 본 명세서 및 청구범위에 사용된 용어 또는 단어는 발명자가 자신의 발명을 최선의 방법으로 설명하기 위해 적절한 용어의 개념을 정의할 수 있다는 원칙에 입각하여 본 발명의 기술적 사상에 부합하는 의미와 개념으로 해석되어야 할 것이다.It should be noted that, prior to explaining the specific contents for carrying out the present invention, a configuration not directly related to the technical subject matter of the present invention has been omitted without detracting from the technical subject matter of the present invention. In addition, the terms or words used in the specification and claims are based on the principle that the inventor can define the concept of appropriate terms in order to explain his or her invention in the best way. It should be interpreted as a concept.

본 개시내용의 일 실시예에 따른 인덱싱 기술의 기본 개념은 본 출원에서 그 전체가 참조로 통합되는 미국 특허 US9626398호(2017.04.18) 및 US2007-0174309(2007.07.26)호에서 구체적으로 논의된다. The basic concept of indexing technology according to an embodiment of the present disclosure is specifically discussed in US Pat. Nos. US9626398 (2017.04.18) and US2007-0174309 (2007.07.26), the entire contents of which are incorporated herein by reference.

전술한 바와 같이, 인덱스는 B+트리 인덱스 및 비트맵 인덱스로 나누어질 수 있다.As described above, the index can be divided into a B+tree index and a bitmap index.

B+트리 인덱스에서는 테이블의 컬럼 값과 집합 내에서의 정보의 ROWID를 정렬된 형태로 가지는 구조이기 때문에, 동일한 값에 대해서 물리적 주소가 서로 상이한 경우 동일한 값을 중복하여 가지고 있어야 함에 따라 저장 공간에 낭비가 발생할 수 있다. 즉, B+트리 인덱스는 실제 컬럼 값을 인덱스에서도 보관하고 있어야 한다는 점 때문에, 대용량 데이터를 관리 및 처리하는데에 있어서 단점이 존재할 수 있다. 또한, B+트리 인덱스가 효율적으로 동작하기 위해서는 컬럼값의 분포도(selectivity)가 양호하여야 할 뿐만 아니라, 특정한 조건들에서는 인덱스의 성능을 보장받지 못한다는 단점이 존재할 수 있다. 나아가, B+트리 인덱스는 동일한 테이블에 대한 액세스시에 두 개 이상의 인덱스들을 병렬적으로 사용하는 상황에서 많은 제약사항이 따를 수 있다. 실제 DBMS 업무 환경에서의 사용자의 요구 사항은 매우 다양할 수 있다. 이러한 다양한 형태의 사용자들의 요구사항을 만족시키기 위해서는 테이블의 모든 컬럼의 조합만큼의 B+트리 인덱스를 만들어야 하며, 이러한 경우 테이블의 크기보다 오히려 인덱스의 크기가 더 커지는 기현상을 초래할 수 있다. 분포도가 양호한 2개의 B+트리 인덱스들이 존재하는 경우 2개 이상의 인덱스들을 동시에 사용하는데 하나만 사용되는 등과 같이 2개의 인덱스를 효과적으로 사용하기 위한 제약사항들 또한 존재할 수 있다.In B+tree index, it is a structure that has the column values of the table and the ROWID of information in the set in an ordered form, so if the physical addresses for the same value are different, the same value must be duplicated, so storage space is wasted. Can occur. That is, since the B+ tree index must store the actual column value in the index, there may be a disadvantage in managing and processing large amounts of data. In addition, in order for the B+tree index to operate efficiently, the selectivity of the column values must be good, and there may be a disadvantage that the performance of the index is not guaranteed under certain conditions. Furthermore, a B+tree index may be subject to many restrictions in a situation in which two or more indexes are used in parallel when accessing the same table. In a real DBMS work environment, user requirements can be very diverse. In order to satisfy the requirements of various types of users, it is necessary to create a B+tree index equal to the combination of all columns in the table, and in this case, the index size may be larger than the table size. When there are two B+tree indexes with good distribution, there may also be constraints for effectively using the two indexes, such as only one used for simultaneously using two or more indexes.

B+트리 인덱스의 문제점들을 해결하기 위한 시도의 일환으로 비트맵 인덱스가 고려될 수 있다. 비트맵 인덱스는 컴퓨터에서 사용하는 최소 단위인 비트를 이용하여 컬럼값을 저장하고 이를 이용하여 ROWID를 자동으로 생성하는 인덱스 기법을 의미할 수 있다. 일반적으로 비트맵 인덱스는, 인덱스를 생성하고자 하는 컬럼의 값들을 찾기 위해 테이블 스캔을 수행한 후 컬럼값, 시작(start) ROWID, 종료(end) ROWID 및 비트맵을 갖는 인덱스 엔트리가 생성되고 나서, 생성된 비트맵들을 B+트리 구조에 삽입되기 용이한 형태(즉, 키값 및 시작 ROWID 순서)로 정렬한 다음에, 정렬된 인덱스 엔트리들이 B+트리 구조로 삽입되는 방식으로 생성될 수 있다. 하지만, 비트맵 인덱스의 경우에도, DML(Data Manipulation Language) 작업시 계속적으로 비트맵 엔트리가 증가하게 되어, 빈번한 DML 작업을 수행하다 보면 비트맵 인덱스의 크기가 테이블의 크기보다 더 커지는 결과를 초래할 수도 있다. 또한, 비트맵 인덱스는 Distinct 값들이 많은 경우에는 그 크기가 비효율적으로 증가될 수도 있다.The bitmap index may be considered as an attempt to solve the problems of the B+tree index. The bitmap index may refer to an indexing technique that automatically stores a column value using a bit, which is the smallest unit used in a computer, and automatically generates a ROWID using it. In general, the bitmap index, after performing a table scan to find the values of the column to create the index, after the index entry with the column value, start (ROW), end (END) ROWID and bitmap is created, After the generated bitmaps are arranged in a form that is easy to be inserted into the B+ tree structure (ie, the key value and the starting ROWID order), the sorted index entries can be generated in such a way that they are inserted into the B+ tree structure. However, even in the case of a bitmap index, the bitmap entry continuously increases during the Data Manipulation Language (DML) operation, which may result in the size of the bitmap index being larger than the size of the table when performing frequent DML operations. have. In addition, the bitmap index may be inefficiently increased in size when there are many Distinct values.

본 개시내용의 일 실시예에 따라, 비트맵 인덱스 및 B+트리 인덱스를 효율적인 방식으로 조합하는 기법이 제공될 수 있다. 상기 기법은: 리프 노드에 대한 빈도수에 기초하여 빈도수가 낮은 리프 노드에 대해서는 B+트리 방식으로 표현하고 그리고 빈도수가 높은 리프 노드에 대해서는 B+트리 방식에 비트맵 값을 추가하여 표현하는 것을 포함할 수 있다. 또한, 상기 기법은: Distinct 값이 사전결정된 수를 초과하는 경우 혹은 DML의 발생 횟수가 사전결정된 횟수를 초과하는 파티션에 대해서는 B+트리 방식으로 인덱스를 표현하고 그리고 Distinct 값이 사전결정된 수보다 적거나 DML의 발생이 사전결정된 횟수보다 적은 파티션에 대해서는 B+트리 방식에 비트맵 값을 추가하여 인덱스를 표현하는 기법을 포함할 수 있다. According to an embodiment of the present disclosure, a technique of combining a bitmap index and a B+tree index in an efficient manner may be provided. The above technique may include expressing a leaf node with a low frequency in a B+tree manner based on a frequency for a leaf node, and adding a bitmap value to a leaf node with a high frequency in a B+tree manner. . In addition, the above technique: When the Distinct value exceeds a predetermined number, or for a partition where the number of occurrences of DML exceeds a predetermined number, the index is expressed in a B+tree manner, and the Distinct value is less than a predetermined number or DML For partitions with less than a predetermined number of times, a technique of expressing an index by adding a bitmap value to the B+tree method may be included.

본 개시내용의 일 실시예에 따라, 인덱스의 크기를 최소화시킬 수 있을 뿐만 아니라 DBMS의 쿼리 처리 성능을 증대시킬 수 있다. 본 개시내용의 일 실시예에 따라서 DBMS에서의 쿼리 처리 시간이 축소될 수 있을 뿐만 아니라 DB 시스템에서의 상환 조건들에 맞는 최적화된 효율적인 인덱싱 기법이 제공될 수 있다.According to an embodiment of the present disclosure, it is possible to minimize the size of the index as well as increase the query processing performance of the DBMS. According to an embodiment of the present disclosure, not only the query processing time in the DBMS can be reduced, but also an optimized efficient indexing technique suitable for redemption conditions in the DB system can be provided.

도 1은 본 개시내용의 일 실시예에 따른 데이터베이스 관리 장치(120) 및 클라이언트(110)에 대한 개략도를 도시한다.1 shows a schematic diagram of a database management device 120 and a client 110 according to an embodiment of the present disclosure.

도 1에서 도시되는 바와 같이, 데이터베이스 시스템은 클라이언트(110), 데이터베이스 관리 장치(120)를 포함할 수 있다. 본 명세서에서의 데이터베이스 관리 장치는 데이터베이스 서버와 상호 교환 가능하게 사용될 수 있다.As shown in FIG. 1, the database system may include a client 110 and a database management device 120. The database management device in the present specification may be used interchangeably with a database server.

도 1에서 도시되는 바와 같이, 클라이언트(110)는 네트워크를 통하여 통신하기 위한 매커니즘을 갖는 데이터베이스 시스템에서의 노드(들)를 의미할 수 있다. 예를 들어, 클라이언트(110)는 PC, 랩탑 컴퓨터, 워크스테이션, 단말 및/또는 네트워크 접속성을 갖는 임의의 전자 디바이스를 포함할 수 있다. 또한, 클라이언트(110)는 에이전트(Agent), API(Application Programming Interface) 및 플러그-인(Plug-in) 중 적어도 하나에 의해 구현되는 임의의 서버를 포함할 수도 있다. 예를 들어, 도 1에서의 클라이언트(110)는 데이터베이스 관리 장치(120)를 사용하는 사용자와 관련될 수 있다. 이러한 예시에서, 클라이언트(110)는 데이터베이스 관리 장치(120)로 DML 및 DDL(Data Definition Language) 등을 포함하는 다양한 형태의 쿼리들을 발행할 수 있다.As shown in FIG. 1, the client 110 may refer to node(s) in a database system having a mechanism for communicating over a network. For example, the client 110 can include a PC, laptop computer, workstation, terminal, and/or any electronic device with network connectivity. Also, the client 110 may include any server implemented by at least one of an agent, an application programming interface (API), and a plug-in. For example, the client 110 in FIG. 1 may be associated with a user using the database management device 120. In this example, the client 110 may issue various types of queries, including DML and DDL (Data Definition Language), to the database management device 120.

데이터베이스 관리 장치(120)는, 예를 들어, 마이크로프로세서, 메인프레임 컴퓨터, 디지털 싱글 프로세서, 휴대용 디바이스 및 디바이스 제어기 등과 같은 임의의 타입의 컴퓨터 시스템 또는 컴퓨터 디바이스를 포함할 수 있다. 예를 들어, 본 개시내용의 일 실시예에 따른 데이터베이스 관리 장치(120)는 데이터베이스 서버를 포함할 수 있다. 이러한 데이터베이스 관리 장치(120)는 DBMS(Database Management System)(130) 및 영구 저장 매체(persistent storage)(140)를 포함할 수 있다. 도 1에서는 1개의 데이터베이스 관리 장치를 도시하고 있으나, 이보다 많은 데이터베이스 관리 장치들 또한 본 발명의 범위에 포함될 수 있다는 점이 당해 출원 분야에 있어서 통상의 지식을 가진 자에게 명백할 것이다. The database management apparatus 120 may include any type of computer system or computer device, such as, for example, a microprocessor, mainframe computer, digital single processor, portable device, and device controller. For example, the database management device 120 according to an embodiment of the present disclosure may include a database server. The database management device 120 may include a DBMS (Database Management System) 130 and a persistent storage medium (persistent storage) (140). Although FIG. 1 shows one database management device, it will be apparent to those skilled in the art that more database management devices may be included in the scope of the present invention.

도 1에서는 도시되지 않았지만, 데이터베이스 관리 장치(120)는 버퍼 캐시를 포함하는 하나 이상의 메모리를 포함할 수 있다. 또한, 도 1에서는 도시되지 않았지만, 데이터베이스 관리 장치(120)는 하나 이상의 프로세서를 포함할 수 있다. 따라서, DBMS(130)는 상기 메모리 상에서 상기 프로세서에 의하여 동작될 수 있다. Although not illustrated in FIG. 1, the database management device 120 may include one or more memories including a buffer cache. Also, although not illustrated in FIG. 1, the database management device 120 may include one or more processors. Therefore, the DBMS 130 can be operated by the processor on the memory.

여기서, 메모리는 동적 램(DRAM, dynamic random access memory), 정적 램(SRAM, static random access memory) 등의 랜덤 액세스 메모리(RAM)와 같은, 프로세서가 직접 접근하는 주된 저장 장치로서 전원이 꺼지면 저장된 정보가 순간적으로 지워지는 휘발성(volatile) 저장 장치를 의미할 수 있지만, 이들로 한정되는 것은 아니다. 이러한 메모리는 프로세서에 의하여 동작 될 수 있다. 메모리는 데이터 값을 포함하는 데이터 테이블(data table) 및/또는 인덱스 테이블을 임시로 저장할 수 있다. 상기 데이터 테이블 및 인덱스 테이블은 데이터 값을 포함할 수 있으며, 본 발명의 일 실시예에서 상기 테이블들의 데이터 값은 메모리로부터 영구 저장 매체에 기록될 수 있다. 추가적인 양상에서, 메모리는 버퍼 캐시를 포함하며, 상기 버퍼 캐시의 데이터 블록에는 데이터가 저장될 수 있다. 상기 데이터는 백그라운드 프로세스에 의하여 영구 저장 매체에 기록될 수 있다.Here, the memory is a main storage device directly accessed by a processor, such as dynamic random access memory (DRAM) or static random access memory (SRAM), and stored information when the power is turned off. Can mean a volatile storage device that is instantly erased, but is not limited to these. This memory can be operated by a processor. The memory may temporarily store a data table and/or index table including data values. The data table and the index table may include data values, and in one embodiment of the present invention, the data values of the tables may be recorded from memory to a permanent storage medium. In a further aspect, the memory includes a buffer cache, and data may be stored in a data block of the buffer cache. The data can be recorded on a permanent storage medium by a background process.

영구 저장 매체(140)는, 예를 들어 자기(magnetic) 디스크, 광학(optical) 디스크 및 광자기(magneto-optical) 저장 디바이스뿐만 아니라 플래시 메모리 및/또는 배터리-백업 메모리에 기초한 저장 디바이스와 같은, 임의의 데이터를 지속적으로 할 수 있는 비-휘발성(non-volatile) 저장 매체를 의미한다. 이러한 영구 저장 매체(140)는 다양한 통신 수단을 통하여 데이터베이스 관리 장치(120)의 프로세서 및 메모리와 통신할 수 있다. 추가적인 실시예에서, 이러한 영구 저장 매체(140)는 데이터베이스 관리 장치(120) 외부에 위치하여 데이터베이스 관리 장치(120)와 통신 가능할 수도 있다. 본 개시내용의 일 실시예에 따라 영구 저장 매체(140) 및 메모리는 저장부로 통칭될 수도 있다.The permanent storage medium 140 is, for example, a magnetic disk, an optical disk and a magneto-optical storage device, as well as a storage device based on flash memory and/or battery-backed memory, Refers to a non-volatile storage medium capable of continuously performing arbitrary data. The permanent storage medium 140 may communicate with the processor and memory of the database management device 120 through various communication means. In a further embodiment, the permanent storage medium 140 may be located outside the database management device 120 to communicate with the database management device 120. According to an embodiment of the present disclosure, the permanent storage medium 140 and the memory may be collectively referred to as a storage unit.

DBMS(130)는 데이터베이스 관리 장치(120)에서 필요한 데이터를 검색, 삽입, 수정 및/또는 삭제 그리고 인덱스를 생성하고 인덱스에 액세스하는 것 등과 같은 데이터베이스의 임의의 형태의 동작들을 수행하는 것을 허용하기 위한 프로그램으로서, 전술한 바와 같이, 데이터베이스 관리 장치(120)의 메모리에서 프로세서에 의하여 구현될 수 있다. DBMS 130 to allow the database management device 120 to perform any form of operation of the database, such as retrieving, inserting, modifying and/or deleting necessary data, and creating an index and accessing the index. As a program, as described above, it may be implemented by a processor in the memory of the database management apparatus 120.

클라이언트(110)와 데이터베이스 관리 장치(120) 또는 데이터베이스 관리 장치들은 네트워크(미도시)를 통하여 서로 통신할 수 있다. 본 발명의 일 실시예에 따른 네트워크는 공중전화 교환망(PSTN:Public Switiched Telephone Network), xDSL(x Digital Subscriber Line), RADSL(Rate Adaptive DSL), MDSL(Multi Rate DSL), VDSL(Very High Speed DSL), UADSL(Universal Asymmetric DSL), HDSL(High Bit Rate DSL) 및 근거리 통신망(LAN) 등과 같은 다양한 유선 통신 시스템들을 사용할 수 있다.The client 110 and the database management device 120 or the database management devices may communicate with each other through a network (not shown). The network according to an embodiment of the present invention is a public switched telephone network (PSTN), x Digital Subscriber Line (xDSL), Rate Adaptive DSL (RADSL), Multi Rate DSL (MDSL), Very High Speed DSL (VDSL) ), Universal Asymmetric DSL (UADSL), High Bit Rate DSL (HDSL), and various wired communication systems such as a local area network (LAN).

또한, 본 명세서에서 제시되는 네트워크는 CDMA(Code Division Multi Access), TDMA(Time Division Multi Access), FDMA(Frequency Division Multi Access), OFDMA(Orthogonal Frequency Division Multi Access), SC-FDMA(Single Carrier-FDMA) 및 다른 시스템들과 같은 다양한 무선 통신 시스템들을 사용할 수 있다. 추가적으로, 네트워크는 데이터베이스 링크(dblink)를 포함할 수 있으며, 이에 따라 복수의 데이터베이스 관리 장치들은 이러한 데이터베이스 링크를 통해 서로 통신하여 다른 데이터베이스 관리 장치로부터의 데이터를 가져올 수 있다. 본 명세서에서 설명된 기술들은 위에서 언급된 네트워크들뿐만 아니라, 다른 네트워크들에서도 사용될 수 있다. In addition, the network proposed in this specification is CDMA (Code Division Multi Access), TDMA (Time Division Multi Access), FDMA (Frequency Division Multi Access), OFDMA (Orthogonal Frequency Division Multi Access), SC-FDMA (Single Carrier-FDMA) ) And other systems. Additionally, the network may include a database link (dblink), so that multiple database management devices can communicate with each other through this database link to fetch data from other database management devices. The techniques described herein can be used in the networks mentioned above, as well as other networks.

도 2는 본 개시내용의 일 실시예에 따른 DBMS(130)에 대한 블록 구성도를 도시한다.2 shows a block diagram of a DBMS 130 according to one embodiment of the present disclosure.

도 2에서 도시되는 바와 같이, DBMS(130)는 쿼리 수신부(201), 모니터링 수행부(203), 쿼리 처리부(205), 통신부(207) 및 저장부(209)를 포함할 수 있다. 도 2에서 도시되는 컴포넌트들은 예시적인 것이며, 발명의 구현 양상에 따라서 일부 컴포넌트들이 생략될 수 있거나 또는 추가적인 컴포넌트들이 추가될 수도 있다. 상기 DBMS(130)내의 컴포넌트들은 하나 이상의 프로세서에 의해 실행되어 메모리 및/또는 영구 저장 매체(140)에 저장될 수 있다. 전술한 메모리 및 영구 저장 매체(140)는 저장부(209)로 통합될 수도 있다. 이러한 경우, 저장부(209)는 프로세서와 통신하여 데이터 저장/관리를 제어할 수 있다.As illustrated in FIG. 2, the DBMS 130 may include a query receiving unit 201, a monitoring performing unit 203, a query processing unit 205, a communication unit 207, and a storage unit 209. The components illustrated in FIG. 2 are exemplary, and some components may be omitted or additional components may be added depending on the implementation aspect of the invention. Components in the DBMS 130 may be executed by one or more processors and stored in a memory and/or permanent storage medium 140. The above-described memory and permanent storage medium 140 may be integrated into the storage unit 209. In this case, the storage unit 209 may communicate with the processor to control data storage/management.

쿼리 수신부(201)는 클라이언트(110)로부터 발행된 임의의 형태의 쿼리를 수신을 제어할 수 있다. 쿼리 수신부(201)는 수신된 쿼리에 대한 특정 동작을 수행할 수 있다. 본 개시내용에서의 쿼리는 데이터의 조회, 업데이트, 추가, 변경, 삭제 및 삽입 등을 포함하는 DML(Data Manipulation Language)을 포함할 수 있다. 예를 들어, 이러한 DML은 SELECT, INSERT, UPDATE, DELETE와 관련된 명령어 등을 포함할 수 있다. 또한, 본 개시내용에서의 쿼리는 스키마, 도메인, 테이블, 뷰, 인덱스를 정의하거나 변경 또는 제거할때 사용되는 DDL(Data Definition Language)을 포함할 수 있다. 이러한 DDL은 CREATE, ALTER, DROP, RENAME, TRUNCATE 등의 명령어를 포함할 수 있다. DDL의 일례로, 데이터베이스 인덱스 구조의 생성을 위한 쿼리가 포함될 수 있다.The query receiving unit 201 may control receiving any type of query issued from the client 110. The query receiving unit 201 may perform a specific operation on the received query. Queries in the present disclosure may include Data Manipulation Language (DML) including querying, updating, adding, changing, deleting, and inserting data. For example, such DML may include SELECT, INSERT, UPDATE, and DELETE related commands. Further, the query in the present disclosure may include a data definition language (DDL) used when defining, changing, or removing schemas, domains, tables, views, and indexes. The DDL may include commands such as CREATE, ALTER, DROP, RENAME, and TRUNCATE. As an example of DDL, a query for creating a database index structure may be included.

프로세서가 DBMS(130)를 통하여 쿼리를 파싱(parsing)하고, 변환(transformation)하고, 최적화(optimization)한 후, 해당 쿼리는 최적화된 쿼리 문(statement)의 형태로 실행될 수 있다. 추가적으로, 쿼리 수신부(201)가 쿼리를 수신하는 경우, 이에 대한 복수의 (후보) 수행 알고리즘들이 자동적으로 결정될 수 있거나 또는 클라이언트로부터 수신된 힌트(hint)에 기초하여 이에 대한 복수의 (후보) 수행 알고리즘들이 결정될 수도 있다. 데이터 조회와 관련된 예시로, 인덱스를 통하여 데이터 조회를 수행할지 혹은 전체(full) 테이블 스캔을 통하여 데이터 조회를 수행할지에 관한 수행 알고리즘이 결정될 수 있다.After the processor parses, transforms, and optimizes the query through the DBMS 130, the query can be executed in the form of an optimized query statement. Additionally, when the query receiving unit 201 receives a query, a plurality of (candidate) execution algorithms for it may be automatically determined, or a plurality of (candidate) execution algorithms based on a hint received from the client May be decided. As an example related to data inquiry, an execution algorithm on whether to perform data inquiry through an index or full table scan may be determined.

본 발명의 일 양상에서, 데이터베이스 인덱스 구조의 생성과 관련된 쿼리는 예를 들어, CREATE INDEX, CREATE INDEX WITH DROP_EXISTING, ALTER INDEX REBUILD, ALTER TABLE ADD CONSTRAINT(PRIMARY KEY 또는 UNIQUE), ALTER TABLE DROP CONSTRAINT(PRIMARY KEY 또는 UNIQUE), 및 DROP INDEX MOVE TO 등을 포함할 수 있지만, 이들로 제한되지는 않는다.In one aspect of the present invention, queries related to the creation of a database index structure include, for example, CREATE INDEX, CREATE INDEX WITH DROP_EXISTING, ALTER INDEX REBUILD, ALTER TABLE ADD CONSTRAINT (PRIMARY KEY or UNIQUE), ALTER TABLE DROP CONSTRAINT (PRIMARY KEY) Or UNIQUE), and DROP INDEX MOVE TO, and the like.

모니터링 수행부(203)는 데이터베이스 인덱스 구조를 생성하는데 기초가 될 수 있는 모니터링 동작을 수행할 수 있다. 본 명세서에서의 모니터링 동작은, 인덱스를 빌드하기 위하여, 특정 키 값에 액세스되는 횟수/빈도 정보, 특정 데이터에 액세스되는 횟수/빈도 정보, 특정 파티션에 대한 distinct 값 정보, 특정 파티션에 대한 DML 발생 횟수/빈도 정보, 및 특정 파티션에 포함된 데이터의 마지막 액세스 시점 정보 중 적어도 하나의 정보를 획득하는 과정을 의미할 수 있다. 횟수/빈도 정보는 카운팅 값을 의미할 수 있거나 혹은 시간당 횟수 값을 의미할 수 있다. 시점 정보는 실제 시간 정보를 의미할 수 있거나 혹은 데이터베이스 관리 장치(120)가 자체적으로 관리하는 시간 정보(예컨대, TSN(Tibero Sequence Number))를 의미할 수도 있다.The monitoring execution unit 203 may perform a monitoring operation that can be a basis for creating a database index structure. The monitoring operation in the present specification includes the number of times/frequency information accessed for a specific key value, the number/frequency information accessed for specific data, distinct value information for a specific partition, and the number of DML occurrences for a specific partition in order to build an index. It may mean a process of acquiring at least one of /frequency information and last access point information of data included in a specific partition. The count/frequency information may mean a counting value or a count value per hour. The time point information may mean actual time information or time information (for example, a TSN (Tibero Sequence Number)) that the database management device 120 manages itself.

모니터링 수행부(203)는 모니터링 수행에 따른 모니터링 결과 정보를 생성할 수 있다. 모니터링 수행 결과 정보는, 데이터베이스 인덱스 구조가 포함하는 복수의 서브 인덱스 구조들 특정 서브 인덱스 구조(예컨대, 하위 서브 인덱스 구조)가 제 1 타입의 인덱싱 방식 또는 상기 제 1 타입의 인덱싱 방식과 상이한 제 2 타입의 인덱싱 방식 중 어느 인덱싱 방식을 사용할 것인지에 대한 표시를 포함할 수 있다. 본 개시내용의 일 실시예에서, 특정 타입의 인덱싱 방식은 임의의 인덱싱 방식을 포함할 수 있다. 예를 들어, 제 1 타입의 인덱싱 방식은 비트맵 인덱싱 방식을 포함할 수 있다. 제 2 타입의 인덱싱 방식은 B+트리 인덱싱 방식을 포함할 수 있다. 또한, 이러한 모니터링 결과 정보는 ROWID 단위 및/또는 데이터베이스 테이블이 분할된 형태인 파티션 단위로 어떤 인덱싱 방식을 사용할 것인지를 표시할 수 있다.The monitoring execution unit 203 may generate monitoring result information according to monitoring execution. As a result of monitoring, a plurality of sub-index structures included in the database index structure may include a specific sub-index structure (eg, a sub-sub index structure) that is different from the indexing method of the first type or the indexing method of the first type. It may include an indication of which indexing method to use among the indexing methods. In one embodiment of the present disclosure, a particular type of indexing scheme may include any indexing scheme. For example, the first type of indexing scheme may include a bitmap indexing scheme. The second type of indexing scheme may include a B+tree indexing scheme. In addition, the monitoring result information may indicate which indexing method to use in the ROWID unit and/or the partition unit in which the database table is partitioned.

전술한 예시들에 따라서, 예를 들어, 모니터링 결과 정보에는 리프 노드의 특정 로우는 B+트리 인덱싱 방식을 사용하고 특정 로우와는 상이한 다른 로우는 비트맵 인덱싱 방식을 사용한다는 표시가 포함될 수 있다. 다른 예시로, 모니터링 결과 정보에는 데이터베이스 테이블의 특정 파티션은 B+트리 인덱싱 방식을 사용하고 상기 특정 파티션과 상이한 다른 파티션은 비트맵 인덱싱 방식을 사용한다는 표시가 포함될 수 있다.According to the above-described examples, for example, the monitoring result information may include an indication that a specific row of a leaf node uses a B+tree indexing scheme and another row different from a particular row uses a bitmap indexing scheme. As another example, the monitoring result information may include an indication that a specific partition of the database table uses a B+tree indexing method, and another partition different from the specific partition uses a bitmap indexing method.

본 개시내용의 일 실시예에 따라서, 모니터링 수행부(203)는 사전결정된 모니터링 알고리즘에 기초하여, 모니터링 동작을 수행할 수 있다. 예를 들어, 제 1 모니터링 알고리즘에 따라서, 모니터링 수행부(203)는 데이터베이스 인덱스 구조가 참조하는 기초(base) 데이터 구조의 컬럼 값에 액세스되는 횟수 정보를 모니터링함으로써, 모니터링 결과 정보를 생성할 수 있다. 이러한 경우, 예를 들어, 모니터링 결과 정보는, 로우 단위로 제 1 타입의 인덱싱 방식 또는 제 2 타입의 인덱싱 방식 중 어느 인덱싱 방식을 사용할 것인지에 대한 표시를 포함할 수 있다. 이러한 경우, 컬럼 값으로 액세스되는 횟수 정보가 사전결정된 액세스 임계 횟수값 보다 큰 경우 상기 모니터링 결과 정보는 비트맵 인덱싱 방식을 사용할 것을 표시하고, 컬럼 값으로 액세스되는 횟수 정보가 사전결정된 액세스 임계 횟수값 이하인 경우 상기 모니터링 결과 정보는 B+트리 인덱싱 방식을 사용할 것을 표시할 수 있다.According to an embodiment of the present disclosure, the monitoring performing unit 203 may perform a monitoring operation based on a predetermined monitoring algorithm. For example, according to the first monitoring algorithm, the monitoring execution unit 203 may generate monitoring result information by monitoring the number of times the column value of the base data structure referenced by the database index structure is accessed. . In this case, for example, the monitoring result information may include an indication of which indexing method to use, the first type of indexing method or the second type of indexing method on a row basis. In this case, if the number of times the column value is accessed is greater than the predetermined number of access thresholds, the monitoring result information indicates that the bitmap indexing method is used, and the number of times the column value is accessed is less than the predetermined number of access thresholds. In this case, the monitoring result information may indicate that a B+tree indexing method is used.

본 개시내용의 일 실시예에 따라서, 모니터링 수행부(203)에 의해 생성되는 모니터링 결과 정보는, 데이터베이스 인덱스 구조가 참조하는 기초 데이터 구조의 파티션 특성 정보에 적어도 부분적으로 기초하여 파티션 단위로 생성될 수 있다. 예를 들어, 기초 데이터 구조는 데이터베이스 인덱스 구조가 인덱싱하는 데이터 테이블을 포함할 수 있다. 파티션 특성 정보는, 파티션의 Distinct 값의 크기 정보 및 파티션에 대한 DML 발생 횟수 정보 중 적어도 하나에 기초하여 결정될 수 있다. 예를 들어, 파티션의 상기 Distinct 값의 크기 정보가 사전결정된 크기 임계값보다 작거나 또는 상기 파티션에 대한 DML 발생 횟수 정보가 사전결정된 DML 발생 횟수 임계값 보다 작은 경우, 상기 모니터링 결과 정보는 상기 파티션에 대해서 비트맵 인덱싱 방식을 사용할 것을 표시하며, 파티션에 대한 상기 DML 발생 횟수 정보가 사전결정된 DML 발생 횟수 임계값 이상인 경우, 상기 모니터링 결과 정보는 상기 파티션에 대해서 B+트리 인덱싱 방식을 사용할 것을 표시할 수 있다.According to an embodiment of the present disclosure, monitoring result information generated by the monitoring performing unit 203 may be generated in a partition unit based at least in part on partition property information of a basic data structure referenced by a database index structure. have. For example, the basic data structure may include data tables indexed by the database index structure. The partition characteristic information may be determined based on at least one of the size information of the partition's Distinct value and the number of DML occurrences for the partition. For example, if the size information of the Distinct value of a partition is less than a predetermined size threshold value, or if the number of DML occurrences for the partition is less than a predetermined DML occurrence number threshold, the monitoring result information is transmitted to the partition. The bitmap indexing method is used, and when the DML occurrence count information for a partition is equal to or greater than a predetermined DML occurrence count threshold, the monitoring result information may indicate that the B+tree indexing method is used for the partition. .

본 개시내용의 일 실시예에 따라서, 모니터링 수행부(203)에서 생성하는 모니터링 결과 정보는, 파티션에 포함된 데이터의 마지막 액세스 시점 정보에 적어도 부분적으로 기초하여 결정될 수 있다. 이러한 경우, 파티션에 포함된 데이터의 상기 마지막 액세스 시점 정보가 사전결정된 임계 시점 정보 보다 오래된 경우, 상기 모니터링 결과 정보는 상기 파티션에 대해서 비트맵 인덱싱 방식을 사용할 것을 표시할 수 있다. 또한, 파티션에 포함된 데이터의 상기 마지막 액세스 시점 정보가 사전결정된 임계 시점 정보 보다 오래되지 않은 경우, 상기 모니터링 결과 정보는 상기 파티션에 대해서 B+트리 인덱싱 방식을 사용할 것을 표시할 수 있다.According to an embodiment of the present disclosure, monitoring result information generated by the monitoring performing unit 203 may be determined based at least in part on the last access point information of data included in the partition. In this case, when the last access point information of data included in a partition is older than a predetermined threshold time point information, the monitoring result information may indicate that a bitmap indexing method is used for the partition. In addition, when the last access point information of data included in a partition is not older than a predetermined threshold time point information, the monitoring result information may indicate that a B+tree indexing method is used for the partition.

모니터링 수행부(203)의 모니터링 동작에 의하여, 각 로우 단위로 그리고/또는 각 파티션 단위로 상이한 타입의 인덱싱 방식이 적용될 수 있기 때문에, 전체적인 인덱싱 성능이 최적화될 수 있다. Since a different type of indexing method can be applied to each row unit and/or each partition unit by the monitoring operation of the monitoring execution unit 203, the overall indexing performance can be optimized.

일례로, 기존의 비트맵 인덱스의 경우 distinct 값이 큰 경우에는 비효율적으로 크기가 커질 수 있다. 따라서, 본 개시내용의 일 실시예에 따른 기법에 의하여, 특정한 값들이 전체 데이터의 대부분을 차지하고 그리고 나머지 값들은 개수가 적은 이른바 Long Tail Data에 대해서는 액세스 빈도수가 높은 값들에 대해서는 비트맵 인덱싱을 통하여 해당 자료를 표현하는 Row Piece가 줄어들게 된다. 나아가 본 개시내용의 일 실시예에 따른 기법에 의하여, 빈도수가 낮은 값들에 대해서는 B+트리 인덱싱 방식을 사용함에 따라 불필요한 공간 낭비가 줄어들 수 있다. 즉, 본 개시내용의 일 실시예에 따른 기법에 의하여, 인덱스 구조의 크기를 줄임과 동시에 인덱스 속도가 빨라질 수 있다.For example, in the case of the existing bitmap index, if the distinct value is large, the size may be inefficiently increased. Therefore, by the technique according to an embodiment of the present disclosure, specific values occupy most of the entire data, and the remaining values are small numbers, so for the long tail data, corresponding values are accessed through bitmap indexing. The row pieces representing the data are reduced. Furthermore, by using a technique according to an embodiment of the present disclosure, unnecessary space waste can be reduced by using a B+tree indexing method for values with a low frequency. That is, by the technique according to an embodiment of the present disclosure, the index speed may be increased while reducing the size of the index structure.

다른 예시로, DML이 자주 발생하는 경우 B+트리 인덱스가 유리하며, Select 쿼리 등에 대해서는 비트맵 인덱스가 유리할 수 있다. 예를 들어, 과거 데이터가 포함된 파티션에는 비트맵 인덱스를 사용하고 업데이트, 삽입 등이 발생하는 현재 데이터가 포함된 파티션에 대해서는 B+트리 인덱스를 사용함으로써, 파티션 특성에 따라 최적화된 인덱싱 방식을 통하여 쿼리 응답 성능이 향상될 수 있다.As another example, when DML occurs frequently, a B+tree index is advantageous, and a bitmap index may be advantageous for select queries and the like. For example, a bitmap index is used for a partition that contains historical data, and a B+tree index is used for a partition that contains current data that is updated, inserted, etc. Response performance can be improved.

본 개시내용의 일 실시예에 따라, 모니터링 수행부(203)에 의해 생성된 모니터링 결과 정보는 저장부(209)로 저장되어 관리될 수 있다. 이러한 경우, 쿼리 처리부(205)는 저장부(209)에 저장된 모니터링 결과 정보를 리트리브(retrieve)하여 모니터링 결과 정보에 따라 데이터베이스 인덱스 구조를 생성할 수 있다. According to an embodiment of the present disclosure, monitoring result information generated by the monitoring performing unit 203 may be stored and managed in the storage unit 209. In this case, the query processing unit 205 may retrieve the monitoring result information stored in the storage unit 209 to generate a database index structure according to the monitoring result information.

쿼리 처리부(205)는 해당 쿼리에 대한 임의의 연산(operation)을 수행할 수 있다. 쿼리 처리부(205)는, 쿼리 수신부(201)에 의해 수신된 쿼리에 대한 하나 이상의 수행 알고리즘을 생성할 수 있다. 쿼리 처리부(205)는 쿼리를 실행하기 위한 수행 알고리즘을 결정하고, 결정된 수행 알고리즘에 따라서 쿼리를 처리하기 위한 연산을 수행할 수 있다. The query processing unit 205 may perform an arbitrary operation on the corresponding query. The query processing unit 205 may generate one or more execution algorithms for the query received by the query receiving unit 201. The query processing unit 205 may determine an execution algorithm for executing a query, and perform an operation for processing a query according to the determined execution algorithm.

쿼리 처리부(205)는 인덱스 구조 정의/생성에 관한 쿼리를 수신하는 경우, 모니터링 수행부(203)의 모니터링 결과 정보에 기초하여, 데이터베이스 인덱스 구조를 생성할 수 있다. 생성되는 데이터베이스 인덱스 구조는, 복수의 서브 인덱스 구조들을 계층적으로 구비하는 형태가 될 수 있다. When receiving a query related to index structure definition/creation, the query processing unit 205 may generate a database index structure based on the monitoring result information of the monitoring execution unit 203. The generated database index structure may have a form having a plurality of sub-index structures hierarchically.

예를 들어, 데이터베이스 인덱스 구조는 제 1 레벨의 서브 인덱스 구조 및 상기 제 1 레벨 보다 하위 레벨인 제 2 레벨의 서브 인덱스 구조를 포함할 수 있다. 이러한 경우, 제 1 레벨의 서브 인덱스 구조는 루트 노드와 대응될 수 있으며 그리고 제 2 레벨의 서브 인덱스 구조는 리프 노드와 대응될 수 있다 또한, 데이터베이스 인덱스 구조는 제 1 레벨의 서브 인덱스 구조, 제 1 레벨의 하위 레벨인 제 2 레벨의 서브 인덱스 구조 및 제 2 레벨의 하위 레벨인 제 3 레벨의 서브 인덱스 구조를 포함할 수 있다. 이러한 경우, 제 1 레벨의 서브 인덱스 구조는 루트 노드와 대응될 수 있으며, 제 2 레벨의 서브 인덱스 구조는 브랜치 노드와 대응될 수 있으며 그리고 제 3 레벨의 서브 인덱스 구조는 리프 노드와 대응될 수 있다. For example, the database index structure may include a first level sub-index structure and a second level sub-index structure that is lower than the first level. In this case, the sub-index structure of the first level may correspond to the root node, and the sub-index structure of the second level may correspond to the leaf node. In addition, the database index structure may include the sub-index structure of the first level, the first A sub-level structure of the second level, which is a lower level of the level, and a sub-index structure of the third level, which is a lower level of the second level. In this case, the sub-index structure of the first level may correspond to the root node, the sub-index structure of the second level may correspond to the branch node, and the sub-index structure of the third level may correspond to the leaf node. .

본 개시내용의 일 실시예에 따라서, 복수의 서브 인덱스 구조들 중 하나 이상의 서브 인덱스 구조(예컨대, 루트 노드 및/또는 브랜치 노드)는 자신의 하위 레벨의 서브 인덱스 구조의 시작 ROWID 값 및 상기 하위 레벨의 서브 인덱스 구조에 접근하기 위한 주소값을 포함할 수 있다. 이에 따라, 상위 노드는 하위 노드(들)을 포인팅할 수 있어서, 인덱스 구조에서의 상위 레벨로부터 하위 레벨로의 대한 트래버스가 이루어질 수 있다. 프로세서는 트래버스 수행시에 각 노드의 조건을 확인하고 상기 조건에 기초하여 연산을 수행할 수 있다. 상기 조건은 탐색 목표의 인덱스 번호가 노드에 포함하는 인덱스 번호의 범위에 포함되는지 여부일 수 있다.According to an embodiment of the present disclosure, one or more sub-index structures (eg, a root node and/or a branch node) of a plurality of sub-index structures may include a starting ROWID value of the sub-index structure of the sub-level and its sub-level. It may include an address value for accessing the sub-index structure of. Accordingly, the upper node can point to the lower node(s), so that traverse from the upper level to the lower level in the index structure can be made. The processor may check the condition of each node when performing traverse and perform an operation based on the condition. The condition may be whether the index number of the search target is included in the range of index numbers included in the node.

본 개시내용의 일 실시예에 따라서, 복수의 서브 인덱스 구조들 중 하나 이상의 서브 인덱스 구조(예컨대, 리프 노드)는 복수의 컬럼들을 포함할 수 있다. 복수의 컬럼들 중 제 1 컬럼은 키 값들을 포함하며, 복수의 컬럼들 중 제 2 컬럼은, 데이터 블록의 로우에 대응되는 ROWID 값들을 포함하며, 그리고 복수의 컬럼들 중 제 3 컬럼은 상기 모니터링 결과 정보에 기초하여 설정되는 최적화(optimized) 인덱스 값들을 포함할 수 있다. According to an embodiment of the present disclosure, one or more sub-index structures (eg, leaf nodes) of the plurality of sub-index structures may include a plurality of columns. The first column of the plurality of columns includes key values, the second column of the plurality of columns includes ROWID values corresponding to the row of the data block, and the third column of the plurality of columns monitors the Optimized index values that are set based on the result information may be included.

본 개시내용의 일 실시예에 따른 "최적화 인덱스 값"은 어떤 타입의 인덱싱 방식을 사용하는지에 대한 표시를 포함할 수 있다. 예를 들어, 최적화 인덱스 값에 기초하여 해당 컬럼 값이 비트맵 인덱스를 사용하는지 혹은 B+트리 인덱스를 사용하는지 여부가 판단될 수 있다. 예를 들어, B+트리 인덱싱 방식을 사용하는 것으로 결정된 경우 최적화 인덱스 값은 NULL 값을 포함할 수 있다. 또한, 비트맵 인덱싱 방식을 사용하는 것으로 결정된 경우 최적화 인덱스 값은 해당 비트맵 값을 포함할 수 있다. 해당 비트맵 값에 따라, 비트맵 인덱싱이 수행될 수 있다. 즉, 본 개시내용의 일 실시예에 따른 최적화 인덱스 값은, 모니터링 결과 정보가 비트맵 타입의 인덱싱 방식을 사용할 것을 나타내는 경우, 해당 로우에 대하여 비트맵 값을 포함할 것이고, 그리고 모니터링 결과 정보가 B+트리 타입의 인덱싱 방식을 사용할 것을 나타내는 경우, 해당 로우에 대하여 NULL 값을 포함할 것이다.The "optimization index value" according to an embodiment of the present disclosure may include an indication of what type of indexing method is used. For example, it may be determined whether the corresponding column value uses a bitmap index or a B+tree index based on the optimization index value. For example, when it is determined to use the B+tree indexing method, the optimization index value may include a NULL value. In addition, when it is determined to use the bitmap indexing method, the optimization index value may include a corresponding bitmap value. Bitmap indexing may be performed according to the corresponding bitmap value. That is, the optimization index value according to an embodiment of the present disclosure, when the monitoring result information indicates that a bitmap type indexing method is used, will include the bitmap value for the corresponding row, and the monitoring result information is B+ When indicating that a tree type indexing method is used, a NULL value will be included for a corresponding row.

본 개시내용의 일 실시예에 따른 복수의 컬럼들 중 제 4 컬럼은 종료(end) ROWID 값들을 포함할 수도 있다. 이러한 경우, 상기 종료 ROWID 값들은 상기 모니터링 결과 정보가 B+트리의 인덱싱 방식을 사용할 것을 나타내는 경우, 해당 로우에 대하여 NULL 값을 포함할 수 있다. 종료 ROWID 값은 제 2 컬럼에 포함된 ROWID 값을 시작으로 하는 해당 서브 인덱스 구조의 ROWID 범위를 나타내기 위한 마지막 ROWID 값을 의미할 수 있다. 이러한 경우, 제 3 컬럼에 포함된 비트맵 값은, 제 2 컬럼에 포함된 ROWID 값(즉, 시작 ROWID)을 기준 지점으로 하여 상기 기준 지점과 다른 ROWID 값들의 위치 간의 상대적인 거리를 표시하는 비트(bit) 값을 의미할 수 있다.The fourth column of the plurality of columns according to an embodiment of the present disclosure may include end ROWID values. In this case, when the monitoring result information indicates that the indexing method of the B+ tree is to be used, the ending ROWID values may include a NULL value for the corresponding row. The ending ROWID value may mean the last ROWID value for indicating a range of ROWIDs of a corresponding sub-index structure starting with the ROWID value included in the second column. In this case, the bitmap value included in the third column is a bit indicating a relative distance between the reference point and the positions of other ROWID values using the ROWID value (that is, the starting ROWID) included in the second column as a reference point ( bit) value.

통신부(207)는, 다른 데이터베이스 관리 장치 또는 클라이언트(110)와의 통신 기능을 제공할 수 있다. 예를 들어, 통신부(207)는 수신된 쿼리에 대한 처리 결과를 클라이언트(110)로 전송할 것을 결정할 수 있다. 또한, 통신부(207)는 앞서 설명된 임의의 네트워크 및/또는 데이터베이스 링크를 사용하여, 다른 데이터베이스 관리 장치 또는 클라이언트(110)와 통신하는 것을 제어할 수 있다. 또한, 통신부(207)는 클라이언트(110)로부터의 데이터 저장, 조회 및 인덱스 정의/생성/빌드, 조회 요청 등을 수신하는 것을 제어할 수도 있다. 또한, 통신부(207)는 데이터 저장, 조회 및 인덱스 빌드, 조회 요청에 대한 결과 정보를 클라이언트(110) 혹은 내부 컴포넌트들로 전달할 수도 있다. The communication unit 207 may provide a communication function with another database management device or the client 110. For example, the communication unit 207 may determine to transmit the processing result for the received query to the client 110. In addition, the communication unit 207 may control communication with other database management devices or clients 110 using any network and/or database link described above. In addition, the communication unit 207 may control receiving data storage, inquiry and index definition/creation/build, inquiry request, etc. from the client 110. In addition, the communication unit 207 may transmit data, result of query and index build, and query request information to the client 110 or internal components.

저장부(209)는 데이터베이스 관리 장치(120)의 테스크 수행과 관련하여 저장되는 임의의 데이터를 저장할 수 있다. 저장부(209)는 DBMS(130) 및/또는 영구저장매체(140)에 포함될 수 있다. 추가적으로, 저장부(209)는 데이터베이스 관리 장치(120) 상에서의 테이블(예컨대, 인덱스 테이블) 등을 생성할 수도 있다. 예를 들어, 이러한 테이블들의 생성은, 제어 모듈(미도시)과 같은 별도의 컴포넌트에 의해 수행될 수도 있다. 또한, 저장부(209)는 데이터의 저장(업데이트 포함)과 관련된 요청을 처리 및 관리할 수 있다. 이러한 저장부(209)는 데이터 및 인덱스 테이블 등을 저장할 것을 결정할 수 있다. 또한, 저장부(209)는 데이터 및/또는 인덱스 테이블에 대한 저장 위치를 결정할 수 있다. 예를 들어, 저장부(209)은 데이터에 대하여 데이터 테이블 상에서의 저장 위치를 결정할 수 있다. 다른 예시로, 저장부(209)는 데이터에 대하여 영구 저장 매체(140) 상의 저장 위치를 결정할 수 있다. 저장부(209)는 생성된 데이터베이스 인덱스 구조를 저장할 수 있다. 본 개시내용의 추가적인 실시예에서, 저장부(209)는 메모리 및/또는 영구 저장 매체(140)를 포함할 수도 있다.The storage unit 209 may store any data stored in connection with the task execution of the database management device 120. The storage unit 209 may be included in the DBMS 130 and/or the permanent storage medium 140. Additionally, the storage unit 209 may generate a table (eg, an index table) or the like on the database management device 120. For example, the creation of these tables may be performed by a separate component, such as a control module (not shown). Further, the storage unit 209 may process and manage requests related to storage (including updates) of data. The storage unit 209 may determine to store data, index tables, and the like. Also, the storage unit 209 may determine a storage location for data and/or index tables. For example, the storage unit 209 may determine a storage location on the data table for data. As another example, the storage unit 209 may determine a storage location on the permanent storage medium 140 for data. The storage unit 209 may store the created database index structure. In additional embodiments of the present disclosure, storage 209 may include memory and/or permanent storage media 140.

도 3은 본 개시내용의 일 실시예에 따른 데이터베이스 관리 장치의 블록 구성도를 도시한다.3 is a block diagram of a database management apparatus according to an embodiment of the present disclosure.

도 3에서 도시되는 바와 같이, 데이터베이스 관리 장치(120)는 프로세서(310) 및 저장부(320)를 포함할 수 있다. 프로세서(310)는 도 2에서 설명된 DBMS(130)에 의해 수행되는 동작들을 제어할 수 있다. 프로세서(310)는 DBMS(130)를 실행하여, 도 2에서 설명된 동작들을 수행할 수 있다. 예를 들어, 프로세서(310)는 모니터링 동작 및 데이터베이스 인덱스 구조(330)의 생성 동작을 수행할 수 있다. 다른 예시로, 프로세서(310)는 생성된 데이터베이스 인덱스 구조(330)를 저장부(320)에 저장할 수 있다. 프로세서(310)의 동작들은 도 2에서 설명된 DBMS(130)의 동작들과 대응되기 때문에, 도 3과 관련된 설명에서는 이에 대한 기재를 생략하기로 한다. As illustrated in FIG. 3, the database management device 120 may include a processor 310 and a storage unit 320. The processor 310 may control operations performed by the DBMS 130 described in FIG. 2. The processor 310 may execute the operations described in FIG. 2 by executing the DBMS 130. For example, the processor 310 may perform a monitoring operation and a creation operation of the database index structure 330. As another example, the processor 310 may store the generated database index structure 330 in the storage unit 320. Since the operations of the processor 310 correspond to the operations of the DBMS 130 described in FIG. 2, descriptions thereof will be omitted in the description related to FIG. 3.

도 3에서 도시되는 저장부(320)는 앞서 기재된 메모리 및/또는 영구 저장 매체와 대응될 수 있다. 저장부(320)는 데이터베이스 인덱스 구조(330)를 저장할 수 있다. 도 2와 관련하여 설명된 바와 같이, 데이터베이스 인덱스 구조(330)는 하나 이상의 서브 인덱스 구조들(340, 350, 360a 및 360b)을 포함할 수 있다. 이러한 서브 인덱스 구조들(340, 350, 360a 및 360b)은 루트 노드(340), 브랜치 노드(350) 및/또는 리프 노드(360a 및 360b)와 대응될 수 있다. 도 3에서 도시되는 저장부(320)에 저장되는 데이터베이스 인덱스 구조(330)는 하나의 루트 노드, 하나의 브랜치 노드 및 두 개의 리프 노드들만을 포함하고 있으나, 이는 설명의 편의를 위한 예시일 뿐, 본 개시내용의 사상을 벗어나지 않는 범위에서 임의의 개수 및 임의의 형태의 노드들이 데이터베이스 인덱스 구조(330)를 형성할 수 있다.The storage unit 320 illustrated in FIG. 3 may correspond to the memory and/or permanent storage medium described above. The storage unit 320 may store the database index structure 330. 2, the database index structure 330 may include one or more sub-index structures 340, 350, 360a and 360b. The sub-index structures 340, 350, 360a, and 360b may correspond to the root node 340, branch node 350, and/or leaf nodes 360a and 360b. The database index structure 330 stored in the storage unit 320 shown in FIG. 3 includes only one root node, one branch node, and two leaf nodes, but this is only an example for convenience of description, Any number and any type of nodes can form the database index structure 330 without departing from the spirit of the present disclosure.

본 개시내용의 일 실시예에 있어서, 루트 노드(340) 및 브랜치 노드(350)는 자신 보다 하위에 있는 하위 레벨의 서브 인덱스 구조의 ROWID 값 및 상기 하위 레벨의 서브 인덱스 구조에 접근하기 위한 주소값을 포함할 수 있다. 예를 들어, 프로세서(310)가 루트 노드(340)로부터 트래버스를 하는 경우, 루트 노드(340)에 포함된 브랜치 노드(350)로의 분기값(즉, 어느 브랜치 노드(350) 방향으로 트래버스를 할지 결정하는 값) 및 브랜치 노드(350)의 주소(DBA: Data Block Address)에 기초하여, 프로세서(310)는 트래버스할 브랜치 노드(350)를 결정할 수 있다. 브랜치 노드(350) 또한 동일한 방식으로 리프 노드(360a, 360b)들 중 트래버스할 리프 노드를 결정할 수 있다. In one embodiment of the present disclosure, the root node 340 and the branch node 350 have a ROWID value of a lower-level sub-index structure lower than their own, and an address value for accessing the lower-level sub-index structure. It may include. For example, when the processor 310 traverses from the root node 340, a branch value to the branch node 350 included in the root node 340 (that is, which branch node 350 is to be traversed) Based on the value to be determined) and the address of the branch node 350 (DBA: Data Block Address), the processor 310 may determine the branch node 350 to traverse. The branch node 350 may also determine leaf nodes to traverse among the leaf nodes 360a and 360b in the same manner.

도 4는 본 개시내용의 일 실시예에 따른 데이터베이스 인덱스 구조를 예시적으로 도시한다.4 exemplarily illustrates a database index structure according to an embodiment of the present disclosure.

본 개시내용의 일 실시예에 따라, 비트맵 인덱스 및 B+트리 인덱스를 효율적인 방식으로 조합하는 최적의 인덱싱 기법이 제공될 수 있다. 이러한 인덱싱 기법은: 리프 노드에 대한 빈도수에 기초하여 빈도수가 낮은 리프 노드에 대해서는 B+트리 방식으로 표현하고 그리고 빈도수가 높은 리프 노드에 대해서는 B+트리 방식에 비트맵 값을 추가하여 표현하는 것을 포함할 수 있다. 또한, 상기 인덱싱 기법은: Distinct 값이 사전결정된 수를 초과하는 경우 혹은 DML의 발생 횟수가 사전결정된 횟수를 초과하는 파티션에 대해서는 B+트리 방식으로 인덱스를 표현하고 그리고 Distinct 값이 사전결정된 수보다 적거나 DML의 발생이 사전결정된 횟수보다 적은 파티션에 대해서는 B+트리 방식에 비트맵 값을 추가하여 인덱스를 표현하는 기법을 포함할 수 있다. According to an embodiment of the present disclosure, an optimal indexing technique for combining a bitmap index and a B+tree index in an efficient manner may be provided. Such an indexing technique may include: expressing a leaf node with a low frequency based on a frequency for a leaf node in a B+tree manner, and adding a bitmap value to a leaf node with a high frequency in a B+tree manner. have. In addition, the indexing technique: When the Distinct value exceeds a predetermined number, or for a partition in which the number of occurrences of DML exceeds a predetermined number, the index is expressed in a B+tree manner, and the Distinct value is less than or equal to the predetermined number. For partitions in which the occurrence of DML is less than a predetermined number of times, a technique of expressing an index by adding a bitmap value to the B+tree method may be included.

도 4에서 도시되는 바와 같이, 각각의 리프 노드(360a 및 360b)는 서브 인덱스 구조(400)를 가질 수 있다. 이러한 리프 노드(360a 및 360b) 각각의 서브 인덱스 구조(400)는 각각 적어도 3개의 컬럼들(410, 420 및 430)을 포함할 수 있다. 제 1 컬럼(410)은 인덱스 키 값들을 포함할 수 있다. 인덱스 키 값은 기본(base) 테이블에서의 컬럼 값, 컬럼 값에 대한 임의의 해시 값, 및/또는 컬럼 값에 대한 임의의 변환 값을 포함할 수 있다. 키 값들은 제 1 컬럼(410) 내에서 고유한 값들을 가질 수 있거나 혹은 중복되는 값들을 가질 수도 있다. 도 4에서 도시되는 바와 같이, 제 1 컬럼(410)은 1, 1, 2, 3, 4, 5 의 키 값들을 가질 수 있다. As shown in FIG. 4, each leaf node 360a and 360b may have a sub index structure 400. Each sub-index structure 400 of each leaf node 360a and 360b may include at least three columns 410, 420, and 430, respectively. The first column 410 may include index key values. The index key value may include a column value in a base table, an arbitrary hash value for the column value, and/or any conversion value for the column value. The key values may have unique values within the first column 410 or may have duplicate values. As illustrated in FIG. 4, the first column 410 may have key values of 1, 1, 2, 3, 4, and 5.

제 2 컬럼(420)은, 데이터 블록의 로우에 대응되는 ROWID 값들을 포함할 수 있다. ROWID는 실제 데이터가 저장된 위치와 대응될 수 있다. 이러한 ROWID를 통하여, 실제 데이터가 저장된 위치를 찾아갈 수 있기 때문에, 테이블 전체(full) 스캔에 비하여 효율적인 데이터 검색이 가능해진다. 본 개시내용의 일 실시예에 따르면, 도 4에서는 도시되지 않았지만, 제 4 컬럼이 서브 인덱스 구조(400)에 포함될 수도 있다. 제 4 컬럼은 종료(end) ROWID 값들을 포함할 수 있다. 이러한 경우, 상기 종료 ROWID 값들은 모니터링 결과 정보가 B+트리의 인덱싱 방식을 사용할 것을 나타내는 경우, 해당 로우에 대하여 혹은 해당 키 값에 대하여 NULL 값을 포함할 수 있다. 종료 ROWID 값은 제 2 컬럼(420)에 포함된 ROWID 값을 시작으로 하는 해당 서브 인덱스 구조(400)의 ROWID 범위를 나타내기 위한 마지막 ROWID 값을 의미할 수 있다. 이러한 경우, 제 3 컬럼(430)에 포함된 비트맵 값은, 제 2 컬럼(420)에 포함된 ROWID 값(즉, 시작 ROWID)을 기준 지점으로 하여 기준 지점과 다른 ROWID 값들의 위치 간의 상대적인 거리를 표시하는 비트(bit) 값을 의미할 수 있다.The second column 420 may include ROWID values corresponding to rows of the data block. ROWID may correspond to a location where actual data is stored. Through this ROWID, since it is possible to find a location where actual data is stored, efficient data search is possible compared to a full scan of the table. According to an embodiment of the present disclosure, although not illustrated in FIG. 4, a fourth column may be included in the sub index structure 400. The fourth column may include end ROWID values. In this case, when the monitoring result information indicates that the indexing method of the B+ tree is to be used, the ending ROWID values may include a NULL value for a corresponding row or a corresponding key value. The ending ROWID value may refer to the last ROWID value for indicating the range of ROWIDs of the corresponding sub-index structure 400 starting with the ROWID value included in the second column 420. In this case, the bitmap value included in the third column 430 is a relative distance between the reference point and the positions of other ROWID values using the ROWID value (that is, the starting ROWID) included in the second column 420 as a reference point. It may mean a bit value indicating.

제 3 컬럼(430)은 전술한 바와 같이, 모니터링 결과 정보에 기초하여 설정되는 최적화(optimized) 인덱스 값들을 포함할 수 있다. 즉, 제 3 컬럼(430)에 포함된 값들은, 모니터링 결과 정보가 특정 로우에 대해서 혹은 특정 키 값에 대하여 B+트리 인덱스를 사용한다고 표시하는 경우, NULL 값을 가질 수 있다. 제 3 컬럼(430)에 포함된 값들은, 모니터링 결과 정보가, 특정 로우에 대해서 혹은 특정 키 값에 대하여, 비트맵 인덱스를 사용한다고 표시하는 경우, 비트맵 값(즉, 비트 값)을 가질 수 있다. As described above, the third column 430 may include optimized index values that are set based on monitoring result information. That is, the values included in the third column 430 may have a NULL value when the monitoring result information indicates that the B+tree index is used for a specific row or a specific key value. The values included in the third column 430 may have a bitmap value (that is, a bit value) when the monitoring result information indicates that a bitmap index is used for a specific row or a specific key value. have.

예를 들어, 컬럼 값으로 액세스되는 횟수 정보 혹은 키 값에 대한 액세스 횟수 정보가 사전결정된 액세스 임계 횟수값(예컨대, 분당 1회 혹은 총 100회 등) 보다 큰 경우 비트맵 인덱싱 방식을 사용할 것으로 결정되며, 그리고 컬럼 값으로 액세스되는 횟수 정보 혹은 키 값에 대한 액세스 횟수 정보가 사전결정된 액세스 임계 횟수값 이하인 경우 B+트리 인덱싱 방식을 사용할 것으로 결정될 수 있다. For example, if the number of times of access to the column value or the number of times of access to the key value is greater than a predetermined number of access thresholds (for example, once per minute or a total of 100 times, etc.), the bitmap indexing method is determined. In addition, when the number of times of access to the column value or the number of times of access to the key value is equal to or less than the predetermined access threshold number, it may be determined to use the B+tree indexing method.

도 4에서 도시되는 바와 같이, 데이터 액세스의 빈도수가 낮은 키 값인 1, 3, 4에 대해서는 키 값, ROWID 및 NULL 값을 갖는 데이터 구조를 갖게 되며, 데이터의 액세스의 빈도수가 높은 키 값인 2, 5에 대해서는 키 값, ROWID 및 비트맵 값을 갖는 데이터 구조를 가질 수 있다. 도 4에서 도시되는 예시에서는 키 값 단위로 비트맵 인덱스를 사용할지 혹은 B+트리 인덱스를 사용할지가 결정될 수 있는 것으로 도시되었지만, 다른 예시로, ROWID 단위로 비트맵값을 가질지 아니면 NULL 값을 가질지가 모니터링 결과 정보에 의해 설정될 수도 있다. As illustrated in FIG. 4, data structures having key values, ROWID and NULL values for key values 1, 3, and 4, which are low-frequency data accesses, have key values 2, 5, which are high-frequency data accesses. For can have a data structure having a key value, ROWID and bitmap value. In the example shown in FIG. 4, it is shown that it is possible to determine whether to use a bitmap index or a B+tree index as a key value unit. However, as another example, whether to have a bitmap value or a NULL value in ROWID units is monitored. It can also be set by the result information.

본 개시내용의 추가적인 실시예에서, 키 값들 혹은 로우 단위로 결정된 인덱스 구조들과 키 값들 혹은 로우들 간의 맵핑 관계는 메타 데이터로서 메타 테이블에 저장될 수도 있다. 메타 테이블을 형성함에 따라, 인덱스 구조를 생성할때 마다 모니터링 프로세스를 별도로 수행하지 않고, 메타 테이블에 저장된 맵핑 관계를 확인함으로써 보다 효율적인 인덱스 빌드가 이루어질 수 있다.In a further embodiment of the present disclosure, mapping relationships between key values or rows determined by key values or rows and key values or rows may be stored in the meta table as metadata. As the meta table is formed, a more efficient index build can be made by checking the mapping relationship stored in the meta table without separately performing a monitoring process each time an index structure is generated.

전술한 바와 같이, 본 개시내용의 일 실시예에 따른 서브 인덱스 구조(예컨대, 리프 노드의 데이터 구조)는 비트맵 인덱스 구조와 B+트리 인덱스 구조를 하나의 인덱스로 사용할 수 있는 글로벌 인덱스를 제공할 수 있다. 즉, 본 개시내용에 따른 인덱싱 방식은, 하나의 인덱스 구조에 대해서, 액세스 빈도수가 높은 데이터에 대해서는 비트맵 인덱스를 사용하고 액세스 빈도수가 낮은 데이터에 대해서는 B+트리 인덱스를 사용하기 때문에, 불필요한 공간을 줄임으로써 인덱스의 크기를 최소화할 수 있을 뿐만 아니라 인덱스의 속도 또한 향상될 수 있다. As described above, a sub-index structure (eg, a data structure of a leaf node) according to an embodiment of the present disclosure can provide a global index that can use a bitmap index structure and a B+tree index structure as one index. have. That is, the indexing method according to the present disclosure reduces unnecessary space for one index structure because a bitmap index is used for data having a high access frequency and a B+tree index is used for data having a low access frequency. As a result, not only can the size of the index be minimized, but the speed of the index can be improved.

도 5는 본 개시내용의 일 실시예에 따라 파티션 단위로 상이한 데이터베이스 인덱스 구조를 예시적으로 도시한다.5 exemplarily illustrates different database index structures in units of partitions according to an embodiment of the present disclosure.

도 5에서 도시되는 바와 같이, 파티션 1 내지 10(510a, 520a, 530a, 540a)은 각각 파티션 단위로 상이한 인덱스 구조(510b, 520b, 530b, 540b)를 가질 수 있다. 도 5에서의 예시에 따르면, 파티션 1, 2 및 9(510a, 520a, 530a)는 로컬 인덱스로서 B+트리 인덱스(510b, 520b, 530b)를 가질 수 있다. 파티션 10(540b)은 로컬 인덱스로서 비트맵 인덱스(540b)를 가질 수 있다. As illustrated in FIG. 5, partitions 1 to 10 (510a, 520a, 530a, and 540a) may have different index structures 510b, 520b, 530b, and 540b in units of partitions, respectively. According to the example in FIG. 5, partitions 1, 2, and 9 (510a, 520a, 530a) may have B+tree indexes 510b, 520b, and 530b as local indexes. Partition 10 (540b) may have a bitmap index (540b) as a local index.

예를 들어, 파티션의 Distinct 값의 크기 정보가 사전결정된 크기 임계값(예컨대, 2, 5, 10, 100 등)보다 작거나 또는 상기 파티션에 대한 DML 발생 횟수 정보가 사전결정된 DML 발생 횟수 임계값(예컨대, 시간당 1회 혹은 총 10회 등) 보다 작은 경우, 상기 파티션에 대해서는 비트맵 인덱싱 방식이 사용될 것으로 결정될 수 있다. 일례로, 사전결정된 크기 임계값은 사용자에 의해 사전에 설정될 수 있다. 다른 예시로, 사전결정된 크기 임계값은 모니터링 수행부(203)에 의한 모니터링 결과 정보에 기초하여 가변적으로 그리고 동적으로 결정될 수 있다. 또한, 특정 파티션에 대한 DML 발생 횟수 정보가 사전결정된 DML 발생 횟수 임계값 이상인 경우, 상기 파티션에 대해서는 B+트리 인덱싱 방식이 사용될 것으로 결정될 수 있다. For example, the size information of the partition's Distinct value is smaller than a predetermined size threshold (for example, 2, 5, 10, 100, etc.) or the DML occurrence number information for the partition is a predetermined DML occurrence count threshold ( For example, if it is smaller than once per hour or 10 times in total, it may be determined that the bitmap indexing scheme is used for the partition. In one example, the predetermined size threshold may be preset by the user. As another example, the predetermined size threshold may be variably and dynamically determined based on monitoring result information by the monitoring performing unit 203. Further, when the DML occurrence count information for a specific partition is equal to or greater than a predetermined DML occurrence count threshold, it may be determined that the B+tree indexing scheme is used for the partition.

또한, 특정 파티션에 포함된 데이터의 마지막 액세스 시점 정보가 사전결정된 임계 시점 정보(예컨대, 24시간, 7일, 30일 등) 보다 오래된 경우, 상기 파티션에 대해서는 비트맵 인덱싱 방식이 사용될 것으로 결정될 수 있다. 또한, 파티션에 포함된 데이터의 마지막 액세스 시점 정보가 사전결정된 임계 시점 정보 보다 오래되지 않은 경우, 상기 파티션에 대해서는 B+트리 인덱싱 방식이 사용될 것으로 결정될 수 있다.In addition, when the last access point information of data included in a specific partition is older than predetermined threshold time information (eg, 24 hours, 7 days, 30 days, etc.), it may be determined that a bitmap indexing method is used for the partition. . In addition, when the last access point information of the data included in the partition is not older than the predetermined threshold time point information, it may be determined that the B+tree indexing method is used for the partition.

각 파티션이 가지는 서브 인덱스 구조는 도 4와 관련하여 설명된 서브 인덱스 구조(400)와 동일하기 때문에, 도 5에서는 해당 설명을 생략하기로 한다. Since the sub-index structure of each partition is the same as the sub-index structure 400 described with reference to FIG. 4, the description will be omitted in FIG. 5.

본 개시내용의 추가적인 실시예에서, 파티션들(510a, 520a, 530a, 540a)과 파티션 단위로 결정된 인덱스 구조들(510b, 520b, 530b, 540b) 간의 맵핑 관계는 메타 데이터로서 메타 테이블에 저장될 수도 있다. 메타 테이블을 형성함에 따라, 인덱스 구조를 생성할때 마다 모니터링 프로세스를 별도로 수행하지 않고, 메타 테이블에 저장된 맵핑 관계를 확인함으로써 보다 효율적인 인덱스 빌드가 이루어질 수 있다.In a further embodiment of the present disclosure, the mapping relationship between the partitions 510a, 520a, 530a, 540a and the index structures 510b, 520b, 530b, 540b determined in units of partitions may be stored in the meta table as metadata. have. As the meta table is formed, a more efficient index build can be made by checking the mapping relationship stored in the meta table without separately performing a monitoring process each time an index structure is generated.

본 개시내용의 일 실시예에 따른 파티션 단위의 인덱싱 구분 기법에 의해, 파티션의 특성에 따라 파티션 별로 비트맵 인덱스 구조 또는 B+트리 인덱스 구조를 사용할 수 있는 로컬 인덱스를 제공할 수 있다. 즉, 파티션의 Distinct 값이 적거나 DML이 거의 발생하지 않는 파티션에 대해서는 비트맵 인덱스를 사용하고, DML이 자주 발생하는 파티션에 대해서는 B+트리 인덱스를 사용함에 따라, 효율적인 공간 관리가 가능해지고 그리고 쿼리 성능이 향상될 수 있다.According to a partitioning indexing scheme according to an embodiment of the present disclosure, a local index that can use a bitmap index structure or a B+tree index structure for each partition may be provided according to partition characteristics. In other words, the bitmap index is used for partitions with little or no DML in the partition, and the B+tree index is used for partitions with frequent DML, enabling efficient space management and query performance. This can be improved.

도 6은 본 개시내용의 일 실시예에 따라, 효율적인 인덱싱을 수행하기 위한 순서도를 예시적으로 도시한다. 도 6에서 도시되는 단계들은 예를 들어, 데이터베이스 관리 장치(120)에 의해 수행될 수 있다. 또한, 도 6에서 도시되는 단계들은 예를 들어, 하나 이상의 프로세서(310)에 의해 수행될 수 있다. 도 6에서 도시되는 단계들은 예시적인 것이며, 추가적인 단계들이 존재하거나 혹은 도시되는 단계들 중 일부가 생략될 수도 있다.6 exemplarily shows a flowchart for performing efficient indexing, according to one embodiment of the present disclosure. The steps illustrated in FIG. 6 may be performed, for example, by the database management device 120. In addition, the steps shown in FIG. 6 may be performed, for example, by one or more processors 310. The steps shown in FIG. 6 are exemplary and additional steps may be present or some of the steps shown may be omitted.

도 6에서 도시되는 바와 같이, 데이터베이스 관리 장치(120)는 사용자로부터 발행된 인덱스 빌드를 요청하는 쿼리를 수신할 수 있다(610). 예를 들어, 인덱스 빌드를 요청하는 쿼리는 생성된 데이터 테이블 구조에 대한 검색을 용이하게 하기 위하여 인덱스 형태의 데이터 구조를 생성하고자 한다는 표시를 포함할 수 있다. As shown in FIG. 6, the database management device 120 may receive a query requesting an index build issued from a user (610). For example, a query requesting an index build may include an indication that an index type data structure is to be generated to facilitate searching for the generated data table structure.

인덱스 빌드를 요청하는 것에 응답하여, 데이터베이스 관리 장치(120)는 사전결정된 모니터링 알고리즘에 기초하여 인덱스 빌드를 위한 모니터링 동작을 수행할 수 있다(630). 모니터링 동작에 의한 결과로서, 모니터링 결과 정보가 생성될 수 있다. 모니터링 동작을 수행하는데 있어서 임의의 방식의 데이터베이스 모니터링 방식이 사용될 수 있다. 본 명세서에서의 모니터링 동작은, 인덱스를 빌드하기 위하여, 특정 키 값에 액세스되는 횟수/빈도 정보, 특정 데이터에 액세스되는 횟수/빈도 정보, 특정 파티션에 대한 distinct 값 정보, 특정 파티션에 대한 DML 발생 횟수/빈도 정보, 및 특정 파티션에 포함된 데이터의 마지막 액세스 시점 정보 중 적어도 하나의 정보를 획득하는 임의의 형태의 과정을 의미할 수 있다. 예를 들어, 횟수/빈도 정보는 카운팅 값을 의미할 수 있거나 혹은 시간당 횟수 값을 의미할 수 있지만, 이들은 예시들에 불과하며 비교가능한 형태의 정보이면 모니터링 결과 정보에 포함될 수 있다.In response to requesting an index build, the database management device 120 may perform a monitoring operation for the index build based on a predetermined monitoring algorithm (630). As a result of the monitoring operation, monitoring result information may be generated. Any method of database monitoring may be used to perform the monitoring operation. The monitoring operation in the present specification includes the number of times/frequency information accessed for a specific key value, the number/frequency information accessed for specific data, distinct value information for a specific partition, and the number of DML occurrences for a specific partition in order to build an index. It may mean any type of process of acquiring at least one of /frequency information and information on the last access time of data included in a specific partition. For example, the count/frequency information may mean a counting value or a count value per hour, but these are only examples, and if the information is in a comparable form, it may be included in monitoring result information.

데이터베이스 관리 장치(120)는 모니터링 동작에 의해 생성된 모니터링 결과 정보에 기초하여 데이터베이스 인덱스 구조 생성할 수 있다(630). 데이터베이스 인덱스 구조는 모니터링 결과 정보가 포함하는 표시에 따라 가변적으로 설정될 수 있다. 또한, 데이터베이스 관리 장치(120)는 생성된 데이터베이스 인덱스 구조와 기초 데이터 테이블을 서로 연결시킬 수 있다. 데이터베이스 인덱스 구조에 대한 구체적인 설명은 도 2 내지 5와 관련하여 사전에 설명되었기 때문에, 도 6에 대한 설명에서는 생략하기로 한다.The database management device 120 may create a database index structure based on the monitoring result information generated by the monitoring operation (630). The database index structure may be variably set according to an indication included in monitoring result information. In addition, the database management apparatus 120 may connect the generated database index structure and the underlying data table to each other. Since the detailed description of the database index structure has been previously described with reference to FIGS. 2 to 5, the description of FIG. 6 will be omitted.

데이터베이스 관리 장치(120)는 생성된 데이터베이스 인덱스 구조를 임의의 형태의 저장부에 저장할 수 있다(650). 저장부는 일시적/임시적 저장 매체 및/또는 영구 저장 매체를 포함할 수 있다. 예를 들어, 데이터베이스 관리 장치(120)는 생성된 데이터베이스 인덱스 구조를 버퍼 캐시에 저장할 수 있다. 이러한 경우, 사전결정된 주기로 혹은 사용자 설정에 의해 백그라운드 프로세스에 의해 버퍼 캐시에 저장된 데이터베이스 인덱스 구조가 영구 저장 매체로 기록될 수 있다. 또한, 별도의 저장 서버에 상기 생성된 데이터베이스 인덱스 구조가 저장될 수 있으며, 별도의 저장 서버는 저장된 임의의 형태의 데이터들을 그룹 단위로 모아서 영구 저장 매체에 기록할 수도 있다.The database management apparatus 120 may store the generated database index structure in a storage unit of any type (650). The storage unit may include temporary/temporary storage media and/or permanent storage media. For example, the database management device 120 may store the created database index structure in a buffer cache. In this case, the database index structure stored in the buffer cache by a background process at a predetermined cycle or by user setting may be recorded as a permanent storage medium. In addition, the generated database index structure may be stored in a separate storage server, and the separate storage server may collect and store arbitrary types of data in group units in a permanent storage medium.

도 7은 본 개시내용의 일 실시예에 따른 다양한 예시적 논리적 블록들, 구성들, 모듈들, 수단들 및 로직들, 및 알고리즘 단계들이 도시된다.7 shows various exemplary logical blocks, configurations, modules, means and logics, and algorithm steps in accordance with one embodiment of the present disclosure.

당업자들은 추가적으로 여기서 개시된 실시예들과 관련되어 설명된 다양한 예시적 논리적 블록들, 구성들, 모듈들, 회로들, 수단들, 로직들, 및 알고리즘 단계들이 전자 하드웨어, 컴퓨터 소프트웨어, 또는 양쪽 모두의 조합들로 구현될 수 있음을 인식해야 한다. 하드웨어 및 소프트웨어의 상호교환성을 명백하게 예시하기 위해, 다양한 예시적 컴포넌트들, 블록들, 구성들, 수단들, 로직들, 모듈들, 회로들, 및 단계들은 그들의 기능성 측면에서 일반적으로 위에서 설명되었다. 그러한 기능성이 하드웨어로 또는 소프트웨어로서 구현되는지 여부는 전반적인 시스템에 부과된 특정 어플리케이션(application) 및 설계 제한들에 달려 있다. 숙련된 기술자들은 각각의 특정 어플리케이션들을 위해 다양한 방법들로 설명된 기능성을 구현할 수 있으나, 그러한 구현의 결정들이 본 개시내용의 영역을 벗어나게 하는 것으로 해석되어서는 안된다.Those skilled in the art further include various exemplary logical blocks, configurations, modules, circuits, means, logics, and algorithm steps described in connection with the embodiments disclosed herein, electronic hardware, computer software, or a combination of both. It should be recognized that can be implemented as To clearly illustrate the interchangeability of hardware and software, various illustrative components, blocks, configurations, means, logics, modules, circuits, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented in hardware or as software depends on the specific application and design limitations imposed on the overall system. Skilled artisans may implement the described functionality in varying ways for each particular application, but such implementation decisions should not be interpreted as causing a departure from the scope of the present disclosure.

도 7에서 도시되는 수단, 로직, 및 모듈은 각각 도 6에서 설명되는 단계들과 대응될 수 있다.The means, logic, and module illustrated in FIG. 7 may correspond to the steps described in FIG. 6, respectively.

도 7에서 도시되는 바와 같이, 데이터베이스 관리 장치(120)는, 인덱스 빌드를 요청하는 쿼리를 수신하기 위한 수단(710), 모니터링 알고리즘에 기초하여 인덱스 빌드를 위한 모니터링 동작을 수행하기 위한 수단(730), 모니터링 결과 정보에 기초하여 데이터베이스 인덱스 구조를 생성하기 위한 수단(750) 및 생성된 데이터베이스 인덱스 구조를 저장하기 위한 수단(770)을 포함할 수 있다. As shown in FIG. 7, the database management apparatus 120 includes means 710 for receiving a query requesting an index build, and means 730 for performing a monitoring operation for an index build based on a monitoring algorithm. , Means 750 for creating a database index structure based on the monitoring result information and means 770 for storing the created database index structure.

또한, 데이터베이스 관리 장치(120)는, 인덱스 빌드를 요청하는 쿼리를 수신하기 위한 로직(810), 모니터링 알고리즘에 기초하여 인덱스 빌드를 위한 모니터링 동작을 수행하기 위한 로직(830), 모니터링 결과 정보에 기초하여 데이터베이스 인덱스 구조를 생성하기 위한 로직(850) 및 생성된 데이터베이스 인덱스 구조를 저장하기 위한 로직(870)을 포함할 수 있다. In addition, the database management apparatus 120 is based on logic 810 for receiving a query requesting an index build, logic 830 for performing a monitoring operation for an index build based on a monitoring algorithm, and monitoring result information By doing so, it may include logic 850 for creating a database index structure and logic 870 for storing the generated database index structure.

또한, 데이터베이스 관리 장치(120)는, 인덱스 빌드를 요청하는 쿼리를 수신하기 위한 모듈(910), 모니터링 알고리즘에 기초하여 인덱스 빌드를 위한 모니터링 동작을 수행하기 위한 모듈(930), 모니터링 결과 정보에 기초하여 데이터베이스 인덱스 구조를 생성하기 위한 모듈(950) 및 생성된 데이터베이스 인덱스 구조를 저장하기 위한 모듈(970)을 포함할 수 있다. In addition, the database management device 120 is based on a module 910 for receiving a query requesting an index build, a module 930 for performing a monitoring operation for an index build based on a monitoring algorithm, and monitoring result information To include a module 950 for creating a database index structure and a module 970 for storing the created database index structure.

도 8은 본 개시내용의 실시예들이 구현될 수 있는 예시적인 컴퓨팅 환경에 대한 간략하고 일반적인 개략도를 도시한다.8 shows a simplified and general schematic diagram of an exemplary computing environment in which embodiments of the present disclosure can be implemented.

본 개시내용이 일반적으로 하나 이상의 데이터베이스 관리 장치 상에서 실행될 수 있는 컴퓨터 실행가능 명령어 혹은 로직과 관련하여 전술되었지만, 당업자라면 본 개시내용이 기타 프로그램 모듈들과 결합되어 및/또는 하드웨어와 소프트웨어의 조합으로서 구현될 수 있다는 것을 잘 알 것이다.Although the present disclosure has been described above in general with respect to computer-executable instructions or logic that may be executed on one or more database management devices, those skilled in the art may implement the present disclosure in combination with other program modules and/or as a combination of hardware and software. You know it can be.

일반적으로, 프로그램 모듈은 특정의 태스크를 수행하거나 특정의 추상 데이터 유형을 구현하는 루틴, 프로시져, 프로그램, 컴포넌트, 데이터 구조, 기타 등등을 포함한다. 또한, 당업자라면 본 발명의 방법 및 동작이 단일-프로세서 또는 멀티프로세서 컴퓨터 시스템, 미니컴퓨터, 메인프레임 컴퓨터는 물론 퍼스널 컴퓨터, 핸드헬드 컴퓨팅 장치, 마이크로프로세서-기반 또는 프로그램가능 가전 제품, 기타 등등(이들 각각은 하나 이상의 연관된 장치(예컨대, 데이터베이스 관리 장치(120))와 연결되어 동작할 수 있음)을 비롯한 다른 컴퓨터 시스템 구성으로 실시될 수 있다는 것을 잘 알 것이다.Generally, program modules include routines, procedures, programs, components, data structures, etc. that perform particular tasks or implement particular abstract data types. In addition, those of ordinary skill in the art may find that the methods and operations of the present invention include single-processor or multiprocessor computer systems, minicomputers, mainframe computers as well as personal computers, handheld computing devices, microprocessor-based or programmable consumer electronics, and the like. It will be appreciated that each may be implemented with other computer system configurations, including one or more associated devices (eg, capable of operating in conjunction with database management device 120).

본 발명의 설명된 실시예들은 또한 어떤 태스크들이 통신 네트워크를 통해 연결되어 있는 원격 처리 장치들에 의해 수행되는 분산 컴퓨팅 환경에서 실시될 수 있다. 분산 컴퓨팅 환경에서, 프로그램 모듈은 로컬 및 원격 메모리 저장 장치 둘다에 위치할 수 있다.The described embodiments of the invention may also be practiced in distributed computing environments where certain tasks are performed by remote processing devices that are linked through a communications network. In a distributed computing environment, program modules may be located in both local and remote memory storage devices.

컴퓨터(예컨대, 데이터베이스 관리 장치(120))는 통상적으로 다양한 컴퓨터 판독가능 매체를 포함한다. 컴퓨터에 의해 액세스 가능한 매체는 그 어떤 것이든지 컴퓨터 판독가능 매체가 될 수 있고, 이러한 컴퓨터 판독가능 매체는 휘발성 및 비휘발성 매체, 일시적(transitory) 및 비일시적(non-transitory) 매체, 이동식 및 비-이동식 매체를 포함한다. 제한이 아닌 예로서, 컴퓨터 판독가능 매체는 컴퓨터 판독가능 저장 매체 및 컴퓨터 판독가능 전송 매체를 포함할 수 있다. 컴퓨터 판독가능 저장 매체는 컴퓨터 판독가능 명령어, 데이터 구조, 프로그램 모듈 또는 기타 데이터와 같은 정보를 저장하는 임의의 방법 또는 기술로 구현되는 휘발성 및 비휘발성 매체, 일시적 및 비-일시적 매체, 이동식 및 비이동식 매체를 포함한다. 컴퓨터 판독가능 저장 매체는 RAM, ROM, EEPROM, 플래시 메모리 또는 기타 메모리 기술, CD-ROM, DVD(digital video disk) 또는 기타 광 디스크 저장 장치, 자기 카세트, 자기 테이프, 자기 디스크 저장 장치 또는 기타 자기 저장 장치, 또는 컴퓨터에 의해 액세스될 수 있고 원하는 정보를 저장하는 데 사용될 수 있는 임의의 기타 매체를 포함하지만, 이에 한정되지 않는다.Computers (eg, database management device 120) typically include a variety of computer readable media. Any computer-accessible medium can be any computer-readable medium, such computer-readable media being volatile and non-volatile media, transitory and non-transitory media, removable and non- Includes removable media. By way of example, and not limitation, computer readable media may include computer readable storage media and computer readable transmission media. Computer-readable storage media are volatile and non-volatile media, temporary and non-transitory media, removable and non-removable, implemented in any method or technology for storing information such as computer readable instructions, data structures, program modules or other data Includes media. Computer-readable storage media may include RAM, ROM, EEPROM, flash memory or other memory technology, CD-ROM, digital video disk (DVD) or other optical disk storage device, magnetic cassette, magnetic tape, magnetic disk storage device or other magnetic storage Devices, or any other medium that can be accessed by a computer and used to store desired information.

컴퓨터 판독가능 전송 매체는 통상적으로 반송파(carrier wave) 또는 기타 전송 메커니즘(transport mechanism)과 같은 피변조 데이터 신호(modulated data signal)에 컴퓨터 판독가능 명령어, 데이터 구조, 프로그램 모듈 또는 기타 데이터등을 구현하고 모든 정보 전달 매체를 포함한다. 피변조 데이터 신호라는 용어는 신호 내에 정보를 인코딩하도록 그 신호의 특성들 중 하나 이상을 설정 또는 변경시킨 신호를 의미한다. 제한이 아닌 예로서, 컴퓨터 판독가능 전송 매체는 유선 네트워크 또는 직접 배선 접속(direct-wired connection)과 같은 유선 매체, 그리고 음향, RF, 적외선, 기타 무선 매체와 같은 무선 매체를 포함한다.Computer readable transmission media typically embodies computer readable instructions, data structures, program modules or other data in a modulated data signal, such as a carrier wave or other transport mechanism, and Includes all information delivery media. The term modulated data signal means a signal in which one or more of the characteristics of the signal are set or changed to encode information in the signal. By way of example, and not limitation, computer readable transmission media includes wired media such as a wired network or direct-wired connection, and wireless media such as acoustic, RF, infrared, and other wireless media.

컴퓨터(1102)를 포함하는 본 발명의 여러가지 측면들을 구현하는 예시적인 환경(1100)이 나타내어져 있으며, 컴퓨터(1102)는 처리 장치(혹은 프로세서)(1104), 시스템 메모리(1106) 및 시스템 버스(1108)를 포함한다. 시스템 버스(1108)는 시스템 메모리(1106)(이에 한정되지 않음)를 비롯한 시스템 컴포넌트들을 처리 장치(1104)에 연결시킨다. 처리 장치(1104)는 다양한 상용 프로세서들 중 임의의 프로세서일 수 있다. 듀얼 프로세서 및 기타 멀티프로세서 아키텍처도 역시 처리 장치(1104)로서 이용될 수 있다.An exemplary environment 1100 is shown that implements various aspects of the invention, including a computer 1102, which includes a processing device (or processor) 1104, system memory 1106, and system bus ( 1108). System bus 1108 connects system components, including, but not limited to, system memory 1106 to processing device 1104. The processing device 1104 can be any of a variety of commercial processors. Dual processor and other multiprocessor architectures may also be used as the processing unit 1104.

시스템 버스(1108)는 메모리 버스, 주변장치 버스, 및 다양한 상용 버스 아키텍처 중 임의의 것을 사용하는 로컬 버스에 추가적으로 상호 연결될 수 있는 몇가지 유형의 버스 구조 중 임의의 것일 수 있다. 시스템 메모리(1106)는 판독 전용 메모리(ROM)(1110) 및 랜덤 액세스 메모리(RAM)(1112)를 포함한다. 기본 입/출력 시스템(BIOS)은 ROM, EPROM, EEPROM 등의 비휘발성 메모리(1110)에 저장되며, 이 BIOS는 시동 중과 같은 때에 컴퓨터(1102) 내의 구성요소들 간에 정보를 전송하는 일을 돕는 기본적인 루틴을 포함한다. RAM(1112)은 또한 데이터를 캐싱하기 위한 정적 RAM 등의 고속 RAM을 포함할 수 있다.The system bus 1108 can be any of several types of bus structures that can be further interconnected to a memory bus, peripheral bus, and local bus using any of a variety of commercial bus architectures. System memory 1106 includes read-only memory (ROM) 1110 and random access memory (RAM) 1112. A basic input/output system (BIOS) is stored in a non-volatile memory 1110 such as ROM, EPROM, EEPROM, etc. This BIOS is a basic to help transfer information between components in the computer 1102 at the same time as during startup. Contains routines. The RAM 1112 may also include high-speed RAM, such as static RAM for caching data.

컴퓨터(1102)는 또한 내장형 하드 디스크 드라이브(HDD)(1114)(예를 들어, EIDE, SATA)(이 내장형 하드 디스크 드라이브(1114)는 또한 적당한 섀시(도시 생략) 내에서 외장형 용도로 구성될 수 있음), 자기 플로피 디스크 드라이브(FDD)(1116)(예를 들어, 이동식 디스켓(1118)으로부터 판독을 하거나 그에 기록을 하기 위한 것임), 및 광 디스크 드라이브(1120)(예를 들어, CD-ROM 디스크(1122)를 판독하거나 DVD 등의 기타 고용량 광 매체로부터 판독을 하거나 그에 기록을 하기 위한 것임)를 포함한다. 하드 디스크 드라이브(1114), 자기 디스크 드라이브(1116) 및 광 디스크 드라이브(1120)는 각각 하드 디스크 드라이브 인터페이스(1124), 자기 디스크 드라이브 인터페이스(1126) 및 광 드라이브 인터페이스(1128)에 의해 시스템 버스(1108)에 연결될 수 있다. 외장형 드라이브 구현을 위한 인터페이스(1124)는 USB(Universal Serial Bus) 및 IEEE 1394 인터페이스 기술 중 적어도 하나 또는 그 둘다를 포함한다.Computer 1102 is also an internal hard disk drive (HDD) 1114 (eg, EIDE, SATA) (this internal hard disk drive 1114 can also be configured for external use within a suitable chassis (not shown)) Magnetic floppy disk drive (FDD) 1116 (e.g., for reading from or writing to removable diskette 1118), and optical disk drive 1120 (e.g., CD-ROM) Disc 1122 for reading or reading from or writing to other high-capacity optical media such as DVD). The hard disk drive 1114, the magnetic disk drive 1116, and the optical disk drive 1120 are the system bus 1108 by the hard disk drive interface 1124, the magnetic disk drive interface 1126, and the optical drive interface 1128, respectively. ). The interface 1124 for implementing an external drive includes at least one of USB (Universal Serial Bus) and IEEE 1394 interface technology, or both.

이들 드라이브 및 그와 연관된 컴퓨터 판독가능 매체는 데이터, 데이터 구조, 컴퓨터 실행가능 명령어, 기타 등등의 비휘발성 저장을 제공한다. 컴퓨터(1102)의 경우, 드라이브 및 매체는 임의의 데이터를 적당한 디지털 형식으로 저장하는 것에 대응한다. 상기에서의 컴퓨터 판독가능 매체에 대한 설명이 HDD, 이동식 자기 디스크, 및 CD 또는 DVD 등의 이동식 광학 매체를 언급하고 있지만, 당업자라면 집 드라이브(zip drive), 자기 카세트, 플래쉬 메모리 카드, 카트리지, 기타 등등의 컴퓨터에 의해 판독가능한 다른 유형의 매체도 역시 예시적인 운영 환경에서 사용될 수 있으며 또 임의의 이러한 매체가 본 발명의 방법들을 수행하기 위한 컴퓨터 실행가능 명령어를 포함할 수 있다는 것을 잘 알 것이다.These drives and their associated computer-readable media provide non-volatile storage of data, data structures, computer-executable instructions, and the like. In the case of computer 1102, drives and media correspond to storing any data in a suitable digital format. Although the above description of computer readable media refers to HDDs, removable magnetic disks, and removable optical media such as CDs or DVDs, those of ordinary skill in the art can use zip drives, magnetic cassettes, flash memory cards, cartridges, etc. It will be appreciated that other types of media readable by a computer, etc., may also be used in the exemplary operating environment and that any such media may include computer-executable instructions for performing the methods of the present invention.

운영 체제(1130), 하나 이상의 애플리케이션 프로그램(1132), 기타 프로그램 모듈(1134) 및 프로그램 데이터(1136)을 비롯한 다수의 프로그램 모듈이 드라이브 및 RAM(1112)에 저장될 수 있다. 운영 체제, 애플리케이션, 모듈 및/또는 데이터의 전부 또는 그 일부분이 또한 RAM(1112)에 캐싱될 수 있다. 본 발명이 여러가지 상업적으로 이용가능한 운영 체제 또는 운영 체제들의 조합에서 구현될 수 있다는 것을 잘 알 것이다.A number of program modules may be stored in the drive and RAM 1112, including the operating system 1130, one or more application programs 1132, other program modules 1134, and program data 1136. All or a portion of the operating system, applications, modules and/or data may also be cached in RAM 1112. It will be appreciated that the present invention can be implemented in various commercially available operating systems or combinations of operating systems.

사용자는 하나 이상의 유선/무선 입력 장치, 예를 들어, 키보드(1138) 및 마우스(1140) 등의 포인팅 장치를 통해 컴퓨터(1102)에 명령 및 정보를 입력할 수 있다. 기타 입력 장치(도시 생략)로는 마이크, IR 리모콘, 조이스틱, 게임 패드, 스타일러스 펜, 터치 스크린, 기타 등등이 있을 수 있다. 이들 및 기타 입력 장치가 종종 시스템 버스(1108)에 연결되어 있는 입력 장치 인터페이스(1142)를 통해 처리 장치(1104)에 연결되지만, 병렬 포트, IEEE 1394 직렬 포트, 게임 포트, USB 포트, IR 인터페이스, 기타 등등의 기타 인터페이스에 의해 연결될 수 있다.The user may input commands and information to the computer 1102 through one or more wired/wireless input devices, for example, pointing devices such as a keyboard 1138 and a mouse 1140. Other input devices (not shown) may include a microphone, IR remote control, joystick, game pad, stylus pen, touch screen, etc. These and other input devices are often connected to the processing unit 1104 through an input device interface 1142 connected to the system bus 1108, but the parallel port, IEEE 1394 serial port, game port, USB port, IR interface, And other interfaces.

모니터(1144) 또는 다른 유형의 디스플레이 장치도 역시 비디오 어댑터(1146) 등의 인터페이스를 통해 시스템 버스(1108)에 연결된다. 모니터(1144)에 부가하여, 컴퓨터는 일반적으로 스피커, 프린터, 기타 등등의 기타 주변 출력 장치(도시 생략)를 포함한다.The monitor 1144 or other type of display device is also connected to the system bus 1108 through an interface such as a video adapter 1146. In addition to the monitor 1144, the computer generally includes other peripheral output devices (not shown), such as speakers, printers, and the like.

컴퓨터(1102)는 유선 및/또는 무선 통신을 통한 원격 컴퓨터(들)(1148) 등의 하나 이상의 원격 컴퓨터로의 논리적 연결을 사용하여 네트워크화된 환경에서 동작할 수 있다. 원격 컴퓨터(들)(1148)는 워크스테이션, 서버 컴퓨터, 라우터, 퍼스널 컴퓨터, 휴대용 컴퓨터, 마이크로프로세서-기반 오락 기기, 피어 장치 또는 기타 통상의 네트워크 노드일 수 있으며, 일반적으로 컴퓨터(1102)에 대해 기술된 구성요소들 중 다수 또는 그 전부를 포함하지만, 간략함을 위해, 메모리 저장 장치(1150)만이 도시되어 있다. 도시되어 있는 논리적 연결은 근거리 통신망(LAN)(1152) 및/또는 더 큰 네트워크, 예를 들어, 원거리 통신망(WAN)(1154)에의 유선/무선 연결을 포함한다. 이러한 LAN 및 WAN 네트워킹 환경은 사무실 및 회사에서 일반적인 것이며, 인트라넷 등의 전사적 컴퓨터 네트워크(enterprise-wide computer network)를 용이하게 해주며, 이들 모두는 전세계 컴퓨터 네트워크, 예를 들어, 인터넷에 연결될 수 있다.The computer 1102 can operate in a networked environment using logical connections to one or more remote computers, such as remote computer(s) 1148 via wired and/or wireless communication. The remote computer(s) 1148 may be a workstation, server computer, router, personal computer, portable computer, microprocessor-based entertainment device, peer device, or other conventional network node, generally for computer 1102. It includes many or all of the described components, but for simplicity, only the memory storage device 1150 is shown. The illustrated logical connections include wired/wireless connections to a local area network (LAN) 1152 and/or a larger network, such as a wide area network (WAN) 1154. Such LAN and WAN networking environments are common in offices and companies, and facilitate enterprise-wide computer networks such as intranets, all of which can be connected to computer networks around the world, such as the Internet.

LAN 네트워킹 환경에서 사용될 때, 컴퓨터(1102)는 유선 및/또는 무선 통신 네트워크 인터페이스 또는 어댑터(1156)를 통해 로컬 네트워크(1152)에 연결된다. 어댑터(1156)는 LAN(1152)에의 유선 또는 무선 통신을 용이하게 해줄 수 있으며, 이 LAN(1152)은 또한 무선 어댑터(1156)와 통신하기 위해 그에 설치되어 있는 무선 액세스 포인트를 포함하고 있다. WAN 네트워킹 환경에서 사용될 때, 컴퓨터(1102)는 모뎀(1158)을 포함할 수 있거나, WAN(1154) 상의 통신 서버에 연결되거나, 또는 인터넷을 통하는 등, WAN(1154)을 통해 통신을 설정하는 기타 수단을 갖는다. 내장형 또는 외장형 및 유선 또는 무선 장치일 수 있는 모뎀(1158)은 직렬 포트 인터페이스(1142)를 통해 시스템 버스(1108)에 연결된다. 네트워크화된 환경에서, 컴퓨터(1102)에 대해 설명된 프로그램 모듈들 또는 그의 일부분이 원격 메모리/저장 장치(1150)에 저장될 수 있다. 도시된 네트워크 연결이 예시적인 것이며 컴퓨터들 사이에 통신 링크를 설정하는 기타 수단이 사용될 수 있다는 것을 잘 알 것이다.When used in a LAN networking environment, the computer 1102 is connected to the local network 1152 via a wired and/or wireless communication network interface or adapter 1156. The adapter 1156 may facilitate wired or wireless communication to the LAN 1152, which also includes a wireless access point installed therein to communicate with the wireless adapter 1156. When used in a WAN networking environment, the computer 1102 may include a modem 1158, connect to a communication server on the WAN 1154, or otherwise establish communication over the WAN 1154, such as through the Internet. Have a means The modem 1158, which may be an internal or external and wired or wireless device, is connected to the system bus 1108 through a serial port interface 1142. In a networked environment, program modules described for the computer 1102 or portions thereof may be stored in the remote memory/storage device 1150. It will be appreciated that the illustrated network connection is exemplary and other means of establishing a communication link between computers can be used.

컴퓨터(1102)는 무선 통신으로 배치되어 동작하는 임의의 무선 장치 또는 개체, 예를 들어, 프린터, 스캐너, 데스크톱 및/또는 휴대용 컴퓨터, PDA(portable data assistant), 통신 위성, 무선 검출가능 태그와 연관된 임의의 장비 또는 장소, 및 전화와 통신을 하는 동작을 한다. 이것은 적어도 Wi-Fi 및 블루투스 무선 기술을 포함한다. 따라서, 통신은 종래의 네트워크에서와 같이 미리 정의된 구조이거나 단순하게 적어도 2개의 장치 사이의 애드혹 통신(ad hoc communication)일 수 있다.The computer 1102 is associated with any wireless device or entity that is deployed and operates in wireless communication, such as a printer, scanner, desktop and/or portable computer, a portable data assistant (PDA), communication satellite, or wireless detectable tag. It operates to communicate with any equipment or place and telephone. This includes at least Wi-Fi and Bluetooth wireless technology. Accordingly, the communication may be a predefined structure as in a conventional network or simply ad hoc communication between at least two devices.

Wi-Fi(Wireless Fidelity)는 유선 없이도 인터넷 등으로의 연결을 가능하게 해준다. Wi-Fi는 이러한 장치, 예를 들어, 컴퓨터가 실내에서 및 실외에서, 즉 기지국의 통화권 내의 아무 곳에서나 데이터를 전송 및 수신할 수 있게 해주는 셀 전화와 같은 무선 기술이다. Wi-Fi 네트워크는 안전하고 신뢰성있으며 고속인 무선 연결을 제공하기 위해 IEEE 802.11(a,b,g, 기타)이라고 하는 무선 기술을 사용한다. 컴퓨터를 서로에, 인터넷에 및 유선 네트워크(IEEE 802.3 또는 이더넷을 사용함)에 연결시키기 위해 Wi-Fi가 사용될 수 있다. Wi-Fi 네트워크는 비인가 2.4 및 5 GHz 무선 대역에서, 예를 들어, 11Mbps(802.11a) 또는 54 Mbps(802.11b) 데이터 레이트로 동작하거나, 양 대역(듀얼 대역)을 포함하는 제품에서 동작할 수 있다.Wi-Fi (Wireless Fidelity) enables a connection to the Internet or the like without a wired connection. Wi-Fi is a wireless technology such as a cell phone that allows a device, for example, a computer to send and receive data indoors and outdoors, ie anywhere within the base station's coverage area. Wi-Fi networks use a wireless technology called IEEE 802.11 (a,b,g, etc.) to provide a secure, reliable and high-speed wireless connection. Wi-Fi can be used to connect computers to each other, to the Internet, and to a wired network (using IEEE 802.3 or Ethernet). Wi-Fi networks can operate in unlicensed 2.4 and 5 GHz radio bands, for example, at 11 Mbps (802.11a) or 54 Mbps (802.11b) data rates, or in products that include both bands (dual band). have.

본 발명의 기술 분야에서 통상의 지식을 가진 자는 여기에 개시된 실시예들과 관련하여 설명된 다양한 예시적인 논리 블록들, 모듈들, 프로세서들, 수단들, 회로들 및 알고리즘 단계들이 전자 하드웨어, (편의를 위해, 여기에서 "소프트웨어"로 지칭되는) 다양한 형태들의 프로그램 또는 설계 코드 또는 이들 모두의 결합에 의해 구현될 수 있다는 것을 이해할 것이다. 하드웨어 및 소프트웨어의 이러한 상호 호환성을 명확하게 설명하기 위해, 다양한 예시적인 컴포넌트들, 블록들, 모듈들, 회로들 및 단계들이 이들의 기능과 관련하여 위에서 일반적으로 설명되었다. 이러한 기능이 하드웨어 또는 소프트웨어로서 구현되는지 여부는 특정한 애플리케이션 및 전체 시스템에 대하여 부과되는 설계 제약들에 따라 좌우된다. 본 발명의 기술 분야에서 통상의 지식을 가진 자는 각각의 특정한 애플리케이션에 대하여 다양한 방식들로 설명된 기능을 구현할 수 있으나, 이러한 구현 결정들은 본 발명의 범위를 벗어나는 것으로 해석되어서는 안 될 것이다.A person of ordinary skill in the art of the present invention can provide a variety of exemplary logical blocks, modules, processors, means, circuits and algorithm steps described in connection with the embodiments disclosed herein, electronic hardware, (convenience For this, it will be understood that it may be implemented by various forms of program or design code (referred to herein as “software”) or a combination of both. To clearly illustrate this interchangeability of hardware and software, various illustrative components, blocks, modules, circuits, and steps have been described above generally in terms of their functionality. Whether such functionality is implemented as hardware or software depends on the particular application and design constraints imposed on the overall system. Those skilled in the art of the present invention may implement the functions described in various ways for each specific application, but such implementation decisions should not be interpreted as being outside the scope of the present invention.

여기서 제시된 다양한 실시예들은 방법, 장치, 또는 표준 프로그래밍 및/또는 엔지니어링 기술을 사용한 제조 물품(article)으로 구현될 수 있다. 용어 "제조 물품"은 임의의 컴퓨터-판독가능 장치로부터 액세스 가능한 컴퓨터 프로그램, 캐리어, 또는 매체(media)를 포함한다. 예를 들어, 컴퓨터-판독가능 매체는 자기 저장 장치(예를 들면, 하드 디스크, 플로피 디스크, 자기 스트립, 등), 광학 디스크(예를 들면, CD, DVD, 등), 스마트 카드, 및 플래쉬 메모리 장치(예를 들면, EEPROM, 카드, 스틱, 키 드라이브, 등)를 포함하지만, 이들로 제한되는 것은 아니다. 또한, 여기서 제시되는 다양한 저장 매체는 정보를 저장하기 위한 하나 이상의 장치 및/또는 다른 기계-판독가능한 매체를 포함한다. 용어 "기계-판독가능 매체"는 명령(들) 및/또는 데이터를 저장, 보유, 및/또는 전달할 수 있는 무선 채널 및 다양한 다른 매체를 포함하지만, 이들로 제한되는 것은 아니다. Various embodiments presented herein can be implemented as a method, apparatus, or article of manufacture using standard programming and/or engineering techniques. The term “manufactured article” includes a computer program, carrier, or media accessible from any computer-readable device. For example, computer-readable media include magnetic storage devices (eg, hard disks, floppy disks, magnetic strips, etc.), optical disks (eg, CDs, DVDs, etc.), smart cards, and flash memory Devices (eg, EEPROMs, cards, sticks, key drives, etc.), but are not limited to these. Also, various storage media presented herein include one or more devices and/or other machine-readable media for storing information. The term “machine-readable medium” includes, but is not limited to, wireless channels and various other media capable of storing, retaining, and/or transferring command(s) and/or data.

제시된 프로세스들에 있는 단계들의 특정한 순서 또는 계층 구조는 예시적인 접근들의 일례임을 이해하도록 한다. 설계 우선순위들에 기반하여, 본 발명의 범위 내에서 프로세스들에 있는 단계들의 특정한 순서 또는 계층 구조가 재배열될 수 있다는 것을 이해하도록 한다. 첨부된 방법 청구항들은 샘플 순서로 다양한 단계들의 엘리먼트들을 제공하지만 제시된 특정한 순서 또는 계층 구조에 한정되는 것을 의미하지는 않는다.It is understood that the specific order or hierarchy of steps in the processes presented is an example of exemplary approaches. Based on design priorities, it is understood that within the scope of the present invention, a specific order or hierarchy of steps in the processes may be rearranged. The accompanying method claims provide elements of the various steps in a sample order, but are not meant to be limited to the specific order or hierarchy presented.

Claims

A computer program stored on a computer-readable storage medium including encoded instructions, wherein the computer program, when executed by one or more processors, causes the one or more processors to improve data retrieval performance. And the operation is:
Generating a database index structure for pointing a data storage location based on monitoring result information according to a monitoring algorithm in a database management system (DBMS);
It includes,
The database index structure includes at least partially a sub-index structure of a first level and a sub-index structure of a second level lower than the first level,
The sub-index structure of the second level includes a plurality of columns,
The first column of the plurality of columns of the second-level sub-index structure includes key values,
The second column of the plurality of columns of the second-level sub-index structure includes ROWID values corresponding to a row of a data block, and
The third column among the plurality of columns of the second-level sub-index structure is determined based on at least one of monitoring result information according to a monitoring algorithm generated in a row unit or monitoring result information according to a monitoring algorithm generated in a partition unit. Includes optimized index values,
The optimization index value included in the third column,
When the monitoring result information indicates that a second type of indexing method is used, a NULL value is included,
Monitoring result information according to the monitoring algorithm generated in the partition unit,
It is generated on a partition basis based at least in part on partition property information of a basic data structure referenced by the database index structure.
A computer program stored on a computer-readable storage medium.

According to claim 1,
Optimization index values included in the third column,
The second level sub-index structure includes an indication of which indexing method to use, which is a first type of indexing method or a second type of indexing method different from the first type of indexing method,
A computer program stored on a computer-readable storage medium.

According to claim 1,
The monitoring result information,
The second level sub-index structure includes an indication of which indexing method to use, which is a first type of indexing method or a second type of indexing method different from the first type of indexing method,
A computer program stored on a computer-readable storage medium.

The method of claim 3,
The optimization index value included in the third column,
When the monitoring result information indicates that the first type of indexing method is to be used, including a bitmap value,
A computer program stored on a computer-readable storage medium.

The method of claim 4,
When a fourth column among a plurality of columns of the second-level sub-index structure includes end ROWID values, and the end ROWID value indicates that the monitoring result information uses the second type of indexing, Containing a NULL value,
A computer program stored on a computer-readable storage medium.

The method of claim 4,
The bitmap value is
A bit value indicating a relative distance between the reference point and the positions of other ROWID values in the sub-index structure of the second level using the ROWID value as a reference point,
A computer program stored on a computer-readable storage medium.

The method of claim 6,
The sub-index structure of the first level includes at least one of a root node and a branch node,
The second-level sub-index structure includes leaf nodes, and
The sub-index structure of the first level includes a starting ROWID value of the sub-level sub-index structure and an address value for accessing the sub-level sub-index structure,
A computer program stored on a computer-readable storage medium.

The method of claim 7,
Monitoring result information according to the monitoring algorithm,
It is generated based at least in part on the number of times the column value of the base data structure referenced by the database index structure is accessed, and the first type of indexing method or the second in a row unit or a key value unit. Including an indication of which of the types of indexing methods to use,
A computer program stored on a computer-readable storage medium.

The method of claim 8,
When the number of times of access to the column value or the number of times of access to the key value is greater than the predetermined number of access thresholds, the monitoring result information indicates that the indexing method of the first type is used,
When the number of times information accessed by the column value or the number of times of access to the key value is equal to or less than a predetermined access threshold number value, the monitoring result information indicates that the indexing method of the second type is used,
A computer program stored on a computer-readable storage medium.

According to claim 1,
The partition characteristic information of the basic data structure,
Determined based on at least one of the size information of the partition's Distinct value and the number of occurrences of DML (Data Manipulation Language) for the partition,
A computer program stored on a computer-readable storage medium.

The method of claim 10,
If the size information of the Distinct value of the partition is smaller than a predetermined size threshold or the DML occurrence count information for the partition is smaller than a predetermined DML occurrence count threshold, the monitoring result information is first for the partition. Indicates to use the type of indexing method, and
When the DML occurrence count information for the partition is equal to or greater than a predetermined DML occurrence count threshold, the monitoring result information indicates that a second type of indexing method is used for the partition,
A computer program stored on a computer-readable storage medium.

According to claim 1,
Monitoring result information according to the monitoring algorithm,
It is determined based at least in part on the last access point information of the data included in the partition,
If the last access point information of the data included in the partition is older than a predetermined threshold time point information, the monitoring result information indicates that the indexing method of the first type is used for the partition, and
When the last access point information of the data included in the partition is not older than the predetermined critical point time information, the monitoring result information indicates that a second type of indexing method is used for the partition,
A computer program stored on a computer-readable storage medium.

As a method performed in a database management system (DBMS),
Generating a database index structure for pointing the data storage location, based on the monitoring result information according to the monitoring algorithm in the database management system;
It includes,
The database index structure includes at least partially a sub-index structure of a first level and a sub-index structure of a second level lower than the first level,
The sub-index structure of the second level includes a plurality of columns,
The first column of the plurality of columns of the second-level sub-index structure includes key values,
The second column of the plurality of columns of the second-level sub-index structure includes ROWID values corresponding to the row of the data block, and
The third column among the plurality of columns of the second-level sub-index structure is determined based on at least one of monitoring result information according to a monitoring algorithm generated in a row unit or monitoring result information according to a monitoring algorithm generated in a partition unit. Includes optimized index values,
The optimization index value included in the third column,
When the monitoring result information indicates that a second type of indexing method is used, a NULL value is included,
Monitoring result information according to the monitoring algorithm generated in the partition unit,
Is generated on a partition basis based at least in part on partition characteristic information of a basic data structure referenced by the database index structure,
Way.

As a database management device,
A processor generating a database index structure for pointing the data storage location based on the monitoring result information according to the monitoring algorithm in the database management system;
A storage unit for storing the created database index structure;
It includes,
The database index structure includes at least partially a sub-index structure of a first level and a sub-index structure of a second level lower than the first level,
The sub-index structure of the second level includes a plurality of columns,
The first column of the plurality of columns of the second-level sub-index structure includes key values,
The second column of the plurality of columns of the second-level sub-index structure includes ROWID values corresponding to the row of the data block, and
The third column among the plurality of columns of the second-level sub-index structure is determined based on at least one of monitoring result information according to a monitoring algorithm generated in a row unit or monitoring result information according to a monitoring algorithm generated in a partition unit. Includes optimized index values,
The optimization index value included in the third column,
When the monitoring result information indicates that a second type of indexing method is used, a NULL value is included,
Monitoring result information according to the monitoring algorithm generated in the partition unit,
Is generated in a partition unit based at least in part on partition characteristic information of a basic data structure referenced by the database index structure,
Database management device