CN103020299A - Storage method and device for inverted indexes and appended data in full-text search - Google Patents
Storage method and device for inverted indexes and appended data in full-text search Download PDFInfo
- Publication number
- CN103020299A CN103020299A CN2012105919899A CN201210591989A CN103020299A CN 103020299 A CN103020299 A CN 103020299A CN 2012105919899 A CN2012105919899 A CN 2012105919899A CN 201210591989 A CN201210591989 A CN 201210591989A CN 103020299 A CN103020299 A CN 103020299A
- Authority
- CN
- China
- Prior art keywords
- data
- indexing units
- tree
- units data
- indexing
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 14
- 238000003860 storage Methods 0.000 title claims description 27
- 238000004321 preservation Methods 0.000 claims description 9
- 230000005055 memory storage Effects 0.000 claims description 8
- 230000007812 deficiency Effects 0.000 claims description 2
- 230000001502 supplementing effect Effects 0.000 claims description 2
- 230000007246 mechanism Effects 0.000 abstract description 6
- 230000008901 benefit Effects 0.000 abstract description 4
- 230000000153 supplemental effect Effects 0.000 description 10
- 230000004048 modification Effects 0.000 description 5
- 238000012986 modification Methods 0.000 description 5
- 238000013500 data storage Methods 0.000 description 4
- 239000002699 waste material Substances 0.000 description 4
- 241001269238 Data Species 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 230000008859 change Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000012217 deletion Methods 0.000 description 1
- 230000037430 deletion Effects 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 239000012634 fragment Substances 0.000 description 1
- 230000014759 maintenance of location Effects 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 230000008520 organization Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Images
Landscapes
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (6)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210591989.9A CN103020299B (en) | 2012-12-29 | 2012-12-29 | The store method of inverted index and supplemental data thereof and memory storage in full-text search |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201210591989.9A CN103020299B (en) | 2012-12-29 | 2012-12-29 | The store method of inverted index and supplemental data thereof and memory storage in full-text search |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103020299A true CN103020299A (en) | 2013-04-03 |
CN103020299B CN103020299B (en) | 2016-01-13 |
Family
ID=47968902
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201210591989.9A Active CN103020299B (en) | 2012-12-29 | 2012-12-29 | The store method of inverted index and supplemental data thereof and memory storage in full-text search |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN103020299B (en) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2015078273A1 (en) * | 2013-11-29 | 2015-06-04 | Tencent Technology (Shenzhen) Company Limited | Method and apparatus for search |
CN106227677A (en) * | 2016-07-20 | 2016-12-14 | 浪潮电子信息产业股份有限公司 | Method for managing variable-length cache metadata |
CN106776746A (en) * | 2016-11-14 | 2017-05-31 | 天津南大通用数据技术股份有限公司 | A kind of creation method and device of full-text index data |
CN107491523A (en) * | 2017-08-17 | 2017-12-19 | 三星(中国)半导体有限公司 | The method and device of data storage object |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1536509A (en) * | 2003-04-11 | 2004-10-13 | �Ҵ���˾ | Inverted index storage method, inverted index mechanism and on-line updating method |
US20080133574A1 (en) * | 2006-11-27 | 2008-06-05 | Taiga Fukushima | Method, program and device for retrieving symbol strings, and method, program and device for generating trie thereof |
CN101226553A (en) * | 2008-02-03 | 2008-07-23 | 中兴通讯股份有限公司 | Method and device for storing length-various field of embedded database |
US20090037456A1 (en) * | 2007-07-31 | 2009-02-05 | Kirshenbaum Evan R | Providing an index for a data store |
CN101944108A (en) * | 2010-09-07 | 2011-01-12 | 深圳市彩讯科技有限公司 | Index file and establishing method thereof |
CN102682086A (en) * | 2012-04-23 | 2012-09-19 | 华为技术有限公司 | Data segmentation method and data segmentation equipment |
-
2012
- 2012-12-29 CN CN201210591989.9A patent/CN103020299B/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1536509A (en) * | 2003-04-11 | 2004-10-13 | �Ҵ���˾ | Inverted index storage method, inverted index mechanism and on-line updating method |
US20080133574A1 (en) * | 2006-11-27 | 2008-06-05 | Taiga Fukushima | Method, program and device for retrieving symbol strings, and method, program and device for generating trie thereof |
US20090037456A1 (en) * | 2007-07-31 | 2009-02-05 | Kirshenbaum Evan R | Providing an index for a data store |
CN101226553A (en) * | 2008-02-03 | 2008-07-23 | 中兴通讯股份有限公司 | Method and device for storing length-various field of embedded database |
CN101944108A (en) * | 2010-09-07 | 2011-01-12 | 深圳市彩讯科技有限公司 | Index file and establishing method thereof |
CN102682086A (en) * | 2012-04-23 | 2012-09-19 | 华为技术有限公司 | Data segmentation method and data segmentation equipment |
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2015078273A1 (en) * | 2013-11-29 | 2015-06-04 | Tencent Technology (Shenzhen) Company Limited | Method and apparatus for search |
US10452691B2 (en) | 2013-11-29 | 2019-10-22 | Tencent Technology (Shenzhen) Company Limited | Method and apparatus for generating search results using inverted index |
CN106227677A (en) * | 2016-07-20 | 2016-12-14 | 浪潮电子信息产业股份有限公司 | Method for managing variable-length cache metadata |
CN106227677B (en) * | 2016-07-20 | 2018-11-20 | 浪潮电子信息产业股份有限公司 | Method for managing variable-length cache metadata |
CN106776746A (en) * | 2016-11-14 | 2017-05-31 | 天津南大通用数据技术股份有限公司 | A kind of creation method and device of full-text index data |
CN107491523A (en) * | 2017-08-17 | 2017-12-19 | 三星(中国)半导体有限公司 | The method and device of data storage object |
CN107491523B (en) * | 2017-08-17 | 2020-05-05 | 三星(中国)半导体有限公司 | Method and apparatus for storing data objects |
Also Published As
Publication number | Publication date |
---|---|
CN103020299B (en) | 2016-01-13 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102663090B (en) | Method and device for inquiry metadata | |
Tsirogiannis et al. | Query processing techniques for solid state drives | |
US10496621B2 (en) | Columnar storage of a database index | |
CN110188108B (en) | Data storage method, device, system, computer equipment and storage medium | |
US9047301B2 (en) | Method for optimizing the memory usage and performance of data deduplication storage systems | |
CN101963982B (en) | Method for managing metadata of redundancy deletion and storage system based on location sensitive Hash | |
Ahn et al. | ForestDB: A fast key-value storage system for variable-length string keys | |
CN104809182B (en) | Based on the web crawlers URL De-weight method that dynamically can divide Bloom Filter | |
CN103488709A (en) | Method and system for building indexes and method and system for retrieving indexes | |
CN101866358A (en) | A multi-dimensional interval query method and system | |
CN105117417A (en) | Read-optimized memory database Trie tree index method | |
CN104484471B (en) | A kind of implementation method of high-performance data storage engines | |
CN105631003A (en) | Intelligent index establishing, inquiring and maintaining method supporting mass data classification and counting | |
US12339823B2 (en) | Data storage device and storage control method based on log-structured merge tree | |
CN114281989B (en) | Data deduplication method and device based on text similarity, storage medium and server | |
US9189408B1 (en) | System and method of offline annotation of future accesses for improving performance of backup storage system | |
CN102542057B (en) | High dimension data index structure design method based on solid state hard disk | |
CN103020299B (en) | The store method of inverted index and supplemental data thereof and memory storage in full-text search | |
US7783589B2 (en) | Inverted index processing | |
CN116382588A (en) | LSM-Tree storage engine read amplification problem optimization method based on learning index | |
CN110134661A (en) | A Facet-Oriented Storage and Query Method for Academic Big Data | |
CN110515897B (en) | Method and system for optimizing reading performance of LSM storage system | |
He et al. | Read as Needed: Building {WiSER}, a {Flash-Optimized} Search Engine | |
Zhang et al. | Improved deduplication through parallel binning | |
CN108664664A (en) | A kind of magnanimity educational documentation associated storage method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
ASS | Succession or assignment of patent right |
Owner name: TIANJIN NANDA CONVENTIONAL DATA TECHNOLOGY CO., LT Effective date: 20130807 Owner name: STATE COMPUTER NETWORK AND INFORMATION SAFETY MANA Free format text: FORMER OWNER: TIANJIN NANDA GENERAL DATA TECHNOLOGY CO., LTD. Effective date: 20130807 |
|
C41 | Transfer of patent application or patent right or utility model | ||
C53 | Correction of patent of invention or patent application | ||
CB03 | Change of inventor or designer information |
Inventor after: Fan Zhenyong Inventor after: Wu Zhen Inventor after: Zhang Xue Inventor after: Cui Weili Inventor after: Wu Xin Inventor after: Zhao Wei Inventor before: Zhang Xue Inventor before: Fan Zhenyong Inventor before: Cui Weili Inventor before: Wu Xin Inventor before: Zhao Wei |
|
COR | Change of bibliographic data |
Free format text: CORRECT: INVENTOR; FROM: ZHANG XUE FAN ZHENYONG CUI WEILI WU XIN ZHAO WEI TO: FAN ZHENYONG WU ZHEN ZHANG XUE CUI WEILI WU XIN ZHAO WEI Free format text: CORRECT: ADDRESS; FROM: 300384 BINHAI NEW DISTRICT, TIANJIN TO: 100029 CHAOYANG, BEIJING |
|
TA01 | Transfer of patent application right |
Effective date of registration: 20130807 Address after: 100029 Beijing city Chaoyang District Yumin Road No. 3 Applicant after: State Computer Network and Information Safety Management Center Applicant after: Tianjin NanKai University General Data Technologies Co., Ltd. Address before: Haitai 300384 in Tianjin Binhai high tech Zone Huayuan Industrial Zone Development six road No. 6 Haitai green industry base J Applicant before: Tianjin Nanda General Data Technology Co., Ltd. |
|
C14 | Grant of patent or utility model | ||
GR01 | Patent grant |