CN103186611B - 一种压缩、解压及查询文档的方法、装置 - Google Patents
一种压缩、解压及查询文档的方法、装置 Download PDFInfo
- Publication number
- CN103186611B CN103186611B CN201110456661.1A CN201110456661A CN103186611B CN 103186611 B CN103186611 B CN 103186611B CN 201110456661 A CN201110456661 A CN 201110456661A CN 103186611 B CN103186611 B CN 103186611B
- Authority
- CN
- China
- Prior art keywords
- node
- path code
- path
- data content
- compression
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000007906 compression Methods 0.000 title claims abstract description 157
- 230000006835 compression Effects 0.000 title claims abstract description 152
- 238000000034 method Methods 0.000 title claims abstract description 144
- 238000006243 chemical reaction Methods 0.000 claims description 61
- 230000006837 decompression Effects 0.000 claims description 26
- 238000003860 storage Methods 0.000 claims description 7
- 238000005516 engineering process Methods 0.000 abstract description 4
- 238000004590 computer program Methods 0.000 description 7
- 238000010586 diagram Methods 0.000 description 7
- 230000006870 function Effects 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 238000009826 distribution Methods 0.000 description 2
- 241000931705 Cicada Species 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 238000013144 data compression Methods 0.000 description 1
- 230000004807 localization Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 239000003550 marker Substances 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000005457 optimization Methods 0.000 description 1
- 238000000926 separation method Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/10—File systems; File servers
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
- H03M7/70—Type of the data to be coded, other than image and sound
- H03M7/707—Structured documents, e.g. XML
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/30—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F16/33—Querying
- G06F16/3331—Query processing
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Data Mining & Analysis (AREA)
- Databases & Information Systems (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Document Processing Apparatus (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
Abstract
Description
Claims (16)
Priority Applications (6)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201110456661.1A CN103186611B (zh) | 2011-12-30 | 2011-12-30 | 一种压缩、解压及查询文档的方法、装置 |
US14/119,172 US8768900B2 (en) | 2011-12-30 | 2012-12-31 | Method and device for compressing, decompressing and querying document |
KR1020137030777A KR101499441B1 (ko) | 2011-12-30 | 2012-12-31 | 문서를 압축, 역압축 및 조회하는 방법 및 장치 |
JP2014519409A JP5800441B2 (ja) | 2011-12-30 | 2012-12-31 | 文書の圧縮、解凍及び照会のための方法及び装置 |
EP12863927.5A EP2697728A4 (en) | 2011-12-30 | 2012-12-31 | METHOD AND APPARATUS FOR COMPRESSION, DECOMPRESSION AND DOCUMENT INTERROGATION |
PCT/CN2012/088009 WO2013097802A1 (en) | 2011-12-30 | 2012-12-31 | Method and device for compressing, decompressing and querying document |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201110456661.1A CN103186611B (zh) | 2011-12-30 | 2011-12-30 | 一种压缩、解压及查询文档的方法、装置 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103186611A CN103186611A (zh) | 2013-07-03 |
CN103186611B true CN103186611B (zh) | 2016-03-30 |
Family
ID=48677780
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201110456661.1A Expired - Fee Related CN103186611B (zh) | 2011-12-30 | 2011-12-30 | 一种压缩、解压及查询文档的方法、装置 |
Country Status (6)
Country | Link |
---|---|
US (1) | US8768900B2 (zh) |
EP (1) | EP2697728A4 (zh) |
JP (1) | JP5800441B2 (zh) |
KR (1) | KR101499441B1 (zh) |
CN (1) | CN103186611B (zh) |
WO (1) | WO2013097802A1 (zh) |
Families Citing this family (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP2605481A1 (de) * | 2011-12-13 | 2013-06-19 | Siemens Aktiengesellschaft | Verfahren und Vorrichtung zum Filtern von Netzwerkverkehr |
US9104730B2 (en) * | 2012-06-11 | 2015-08-11 | International Business Machines Corporation | Indexing and retrieval of structured documents |
CN105095237B (zh) | 2014-04-30 | 2018-07-17 | 国际商业机器公司 | 用于生成非关系数据库的模式的方法和设备 |
CN106372042B (zh) * | 2016-08-31 | 2019-09-24 | 北京奇艺世纪科技有限公司 | 一种文档内容获取方法和装置 |
CN107609072B (zh) * | 2017-09-01 | 2020-11-20 | 联想(北京)有限公司 | 一种数据处理方法及装置 |
CN109241498B (zh) * | 2018-06-26 | 2023-08-15 | 中国建设银行股份有限公司 | Xml文件处理方法、设备和存储介质 |
CN112329281A (zh) * | 2019-07-31 | 2021-02-05 | 比亚迪股份有限公司 | 文件查错方法、装置、电子设备及存储介质 |
Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102073663A (zh) * | 2009-11-24 | 2011-05-25 | 北大方正集团有限公司 | 一种快速处理xml压缩数据的方法及其装置 |
CN102214170A (zh) * | 2010-04-06 | 2011-10-12 | 北京大学 | 一种xml数据压缩和解压缩方法及系统 |
Family Cites Families (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2008084341A (ja) * | 1999-06-21 | 2008-04-10 | Fujitsu Ltd | 構造化文書の圧縮方法および圧縮装置並びに構造化文書圧縮プログラムを記録したコンピュータ読取可能な記録媒体 |
US6883137B1 (en) * | 2000-04-17 | 2005-04-19 | International Business Machines Corporation | System and method for schema-driven compression of extensible mark-up language (XML) documents |
JP3832807B2 (ja) * | 2001-06-28 | 2006-10-11 | インターナショナル・ビジネス・マシーンズ・コーポレーション | データ処理方法及びその手法を用いたエンコーダ、デコーダ並びにxmlパーサ |
US7415665B2 (en) * | 2003-01-15 | 2008-08-19 | At&T Delaware Intellectual Property, Inc. | Methods and systems for compressing markup language files |
KR20040070894A (ko) * | 2003-02-05 | 2004-08-11 | 삼성전자주식회사 | Xml 데이터의 압축 방법 및 압축된 xml 데이터의복원 방법 |
KR100803285B1 (ko) * | 2003-10-21 | 2008-02-13 | 한국과학기술원 | 역 산술 부호화와 타입 추론 엔진을 이용한 질의 가능 엑스-엠-엘 압축 방법 |
CN1314208C (zh) | 2003-11-28 | 2007-05-02 | 北京大学 | 可扩展标记语言数据流压缩器及其压缩方法 |
US20050144556A1 (en) * | 2003-12-31 | 2005-06-30 | Petersen Peter H. | XML schema token extension for XML document compression |
US7630997B2 (en) * | 2005-03-23 | 2009-12-08 | Microsoft Corporation | Systems and methods for efficiently compressing and decompressing markup language |
US7593949B2 (en) * | 2006-01-09 | 2009-09-22 | Microsoft Corporation | Compression of structured documents |
JP2009543243A (ja) * | 2006-07-12 | 2009-12-03 | エクスプウェイ | 構造化文書の圧縮のための方法と装置 |
JP2010287052A (ja) * | 2009-06-11 | 2010-12-24 | Fujitsu Ltd | 検索システムおよび記憶媒体 |
-
2011
- 2011-12-30 CN CN201110456661.1A patent/CN103186611B/zh not_active Expired - Fee Related
-
2012
- 2012-12-31 US US14/119,172 patent/US8768900B2/en active Active
- 2012-12-31 JP JP2014519409A patent/JP5800441B2/ja active Active
- 2012-12-31 EP EP12863927.5A patent/EP2697728A4/en not_active Ceased
- 2012-12-31 KR KR1020137030777A patent/KR101499441B1/ko active IP Right Grant
- 2012-12-31 WO PCT/CN2012/088009 patent/WO2013097802A1/en active Application Filing
Patent Citations (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102073663A (zh) * | 2009-11-24 | 2011-05-25 | 北大方正集团有限公司 | 一种快速处理xml压缩数据的方法及其装置 |
CN102214170A (zh) * | 2010-04-06 | 2011-10-12 | 北京大学 | 一种xml数据压缩和解压缩方法及系统 |
Also Published As
Publication number | Publication date |
---|---|
EP2697728A4 (en) | 2014-04-09 |
CN103186611A (zh) | 2013-07-03 |
WO2013097802A1 (en) | 2013-07-04 |
US8768900B2 (en) | 2014-07-01 |
EP2697728A1 (en) | 2014-02-19 |
KR101499441B1 (ko) | 2015-03-06 |
KR20140056172A (ko) | 2014-05-09 |
US20140089277A1 (en) | 2014-03-27 |
JP5800441B2 (ja) | 2015-10-28 |
JP2014521159A (ja) | 2014-08-25 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103186611B (zh) | 一种压缩、解压及查询文档的方法、装置 | |
CN102033954B (zh) | 关系数据库中可扩展标记语言文档全文检索查询索引方法 | |
Olteanu et al. | XPath: looking forward | |
CN101719156B (zh) | 一种在关系型数据库中无缝集成纯xml查询引擎的系统 | |
EP1580671A2 (en) | Data mapping with nested tables | |
CN102650992A (zh) | 用于二进制xml数据的生成及其节点定位的方法和装置 | |
US20090222419A1 (en) | Succinct index structure for xml | |
CN111831626A (zh) | 数据库逻辑关系的图结构生成方法、数据查询方法及装置 | |
CN101216824B (zh) | 一种将树型结构数据库发布为分布式xml数据库的方法 | |
CN101833588B (zh) | 一种xml文档索引结构 | |
Reggiori et al. | Indexing and retrieving Semantic Web resources: the RDFStore model | |
Hsu et al. | UCIS-X: an updatable compact indexing scheme for efficient extensible markup language document updating and query evaluation | |
KR20110071651A (ko) | 무선 방송 스트림에서 xml 질의 처리 방법 | |
CN110321456B (zh) | 一种海量不确定xml近似查询方法 | |
US11074401B2 (en) | Merging delta object notation documents | |
CN104679775A (zh) | 一种基于Huffman表的数据处理方法 | |
CN102831151B (zh) | 电子文档的生成方法和装置 | |
Arora et al. | Iterative method for recreating a binary tree from its traversals | |
Amin et al. | Labeling schemes to support dynamic updates on XML trees: A technical review | |
CN102867054A (zh) | 一种xml关键字查询方法 | |
Wei et al. | A new and effective approach to GML documents compression | |
Deng et al. | LAF: a new XML encoding and indexing strategy for keyword‐based XML search | |
Zhang et al. | An approach of domain ontology construction based on resource model and Jena | |
이준희 | Space-efficient Representation of Semi-structured Document Formats Utilizing Succinct Data Structures | |
JP5374456B2 (ja) | 文書検索装置の動作方法およびこれをコンピュータに実行させるためのコンピュータプログラム |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
ASS | Succession or assignment of patent right |
Owner name: FOUNDER INFORMATION INDUSTRY HOLDING CO., LTD. BEI Free format text: FORMER OWNER: BEIJING FOUNDER APABI TECHNOLOGY CO., LTD. BEIJING UNIV. Effective date: 20130904 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20130904 Address after: 100871 Beijing, Haidian District into the house road, founder of the building on the 9 floor, No. 298 Applicant after: PEKING UNIVERSITY FOUNDER GROUP Co.,Ltd. Applicant after: FOUNDER INFORMATION INDUSTRY HOLDINGS Co.,Ltd. Applicant after: FOUNDER APABI TECHNOLOGY Ltd. Applicant after: Peking University Address before: 100871 Beijing, Haidian District into the house road, founder of the building on the 9 floor, No. 298 Applicant before: PEKING UNIVERSITY FOUNDER GROUP Co.,Ltd. Applicant before: FOUNDER APABI TECHNOLOGY Ltd. Applicant before: Peking University |
|
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
CP01 | Change in the name or title of a patent holder | ||
CP01 | Change in the name or title of a patent holder |
Address after: 100871, Beijing, Haidian District Cheng Fu Road 298, founder building, 9 floor Patentee after: PEKING UNIVERSITY FOUNDER GROUP Co.,Ltd. Patentee after: PKU FOUNDER INFORMATION INDUSTRY GROUP CO.,LTD. Patentee after: FOUNDER APABI TECHNOLOGY Ltd. Patentee after: Peking University Address before: 100871, Beijing, Haidian District Cheng Fu Road 298, founder building, 9 floor Patentee before: PEKING UNIVERSITY FOUNDER GROUP Co.,Ltd. Patentee before: FOUNDER INFORMATION INDUSTRY HOLDINGS Co.,Ltd. Patentee before: FOUNDER APABI TECHNOLOGY Ltd. Patentee before: Peking University |
|
TR01 | Transfer of patent right | ||
TR01 | Transfer of patent right |
Effective date of registration: 20220915 Address after: 3007, Hengqin international financial center building, No. 58, Huajin street, Hengqin new area, Zhuhai, Guangdong 519031 Patentee after: New founder holdings development Co.,Ltd. Patentee after: FOUNDER APABI TECHNOLOGY Ltd. Patentee after: Peking University Address before: 100871, Beijing, Haidian District Cheng Fu Road 298, founder building, 9 floor Patentee before: PEKING UNIVERSITY FOUNDER GROUP Co.,Ltd. Patentee before: PKU FOUNDER INFORMATION INDUSTRY GROUP CO.,LTD. Patentee before: FOUNDER APABI TECHNOLOGY Ltd. Patentee before: Peking University |
|
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20160330 |