CN1139883C - Intelligent compression method for file of computer - Google Patents

Intelligent compression method for file of computer Download PDF

Info

Publication number
CN1139883C
CN1139883C CNB011241586A CN01124158A CN1139883C CN 1139883 C CN1139883 C CN 1139883C CN B011241586 A CNB011241586 A CN B011241586A CN 01124158 A CN01124158 A CN 01124158A CN 1139883 C CN1139883 C CN 1139883C
Authority
CN
China
Prior art keywords
file
document
compression
type
compressed
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNB011241586A
Other languages
Chinese (zh)
Other versions
CN1405705A (en
Inventor
王金波
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Jiuzhou Computer Network Co., Ltd., Beijing
Original Assignee
JIUZHOU COMPUTER NETWORK CO Ltd BEIJING
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by JIUZHOU COMPUTER NETWORK CO Ltd BEIJING filed Critical JIUZHOU COMPUTER NETWORK CO Ltd BEIJING
Priority to CNB011241586A priority Critical patent/CN1139883C/en
Publication of CN1405705A publication Critical patent/CN1405705A/en
Application granted granted Critical
Publication of CN1139883C publication Critical patent/CN1139883C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Abstract

The present invention relates to an intelligent compression method for compressing various computer file in high efficiency. According to the method, distinct types of data messages (such as characters, images, sound, etc.) in computer files can be automatically identified by computers and can also be compressed in high efficiency by automatically adopting suitable nondestructive or destructive encoding. The present invention also comprises a corresponding decompression method.

Description

The Intelligent Compression of computer documents and decompression method
Technical field under the present invention is a computing machine.
Leave the various primary data information (pdi)s in the computing machine in as computer documents, as Word message, picture information and acoustic information etc., normally uncompressed.But when you when their packings are taken away or are transmitted by Internet and telephone wire, you often need they are suitably compressed.
The compression of computer documents has two kinds of operator schemes at present.A kind of is that information type by the artificial cognition file is (as Text, Image, Speech etc.), and respectively move corresponding compressed software it is compressed (as using the ARJ compressed text file, use Lead View compressed image file, using Real Audio compressed voice file etc.).You can obtain the desired compression effect in this way, but when comprising that a plurality of different kinds of information unit maybe needs to compress a plurality of file in the file, and this squeeze operation will expensive handling time, and needs to buy many kinds of compressed softwares.
The another kind of squeeze operation pattern of computer documents is at present: no matter the file of what type all adopts lossless compression-encoding without exception.For example in modem, adopt V.42bis chip, institute's information transmitted is carried out the lossless coding compression without exception, or various files are carried out the lossless coding compression without exception with WinZip software.This compact model can be avoided expensive handling time, but can only obtain lower compressibility.This compress mode more is applicable to character information, and not too is applicable to out of Memory such as image, sound.
Purpose of the present invention is the compression method that a kind of intelligence is provided for the compression of computer documents.Use this method, both can compress any one or a sets of computer file automatically, can make the different information types (as Word message, picture information, acoustic information etc.) in the file can both obtain higher compressibility again by computing machine.
Fig. 1 is the FB(flow block) of compression side.
Fig. 2 is the FB(flow block) of decompression side.
The compression method of this intelligence, the mode that available computers software, computer hardware or computer software combine with hardware realizes.Its FB(flow block) as depicted in figs. 1 and 2.It is described respectively below.
In compression side, FB(flow block) as shown in Figure 1.At the initiating terminal of flow process, one group of file to be compressed is selected by the user.Other step among Fig. 1 is automatically performed according to method of operating of the present invention by computer system:
A. from above-mentioned one group of file to be compressed, choose arbitrary file, verify then whether this document belongs to the identified documentation type of default.
For this reason, some file types of setting like this need be arranged in the system: the first, the file layout of these file types is known, and these file types can be used the harmless or lossy compression method coding more more suitable than the general lossless compression-encoding of default; The second, these file types are made of two tabulations: one is the file extent list of file names, and the file extension of above-mentioned each file type is listed in this tabulation; Another is the document control information list, and some the document control information corresponding to above-mentioned each file extension are listed in this tabulation.
In order to verify whether selected file belongs to the identified documentation type of default, and system will verify at first whether the extension name of this document is listed in the above-mentioned file extent list of file names.If, system will then verify this document whether with its document control information list in corresponding control information just conform.Above-mentioned two checkings all are sure, and then this document belongs to the identified documentation type of default; It negates that then this document does not belong to the identified documentation type of default that any checking is arranged.
For example, the Bitmap file is a kind of image file type.Its appointed file extension name is bmp.Its control information corresponding is: the 1-2 byte of this document is BM, and 2-5 byte indicated the actual image data length of this document, and 10-13 byte, expression this document pictorial data reference position, or the like.
If when chosen will the compression of file that is nominally Picture.bmp is arranged, system will check at first whether bmp is listed in the file extent list of file names.If, next system will verify whether the first two byte of this document just in time is BM, and the POS INT that indicates from 10-13 byte of this document of the pictorial data of this document, whether its physical length just in time conforms to 2-5 indicated length of byte of this document.If above-mentioned checking all is sure, then this Picture.bmp file belongs to the identified documentation type of default, otherwise this Picture.bmp file does not belong to the identified documentation type of default.
If B. this document does not belong to the identified documentation type of default, then system will adopt default general lossless compression-encoding compression this document automatically.
If C. this document belongs to the identified documentation type of default, then system will determine whether this document is the simple files that only comprises a kind of data message type, or this document is the composite file that comprises not only a kind of data message type.
D. for a simple files, according to the form of this document, system can automatically discern the data message type that this document comprises, and adopt a kind of suitable compressed encoding automatically.For example, automatically adopt G 723 encoding compression speech datas, adopt color image data of JPEG encoding compression automatically, adopt JBIG compression soot-and-whitewash data or the like automatically.
E. for a composite file, for example a RTF file or a literature kit, according to the file layout of this document, system will automatically be split as a plurality of message units to this composite file, and each unit only comprises a kind of data message type.Thereby system can adopt the mode similar to above-mentioned D, automatically compresses the data message in each message unit.
In the present invention, for reducing the complicacy of operation, system also can adopt composite file of general lossless compression-encoding compression of systemic presupposition automatically, and does not adopt the step that composite file is split.This situation is not represented in Fig. 1.
F. in order to preserve data message and the corresponding control information that these files had been compressed, system need define the literature kit form of self.This form can be formulated and name by the system design people, and for example called after ICF form (Intelligent CompressionFormat) has file extension .icf.Last step of Fig. 1 is about to each file that has compressed and forms the icf file.
Do not finish if file G. to be compressed also all compresses, then system repeats the operation of above-mentioned A to F, all compresses up to file to be compressed to finish.
The present invention also is related to a decompression method of compressed file, as shown in Figure 2.At the initiating terminal of Fig. 2, have the decompressing files for the treatment of of icf extension name, select by the user.Remaining action is all finished automatically by computer system.Operate as follows:
Press the file from above-mentioned quasi-solution with extension name icf and to choose arbitrary file, according to the control information of this document, really whether checking this document an icf file then.
If this document is not a real icf file,, remove to choose another file again just stop the press operation of separating to this document.
If this document is an icf file really, system will determine further whether this document is the simple files that only comprises a kind of compressed data message type, or comprise the composite file of multiple compressed data message type.
If this document is a compressed simple files, system will discern the data message type in this document automatically, and use and the corresponding decompress(ion) coding of compressed encoding, the data message in decompress(ion) this document.
If this document is a compressed composite file, and this composite file does not compress with lossless coding fully, system will automatically be split as a plurality of message units to this document, each unit only comprises a kind of compressed data information type, and by each such unit of above-mentioned similar mode difference decompress(ion).
If a composite file compressed with a kind of lossless coding originally entirely, this composite file will adopt corresponding lossless coding to carry out decompress(ion).This situation is not shown among Fig. 2.
The final step of Fig. 2 is to form a decompressing files.If above-mentioned file with icf extension name is not also finished by whole decompress(ion)s, system will repeat top operation automatically, and all being extracted up to the above-mentioned file with icf extension name for the treatment of decompress(ion) finishes.
Based on method provided by the invention, developed a kind of new type of compression software, this software makes that the compression of various computer documentss is not only easy but also efficient, is much better than existing various computer documents tool of compression.Method provided by the invention also can be used in the various application systems, as is used for e.mail, FTP, modem, E-Government, ecommerce etc.

Claims (9)

1, compress the method for dissimilar computer documentss by computer system automatically, comprise following operation:
(1), whether belong to the file type of default by following operation demonstration this document for a file to be compressed:
Some file types that such setting is arranged in A, the system: the first, the file layout of these file types is known, and these file types can be used the harmless or lossy compression method coding more more suitable than the general lossless compression-encoding of default; The second, these file types can be represented with two tabulations: one is the file extent list of file names, and one is corresponding file control information tabulation;
B, whether belong to the file type of default in order to verify selection spare, system will verify at first whether the file extension of this document is included in the above-mentioned file extent list of file names, if, the document control information of verifying this document again whether with above-mentioned document control information list in corresponding informance just conform;
(2) according to above-mentioned checking result, adopt following squeeze operation respectively:
If A this document does not belong to above-mentioned specific file type, then system adopt default general lossless compression-encoding to compression;
If B this document belongs to above-mentioned specific file type, and this document only comprises a kind of data message type, then according to the data message type of this document, adopt suitable harmless or lossy coding to compression;
If C this document belongs to above-mentioned specific file type, and this document comprises the several data information type, then at first this document is split as a plurality of message units, each unit only comprises a kind of data message type, then according to the data message type of each message unit, adopt respectively suitable harmless or lossy coding to compression.
2, the method for claim 1, the use in computer software, hardware or soft, hardware combine, or the use in having other device of corresponding computer function.
3, the method for claim 1, as independently Information Compression instrument use, or the use in any application system.
4, compress the method for dissimilar computer documentss by computer system automatically, comprise following steps:
(1), whether belong to the file type of default by following operation demonstration this document for a file to be compressed:
Some file types that such setting is arranged in A, the system: the first, the file layout of these file types is known, these file types are the simple files that only comprise a kind of information type, and these file types can be used the harmless or lossy compression method coding more more suitable than the general lossless compression-encoding of default; The second, these file types can be represented with two tabulations: one is the file extent list of file names, and one is corresponding file control information tabulation;
B, whether belong to the file type of default in order to verify selection spare, system will verify at first whether the file extension of this document is included in the above-mentioned file extent list of file names, if, the document control information of verifying this document again whether with above-mentioned document control information list in corresponding informance just conform;
(2) according to above-mentioned checking result, adopt following squeeze operation respectively:
If A this document does not belong to above-mentioned specific file type, then system adopt default general lossless coding to compression;
If B this document belongs to above-mentioned specific file type, then according to the data message type of this document, adopt suitable harmless or lossy coding to compression;
5, the method for claim 4, the use in computer software, hardware or soft, hardware combine, or the use in having other device of corresponding computer function.
6, the method for claim 4, as independently Information Compression instrument use, or the use in any application system.
7, the method that decompresses automatically for the computer documents that has compressed by computer system, when by aforesaid right requirement 1 or
When the computer documents that the method for claim 4 is compressed had certain file extension, its operation steps comprised:
(1) for a decompressing files for the treatment of with above-mentioned extension name, according to the control information of this document, whether checking this document really conforms to this extension name;
(2) if this document really conforms to this extension name, according to the file layout of this document, whether decision this document is the simple files that only comprises a kind of compressed data message type, or comprises the composite file of multiple compressed data message type;
(3), then use and the corresponding decompress(ion) coding of compressed encoding, decompress(ion) this document if this document is a compressed simple files;
(4) if this document is a compressed composite file, then this document is split as a plurality of message units, each unit only comprises a kind of compressed data message type, and use and the compressed encoding coding that decompresses accordingly, each message unit of decompress(ion).
8, the method for claim 7, the use in computer software, hardware or soft, hardware combine, or the use in having other device of corresponding computer function;
9, the method for claim 7, as independently Information Compression instrument use, or the use in any application system.
CNB011241586A 2001-08-20 2001-08-20 Intelligent compression method for file of computer Expired - Fee Related CN1139883C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB011241586A CN1139883C (en) 2001-08-20 2001-08-20 Intelligent compression method for file of computer

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB011241586A CN1139883C (en) 2001-08-20 2001-08-20 Intelligent compression method for file of computer

Publications (2)

Publication Number Publication Date
CN1405705A CN1405705A (en) 2003-03-26
CN1139883C true CN1139883C (en) 2004-02-25

Family

ID=4665545

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB011241586A Expired - Fee Related CN1139883C (en) 2001-08-20 2001-08-20 Intelligent compression method for file of computer

Country Status (1)

Country Link
CN (1) CN1139883C (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9912624B2 (en) 2015-09-25 2018-03-06 International Business Machines Corporation Lossy text source coding by word length
CN110286917A (en) * 2019-05-21 2019-09-27 深圳壹账通智能科技有限公司 File packing method, device, equipment and storage medium

Families Citing this family (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1584875B (en) * 2004-06-01 2011-08-10 北京九州软件有限公司 Ergodic compressing and decompressing method for batched computer document
CN100343851C (en) * 2004-11-03 2007-10-17 北京神舟航天软件技术有限公司 Database compression and decompression method
CN1318974C (en) * 2005-08-05 2007-05-30 北京九州汇宝软件有限公司<Del/> Method for compression and search of database backup data
WO2011079796A1 (en) * 2009-12-30 2011-07-07 北京飞天诚信科技有限公司 Method for compressing.net document
CN102054038B (en) * 2010-12-30 2014-05-28 东莞宇龙通信科技有限公司 File decompression method and device as well as mobile terminal
US8909657B2 (en) 2011-01-14 2014-12-09 Apple Inc. Content based file chunking
CN102147818B (en) * 2011-05-17 2013-09-25 上海华岭集成电路技术股份有限公司 Test file compression method
CN102693325B (en) * 2012-06-12 2014-06-11 腾讯科技(深圳)有限公司 File storing method and device
CN103902567A (en) * 2012-12-26 2014-07-02 联想(北京)有限公司 Data processing method, device and system
CN104125458B (en) * 2013-04-27 2017-08-08 展讯通信(上海)有限公司 Internal storage data lossless compression method and device
CN104868922B (en) * 2014-02-24 2018-05-29 华为技术有限公司 Data compression method and apparatus
CN103873860B (en) * 2014-03-18 2017-12-22 深信服网络科技(深圳)有限公司 Document transmission method and device
CN104978319A (en) * 2014-04-02 2015-10-14 东华软件股份公司 Method and equipment used for classified transmission of files
CN106470037A (en) * 2015-08-21 2017-03-01 博雅网络游戏开发(深圳)有限公司 Intelligent compression method and system

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9912624B2 (en) 2015-09-25 2018-03-06 International Business Machines Corporation Lossy text source coding by word length
US10361980B2 (en) 2015-09-25 2019-07-23 International Business Machines Corporation Lossy text source coding by word length
CN110286917A (en) * 2019-05-21 2019-09-27 深圳壹账通智能科技有限公司 File packing method, device, equipment and storage medium
WO2020233057A1 (en) * 2019-05-21 2020-11-26 深圳壹账通智能科技有限公司 File packaging method, apparatus and device, and storage medium

Also Published As

Publication number Publication date
CN1405705A (en) 2003-03-26

Similar Documents

Publication Publication Date Title
CN1139883C (en) Intelligent compression method for file of computer
US7492290B1 (en) Alternative encoding for LZSS output
JP3299912B2 (en) Method and apparatus for compressing and decompressing fonts
EP0729237A2 (en) Adaptive multiple dictionary data compression
US20050219075A1 (en) In-place differential compression
KR940701621A (en) Adaptive Block Size Image Compression Method and System
BR0210786A (en) an equipment and method for encoding digital image data in a lossless manner
KR960032912A (en) Data compression apparatus and method for indexed color image data
CN1584875B (en) Ergodic compressing and decompressing method for batched computer document
JP2007537642A (en) Method and apparatus for compression and decompression of structured block unit of XML data
KR960009755A (en) How to compress and decompress data files
CN102982046A (en) Storage method and system for webpage data compression
US7733249B2 (en) Method and system of compressing and decompressing data
Broder et al. Pattern-based compression of text images
Shim et al. DH-LZW: lossless data hiding in LZW compression
KR20200121760A (en) Conditional transcoding for encoded data
JP2001169120A5 (en)
JP3152772B2 (en) Image data restoration device
Reif et al. REAL-TIME DYNAMIC COMPRESSION OF VIDEO ON A GRID-CONNECTED PAR-ALLEL COMPUTER
CN100530240C (en) Image compression and recovery method
JPS6118387B2 (en)
US8116580B2 (en) Embedded high frequency image details
Garg Compact Improvement in Proficiency of Huffman Coding
JP3796235B2 (en) Image compression and decompression method
JP2005260408A (en) Data compression apparatus and data compression program

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
C57 Notification of unclear or unknown address
DD01 Delivery of document by public notice

Addressee: Wang Jinbo

Document name: payment instructions

ASS Succession or assignment of patent right

Owner name: BEIJING JIUZHOU SOFTWARE CO., LTD.

Free format text: FORMER OWNER: JIUZHOU COMPUTER NETWORK CO., LTD., BEIJING

Effective date: 20050218

C41 Transfer of patent application or patent right or utility model
TR01 Transfer of patent right

Effective date of registration: 20050218

Address after: 100080, room 52, 1608 Haidian Road, Beijing, Haidian District

Patentee after: Jiuzhou Computer Network Co., Ltd., Beijing

Address before: 100080 Pacific Mansion, No. 52, Haidian Road, Beijing, Haidian District

Patentee before: Jiuzhou Computer Network Co., Ltd., Beijing

C56 Change in the name or address of the patentee

Owner name: BEIJING JIUZHOU HUIBAO SOFTWARE CO., LTD.

Free format text: FORMER NAME OR ADDRESS: BEIJING JIUZHOU SOFTWARE CO., LTD.

CP03 Change of name, title or address

Address after: 100080, room 52, 1608 Haidian Road, Beijing, Haidian District

Patentee after: Jiuzhou Huibao Software Co., Ltd., Beijing

Address before: 100080, room 52, 1608 Haidian Road, Beijing, Haidian District

Patentee before: Jiuzhou Computer Network Co., Ltd., Beijing

C56 Change in the name or address of the patentee

Owner name: BEIJING JIUZHOU S OF TWARE CO., LTD.

Free format text: FORMER NAME: BEIJING GLOGAL S OF TWARE CO., LTD.

CP03 Change of name, title or address

Address after: 100086 Beijing Haidian District Sanyi Temple No. 2 North Building 502-505

Patentee after: Jiuzhou Computer Network Co., Ltd., Beijing

Address before: 100029 Beijing city Chaoyang District Beitucheng West Road No. 3 building B block six layer Microelectronics

Patentee before: Jiuzhou Huibao Software Co., Ltd., Beijing

DD01 Delivery of document by public notice

Addressee: Wang Jinbo

Document name: Notification that Application Deemed not to be Proposed

C56 Change in the name or address of the patentee
CP02 Change in the address of a patent holder

Address after: 100044 Beijing city Xicheng District Xizhimen Street No. 135 Building No. 4 hospital 3

Patentee after: Jiuzhou Computer Network Co., Ltd., Beijing

Address before: 100086 Beijing Haidian District Sanyi Temple No. 2 North Building 502-505

Patentee before: Jiuzhou Computer Network Co., Ltd., Beijing

DD01 Delivery of document by public notice

Addressee: Jiuzhou Computer Network Co., Ltd., Beijing

Document name: Notification to Pay the Fees

CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20040225

Termination date: 20140820

EXPY Termination of patent right or utility model
C56 Change in the name or address of the patentee
CP02 Change in the address of a patent holder

Address after: 100081, Haidian District, Beijing, Zhongguancun South Street, No. 52, China Foreign Exchange Building 902

Patentee after: Jiuzhou Computer Network Co., Ltd., Beijing

Address before: 100044 Beijing city Xicheng District Xizhimen Street No. 135 Building No. 4 hospital 3

Patentee before: Jiuzhou Computer Network Co., Ltd., Beijing

REIN Reinstatement of patent application or patent right
REIN Reinstatement of patent application or patent right
RR01 Reinstatement of patent right

Former decision: cessation of patent right due to non-payment of the annual fee

Former decision publication date: 20151028

DD01 Delivery of document by public notice
DD01 Delivery of document by public notice

Addressee: Wang Jinbo

Document name: payment instructions

DD01 Delivery of document by public notice
DD01 Delivery of document by public notice

Addressee: Wang Jinbo

Document name: Notice of termination of patent right

CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20040225

Termination date: 20200820

CF01 Termination of patent right due to non-payment of annual fee