CA1226369A - Method and apparatus for data compression - Google Patents
Method and apparatus for data compressionInfo
- Publication number
- CA1226369A CA1226369A CA000465602A CA465602A CA1226369A CA 1226369 A CA1226369 A CA 1226369A CA 000465602 A CA000465602 A CA 000465602A CA 465602 A CA465602 A CA 465602A CA 1226369 A CA1226369 A CA 1226369A
- Authority
- CA
- Canada
- Prior art keywords
- word
- words
- text
- token
- dictionary
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired
Links
- 238000000034 method Methods 0.000 title claims abstract description 39
- 238000013144 data compression Methods 0.000 title description 2
- 230000005540 biological transmission Effects 0.000 claims 2
- 230000009467 reduction Effects 0.000 abstract description 6
- 150000002500 ions Chemical class 0.000 description 12
- CXENHBSYCFFKJS-OXYODPPFSA-N (Z,E)-alpha-farnesene Chemical compound CC(C)=CCC\C(C)=C\C\C=C(\C)C=C CXENHBSYCFFKJS-OXYODPPFSA-N 0.000 description 5
- 238000012360 testing method Methods 0.000 description 5
- 239000010977 jade Substances 0.000 description 3
- 101100042793 Gallus gallus SMC2 gene Proteins 0.000 description 2
- 239000004459 forage Substances 0.000 description 2
- 101150082527 ALAD gene Proteins 0.000 description 1
- XUKUURHRXDUEBC-KAYWLYCHSA-N Atorvastatin Chemical compound C=1C=CC=CC=1C1=C(C=2C=CC(F)=CC=2)N(CC[C@@H](O)C[C@@H](O)CC(O)=O)C(C(C)C)=C1C(=O)NC1=CC=CC=C1 XUKUURHRXDUEBC-KAYWLYCHSA-N 0.000 description 1
- 241001674044 Blattodea Species 0.000 description 1
- NLZUEZXRPGMBCV-UHFFFAOYSA-N Butylhydroxytoluene Chemical compound CC1=CC(C(C)(C)C)=C(O)C(C(C)(C)C)=C1 NLZUEZXRPGMBCV-UHFFFAOYSA-N 0.000 description 1
- 241000736839 Chara Species 0.000 description 1
- 101710200331 Cytochrome b-245 chaperone 1 Proteins 0.000 description 1
- 102100037186 Cytochrome b-245 chaperone 1 Human genes 0.000 description 1
- 101710119396 Cytochrome b-245 chaperone 1 homolog Proteins 0.000 description 1
- ZAKOWWREFLAJOT-CEFNRUSXSA-N D-alpha-tocopherylacetate Chemical compound CC(=O)OC1=C(C)C(C)=C2O[C@@](CCC[C@H](C)CCC[C@H](C)CCCC(C)C)(C)CCC2=C1C ZAKOWWREFLAJOT-CEFNRUSXSA-N 0.000 description 1
- 241000643923 Dictis Species 0.000 description 1
- 241001505295 Eros Species 0.000 description 1
- ULGZDMOVFRHVEP-RWJQBGPGSA-N Erythromycin Chemical compound O([C@@H]1[C@@H](C)C(=O)O[C@@H]([C@@]([C@H](O)[C@@H](C)C(=O)[C@H](C)C[C@@](C)(O)[C@H](O[C@H]2[C@@H]([C@H](C[C@@H](C)O2)N(C)C)O)[C@H]1C)(C)O)CC)[C@H]1C[C@@](C)(OC)[C@@H](O)[C@H](C)O1 ULGZDMOVFRHVEP-RWJQBGPGSA-N 0.000 description 1
- 241000276457 Gadidae Species 0.000 description 1
- 241000592718 Ibla Species 0.000 description 1
- 101100059652 Mus musculus Cetn1 gene Proteins 0.000 description 1
- 101100059655 Mus musculus Cetn2 gene Proteins 0.000 description 1
- 101100136062 Mycobacterium tuberculosis (strain ATCC 25618 / H37Rv) PE10 gene Proteins 0.000 description 1
- 241000353355 Oreosoma atlanticum Species 0.000 description 1
- 101150095197 PALD1 gene Proteins 0.000 description 1
- 101150014691 PPARA gene Proteins 0.000 description 1
- 241000282860 Procaviidae Species 0.000 description 1
- 208000003251 Pruritus Diseases 0.000 description 1
- 101150057388 Reln gene Proteins 0.000 description 1
- ATJFFYVFTNAWJD-UHFFFAOYSA-N Tin Chemical compound [Sn] ATJFFYVFTNAWJD-UHFFFAOYSA-N 0.000 description 1
- 239000000956 alloy Substances 0.000 description 1
- 229910045601 alloy Inorganic materials 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 230000001174 ascending effect Effects 0.000 description 1
- 238000009924 canning Methods 0.000 description 1
- 235000019994 cava Nutrition 0.000 description 1
- ZPUCINDJVBIVPJ-LJISPDSOSA-N cocaine Chemical compound O([C@H]1C[C@@H]2CC[C@@H](N2C)[C@H]1C(=O)OC)C(=O)C1=CC=CC=C1 ZPUCINDJVBIVPJ-LJISPDSOSA-N 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 244000144980 herd Species 0.000 description 1
- 101150048143 lpl2 gene Proteins 0.000 description 1
- 230000000873 masking effect Effects 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
- H03M7/40—Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
- H03M7/42—Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code using table look-up for the coding or decoding process, e.g. using read-only memory
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US54328683A | 1983-10-19 | 1983-10-19 | |
US543,286 | 1983-10-19 |
Publications (1)
Publication Number | Publication Date |
---|---|
CA1226369A true CA1226369A (en) | 1987-09-01 |
Family
ID=24167358
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA000465602A Expired CA1226369A (en) | 1983-10-19 | 1984-10-17 | Method and apparatus for data compression |
Country Status (5)
Country | Link |
---|---|
EP (1) | EP0160672A4 (it) |
JP (1) | JPS61500345A (it) |
CA (1) | CA1226369A (it) |
IT (1) | IT1180100B (it) |
WO (1) | WO1985001814A1 (it) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5099426A (en) * | 1989-01-19 | 1992-03-24 | International Business Machines Corporation | Method for use of morphological information to cross reference keywords used for information retrieval |
Families Citing this family (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
AU5091485A (en) * | 1984-11-08 | 1986-06-03 | Datran Corp. | Symbolic tokenizer for words and phrases |
US4758955A (en) * | 1985-07-19 | 1988-07-19 | Carson Chen | Hand-held spelling checker and method for reducing redundant information in the storage of textural material |
US4949302A (en) * | 1986-11-17 | 1990-08-14 | International Business Machines Corporation | Message file formation for computer programs |
US4843389A (en) * | 1986-12-04 | 1989-06-27 | International Business Machines Corp. | Text compression and expansion method and apparatus |
WO1988009586A1 (en) * | 1987-05-25 | 1988-12-01 | Megaword International Pty. Ltd. | A method of processing a text in order to store the text in memory |
US5754847A (en) * | 1987-05-26 | 1998-05-19 | Xerox Corporation | Word/number and number/word mapping |
US5560037A (en) * | 1987-12-28 | 1996-09-24 | Xerox Corporation | Compact hyphenation point data |
DE3914589A1 (de) * | 1989-05-03 | 1990-11-08 | Bosch Gmbh Robert | Verfahren zur datenreduktion bei strassennamen |
US5325091A (en) * | 1992-08-13 | 1994-06-28 | Xerox Corporation | Text-compression technique using frequency-ordered array of word-number mappers |
CA2125337A1 (en) * | 1993-06-30 | 1994-12-31 | Marlin Jay Eller | Method and system for searching compressed data |
US6023679A (en) * | 1994-10-04 | 2000-02-08 | Amadeus Global Travel Distribution Llc | Pre- and post-ticketed travel reservation information management system |
GB2305746B (en) * | 1995-09-27 | 2000-03-29 | Canon Res Ct Europe Ltd | Data compression apparatus |
DE19681251T1 (de) * | 1995-12-14 | 1998-08-20 | Motorola Inc | Verfahren und Vorrichtung zum Speichern und Darstellen von Text |
US6012062A (en) * | 1996-03-04 | 2000-01-04 | Lucent Technologies Inc. | System for compression and buffering of a data stream with data extraction requirements |
US5883906A (en) * | 1997-08-15 | 1999-03-16 | Advantest Corp. | Pattern data compression and decompression for semiconductor test system |
DE19854179A1 (de) * | 1998-11-24 | 2000-05-25 | Siemens Ag | Verfahren und Anordnung zur Kompression bzw. Expansion von Zeichenketten durch eine DV-Einrichtung |
CN1732426A (zh) * | 2002-12-27 | 2006-02-08 | 诺基亚公司 | 用于移动通信终端的预测性文本条目和数据压缩方法 |
DE102008022184A1 (de) * | 2008-03-11 | 2009-09-24 | Navigon Ag | Verfahren zur Erzeugung einer elektronischen Adressdatenbank, Verfahren zur Durchsuchung einer elektronischen Adressdatenbank und Navigationsgerät mit einer elektronischen Adressdatenbank |
AU2013382910B2 (en) * | 2013-03-22 | 2017-01-05 | Fujitsu Limited | Compression device, compression method, decompression device, decompression method, and information processing system |
JP2020061641A (ja) * | 2018-10-09 | 2020-04-16 | 富士通株式会社 | 符号化プログラム、符号化方法、符号化装置、復号化プログラム、復号化方法および復号化装置 |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3344405A (en) * | 1964-09-30 | 1967-09-26 | Ibm | Data storage and retrieval system |
US3717851A (en) * | 1971-03-03 | 1973-02-20 | Ibm | Processing of compacted data |
GB1516310A (en) * | 1974-10-29 | 1978-07-05 | Data Recording Instr Co | Information indexing and retrieval processes |
US4270182A (en) * | 1974-12-30 | 1981-05-26 | Asija Satya P | Automated information input, storage, and retrieval system |
US4189781A (en) * | 1977-01-25 | 1980-02-19 | International Business Machines Corporation | Segmented storage logging and controlling |
JPS55108075A (en) * | 1979-02-09 | 1980-08-19 | Sharp Corp | Data retrieval system |
US4356549A (en) * | 1980-04-02 | 1982-10-26 | Control Data Corporation | System page table apparatus |
US4358826A (en) * | 1980-06-30 | 1982-11-09 | International Business Machines Corporation | Apparatus for enabling byte or word addressing of storage organized on a word basis |
US4500955A (en) * | 1981-12-31 | 1985-02-19 | International Business Machines Corporation | Full word coding for information processing |
-
1984
- 1984-10-17 CA CA000465602A patent/CA1226369A/en not_active Expired
- 1984-10-17 EP EP19840903871 patent/EP0160672A4/en not_active Withdrawn
- 1984-10-17 JP JP59503813A patent/JPS61500345A/ja active Pending
- 1984-10-17 WO PCT/US1984/001667 patent/WO1985001814A1/en not_active Application Discontinuation
- 1984-10-19 IT IT68039/84A patent/IT1180100B/it active
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5099426A (en) * | 1989-01-19 | 1992-03-24 | International Business Machines Corporation | Method for use of morphological information to cross reference keywords used for information retrieval |
Also Published As
Publication number | Publication date |
---|---|
IT8468039A1 (it) | 1986-04-19 |
EP0160672A4 (en) | 1986-05-12 |
JPS61500345A (ja) | 1986-02-27 |
EP0160672A1 (en) | 1985-11-13 |
IT8468039A0 (it) | 1984-10-19 |
WO1985001814A1 (en) | 1985-04-25 |
IT1180100B (it) | 1987-09-23 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA1226369A (en) | Method and apparatus for data compression | |
US5999949A (en) | Text file compression system utilizing word terminators | |
US4610025A (en) | Cryptographic analysis system | |
US5109433A (en) | Compressing and decompressing text files | |
US5572423A (en) | Method for correcting spelling using error frequencies | |
US6119120A (en) | Computer implemented methods for constructing a compressed data structure from a data string and for using the data structure to find data patterns in the data string | |
US4384329A (en) | Retrieval of related linked linguistic expressions including synonyms and antonyms | |
US5229768A (en) | Adaptive data compression system | |
US8326605B2 (en) | Dictionary for textual data compression and decompression | |
US7026962B1 (en) | Text compression method and apparatus | |
US7010519B2 (en) | Method and system for expanding document retrieval information | |
Severance | A practitioner's guide to data base compression tutorial | |
JPH06208453A (ja) | テキスト圧縮駆動部構築方法及び入力テキスト列圧縮方法 | |
EP0052725B1 (en) | Method of reducing the print element changes in a text processing system | |
US8326604B2 (en) | Dictionary for textual data compression and decompression | |
EP0450049B1 (en) | Character encoding | |
EP0087871B1 (en) | Interactive chinese typewriter | |
EP0310147A2 (en) | Text comparator | |
JPH05324722A (ja) | 文書検索方式 | |
US8332209B2 (en) | Method and system for text compression and decompression | |
US7076423B2 (en) | Coding and storage of phonetical characteristics of strings | |
US7359850B2 (en) | Spelling and encoding method for ideographic symbols | |
Van Hintum | A computer compatible system for scoring heterogeneous populations | |
WO2002058379A1 (en) | Method for electronic transport of digital ink | |
US9143163B2 (en) | Method and system for text compression and decompression |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
MKEX | Expiry | ||
MKEX | Expiry |
Effective date: 20041017 |