CA1226369A - Methode et dispositif de compression de donnees - Google Patents
Methode et dispositif de compression de donneesInfo
- Publication number
- CA1226369A CA1226369A CA000465602A CA465602A CA1226369A CA 1226369 A CA1226369 A CA 1226369A CA 000465602 A CA000465602 A CA 000465602A CA 465602 A CA465602 A CA 465602A CA 1226369 A CA1226369 A CA 1226369A
- Authority
- CA
- Canada
- Prior art keywords
- word
- words
- text
- token
- dictionary
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired
Links
- 238000000034 method Methods 0.000 title claims abstract description 39
- 238000013144 data compression Methods 0.000 title description 2
- 230000005540 biological transmission Effects 0.000 claims 2
- 230000009467 reduction Effects 0.000 abstract description 6
- 150000002500 ions Chemical class 0.000 description 12
- CXENHBSYCFFKJS-OXYODPPFSA-N (Z,E)-alpha-farnesene Chemical compound CC(C)=CCC\C(C)=C\C\C=C(\C)C=C CXENHBSYCFFKJS-OXYODPPFSA-N 0.000 description 5
- 238000012360 testing method Methods 0.000 description 5
- 239000010977 jade Substances 0.000 description 3
- 101100042793 Gallus gallus SMC2 gene Proteins 0.000 description 2
- 239000004459 forage Substances 0.000 description 2
- 101150082527 ALAD gene Proteins 0.000 description 1
- XUKUURHRXDUEBC-KAYWLYCHSA-N Atorvastatin Chemical compound C=1C=CC=CC=1C1=C(C=2C=CC(F)=CC=2)N(CC[C@@H](O)C[C@@H](O)CC(O)=O)C(C(C)C)=C1C(=O)NC1=CC=CC=C1 XUKUURHRXDUEBC-KAYWLYCHSA-N 0.000 description 1
- 241001674044 Blattodea Species 0.000 description 1
- NLZUEZXRPGMBCV-UHFFFAOYSA-N Butylhydroxytoluene Chemical compound CC1=CC(C(C)(C)C)=C(O)C(C(C)(C)C)=C1 NLZUEZXRPGMBCV-UHFFFAOYSA-N 0.000 description 1
- 241000736839 Chara Species 0.000 description 1
- 101710200331 Cytochrome b-245 chaperone 1 Proteins 0.000 description 1
- 102100037186 Cytochrome b-245 chaperone 1 Human genes 0.000 description 1
- 101710119396 Cytochrome b-245 chaperone 1 homolog Proteins 0.000 description 1
- ZAKOWWREFLAJOT-CEFNRUSXSA-N D-alpha-tocopherylacetate Chemical compound CC(=O)OC1=C(C)C(C)=C2O[C@@](CCC[C@H](C)CCC[C@H](C)CCCC(C)C)(C)CCC2=C1C ZAKOWWREFLAJOT-CEFNRUSXSA-N 0.000 description 1
- 241000643923 Dictis Species 0.000 description 1
- 241001505295 Eros Species 0.000 description 1
- ULGZDMOVFRHVEP-RWJQBGPGSA-N Erythromycin Chemical compound O([C@@H]1[C@@H](C)C(=O)O[C@@H]([C@@]([C@H](O)[C@@H](C)C(=O)[C@H](C)C[C@@](C)(O)[C@H](O[C@H]2[C@@H]([C@H](C[C@@H](C)O2)N(C)C)O)[C@H]1C)(C)O)CC)[C@H]1C[C@@](C)(OC)[C@@H](O)[C@H](C)O1 ULGZDMOVFRHVEP-RWJQBGPGSA-N 0.000 description 1
- 241000276457 Gadidae Species 0.000 description 1
- 241000592718 Ibla Species 0.000 description 1
- 101100059652 Mus musculus Cetn1 gene Proteins 0.000 description 1
- 101100059655 Mus musculus Cetn2 gene Proteins 0.000 description 1
- 101100136062 Mycobacterium tuberculosis (strain ATCC 25618 / H37Rv) PE10 gene Proteins 0.000 description 1
- 241000353355 Oreosoma atlanticum Species 0.000 description 1
- 101150095197 PALD1 gene Proteins 0.000 description 1
- 101150014691 PPARA gene Proteins 0.000 description 1
- 241000282860 Procaviidae Species 0.000 description 1
- 208000003251 Pruritus Diseases 0.000 description 1
- 101150057388 Reln gene Proteins 0.000 description 1
- ATJFFYVFTNAWJD-UHFFFAOYSA-N Tin Chemical compound [Sn] ATJFFYVFTNAWJD-UHFFFAOYSA-N 0.000 description 1
- 239000000956 alloy Substances 0.000 description 1
- 229910045601 alloy Inorganic materials 0.000 description 1
- 238000013459 approach Methods 0.000 description 1
- 230000001174 ascending effect Effects 0.000 description 1
- 238000009924 canning Methods 0.000 description 1
- 235000019994 cava Nutrition 0.000 description 1
- ZPUCINDJVBIVPJ-LJISPDSOSA-N cocaine Chemical compound O([C@H]1C[C@@H]2CC[C@@H](N2C)[C@H]1C(=O)OC)C(=O)C1=CC=CC=C1 ZPUCINDJVBIVPJ-LJISPDSOSA-N 0.000 description 1
- 238000004590 computer program Methods 0.000 description 1
- 238000013500 data storage Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 244000144980 herd Species 0.000 description 1
- 101150048143 lpl2 gene Proteins 0.000 description 1
- 230000000873 masking effect Effects 0.000 description 1
- 239000000463 material Substances 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 238000012546 transfer Methods 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H03—ELECTRONIC CIRCUITRY
- H03M—CODING; DECODING; CODE CONVERSION IN GENERAL
- H03M7/00—Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
- H03M7/30—Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
- H03M7/40—Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
- H03M7/42—Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code using table look-up for the coding or decoding process, e.g. using read-only memory
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Computational Linguistics (AREA)
- General Health & Medical Sciences (AREA)
- Physics & Mathematics (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US54328683A | 1983-10-19 | 1983-10-19 | |
US543,286 | 1983-10-19 |
Publications (1)
Publication Number | Publication Date |
---|---|
CA1226369A true CA1226369A (fr) | 1987-09-01 |
Family
ID=24167358
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA000465602A Expired CA1226369A (fr) | 1983-10-19 | 1984-10-17 | Methode et dispositif de compression de donnees |
Country Status (5)
Country | Link |
---|---|
EP (1) | EP0160672A4 (fr) |
JP (1) | JPS61500345A (fr) |
CA (1) | CA1226369A (fr) |
IT (1) | IT1180100B (fr) |
WO (1) | WO1985001814A1 (fr) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5099426A (en) * | 1989-01-19 | 1992-03-24 | International Business Machines Corporation | Method for use of morphological information to cross reference keywords used for information retrieval |
Families Citing this family (20)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1986003039A1 (fr) * | 1984-11-08 | 1986-05-22 | Datran Corporation | Systeme d'identification symbolique de mots et de phrases |
US4758955A (en) * | 1985-07-19 | 1988-07-19 | Carson Chen | Hand-held spelling checker and method for reducing redundant information in the storage of textural material |
US4949302A (en) * | 1986-11-17 | 1990-08-14 | International Business Machines Corporation | Message file formation for computer programs |
US4843389A (en) * | 1986-12-04 | 1989-06-27 | International Business Machines Corp. | Text compression and expansion method and apparatus |
WO1988009586A1 (fr) * | 1987-05-25 | 1988-12-01 | Megaword International Pty. Ltd. | Procede de traitement d'un texte permettant de garder le texte en memoire |
US5754847A (en) * | 1987-05-26 | 1998-05-19 | Xerox Corporation | Word/number and number/word mapping |
US5560037A (en) * | 1987-12-28 | 1996-09-24 | Xerox Corporation | Compact hyphenation point data |
DE3914589A1 (de) * | 1989-05-03 | 1990-11-08 | Bosch Gmbh Robert | Verfahren zur datenreduktion bei strassennamen |
US5325091A (en) * | 1992-08-13 | 1994-06-28 | Xerox Corporation | Text-compression technique using frequency-ordered array of word-number mappers |
CA2125337A1 (fr) * | 1993-06-30 | 1994-12-31 | Marlin Jay Eller | Methode et systeme d'exploration de donnees comprimees |
US6023679A (en) * | 1994-10-04 | 2000-02-08 | Amadeus Global Travel Distribution Llc | Pre- and post-ticketed travel reservation information management system |
GB2305746B (en) * | 1995-09-27 | 2000-03-29 | Canon Res Ct Europe Ltd | Data compression apparatus |
AU1082397A (en) * | 1995-12-14 | 1997-07-03 | Motorola, Inc. | Apparatus and method for storing and presenting text |
US6012062A (en) * | 1996-03-04 | 2000-01-04 | Lucent Technologies Inc. | System for compression and buffering of a data stream with data extraction requirements |
US5883906A (en) * | 1997-08-15 | 1999-03-16 | Advantest Corp. | Pattern data compression and decompression for semiconductor test system |
DE19854179A1 (de) * | 1998-11-24 | 2000-05-25 | Siemens Ag | Verfahren und Anordnung zur Kompression bzw. Expansion von Zeichenketten durch eine DV-Einrichtung |
CN1732426A (zh) * | 2002-12-27 | 2006-02-08 | 诺基亚公司 | 用于移动通信终端的预测性文本条目和数据压缩方法 |
DE102008022184A1 (de) * | 2008-03-11 | 2009-09-24 | Navigon Ag | Verfahren zur Erzeugung einer elektronischen Adressdatenbank, Verfahren zur Durchsuchung einer elektronischen Adressdatenbank und Navigationsgerät mit einer elektronischen Adressdatenbank |
KR101750646B1 (ko) | 2013-03-22 | 2017-06-23 | 후지쯔 가부시끼가이샤 | 압축 장치, 압축 방법, 신장 장치, 신장 방법 및 정보 처리 시스템 |
JP2020061641A (ja) * | 2018-10-09 | 2020-04-16 | 富士通株式会社 | 符号化プログラム、符号化方法、符号化装置、復号化プログラム、復号化方法および復号化装置 |
Family Cites Families (9)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3344405A (en) * | 1964-09-30 | 1967-09-26 | Ibm | Data storage and retrieval system |
US3717851A (en) * | 1971-03-03 | 1973-02-20 | Ibm | Processing of compacted data |
GB1516310A (en) * | 1974-10-29 | 1978-07-05 | Data Recording Instr Co | Information indexing and retrieval processes |
US4270182A (en) * | 1974-12-30 | 1981-05-26 | Asija Satya P | Automated information input, storage, and retrieval system |
US4189781A (en) * | 1977-01-25 | 1980-02-19 | International Business Machines Corporation | Segmented storage logging and controlling |
JPS55108075A (en) * | 1979-02-09 | 1980-08-19 | Sharp Corp | Data retrieval system |
US4356549A (en) * | 1980-04-02 | 1982-10-26 | Control Data Corporation | System page table apparatus |
US4358826A (en) * | 1980-06-30 | 1982-11-09 | International Business Machines Corporation | Apparatus for enabling byte or word addressing of storage organized on a word basis |
US4500955A (en) * | 1981-12-31 | 1985-02-19 | International Business Machines Corporation | Full word coding for information processing |
-
1984
- 1984-10-17 CA CA000465602A patent/CA1226369A/fr not_active Expired
- 1984-10-17 EP EP19840903871 patent/EP0160672A4/fr not_active Withdrawn
- 1984-10-17 WO PCT/US1984/001667 patent/WO1985001814A1/fr not_active Application Discontinuation
- 1984-10-17 JP JP59503813A patent/JPS61500345A/ja active Pending
- 1984-10-19 IT IT68039/84A patent/IT1180100B/it active
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5099426A (en) * | 1989-01-19 | 1992-03-24 | International Business Machines Corporation | Method for use of morphological information to cross reference keywords used for information retrieval |
Also Published As
Publication number | Publication date |
---|---|
IT8468039A1 (it) | 1986-04-19 |
JPS61500345A (ja) | 1986-02-27 |
WO1985001814A1 (fr) | 1985-04-25 |
IT1180100B (it) | 1987-09-23 |
EP0160672A1 (fr) | 1985-11-13 |
IT8468039A0 (it) | 1984-10-19 |
EP0160672A4 (fr) | 1986-05-12 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CA1226369A (fr) | Methode et dispositif de compression de donnees | |
US5663721A (en) | Method and apparatus using code values and length fields for compressing computer data | |
EP0294950B1 (fr) | Procédé pour faciliter le triage par ordinateur | |
US4599612A (en) | Displaying and correcting method for machine translation system | |
WO1998040969A2 (fr) | Systeme de compression de fichiers texte | |
US6119120A (en) | Computer implemented methods for constructing a compressed data structure from a data string and for using the data structure to find data patterns in the data string | |
US5229768A (en) | Adaptive data compression system | |
US8326605B2 (en) | Dictionary for textual data compression and decompression | |
US7026962B1 (en) | Text compression method and apparatus | |
JPH06208453A (ja) | テキスト圧縮駆動部構築方法及び入力テキスト列圧縮方法 | |
GB2097974A (en) | Spelling error detector apparatus and methods | |
US20020169763A1 (en) | Method and system for expanding document retrieval information | |
US5585793A (en) | Order preserving data translation | |
EP0052725A1 (fr) | Procédé pour la réduction de changements de l'élément d'impression dans un système de traitement de texte | |
US8326604B2 (en) | Dictionary for textual data compression and decompression | |
White | Printed English compression by dictionary encoding | |
US5930756A (en) | Method, device and system for a memory-efficient random-access pronunciation lexicon for text-to-speech synthesis | |
EP0450049B1 (fr) | Codage de caracteres | |
US4531201A (en) | Text comparator | |
Alhawiti | Adaptive models of Arabic text | |
JPH05324722A (ja) | 文書検索方式 | |
US8332209B2 (en) | Method and system for text compression and decompression | |
Van Hintum | A computer compatible system for scoring heterogeneous populations | |
US20010032073A1 (en) | Coding and storage of phonetical characteristics of strings | |
US20050080612A1 (en) | Spelling and encoding method for ideographic symbols |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
MKEX | Expiry | ||
MKEX | Expiry |
Effective date: 20041017 |