CA1226369A - Method and apparatus for data compression - Google Patents

Method and apparatus for data compression

Info

Publication number
CA1226369A
CA1226369A CA000465602A CA465602A CA1226369A CA 1226369 A CA1226369 A CA 1226369A CA 000465602 A CA000465602 A CA 000465602A CA 465602 A CA465602 A CA 465602A CA 1226369 A CA1226369 A CA 1226369A
Authority
CA
Canada
Prior art keywords
word
words
text
token
dictionary
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired
Application number
CA000465602A
Other languages
English (en)
French (fr)
Inventor
Louie D. Tague
Allen T. Cobb
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
TEXT SCIENCES Corp
Original Assignee
TEXT SCIENCES Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by TEXT SCIENCES Corp filed Critical TEXT SCIENCES Corp
Application granted granted Critical
Publication of CA1226369A publication Critical patent/CA1226369A/en
Expired legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • H03M7/40Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code
    • H03M7/42Conversion to or from variable length codes, e.g. Shannon-Fano code, Huffman code, Morse code using table look-up for the coding or decoding process, e.g. using read-only memory
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Artificial Intelligence (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • General Health & Medical Sciences (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
CA000465602A 1983-10-19 1984-10-17 Method and apparatus for data compression Expired CA1226369A (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US54328683A 1983-10-19 1983-10-19
US543,286 1983-10-19

Publications (1)

Publication Number Publication Date
CA1226369A true CA1226369A (en) 1987-09-01

Family

ID=24167358

Family Applications (1)

Application Number Title Priority Date Filing Date
CA000465602A Expired CA1226369A (en) 1983-10-19 1984-10-17 Method and apparatus for data compression

Country Status (5)

Country Link
EP (1) EP0160672A4 (it)
JP (1) JPS61500345A (it)
CA (1) CA1226369A (it)
IT (1) IT1180100B (it)
WO (1) WO1985001814A1 (it)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5099426A (en) * 1989-01-19 1992-03-24 International Business Machines Corporation Method for use of morphological information to cross reference keywords used for information retrieval

Families Citing this family (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU5091485A (en) * 1984-11-08 1986-06-03 Datran Corp. Symbolic tokenizer for words and phrases
US4758955A (en) * 1985-07-19 1988-07-19 Carson Chen Hand-held spelling checker and method for reducing redundant information in the storage of textural material
US4949302A (en) * 1986-11-17 1990-08-14 International Business Machines Corporation Message file formation for computer programs
US4843389A (en) * 1986-12-04 1989-06-27 International Business Machines Corp. Text compression and expansion method and apparatus
WO1988009586A1 (en) * 1987-05-25 1988-12-01 Megaword International Pty. Ltd. A method of processing a text in order to store the text in memory
US5754847A (en) * 1987-05-26 1998-05-19 Xerox Corporation Word/number and number/word mapping
US5560037A (en) * 1987-12-28 1996-09-24 Xerox Corporation Compact hyphenation point data
DE3914589A1 (de) * 1989-05-03 1990-11-08 Bosch Gmbh Robert Verfahren zur datenreduktion bei strassennamen
US5325091A (en) * 1992-08-13 1994-06-28 Xerox Corporation Text-compression technique using frequency-ordered array of word-number mappers
CA2125337A1 (en) * 1993-06-30 1994-12-31 Marlin Jay Eller Method and system for searching compressed data
US6023679A (en) * 1994-10-04 2000-02-08 Amadeus Global Travel Distribution Llc Pre- and post-ticketed travel reservation information management system
GB2305746B (en) * 1995-09-27 2000-03-29 Canon Res Ct Europe Ltd Data compression apparatus
DE19681251T1 (de) * 1995-12-14 1998-08-20 Motorola Inc Verfahren und Vorrichtung zum Speichern und Darstellen von Text
US6012062A (en) * 1996-03-04 2000-01-04 Lucent Technologies Inc. System for compression and buffering of a data stream with data extraction requirements
US5883906A (en) * 1997-08-15 1999-03-16 Advantest Corp. Pattern data compression and decompression for semiconductor test system
DE19854179A1 (de) * 1998-11-24 2000-05-25 Siemens Ag Verfahren und Anordnung zur Kompression bzw. Expansion von Zeichenketten durch eine DV-Einrichtung
CN1732426A (zh) * 2002-12-27 2006-02-08 诺基亚公司 用于移动通信终端的预测性文本条目和数据压缩方法
DE102008022184A1 (de) * 2008-03-11 2009-09-24 Navigon Ag Verfahren zur Erzeugung einer elektronischen Adressdatenbank, Verfahren zur Durchsuchung einer elektronischen Adressdatenbank und Navigationsgerät mit einer elektronischen Adressdatenbank
AU2013382910B2 (en) * 2013-03-22 2017-01-05 Fujitsu Limited Compression device, compression method, decompression device, decompression method, and information processing system
JP2020061641A (ja) * 2018-10-09 2020-04-16 富士通株式会社 符号化プログラム、符号化方法、符号化装置、復号化プログラム、復号化方法および復号化装置

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3344405A (en) * 1964-09-30 1967-09-26 Ibm Data storage and retrieval system
US3717851A (en) * 1971-03-03 1973-02-20 Ibm Processing of compacted data
GB1516310A (en) * 1974-10-29 1978-07-05 Data Recording Instr Co Information indexing and retrieval processes
US4270182A (en) * 1974-12-30 1981-05-26 Asija Satya P Automated information input, storage, and retrieval system
US4189781A (en) * 1977-01-25 1980-02-19 International Business Machines Corporation Segmented storage logging and controlling
JPS55108075A (en) * 1979-02-09 1980-08-19 Sharp Corp Data retrieval system
US4356549A (en) * 1980-04-02 1982-10-26 Control Data Corporation System page table apparatus
US4358826A (en) * 1980-06-30 1982-11-09 International Business Machines Corporation Apparatus for enabling byte or word addressing of storage organized on a word basis
US4500955A (en) * 1981-12-31 1985-02-19 International Business Machines Corporation Full word coding for information processing

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5099426A (en) * 1989-01-19 1992-03-24 International Business Machines Corporation Method for use of morphological information to cross reference keywords used for information retrieval

Also Published As

Publication number Publication date
IT8468039A1 (it) 1986-04-19
EP0160672A4 (en) 1986-05-12
JPS61500345A (ja) 1986-02-27
EP0160672A1 (en) 1985-11-13
IT8468039A0 (it) 1984-10-19
WO1985001814A1 (en) 1985-04-25
IT1180100B (it) 1987-09-23

Similar Documents

Publication Publication Date Title
CA1226369A (en) Method and apparatus for data compression
US5999949A (en) Text file compression system utilizing word terminators
US4610025A (en) Cryptographic analysis system
US5109433A (en) Compressing and decompressing text files
US5572423A (en) Method for correcting spelling using error frequencies
US6119120A (en) Computer implemented methods for constructing a compressed data structure from a data string and for using the data structure to find data patterns in the data string
US4384329A (en) Retrieval of related linked linguistic expressions including synonyms and antonyms
US5229768A (en) Adaptive data compression system
US8326605B2 (en) Dictionary for textual data compression and decompression
US7026962B1 (en) Text compression method and apparatus
US7010519B2 (en) Method and system for expanding document retrieval information
Severance A practitioner's guide to data base compression tutorial
JPH06208453A (ja) テキスト圧縮駆動部構築方法及び入力テキスト列圧縮方法
EP0052725B1 (en) Method of reducing the print element changes in a text processing system
US8326604B2 (en) Dictionary for textual data compression and decompression
EP0450049B1 (en) Character encoding
EP0087871B1 (en) Interactive chinese typewriter
EP0310147A2 (en) Text comparator
JPH05324722A (ja) 文書検索方式
US8332209B2 (en) Method and system for text compression and decompression
US7076423B2 (en) Coding and storage of phonetical characteristics of strings
US7359850B2 (en) Spelling and encoding method for ideographic symbols
Van Hintum A computer compatible system for scoring heterogeneous populations
WO2002058379A1 (en) Method for electronic transport of digital ink
US9143163B2 (en) Method and system for text compression and decompression

Legal Events

Date Code Title Description
MKEX Expiry
MKEX Expiry

Effective date: 20041017