ATE203604T1 - CATEGORIZING STRINGS IN CHARACTER RECOGNITION. - Google Patents

CATEGORIZING STRINGS IN CHARACTER RECOGNITION.

Info

Publication number
ATE203604T1
ATE203604T1 AT93906168T AT93906168T ATE203604T1 AT E203604 T1 ATE203604 T1 AT E203604T1 AT 93906168 T AT93906168 T AT 93906168T AT 93906168 T AT93906168 T AT 93906168T AT E203604 T1 ATE203604 T1 AT E203604T1
Authority
AT
Austria
Prior art keywords
string
ending
acceptance
categories
subsequence
Prior art date
Application number
AT93906168T
Other languages
German (de)
Inventor
Ronald M Kaplan
Robert Shuchatowitz
Atty T Mullins
Original Assignee
Xerox Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xerox Corp filed Critical Xerox Corp
Application granted granted Critical
Publication of ATE203604T1 publication Critical patent/ATE203604T1/en

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/90Details of database functions independent of the retrieved data types
    • G06F16/903Querying
    • G06F16/90335Query processing
    • G06F16/90344Query processing by using string matching techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Databases & Information Systems (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Data Mining & Analysis (AREA)
  • Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Character Discrimination (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Machine Translation (AREA)

Abstract

A system (10) for use in character (16) recognition (374, 382) in which a directed graph representing a combination of finite state machines implementing string recognition algorithms for a variety of string categories is used to process string data (12) containing a particular ending subsequence. The ending (18, 70, 108) subsequence includes acceptance (36, 80) information indicating whether (80) the string (12) is acceptable. If so, the ending subsequence (18, 70, 108) also includes information indicating a set of categories (36, 82) including words (204, 206), numbers (208, 212), coumpounds words (210), and so forth. The acceptance (18, 36, 108) information can include a bit indicating the character type of the ending character, and an acceptance (18) data unit which indicates an acceptable string ending. The acceptance (18) data unit can be followed by category (18) data units, each indicating a category, which can be used to obtain a bit vector for a string (12), each bit of which indicates whether the string (12) is in one of the categories.
AT93906168T 1993-02-23 1993-02-23 CATEGORIZING STRINGS IN CHARACTER RECOGNITION. ATE203604T1 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/US1993/001590 WO1994019757A1 (en) 1993-02-23 1993-02-23 Categorizing strings in character recognition

Publications (1)

Publication Number Publication Date
ATE203604T1 true ATE203604T1 (en) 2001-08-15

Family

ID=22236346

Family Applications (1)

Application Number Title Priority Date Filing Date
AT93906168T ATE203604T1 (en) 1993-02-23 1993-02-23 CATEGORIZING STRINGS IN CHARACTER RECOGNITION.

Country Status (6)

Country Link
EP (1) EP0638187B1 (en)
JP (1) JPH07506207A (en)
AT (1) ATE203604T1 (en)
DE (1) DE69330493T2 (en)
DK (1) DK0638187T3 (en)
WO (1) WO1994019757A1 (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7119577B2 (en) 2002-08-28 2006-10-10 Cisco Systems, Inc. Method and apparatus for efficient implementation and evaluation of state machines and programmable finite state automata
US7451143B2 (en) 2002-08-28 2008-11-11 Cisco Technology, Inc. Programmable rule processing apparatus for conducting high speed contextual searches and characterizations of patterns in data
US7464254B2 (en) * 2003-01-09 2008-12-09 Cisco Technology, Inc. Programmable processor apparatus integrating dedicated search registers and dedicated state machine registers with associated execution hardware to support rapid application of rulesets to data
US7085918B2 (en) 2003-01-09 2006-08-01 Cisco Systems, Inc. Methods and apparatuses for evaluation of regular expressions of arbitrary size

Family Cites Families (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4034343A (en) * 1976-10-01 1977-07-05 Xerox Corporation Optical character recognition system
US4499553A (en) * 1981-09-30 1985-02-12 Dickinson Robert V Locating digital coded words which are both acceptable misspellings and acceptable inflections of digital coded query words
DE3523042A1 (en) * 1984-06-28 1986-01-02 Canon K.K., Tokio/Tokyo IMAGE PROCESSING SYSTEM
JPH0724055B2 (en) * 1984-07-31 1995-03-15 株式会社日立製作所 Word division processing method
JPH0797373B2 (en) * 1985-08-23 1995-10-18 株式会社日立製作所 Document matching system
US4862408A (en) * 1987-03-20 1989-08-29 International Business Machines Corporation Paradigm-based morphological text analysis for natural languages
JPH0664631B2 (en) * 1987-09-09 1994-08-22 インターナショナル・ビジネス・マシーンズ・コーポレーション Character recognition device

Also Published As

Publication number Publication date
DE69330493D1 (en) 2001-08-30
DE69330493T2 (en) 2001-11-22
EP0638187B1 (en) 2001-07-25
WO1994019757A1 (en) 1994-09-01
JPH07506207A (en) 1995-07-06
EP0638187A4 (en) 1995-08-23
DK0638187T3 (en) 2001-09-24
EP0638187A1 (en) 1995-02-15

Similar Documents

Publication Publication Date Title
HK1058588A1 (en) Information processing system and method.
ATE120291T1 (en) PORTABLE ELECTRONIC DEVICE TO LINK THE PUBLIC TO MEDIA OR SIMILAR.
PH27313A (en) Method and apparatus for concurrent modification of an index tree in a transaction processing system utilizing selective indication of structural modification operations
DK0644499T3 (en) Accessibility processor and method
SE418021B (en) DIGITAL REFERENCE MATTER FOR VERIFYING ALPHABETICAL WORDS AS VALID SPEECH EXPRESSIONS
DE59109042D1 (en) METHOD FOR AUTHENTICATING A USER USING A DATA STATION
EP0251056A3 (en) Cache tag lookaside
IT1244938B (en) DATA INTERROGATION SYSTEM IN THE DATABASES AND DATABASES.
EP0272821A3 (en) Method and apparatus for computation stack recovery in a calculator
ATE203604T1 (en) CATEGORIZING STRINGS IN CHARACTER RECOGNITION.
DE3576090D1 (en) PROCESSING ARRANGEMENT AND METHOD.
DE69401035D1 (en) Portable language learning device
FI865300A0 (en) PROCESS FOER OEVERVAKNING AV EN DATABEHANDLINGSENHET OCH SYSTEM FOER UTFOERANDE AV PROCESSEN.
Federici et al. Advances in analogy-based learning: False friends and exceptional items in pronunciation by paradigm-driven analogy
JPS5790782A (en) Retrieving system
JPS643772A (en) Sentence processing unit
JPH0410104B2 (en)
JPH04335464A (en) Dictionary storage device
HUP0201335A2 (en) Method of using at least one computer to construct a technical plan for a specific process, computer-based system as well as computer-readable medium
Dreyfus-Graf Redundancy analysis: A guide-line through speech recognition
JPS576967A (en) Data transfer system
JPS57121727A (en) Character assigning method of word processor
JPS6464048A (en) Information protecting device
JPS6448131A (en) Data base processor
JPS5790757A (en) Sort-merge process system

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties