JPS5521384B2 - - Google Patents

Info

Publication number
JPS5521384B2
JPS5521384B2 JP14913674A JP14913674A JPS5521384B2 JP S5521384 B2 JPS5521384 B2 JP S5521384B2 JP 14913674 A JP14913674 A JP 14913674A JP 14913674 A JP14913674 A JP 14913674A JP S5521384 B2 JPS5521384 B2 JP S5521384B2
Authority
JP
Japan
Prior art keywords
character
characters
word
running total
crowding
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired
Application number
JP14913674A
Other languages
Japanese (ja)
Other versions
JPS50137037A (en
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed filed Critical
Publication of JPS50137037A publication Critical patent/JPS50137037A/ja
Publication of JPS5521384B2 publication Critical patent/JPS5521384B2/ja
Expired legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/20Natural language analysis
    • G06F40/232Orthographic correction, e.g. spell checking or vowelisation

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Multimedia (AREA)
  • Artificial Intelligence (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Character Discrimination (AREA)
  • Machine Translation (AREA)
  • Document Processing Apparatus (AREA)

Abstract

1454148 Character recognition INTERNATIONAL BUSINESS MACHINES CORP 16 Dec 1974 [10 April 1974] 54200/74 Heading G4R A character recognition machine contains a store holding dictionary words flagged to indicate a propensity to be misread either by HSS where a character in the original is split to form two characters in the recognized word, CS where two characters in the original are joined to form one character in the recognized word or by CRS where two characters in the original are crowded together and recognized as two different characters because of a misread of the segmentation point. The recognized word is fed to a word Separation Detector 4. Alphabetic character fields are fed to an OCR Word Shift Register 14 and the number of characters fed to the register is counted by Counter 18 and the value is fed to Shift Control 20. The end character of the word is stored in cell K1 of register 14. A dictionary store 28 contains words which may be read, e.g. the contents of a dictionary or a directory of street names, &c. depending on the application, and feeds words to a shift Register 26 with the end character of the word in cell L1. The flag bits may be in register 26 or in a separate register 34. A Flag Decode 100 examines the flag of the character stored in cell L1 and has four output lines 102 indicating a probable simple substitution, a probable character splitting (HSS), a probable character pair concatenation (CS) or a probable character pair crowding (CRS). The shift registers 14, 26 feed respective multiplex units 94, 96 which via respective address registers 116, 122 obtain probability values from a Conditional Probability Storage Matrix 124. The matrix contains conditional probabilities P(Kn/Lm) that the character in cell Kn was really the character in cell Lm for values N=1, m=1; n=2, m=2; n=2, m=3; n=3, m=2; P(K1K2/L1) that characters K1K2 were split versions of L1, P(K1/L1L2) that character K1 was a combination of L1L2 and P(K1K2/L1L2) that characters K1K2 were misread by crowding of characters L1L2. Timing of the operations of multiplex units 94, 96 and of a multiplex unit 128 receiving the probability values from the matrix are controlled by timer 108 receiving the Flag Decode signals. The probabilities obtained from the matrix are supplied to registers 130, 132, 134, 136 and pairs of values are multiplied and compared to determine which L character has the most probability of forming the OCR K character and to determine if relative shifting of the characters in registers 14, 26 should occur. Example.-OCR word IWn*C where * indicates rejected or unrecognized character. This could be compared with Break, Wreck or Freak. Break carries a flag bit indicating that r and e can produce crowding errors. The probabili. ties that K was read as C and that A was not recognized gives a running total of 7À3Î10<SP>-5</SP>. For crowding and the best running total is 2 x 10<SP>-9</SP> P(I/B)=2À0Î10<SP>-4</SP> and the running total is 4 Î 10<SP>-13</SP>. Wreck carries three flag bits indicating probability of crowding of r and e, joining of c and k to form one character and splitting of W to form two characters. The probabilities are running total 3 x 10<SP>-7</SP> running total = 1À1 Î 10<SP>-9</SP> for the original word to be Wreck. The running total for Freak is 1À5 x 10<SP>-12</SP>.
JP14913674A 1974-04-10 1974-12-27 Expired JPS5521384B2 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US45982074A 1974-04-10 1974-04-10

Publications (2)

Publication Number Publication Date
JPS50137037A JPS50137037A (en) 1975-10-30
JPS5521384B2 true JPS5521384B2 (en) 1980-06-09

Family

ID=23826267

Family Applications (1)

Application Number Title Priority Date Filing Date
JP14913674A Expired JPS5521384B2 (en) 1974-04-10 1974-12-27

Country Status (8)

Country Link
JP (1) JPS5521384B2 (en)
BE (1) BE824366A (en)
CA (1) CA1062810A (en)
DE (1) DE2460757C2 (en)
FR (1) FR2267590B1 (en)
GB (1) GB1454148A (en)
IT (1) IT1033223B (en)
NL (1) NL7503946A (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH01167469U (en) * 1988-05-14 1989-11-24
JPH01176171U (en) * 1988-06-03 1989-12-15

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4201881A (en) * 1979-03-28 1980-05-06 Wisconsin Alumni Research Foundation 24,24-Difluoro-1α,25-dihydroxycholecalciferol
JPS6055866B2 (en) * 1983-05-09 1985-12-06 株式会社日立製作所 character recognition device
JPS6274181A (en) * 1985-09-27 1987-04-04 Sony Corp Character recognizing device
JPS6297081A (en) * 1986-10-08 1987-05-06 Hitachi Ltd Character recognizer
GB2289969A (en) * 1994-05-24 1995-12-06 Ibm Character segmentation

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH01167469U (en) * 1988-05-14 1989-11-24
JPH01176171U (en) * 1988-06-03 1989-12-15

Also Published As

Publication number Publication date
DE2460757C2 (en) 1983-08-18
IT1033223B (en) 1979-07-10
GB1454148A (en) 1976-10-27
FR2267590A1 (en) 1975-11-07
FR2267590B1 (en) 1977-05-20
DE2460757A1 (en) 1975-10-23
CA1062810A (en) 1979-09-18
BE824366A (en) 1975-05-02
JPS50137037A (en) 1975-10-30
NL7503946A (en) 1975-10-14

Similar Documents

Publication Publication Date Title
GB1500203A (en) Cluster storage apparatus
JPS54500112A (en)
JPS5521384B2 (en)
GB1332631A (en) Data processing system
GB1018330A (en)
GB963554A (en) Systmes for identifying manifestations,for example, speech
JPS57146380A (en) Address reader
JPS56147269A (en) Electronic translator
GB1536628A (en) Data encoding apparatus
Corfis Fernando de Rojas and Albrecht von Eyb's" Margarita Poetica"
GB1285288A (en) Multiple character generator
SU885078A1 (en) Unit for automatic syllabification
GB1070422A (en) Improvements in or relating to the comparison of data in data processing apparatus
Niklaus John Lough," The Contributors to the Encyclopédie"(Book Review)
Lavine ROSSLYN, FELICITY." Pope's Annotations to Tickell's" Iliad" Book One,"" RES", 30 (February 1979), 45-59
JPS6441061A (en) Automatic correcting device for wrong character of japanese sentence
JPS5624669A (en) System for converting kana (japanese syllabary) letter row into same sound word row
GB928989A (en) Improvements in and relating to knife-edge bearings
Rogers Two Unrecorded Letters by Daniel Defoe
CN1079829A (en) Chinese character input method for computer
Miharlovich Selig O. Wassner:" Treasure of Russian Short Stories 1900-1966"(Book Review)
Bieler Latin Bookhands of the Later Middle Ages 1100-1500
Prachar Review 13--No Title
Bülow-Jacobsen Bärbel Kramer, Michael Erler, Dieter Hagedorn, Robert Hübner: Kölner Papyri (P. Köln), Band 3.(Papyrologica Coloniensia, VII.) Pp. 218; 34 plates (halftone). Opladen: Westdeutscher Verlag, 1980. DM. 56.
Whallon The Song of Roland: Formulaic Style and Poetic Craft