JPS5521384B2 - - Google Patents
Info
- Publication number
- JPS5521384B2 JPS5521384B2 JP14913674A JP14913674A JPS5521384B2 JP S5521384 B2 JPS5521384 B2 JP S5521384B2 JP 14913674 A JP14913674 A JP 14913674A JP 14913674 A JP14913674 A JP 14913674A JP S5521384 B2 JPS5521384 B2 JP S5521384B2
- Authority
- JP
- Japan
- Prior art keywords
- character
- characters
- word
- running total
- crowding
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired
Links
- 239000011159 matrix material Substances 0.000 abstract 4
- 230000011218 segmentation Effects 0.000 abstract 1
- 238000000926 separation method Methods 0.000 abstract 1
- 238000006467 substitution reaction Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/20—Natural language analysis
- G06F40/232—Orthographic correction, e.g. spell checking or vowelisation
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Physics & Mathematics (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Artificial Intelligence (AREA)
- General Health & Medical Sciences (AREA)
- General Engineering & Computer Science (AREA)
- General Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Character Discrimination (AREA)
- Machine Translation (AREA)
- Document Processing Apparatus (AREA)
Abstract
1454148 Character recognition INTERNATIONAL BUSINESS MACHINES CORP 16 Dec 1974 [10 April 1974] 54200/74 Heading G4R A character recognition machine contains a store holding dictionary words flagged to indicate a propensity to be misread either by HSS where a character in the original is split to form two characters in the recognized word, CS where two characters in the original are joined to form one character in the recognized word or by CRS where two characters in the original are crowded together and recognized as two different characters because of a misread of the segmentation point. The recognized word is fed to a word Separation Detector 4. Alphabetic character fields are fed to an OCR Word Shift Register 14 and the number of characters fed to the register is counted by Counter 18 and the value is fed to Shift Control 20. The end character of the word is stored in cell K1 of register 14. A dictionary store 28 contains words which may be read, e.g. the contents of a dictionary or a directory of street names, &c. depending on the application, and feeds words to a shift Register 26 with the end character of the word in cell L1. The flag bits may be in register 26 or in a separate register 34. A Flag Decode 100 examines the flag of the character stored in cell L1 and has four output lines 102 indicating a probable simple substitution, a probable character splitting (HSS), a probable character pair concatenation (CS) or a probable character pair crowding (CRS). The shift registers 14, 26 feed respective multiplex units 94, 96 which via respective address registers 116, 122 obtain probability values from a Conditional Probability Storage Matrix 124. The matrix contains conditional probabilities P(Kn/Lm) that the character in cell Kn was really the character in cell Lm for values N=1, m=1; n=2, m=2; n=2, m=3; n=3, m=2; P(K1K2/L1) that characters K1K2 were split versions of L1, P(K1/L1L2) that character K1 was a combination of L1L2 and P(K1K2/L1L2) that characters K1K2 were misread by crowding of characters L1L2. Timing of the operations of multiplex units 94, 96 and of a multiplex unit 128 receiving the probability values from the matrix are controlled by timer 108 receiving the Flag Decode signals. The probabilities obtained from the matrix are supplied to registers 130, 132, 134, 136 and pairs of values are multiplied and compared to determine which L character has the most probability of forming the OCR K character and to determine if relative shifting of the characters in registers 14, 26 should occur. Example.-OCR word IWn*C where * indicates rejected or unrecognized character. This could be compared with Break, Wreck or Freak. Break carries a flag bit indicating that r and e can produce crowding errors. The probabili. ties that K was read as C and that A was not recognized gives a running total of 7À3Î10<SP>-5</SP>. For crowding and the best running total is 2 x 10<SP>-9</SP> P(I/B)=2À0Î10<SP>-4</SP> and the running total is 4 Î 10<SP>-13</SP>. Wreck carries three flag bits indicating probability of crowding of r and e, joining of c and k to form one character and splitting of W to form two characters. The probabilities are running total 3 x 10<SP>-7</SP> running total = 1À1 Î 10<SP>-9</SP> for the original word to be Wreck. The running total for Freak is 1À5 x 10<SP>-12</SP>.
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US45982074A | 1974-04-10 | 1974-04-10 |
Publications (2)
Publication Number | Publication Date |
---|---|
JPS50137037A JPS50137037A (en) | 1975-10-30 |
JPS5521384B2 true JPS5521384B2 (en) | 1980-06-09 |
Family
ID=23826267
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
JP14913674A Expired JPS5521384B2 (en) | 1974-04-10 | 1974-12-27 |
Country Status (8)
Country | Link |
---|---|
JP (1) | JPS5521384B2 (en) |
BE (1) | BE824366A (en) |
CA (1) | CA1062810A (en) |
DE (1) | DE2460757C2 (en) |
FR (1) | FR2267590B1 (en) |
GB (1) | GB1454148A (en) |
IT (1) | IT1033223B (en) |
NL (1) | NL7503946A (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH01167469U (en) * | 1988-05-14 | 1989-11-24 | ||
JPH01176171U (en) * | 1988-06-03 | 1989-12-15 |
Families Citing this family (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US4201881A (en) * | 1979-03-28 | 1980-05-06 | Wisconsin Alumni Research Foundation | 24,24-Difluoro-1α,25-dihydroxycholecalciferol |
JPS6055866B2 (en) * | 1983-05-09 | 1985-12-06 | 株式会社日立製作所 | character recognition device |
JPS6274181A (en) * | 1985-09-27 | 1987-04-04 | Sony Corp | Character recognizing device |
JPS6297081A (en) * | 1986-10-08 | 1987-05-06 | Hitachi Ltd | Character recognizer |
GB2289969A (en) * | 1994-05-24 | 1995-12-06 | Ibm | Character segmentation |
-
1974
- 1974-12-05 FR FR7441665A patent/FR2267590B1/fr not_active Expired
- 1974-12-16 GB GB5420074A patent/GB1454148A/en not_active Expired
- 1974-12-21 DE DE2460757A patent/DE2460757C2/en not_active Expired
- 1974-12-27 JP JP14913674A patent/JPS5521384B2/ja not_active Expired
-
1975
- 1975-01-14 BE BE152362A patent/BE824366A/en not_active IP Right Cessation
- 1975-02-27 IT IT20692/75A patent/IT1033223B/en active
- 1975-03-10 CA CA221,755A patent/CA1062810A/en not_active Expired
- 1975-04-03 NL NL7503946A patent/NL7503946A/en not_active Application Discontinuation
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPH01167469U (en) * | 1988-05-14 | 1989-11-24 | ||
JPH01176171U (en) * | 1988-06-03 | 1989-12-15 |
Also Published As
Publication number | Publication date |
---|---|
DE2460757C2 (en) | 1983-08-18 |
IT1033223B (en) | 1979-07-10 |
GB1454148A (en) | 1976-10-27 |
FR2267590A1 (en) | 1975-11-07 |
FR2267590B1 (en) | 1977-05-20 |
DE2460757A1 (en) | 1975-10-23 |
CA1062810A (en) | 1979-09-18 |
BE824366A (en) | 1975-05-02 |
JPS50137037A (en) | 1975-10-30 |
NL7503946A (en) | 1975-10-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
GB1500203A (en) | Cluster storage apparatus | |
JPS54500112A (en) | ||
JPS5521384B2 (en) | ||
GB1332631A (en) | Data processing system | |
GB1018330A (en) | ||
GB963554A (en) | Systmes for identifying manifestations,for example, speech | |
JPS57146380A (en) | Address reader | |
JPS56147269A (en) | Electronic translator | |
GB1536628A (en) | Data encoding apparatus | |
Corfis | Fernando de Rojas and Albrecht von Eyb's" Margarita Poetica" | |
GB1285288A (en) | Multiple character generator | |
SU885078A1 (en) | Unit for automatic syllabification | |
GB1070422A (en) | Improvements in or relating to the comparison of data in data processing apparatus | |
Niklaus | John Lough," The Contributors to the Encyclopédie"(Book Review) | |
Lavine | ROSSLYN, FELICITY." Pope's Annotations to Tickell's" Iliad" Book One,"" RES", 30 (February 1979), 45-59 | |
JPS6441061A (en) | Automatic correcting device for wrong character of japanese sentence | |
JPS5624669A (en) | System for converting kana (japanese syllabary) letter row into same sound word row | |
GB928989A (en) | Improvements in and relating to knife-edge bearings | |
Rogers | Two Unrecorded Letters by Daniel Defoe | |
CN1079829A (en) | Chinese character input method for computer | |
Miharlovich | Selig O. Wassner:" Treasure of Russian Short Stories 1900-1966"(Book Review) | |
Bieler | Latin Bookhands of the Later Middle Ages 1100-1500 | |
Prachar | Review 13--No Title | |
Bülow-Jacobsen | Bärbel Kramer, Michael Erler, Dieter Hagedorn, Robert Hübner: Kölner Papyri (P. Köln), Band 3.(Papyrologica Coloniensia, VII.) Pp. 218; 34 plates (halftone). Opladen: Westdeutscher Verlag, 1980. DM. 56. | |
Whallon | The Song of Roland: Formulaic Style and Poetic Craft |