HK1179445A1 - 過程中將詞語位圖分割為單個字符或字形 - Google Patents

過程中將詞語位圖分割為單個字符或字形

Info

Publication number
HK1179445A1
HK1179445A1 HK13106062.5A HK13106062A HK1179445A1 HK 1179445 A1 HK1179445 A1 HK 1179445A1 HK 13106062 A HK13106062 A HK 13106062A HK 1179445 A1 HK1179445 A1 HK 1179445A1
Authority
HK
Hong Kong
Prior art keywords
ocr
segmentation
individual characters
word bitmap
glyphs
Prior art date
Application number
HK13106062.5A
Other languages
English (en)
Inventor
Djordje Nijemcevic
Original Assignee
Microsoft Technology Licensing Llc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Technology Licensing Llc filed Critical Microsoft Technology Licensing Llc
Publication of HK1179445A1 publication Critical patent/HK1179445A1/zh

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/14Image acquisition
    • G06V30/148Segmentation of character regions
    • G06V30/15Cutting or merging image elements, e.g. region growing, watershed or clustering-based techniques
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Theoretical Computer Science (AREA)
  • Character Input (AREA)
  • Character Discrimination (AREA)
HK13106062.5A 2010-05-10 2013-05-22 過程中將詞語位圖分割為單個字符或字形 HK1179445A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US12/776,576 US8571270B2 (en) 2010-05-10 2010-05-10 Segmentation of a word bitmap into individual characters or glyphs during an OCR process
PCT/US2011/034242 WO2011142977A2 (en) 2010-05-10 2011-04-28 Segmentation of a word bitmap into individual characters or glyphs during an ocr process

Publications (1)

Publication Number Publication Date
HK1179445A1 true HK1179445A1 (zh) 2013-09-27

Family

ID=44901973

Family Applications (1)

Application Number Title Priority Date Filing Date
HK13106062.5A HK1179445A1 (zh) 2010-05-10 2013-05-22 過程中將詞語位圖分割為單個字符或字形

Country Status (6)

Country Link
US (1) US8571270B2 (zh)
EP (1) EP2569930B1 (zh)
CN (1) CN102870399B (zh)
CA (1) CA2797363C (zh)
HK (1) HK1179445A1 (zh)
WO (1) WO2011142977A2 (zh)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8401293B2 (en) * 2010-05-03 2013-03-19 Microsoft Corporation Word recognition of text undergoing an OCR process
US9053361B2 (en) 2012-01-26 2015-06-09 Qualcomm Incorporated Identifying regions of text to merge in a natural image or video frame
US9064191B2 (en) 2012-01-26 2015-06-23 Qualcomm Incorporated Lower modifier detection and extraction from devanagari text images to improve OCR performance
US9076242B2 (en) 2012-07-19 2015-07-07 Qualcomm Incorporated Automatic correction of skew in natural images and video
US9141874B2 (en) 2012-07-19 2015-09-22 Qualcomm Incorporated Feature extraction and use with a probability density function (PDF) divergence metric
US9047540B2 (en) 2012-07-19 2015-06-02 Qualcomm Incorporated Trellis based word decoder with reverse pass
US9014480B2 (en) 2012-07-19 2015-04-21 Qualcomm Incorporated Identifying a maximally stable extremal region (MSER) in an image by skipping comparison of pixels in the region
US9262699B2 (en) 2012-07-19 2016-02-16 Qualcomm Incorporated Method of handling complex variants of words through prefix-tree based decoding for Devanagiri OCR
US9183636B1 (en) 2014-04-16 2015-11-10 I.R.I.S. Line segmentation method
US9443158B1 (en) * 2014-06-22 2016-09-13 Kristopher Haskins Method for computer vision to recognize objects marked for identification with a bigram of glyphs, and devices utilizing the method for practical purposes
CN107392260B (zh) * 2017-06-08 2020-03-17 中国民生银行股份有限公司 一种字符识别结果的错误标定方法和装置
US10482344B2 (en) 2018-01-04 2019-11-19 Wipro Limited System and method for performing optical character recognition
US10970848B2 (en) 2018-11-29 2021-04-06 Sap Se Font family and size aware character segmentation
CN112926334A (zh) * 2019-12-06 2021-06-08 北京三星通信技术研究有限公司 确定词表示向量的方法、装置及电子设备
CN111626302B (zh) * 2020-05-25 2022-07-29 西北民族大学 乌金体藏文古籍文档图像的粘连文本行切分方法及系统

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TW222337B (zh) 1992-09-02 1994-04-11 Motorola Inc
US5848184A (en) 1993-03-15 1998-12-08 Unisys Corporation Document page analyzer and method
US5384864A (en) * 1993-04-19 1995-01-24 Xerox Corporation Method and apparatus for automatic determination of text line, word and character cell spatial features
EP0677811A1 (en) * 1994-04-15 1995-10-18 Canon Kabushiki Kaisha Image processing system with on-the-fly JPEG compression
JP3805005B2 (ja) * 1994-11-09 2006-08-02 キヤノン株式会社 画像処理装置及び光学的文字認識装置及びそれらの方法
JP3428494B2 (ja) * 1999-05-19 2003-07-22 日本電気株式会社 文字認識装置及びその文字認識方法並びにその制御プログラムを記録した記録媒体
JP3425408B2 (ja) * 2000-05-31 2003-07-14 株式会社東芝 文書読取装置
JP4655335B2 (ja) 2000-06-20 2011-03-23 コニカミノルタビジネステクノロジーズ株式会社 画像認識装置、画像認識方法および画像認識プログラムを記録したコンピュータ読取可能な記録媒体
JP4181310B2 (ja) 2001-03-07 2008-11-12 昌和 鈴木 数式認識装置および数式認識方法
US7095894B2 (en) 2002-09-04 2006-08-22 Lockheed Martin Corporation Method and computer program product for recognizing italicized text
US20060008148A1 (en) * 2004-07-06 2006-01-12 Fuji Photo Film Co., Ltd. Character recognition device and method
US7911655B2 (en) 2004-10-06 2011-03-22 Iuval Hatzav System for extracting information from an identity card
US7808681B2 (en) 2004-10-06 2010-10-05 Iuval Hatzav Portable photocopy apparatus and method of use
US7702182B2 (en) 2006-02-16 2010-04-20 Adobe Systems, Incorporated Method and apparatus for creating a high-fidelity glyph prototype from low-resolution glyph images
US20080304113A1 (en) 2007-06-06 2008-12-11 Xerox Corporation Space font: using glyphless font for searchable text documents
GB0719964D0 (en) 2007-10-12 2007-11-21 Katholleke Universiteit Leuven Method for detecting and resolving hidden text salting
JP5376795B2 (ja) 2007-12-12 2013-12-25 キヤノン株式会社 画像処理装置、画像処理方法、そのプログラム及び記憶媒体
CN101251892B (zh) * 2008-03-07 2010-06-09 北大方正集团有限公司 一种字符切分方法和装置
US7471826B1 (en) * 2008-03-31 2008-12-30 International Business Machines Corporation Character segmentation by slices

Also Published As

Publication number Publication date
CN102870399B (zh) 2015-09-02
CA2797363A1 (en) 2011-11-17
US8571270B2 (en) 2013-10-29
US20110274354A1 (en) 2011-11-10
CN102870399A (zh) 2013-01-09
EP2569930A4 (en) 2017-08-09
WO2011142977A2 (en) 2011-11-17
EP2569930A2 (en) 2013-03-20
EP2569930B1 (en) 2023-01-11
WO2011142977A3 (en) 2012-01-12
CA2797363C (en) 2017-07-04

Similar Documents

Publication Publication Date Title
HK1179445A1 (zh) 過程中將詞語位圖分割為單個字符或字形
GB201105509D0 (en) Text, character encoding and language recognition
GB2500127B (en) Method and apparatus pertaining to an RFID tag reader antenna array
EP2828793A4 (en) ROTATION-FREE DETECTION OF HAND-WRITTEN CHARACTERS
GB201411082D0 (en) Reusable grreeting card and envelope
EP2932467A4 (en) METHOD FOR DETECTING COUNTERFEITING AND TABLET IDENTIFICATION
GB201415088D0 (en) Inspection method with barcode identification
ZA201503824B (en) Paper, labels made therefrom and methods of making paper and labels
ZA201502408B (en) Method for producing a contactless smart card with a transparent logo
EP2417558A4 (en) GENERATING AN INDIVIDUAL GLYPHE, SYSTEM AND METHOD FOR INSPECTING INDIVIDUAL GLYPHIDS
EP2824608A4 (en) IMAGE PROCESSING METHOD FOR RECOGNITION OF CHARACTERS, AND DEVICE AND CHARACTER RECOGNITION PROGRAM USING THE SAME
EP2892044A4 (en) TUBULAR SHRINK FIBER LABEL AND METHOD FOR THE PRODUCTION THEREOF
EP2706466A4 (en) EXTRACTION PROCESS, INFORMATION PROCESSING, EXTRACTION PROGRAM, INFORMATION PROCESSING, EXTRACTION DEVICE AND INFORMATION PROCESSING DEVICE
EP2431908A4 (en) METHOD AND SYSTEM FOR ANTI-COLLISION LABELS
SG11201404838UA (en) Method and apparatusfor text searching on a touchterminal
EP2592120A4 (en) AQUEOUS INK INK INK AND METHOD FOR FORMING AN INK INJECTION
EP2442273A4 (en) OBJECT IDENTIFICATION IMAGE DATABASE GENERATION PROCESS, GENERATION DEVICE AND PRODUCTION PROCESSING PROGRAM
EP2537120A4 (en) METHOD AND DEVICE FOR THE DISTINCTION OF RFID LABELS
AP3820A (en) Heap leaching method
EP2839661A4 (en) PROCESS AND DEVICE FOR SAMPLE ADAPTIVE OFFSET CODING WITH SEPARATE DISPLAY AND SIZE
ZA201402209B (en) Method and apparatus for sorting lidar data
EP2701334A4 (en) DATA RECEIVING DEVICE, METHOD FOR EXTRACTING MARKER INFORMATION AND MARKER POSITION DETECTION METHOD
EP2993056A4 (en) LABEL FOR BARCODES, LETTERS AND PICTURES AND METHOD FOR PRODUCING BARCODES, LETTERS AND PICTURES
SG11201502379UA (en) Dictionary creation device for monitoring text information, dictionary creation method for monitoring text information, and dictionary creation program for monitoring text information
EP2981056A4 (en) COLOR TRANSFORMATION METHOD, DEVICE FOR CORRECTING GRAY VALUE VALUES, COMPUTER PROGRAM AND DISPLAY DEVICE