EP2208170A1 - Procédé d'analyse d'image notamment destiné à un appareil de téléphonie mobile - Google Patents
Procédé d'analyse d'image notamment destiné à un appareil de téléphonie mobileInfo
- Publication number
- EP2208170A1 EP2208170A1 EP08848083A EP08848083A EP2208170A1 EP 2208170 A1 EP2208170 A1 EP 2208170A1 EP 08848083 A EP08848083 A EP 08848083A EP 08848083 A EP08848083 A EP 08848083A EP 2208170 A1 EP2208170 A1 EP 2208170A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- pixel
- list
- letters
- value
- pixel groups
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
- G06V30/18—Extraction of features or characteristics of the image
- G06V30/1801—Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes or intersections
- G06V30/18076—Detecting partial patterns, e.g. edges or contours, or configurations, e.g. loops, corners, strokes or intersections by analysing connectivity, e.g. edge linking, connected component analysis or slices
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V30/00—Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
- G06V30/10—Character recognition
Definitions
- the invention relates to a method for image analysis, in particular for a mobile device with built-in digital camera for automatic optical character recognition (OCR), according to the preamble of patent claim 1 or 2.
- OCR optical character recognition
- Handheld scanners that display, save or transfer scanned text to a computer screen. There are always problems when the original is scanned askew, or only letters of the fragments can be recognized (for example, flag inscribed in the wind). In addition, such techniques fail when direct scanning is not possible (e.g., roadside signage). According to the current state of the art, such an image could be recorded with a high resolution, which can be subsequently scanned. However, there is no direct OCR in the camera itself, as it is too computationally intensive with conventional methods.
- Pattern Matching or, as in the case of handwriting recognition, with the description of the letters by lines and crossing points Pattern matching can be used particularly well in the case of standardized letters (eg In the case of license plate recognition, the characters to be recognized are limited to a small number, which are also standardized.
- DE 10025017 A1 discloses a mobile phone, which in particular for a simpler application and use of additional services and functions such. As short message service, payment transactions, identity or security checks, etc. is suitable.
- the mobile phone has an integrated device for reading characters, symbols codes and / or identity features, which is a scanner, a bar code reader or a fingerprint reader in the form of a CCD sensor, thus providing a comfortable and fast input and capture of text, Symbols or safety-relevant characteristics possible.
- DE 202005018376 U1 discloses a mobile telephone with keyboard, screen, data processing system and an optical scanning system arranged behind an opening or a window of the housing, in particular a hand-held scanner, and an integrated translation program.
- the optical scanning system makes it possible to scan characters and / or words available in another language. Selecting the language translates the word or words.
- the user of the mobile phone is able to read him strange words and texts. This can be beneficial menus, warnings, operating instructions and maps and signs.
- users can also enter words themselves from the keyboard of the mobile phone or select from an encyclopedia contained in the memory of the data processing system. By interconnecting the data processing system with the screen and keyboard, the choice of language translates these words and displays them on the screen.
- DE 10163688 A1 discloses a method and a system for tracking goods which are provided with an optically readable, alphanumeric identification, as well as a detection device therefor.
- the marking is captured as an image by the recording device and converted into image data.
- These are sent by the detection device by radio to a receiver which is connected to a computer system, which further evaluates the image data.
- the image data are evaluated before being sent to the receiver in the detection device. How exactly the evaluation of the image data is done, is not disclosed in detail.
- Object of the present invention is therefore to provide a generic method for image processing in mobile devices with digital camera, which works much more accurate and faster.
- the invention is characterized by the features of independent claim 1 or 2.
- Advantage of the invention is a more robust OCR detection with optional translation in real time (real-time), which also manages with relatively little computing power.
- the robustness refers in particular to the fact that detection works better than conventional systems even under poor conditions (especially light conditions, overlapping interference).
- this is achieved by first performing an adaptive pixel group-optimized preprocessing, which searches the image for lines.
- the most important distinguishing feature of the previously known methods is that now no further direct pattern comparison takes place, but an attempt is made to trace the lines as optimally as possible. From the sequence of movements is then closed on the corresponding character. Because this sequence of movements scale well and describe with relatively little effort This technology is currently suitable for mobile use.
- the sequence of movements of known characters is stored in a search word, so that the movement can be concluded directly on the letter.
- a dictionary / lexicon can be used. If words are recognized by the dictionary / lexicon, the recognized letters can be used for even more optimized character recognition.
- Application scenarios are camera mobile phones for tourists abroad, in particular to read traffic signs, menu cards, general information signs.
- the content can be translated into a second language.
- the translation is shown to the user on the display, or read out via a "text to speech application”.
- the robustness of the recognition is based initially on a normalization of the line widths or letter sizes. Subsequently, the letters are traced, in which case the actual letters are recognized during the tracing.
- the robustness of the detection method results from the combination of different solution steps. Due to the normalization of the bar widths, shadow effects and poor lighting conditions have hardly any influence on the detection rate. Due to the size norms, the
- the image is converted into electrical signals with an image recording element (for example a CCD camera). These signals are then stored according to the method of the patent DE 101 13 880 B4 in a prioritized array.
- a position factor can also be included in the prioritization. The position factor is the greater, the closer the pixel group is to the starting pixel.
- the Startpixel is located in the western languages (English, German, French) first in the upper left corner of the array.
- the pixel groups here can also vary during the recognition process.
- An example of a pixel group is a one-line horizontal array of pixels whose length is dependent on a double change in brightness. For dark letters to be recognized on a light background, the distance between the first light-dark transition and the subsequent dark-light transition would then be a size for an assumed stroke width. Pixel groups of the same assumed bar widths are each compiled in a separate list.
- a low-pass filter In this filter, the sum of n adjacent pixels is taken in each case to find corresponding light-dark, or dark-light transitions. Due to the summation, any pixel errors or errors due to strong noise are greatly reduced.
- each list thus obtained is sorted in such a way that the pixel groups which have a lower Y position are sorted in descending order. If several similar pixel groups are at the same Y positions, new lists are created for them. From these lists, an attempt is now made to derive corresponding vectors. In the process, the pixel groups with the lowest and highest Y values are selected from the respective lists. Between these pixel group positions a line is calculated. Then the deviations of the other pixel groups to this line are determined. If all deviations are below a certain threshold, then a description vector has been found for this list.
- the list is split and an attempt is made to generate corresponding vectors for each sub-list. It makes sense to divide the list where the largest deviations from the calculated line occurred. In this way one obtains a number of vectors. Touching vectors are summarized in another vector list and sorted according to the Y values. This vector list then describes corresponding letters. The vector list is then normalized (eg to the maximum Y difference). Such a normalized vector list can then go through a solution tree in which the different letters are stored. With this approach you will first recognize only a part of the letters. However, you get in this way first information about the font to be recognized. For large characters you will get double letters.
- recognition errors with dictionaries could be partially detected and corrected.
- the output of the recognized characters can be realized both via a display and via a "speech-to-text program".
- the method described describes an optimized method which forms vectors from pixel-based images, wherein each individual pixel (with a one-line pixel group) only needs to be traversed once.
- an edge optimization is usually carried out to increase the recognition rate beforehand, and only then is the recognition process started. In the method described above, this is done in one step, so it is both less computationally intensive and more robust.
Landscapes
- Engineering & Computer Science (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Multimedia (AREA)
- Theoretical Computer Science (AREA)
- Character Discrimination (AREA)
- Character Input (AREA)
Abstract
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE102007052622A DE102007052622A1 (de) | 2007-11-05 | 2007-11-05 | Verfahren zur Bildanalyse, insbesondere für ein Mobilfunkgerät |
PCT/EP2008/009093 WO2009059715A1 (fr) | 2007-11-05 | 2008-10-28 | Procédé d'analyse d'image notamment destiné à un appareil de téléphonie mobile |
Publications (1)
Publication Number | Publication Date |
---|---|
EP2208170A1 true EP2208170A1 (fr) | 2010-07-21 |
Family
ID=40514367
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP08848083A Ceased EP2208170A1 (fr) | 2007-11-05 | 2008-10-28 | Procédé d'analyse d'image notamment destiné à un appareil de téléphonie mobile |
Country Status (10)
Country | Link |
---|---|
US (1) | US8532389B2 (fr) |
EP (1) | EP2208170A1 (fr) |
KR (1) | KR101606469B1 (fr) |
CN (1) | CN101855640B (fr) |
BR (1) | BRPI0820570A2 (fr) |
CA (1) | CA2704830C (fr) |
DE (1) | DE102007052622A1 (fr) |
MX (1) | MX2010004732A (fr) |
RU (1) | RU2454718C2 (fr) |
WO (1) | WO2009059715A1 (fr) |
Families Citing this family (12)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9028344B2 (en) * | 2010-01-28 | 2015-05-12 | Chsz, Llc | Electronic golf assistant utilizing electronic storing |
CN102364926A (zh) * | 2011-10-21 | 2012-02-29 | 镇江科大船苑计算机网络工程有限公司 | 基于Android智能化信息转换方法 |
US9064191B2 (en) | 2012-01-26 | 2015-06-23 | Qualcomm Incorporated | Lower modifier detection and extraction from devanagari text images to improve OCR performance |
US20130194448A1 (en) | 2012-01-26 | 2013-08-01 | Qualcomm Incorporated | Rules for merging blocks of connected components in natural images |
US9076242B2 (en) | 2012-07-19 | 2015-07-07 | Qualcomm Incorporated | Automatic correction of skew in natural images and video |
US9141874B2 (en) | 2012-07-19 | 2015-09-22 | Qualcomm Incorporated | Feature extraction and use with a probability density function (PDF) divergence metric |
US9262699B2 (en) | 2012-07-19 | 2016-02-16 | Qualcomm Incorporated | Method of handling complex variants of words through prefix-tree based decoding for Devanagiri OCR |
US9014480B2 (en) | 2012-07-19 | 2015-04-21 | Qualcomm Incorporated | Identifying a maximally stable extremal region (MSER) in an image by skipping comparison of pixels in the region |
US9047540B2 (en) | 2012-07-19 | 2015-06-02 | Qualcomm Incorporated | Trellis based word decoder with reverse pass |
RU2587406C2 (ru) | 2014-05-29 | 2016-06-20 | Общество С Ограниченной Ответственностью "Яндекс" | Способ обработки визуального объекта и электронное устройство, используемое в нем |
RU2582064C1 (ru) * | 2014-12-16 | 2016-04-20 | Общество с ограниченной ответственностью "Аби Девелопмент" | Способы и системы эффективного автоматического распознавания символов с использованием леса решений |
RU2598300C2 (ru) | 2015-01-27 | 2016-09-20 | Общество с ограниченной ответственностью "Аби Девелопмент" | Способы и системы автоматического распознавания символов с использованием дерева решений |
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5581634A (en) * | 1992-08-18 | 1996-12-03 | Perception Software Corporation | Recognition system with an automated development tool |
Family Cites Families (19)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JPS5917668A (ja) | 1982-07-20 | 1984-01-28 | Ricoh Co Ltd | ストロ−ク抽出方法 |
JPS60217477A (ja) * | 1984-04-12 | 1985-10-31 | Toshiba Corp | 手書き文字認識装置 |
DE3815869A1 (de) * | 1987-05-08 | 1988-11-17 | Ricoh Kk | Verfahren zum extrahieren von merkmalsmengen eines zeichens |
CN1075565A (zh) | 1992-02-15 | 1993-08-25 | 曾怜玉 | 具有学习能力的文字识别方法 |
US5319721A (en) * | 1992-04-14 | 1994-06-07 | International Business Machines Corporation | Methods and apparatus for evolving a starter set of handwriting prototypes into a user-specific set |
CN1029534C (zh) | 1993-02-25 | 1995-08-16 | 清华大学 | 自由书写联机手写汉字识别方法及其系统 |
JP3260979B2 (ja) * | 1994-07-15 | 2002-02-25 | 株式会社リコー | 文字認識方法 |
US5940534A (en) * | 1995-07-17 | 1999-08-17 | Nippon Telegraph And Telephone Corporation | On-line handwritten character recognition using affine transformation to maximize overlapping of corresponding input and reference pattern strokes |
US5796867A (en) * | 1996-06-12 | 1998-08-18 | Industrial Technology Research Institute | Stroke-number-free and stroke-order-free on-line Chinese character recognition method |
US6970599B2 (en) * | 2002-07-25 | 2005-11-29 | America Online, Inc. | Chinese character handwriting recognition system |
RU2249251C2 (ru) * | 1999-06-21 | 2005-03-27 | Де ла рю Жиори С.А. | Автоматическое распознавание символов на структурированном фоне при помощи комбинирования моделей фона и символов |
DE10025017B4 (de) | 1999-06-21 | 2008-04-10 | T-Mobile Deutschland Gmbh | Verfahren zum Betrieb eines Mobiltelefons |
US6807309B1 (en) * | 2000-12-27 | 2004-10-19 | Canon Kabushiki Kaisha | Linear list compression |
DE10113880B4 (de) * | 2001-03-21 | 2004-04-29 | T-Mobile Deutschland Gmbh | Verfahren zur Komprimierung und Dekomprimierung von Videodaten |
CA2441372C (fr) * | 2001-03-21 | 2012-10-09 | T-Mobile Deutschland Gmbh | Procede de compression et de decompression de donnees d'image |
DE10163688A1 (de) | 2001-12-21 | 2003-07-03 | Blg Complements Gmbh & Co Kg | Verfahren und System zur Verfolgung von Gütern sowie Erfassungsgerät hierfür |
RU2260280C2 (ru) * | 2002-12-15 | 2005-09-20 | Всероссийский научно-исследовательский институт консервной и овощесушильной промышленности (Государственное научное учреждение) | Способ защиты вегетирующих пасленовых культур от вредных насекомых |
DE102005033001A1 (de) | 2005-07-14 | 2007-01-25 | Siemens Ag | Verfahren zur Optimierung von Kontrollvorgängen beim Einsatz mobiler Endgeräte |
DE202005018376U1 (de) | 2005-11-18 | 2006-02-02 | Alcatech Service Und Vertrieb Gmbh & Co Kg | Mobiles Gerät als mobiler Personalcomputer oder Mobiltelefon |
-
2007
- 2007-11-05 DE DE102007052622A patent/DE102007052622A1/de not_active Withdrawn
-
2008
- 2008-10-28 RU RU2010122947/08A patent/RU2454718C2/ru active
- 2008-10-28 CA CA2704830A patent/CA2704830C/fr active Active
- 2008-10-28 EP EP08848083A patent/EP2208170A1/fr not_active Ceased
- 2008-10-28 BR BRPI0820570-1A patent/BRPI0820570A2/pt not_active Application Discontinuation
- 2008-10-28 MX MX2010004732A patent/MX2010004732A/es active IP Right Grant
- 2008-10-28 KR KR1020107012446A patent/KR101606469B1/ko active IP Right Grant
- 2008-10-28 CN CN200880114728XA patent/CN101855640B/zh active Active
- 2008-10-28 US US12/740,047 patent/US8532389B2/en active Active
- 2008-10-28 WO PCT/EP2008/009093 patent/WO2009059715A1/fr active Application Filing
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5581634A (en) * | 1992-08-18 | 1996-12-03 | Perception Software Corporation | Recognition system with an automated development tool |
Also Published As
Publication number | Publication date |
---|---|
CN101855640A (zh) | 2010-10-06 |
US8532389B2 (en) | 2013-09-10 |
CN101855640B (zh) | 2013-12-04 |
MX2010004732A (es) | 2010-05-20 |
WO2009059715A1 (fr) | 2009-05-14 |
DE102007052622A1 (de) | 2009-05-07 |
RU2010122947A (ru) | 2011-12-20 |
BRPI0820570A2 (pt) | 2015-06-16 |
KR20100099154A (ko) | 2010-09-10 |
CA2704830C (fr) | 2014-09-30 |
RU2454718C2 (ru) | 2012-06-27 |
US20100296729A1 (en) | 2010-11-25 |
KR101606469B1 (ko) | 2016-03-25 |
CA2704830A1 (fr) | 2009-05-14 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
EP2208170A1 (fr) | Procédé d'analyse d'image notamment destiné à un appareil de téléphonie mobile | |
DE102006059663B4 (de) | Vorrichtung, Verfahren und Computerprogramm zum Identifizieren eines Verkehrszeichens in einem Bild | |
DE10195927B4 (de) | Verallgemeinerte Textlokalisation in Bildern | |
DE10195928B4 (de) | Abschätzen der Textfarbe und Segmentieren von Bildern | |
DE102009059264B4 (de) | Verfahren zum schnellen Lokalisieren eines entschlüsselbaren Musters | |
DE60204005T2 (de) | Verfahren und einrichtung zur erkennung eines handschriftlichen musters | |
DE60303202T2 (de) | System und verfahren zum identifizieren und extrahieren von zeichenketten aus erfassten bilddaten | |
DE60312572T2 (de) | Verfahren und Gerät zum Konvertieren digitaler Bilder von Handzeichnungen zur weiteren Verwendung in einem strukturierten Text/Graphik Editor. | |
DE10162156B4 (de) | Die Benutzernavigation durch Multimedia-Dateiinhalte unterstützendes System und Verfahren | |
DE60116442T2 (de) | System zur Zuordnung von Schlüsselwörtern zu Dokumenten | |
EP1665132B1 (fr) | Procede et systeme de detection de donnees provenant de plusieurs documents lisibles par ordinateur | |
DE4427992C2 (de) | Verfahren zum Erzeugen einer Dokumenten-Darstellung aus einem Bitmap-Bild eines Dokuments | |
DE19705757A1 (de) | Verfahren und Gerät für das Design eines hoch-zuverlässigen Mustererkennungs-Systems | |
DE19960555A1 (de) | Verfahren zum Auffinden und Lesen eines zweidimensionalen Strichcodes | |
DE102011005579A1 (de) | Bildverarbeitungsvorrichtung, Bildverarbeitungsverfahren und Speichermedium | |
DE19956158A1 (de) | Bild-Binärisierungsverfahren auf Bereichsbasis | |
EP2082357B1 (fr) | Dispositif, procédé et programme d'ordinateur pour identifier des caractères sur une image | |
DE112021000371T5 (de) | Digitale bildbearbeitung | |
DE19726592C2 (de) | Informationserkennungs-Vorrichtung | |
Baloun et al. | ChronSeg: Novel Dataset for Segmentation of Handwritten Historical Chronicles. | |
DE60225268T2 (de) | Verfahren zum bestimmen von textfeldern in einem videobild | |
EP0767441A2 (fr) | Méthode et dispositif de lecture d'un code à barres | |
DE102016119465A1 (de) | Verfahren und anordnung zum lesen von zweidimensionalen symbolen durch übersetzen von kodierten symbolzeichen von einem lokalen auf einen globalen zeichensatz | |
DE19507059A1 (de) | Verfahren zur omnidirektionalen Erfassung von OCR-Klarschrift auf Etiketten oder ähnlichen Datenträgern durch zufallsgesteuerte Suche und Dekodierung mit einem neuronalen Netzwerk | |
Poirriez et al. | Visual perception of unitary elements for layout analysis of unconstrained documents in heterogeneous databases |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 20100428 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MT NL NO PL PT RO SE SI SK TR |
|
AX | Request for extension of the european patent |
Extension state: AL BA MK RS |
|
DAX | Request for extension of the european patent (deleted) | ||
17Q | First examination report despatched |
Effective date: 20140404 |
|
REG | Reference to a national code |
Ref country code: DE Ref legal event code: R003 |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION HAS BEEN REFUSED |
|
18R | Application refused |
Effective date: 20160130 |