CN201191870Y - Mobile phone having OCR recognition function - Google Patents

Mobile phone having OCR recognition function Download PDF

Info

Publication number
CN201191870Y
CN201191870Y CNU2008200207577U CN200820020757U CN201191870Y CN 201191870 Y CN201191870 Y CN 201191870Y CN U2008200207577 U CNU2008200207577 U CN U2008200207577U CN 200820020757 U CN200820020757 U CN 200820020757U CN 201191870 Y CN201191870 Y CN 201191870Y
Authority
CN
China
Prior art keywords
ocr
handset
mobile phone
image
ocr recognition
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CNU2008200207577U
Other languages
Chinese (zh)
Inventor
王爱磊
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CNU2008200207577U priority Critical patent/CN201191870Y/en
Application granted granted Critical
Publication of CN201191870Y publication Critical patent/CN201191870Y/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Landscapes

  • Character Discrimination (AREA)

Abstract

The utility model discloses a handset, in particular to a handset with OCR identification function, which comprises a handset host, a cam and a main board hard disk. The handset is characterized in that an OCR chip which is integrated with an OCR identification system is arranged on the main board hard disk, the OCR identification system enters into the video-image fore-process, the character feature extraction and the comparison identification through the video-image input of the cam and corrects the false character by the artificial correction and outputs the result. The cam of the handset can store a plurality of 'shot' data in the handset, which provides convenience to read for people timely, further memorizing repeatedly and reaching the goal of getting hold of the knowledge. The utility model plants the OCR function into the handset, which is beneficial to read book and newspaper for people or to store the data of the plane media for students greatly.

Description

Mobile phone with OCR recognition function
(1) technical field
The utility model relates to a kind of mobile phone, particularly a kind of mobile phone with OCR recognition function.
(2) background technology
In daily life, we see that from newspaper, magazine many data have value for preservation very much, and common people do not have copier at hand, record and the especially trouble that seems with notes.At any time to consult in the mobile phone to be very easily if can be stored into it, but spell them in the mobile phone not only consuming time but also consume power with phonetic or handwriting functions!
(3) summary of the invention
The utility model provides a kind of mobile phone with OCR recognition function easy to use in order to remedy the deficiencies in the prior art.
The utility model is achieved by the following technical solution:
A kind of mobile phone with OCR recognition function comprises mobile phone main body, camera, mainboard hard disk, and its special character is: an OCR chip that is integrated with the OCR recognition system is installed on the described mainboard hard disk.
Mobile phone with OCR recognition function of the present utility model, described OCR recognition system enter image pre-treatment, character features extraction, comparison identification successively through the image input of camera, through the word correction that manual synchronizing will be admitted one's mistake, the result are exported.
OCR (Optical Character Recognition) optical character identification.The OCR Chinese meaning is discerned literal by optical technology exactly.This technology can make equipment come identification character by the mechanism of optics.The mankind discern many things with eyes, and its mode is exactly a kind of optics mechanism.
The utility model is added to the OCR function in the mobile phone, makes things convenient for the scholar to store data.The utility model utilizes the camera of mobile phone that some data " bat " are stored in mobile phone, said " bat " be utilize existing OCR system data logging in mobile phone.The OCR system notes data with the form of picture, but utilize its Automatic Editing Function that has that the form of data with text is stored in the mobile phone, and can browse at any time, be convenient for people to browse at any time, and then repetitive memory, reach the purpose of grasping this knowledge! After the utility model is successfully implanted mobile phone with the OCR function, people who reads newspaper helping greatly often reading and the data on numerous students' memory plane medium.
(4) description of drawings
Below in conjunction with accompanying drawing the utility model is further described.
Fig. 1 is a structural representation of the present utility model;
Fig. 2 is the structured flowchart of the utility model OCR recognition system.
Among the figure, 1 mobile phone main body, 2 cameras, 3 mainboard hard disks, 4OCR chip.
(5) embodiment
Accompanying drawing is a kind of specific embodiment of the present utility model.This embodiment comprises mobile phone main body 1, camera 2, mainboard hard disk 3, and an OCR chip 4 that is integrated with the OCR recognition system is installed on the mainboard hard disk 3; The OCR recognition system enters image pre-treatment, character features extraction, comparison identification successively through the image input of camera, through the word correction that manual synchronizing will be admitted one's mistake, the result is exported.
OCR (Optical Character Recognition) optical character identification.It belongs to a knowledge of pattern identification, and its purpose will allow computer know what it has seen on earth exactly, especially written historical materials.Because OCR is a technology with the discrimination tug-of-war, therefore how debug or utilize supplementary raising recognition correct rate is the most important problem of OCR.And the media difference that exists according to written historical materials, and obtain the mode difference of these data, just derive of all kinds, various application.
An OCR recognition system, its purpose is very simple, just to do a conversion to image, make the figure in the image continue to preserve, have form then interior data of form and the interior literal of image, become the mobile phone literal without exception, the literal that the storage capacity that enables to reach image data reduces, identifies can re-use and analyze, and also can save manpower and time because of the keyboard input certainly.Its handling process is as follows:
From the image to result, export, must extract through image input, image pre-treatment, character features, comparison is discerned, after the word correction that manual synchronizing will be admitted one's mistake the result is exported.Make introductions all round at this:
The image input: the data that needs OCR to handle must be transferred to image in the mobile phone by camera.Now the medium-to-high grade mobile phone produced of each big communication apparatus company generally all is equipped with camera, has and takes a picture and camera function.Along with the progress of science and technology, the resolution of camera will be more and more higher, thereby make imaging quality also more and more clear simultaneously, and this can improve reading of OCR system and automatic editing speed greatly.
The image pre-treatment: the image pre-treatment is in the OCR system, the maximum module of must dealing with problems, from obtain one be not black be exactly white binaryzation image, or GTG, colored image, process to independently going out literal image one by one all belongs to the image pre-treatment.Comprise image normalization, removed the image processing of noise, image rectification etc., and the file pre-treatment that separates with word of picture and text analysis, literal line.Therefore aspect image processing, all reached the ripe stage in principle and technical elements, on the market or many available chained libraries are arranged on the website; Aspect the file pre-treatment, then with each tame ability; Image must be separated picture, form and character area earlier, even the layout direction of article, the outline and the content body of article can be distinguished, and the font of the size of literal and literal also can judging as original document.
Character features extracts: single with discrimination, and can the say so core of OCR of feature extraction, with what feature, how to extract, the direct quality discerned of influence, so also study the initial stage at OCR, the research report of feature extraction is many especially.And the chip that feature can be said so and be discerned, easy differentiation can be divided into two classes: a feature for statistics, as the ratio of counting of the black/white in the character area, when literal field is divided into several zones, this regional one by one black/white count than associating, just become a numerical value vector in space, when comparison, basic mathematical theory just is enough to deal with.And the another kind of feature that is characterized as structure, after literal image graph thinning, obtain the stroke end points of word, the quantity and the position in crosspoint, or be feature with the stroke section, cooperate special comparison method, compare the many methods of the recognition methods of hand-written Input Software on the line on the market based on this kind structure.
Comparison database: after input characters has been calculated feature, no matter be feature with statistics or structure, all must there be a comparison database or property data base to compare, the content of database should comprise the word collection literal of all desire identifications, according to the feature group of the feature extraction method gained the same with input characters.
Contrast identification: this is a module can giving full play to the mathematical operation theory, according to different features, select different mathematical distance functions for use, more famous comparison method has, the comparison method of theorem in Euclid space, lax comparison method (Relaxation), dynamic routine comparison method (Dynamic Programming, DP), and the database of neural network is set up and comparison, HMM (Hidden Markov Model) ... etc. famous method, for the result that makes identification more stable, also there is so-called expert system (Experts System) to be suggested, utilize the different complementarity of various feature comparison methods, make the result who identifies, its confidence degree is high especially.
The words reprocessing: because the discrimination of OCR and can't reaching absolutely, or want correctness and the confidence value strengthening comparing, some debugs or even the function of help corrigendum, also become necessary in an OCR system module.The words reprocessing is exactly an example, utilizes in identification literal and its possible similar candidate's sub-block after the comparison, finds out the most logical speech according to the identification literal of front and back, does the function of corrigendum.
Word database: the dictionary of being set up for the words reprocessing.
Manual synchronizing: a good OCR software, except a stable image processing and identification core are arranged, reducing outside the error rate, the operating process of manual synchronizing and function thereof also influence the treatment effeciency of OCR.
Result's output: output is the simple thing of part in fact, but must see the user with OCR on earth for what? someone is as long as text is made the usefulness that re-uses of segment word, so as long as general text file, someone will be beautiful bright with the input file striking resemblances, so the function, the someone that have original text to reappear pay attention to the literal in the form, thus will with software combination such as Excel.No matter how to change, all just export the variation of File Format.

Claims (2)

1. the mobile phone with OCR recognition function comprises mobile phone main body (1), camera (2), mainboard hard disk (3), it is characterized in that: an OCR chip (4) that is integrated with the OCR recognition system is installed on the described mainboard hard disk (3).
2. the mobile phone with OCR recognition function according to claim 1, it is characterized in that: described OCR recognition system is through the image input of camera, enter image pre-treatment, character features extraction, comparison identification successively,, the result is exported through the word correction that manual synchronizing will be admitted one's mistake.
CNU2008200207577U 2008-04-25 2008-04-25 Mobile phone having OCR recognition function Expired - Fee Related CN201191870Y (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNU2008200207577U CN201191870Y (en) 2008-04-25 2008-04-25 Mobile phone having OCR recognition function

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNU2008200207577U CN201191870Y (en) 2008-04-25 2008-04-25 Mobile phone having OCR recognition function

Publications (1)

Publication Number Publication Date
CN201191870Y true CN201191870Y (en) 2009-02-04

Family

ID=40336029

Family Applications (1)

Application Number Title Priority Date Filing Date
CNU2008200207577U Expired - Fee Related CN201191870Y (en) 2008-04-25 2008-04-25 Mobile phone having OCR recognition function

Country Status (1)

Country Link
CN (1) CN201191870Y (en)

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010105701A1 (en) * 2009-03-20 2010-09-23 Sony Ericsson Mobile Communications Ab System and method for providing text input to a communication device
CN101788849B (en) * 2009-12-31 2011-11-16 优视科技有限公司 Optical character recognition input method used for mobile communication equipment system
CN102364926A (en) * 2011-10-21 2012-02-29 镇江科大船苑计算机网络工程有限公司 Android-based intelligent information conversion method
CN102591477A (en) * 2012-01-18 2012-07-18 邓晓波 Character selection method and character selection device for typing in short sentence
CN103186589A (en) * 2011-12-30 2013-07-03 牟颖 A method for quickly judging the authenticity and alarming of drugs through mobile phones
CN103186593A (en) * 2011-12-30 2013-07-03 牟颖 Method for quickly acquiring quality authentication information of electric product through mobile phone
CN105096677A (en) * 2015-08-19 2015-11-25 北京京东方多媒体科技有限公司 Teaching system and work method thereof
CN106446882A (en) * 2016-08-31 2017-02-22 武汉颂大教育科技股份有限公司 method for intelligently marking paper with trace left based on 8-character code
CN106776069A (en) * 2016-12-14 2017-05-31 北京龙贝世纪科技股份有限公司 The automatic method and system for collecting transmission data between a kind of software systems
US9984287B2 (en) 2015-03-05 2018-05-29 Wipro Limited Method and image processing apparatus for performing optical character recognition (OCR) of an article

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2010105701A1 (en) * 2009-03-20 2010-09-23 Sony Ericsson Mobile Communications Ab System and method for providing text input to a communication device
CN101788849B (en) * 2009-12-31 2011-11-16 优视科技有限公司 Optical character recognition input method used for mobile communication equipment system
CN102364926A (en) * 2011-10-21 2012-02-29 镇江科大船苑计算机网络工程有限公司 Android-based intelligent information conversion method
CN103186589A (en) * 2011-12-30 2013-07-03 牟颖 A method for quickly judging the authenticity and alarming of drugs through mobile phones
CN103186593A (en) * 2011-12-30 2013-07-03 牟颖 Method for quickly acquiring quality authentication information of electric product through mobile phone
CN102591477A (en) * 2012-01-18 2012-07-18 邓晓波 Character selection method and character selection device for typing in short sentence
US9984287B2 (en) 2015-03-05 2018-05-29 Wipro Limited Method and image processing apparatus for performing optical character recognition (OCR) of an article
CN105096677A (en) * 2015-08-19 2015-11-25 北京京东方多媒体科技有限公司 Teaching system and work method thereof
CN106446882A (en) * 2016-08-31 2017-02-22 武汉颂大教育科技股份有限公司 method for intelligently marking paper with trace left based on 8-character code
CN106776069A (en) * 2016-12-14 2017-05-31 北京龙贝世纪科技股份有限公司 The automatic method and system for collecting transmission data between a kind of software systems

Similar Documents

Publication Publication Date Title
CN201191870Y (en) Mobile phone having OCR recognition function
Liwicki et al. IAM-OnDB-an on-line English sentence database acquired from handwritten text on a whiteboard
US20080008387A1 (en) Method and apparatus for recognition of handwritten symbols
CN106161873A (en) A kind of video information extracts method for pushing and system
Li et al. Towards real-world writing assistance: A chinese character checking benchmark with faked and misspelled characters
CN111369980A (en) Voice detection method and device, electronic equipment and storage medium
CN114357206A (en) Education video color subtitle generation method and system based on semantic analysis
CN111932418B (en) Student learning condition identification method and system, teaching terminal and storage medium
CN112149680A (en) Wrong word detection and identification method and device, electronic equipment and storage medium
CN115988149A (en) Method for generating video by AI intelligent graphics context
CN201251767Y (en) Intelligent electronic dictionary
US20240211542A1 (en) Text verification device with battery power supply
Xin et al. Comic text detection and recognition based on deep learning
CN111274369A (en) English word recognition method and device
KR102604122B1 (en) Apparatus and method for classifying images
CN117152770A (en) Handwriting input-oriented writing capability intelligent evaluation method and system
CN114118054A (en) Method and system for reprocessing extracted bill information
CN110674859A (en) Chinese short text similarity detection method and system based on Chinese character strokes
Sai et al. Advanced handwritten text recognition for cursive writings with spelling correction module
Puigcerver et al. Advances in handwritten keyword indexing and search technologies
CN114764437A (en) User intention identification method and device and electronic equipment
Rychlik et al. Development of a New Image-to-Text Conversion System for Pashto, Farsi and Traditional Chinese
Oliveira et al. Comparing human and machine performances in transcribing 18th century handwritten venetian script
Bharadwaj et al. Handwriting Recognition Using CNN
Ang et al. Extracting Medication Information from Typewritten Philippine Medical Prescriptions Using Optical Character Recognition (OCR) and Named Entity Recognition (NER)

Legal Events

Date Code Title Description
C14 Grant of patent or utility model
GR01 Patent grant
C17 Cessation of patent right
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20090204

Termination date: 20110425