CN201191870Y - Mobile phone having OCR recognition function - Google Patents
Mobile phone having OCR recognition function Download PDFInfo
- Publication number
- CN201191870Y CN201191870Y CNU2008200207577U CN200820020757U CN201191870Y CN 201191870 Y CN201191870 Y CN 201191870Y CN U2008200207577 U CNU2008200207577 U CN U2008200207577U CN 200820020757 U CN200820020757 U CN 200820020757U CN 201191870 Y CN201191870 Y CN 201191870Y
- Authority
- CN
- China
- Prior art keywords
- ocr
- handset
- mobile phone
- image
- ocr recognition
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000000605 extraction Methods 0.000 claims abstract description 7
- 238000012937 correction Methods 0.000 claims abstract description 5
- 238000002203 pretreatment Methods 0.000 claims description 9
- 238000000034 method Methods 0.000 abstract description 13
- 230000009286 beneficial effect Effects 0.000 abstract 1
- 238000012015 optical character recognition Methods 0.000 description 33
- 230000006870 function Effects 0.000 description 15
- 238000005516 engineering process Methods 0.000 description 5
- 230000003287 optical effect Effects 0.000 description 3
- 230000008569 process Effects 0.000 description 3
- 238000012545 processing Methods 0.000 description 3
- 238000012958 reprocessing Methods 0.000 description 3
- 239000000284 extract Substances 0.000 description 2
- 239000000463 material Substances 0.000 description 2
- 230000007246 mechanism Effects 0.000 description 2
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000006243 chemical reaction Methods 0.000 description 1
- 238000004891 communication Methods 0.000 description 1
- 230000007812 deficiency Effects 0.000 description 1
- 230000004069 differentiation Effects 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 238000004321 preservation Methods 0.000 description 1
- 230000003252 repetitive effect Effects 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 238000005728 strengthening Methods 0.000 description 1
Images
Landscapes
- Character Discrimination (AREA)
Abstract
The utility model discloses a handset, in particular to a handset with OCR identification function, which comprises a handset host, a cam and a main board hard disk. The handset is characterized in that an OCR chip which is integrated with an OCR identification system is arranged on the main board hard disk, the OCR identification system enters into the video-image fore-process, the character feature extraction and the comparison identification through the video-image input of the cam and corrects the false character by the artificial correction and outputs the result. The cam of the handset can store a plurality of 'shot' data in the handset, which provides convenience to read for people timely, further memorizing repeatedly and reaching the goal of getting hold of the knowledge. The utility model plants the OCR function into the handset, which is beneficial to read book and newspaper for people or to store the data of the plane media for students greatly.
Description
(1) technical field
The utility model relates to a kind of mobile phone, particularly a kind of mobile phone with OCR recognition function.
(2) background technology
In daily life, we see that from newspaper, magazine many data have value for preservation very much, and common people do not have copier at hand, record and the especially trouble that seems with notes.At any time to consult in the mobile phone to be very easily if can be stored into it, but spell them in the mobile phone not only consuming time but also consume power with phonetic or handwriting functions!
(3) summary of the invention
The utility model provides a kind of mobile phone with OCR recognition function easy to use in order to remedy the deficiencies in the prior art.
The utility model is achieved by the following technical solution:
A kind of mobile phone with OCR recognition function comprises mobile phone main body, camera, mainboard hard disk, and its special character is: an OCR chip that is integrated with the OCR recognition system is installed on the described mainboard hard disk.
Mobile phone with OCR recognition function of the present utility model, described OCR recognition system enter image pre-treatment, character features extraction, comparison identification successively through the image input of camera, through the word correction that manual synchronizing will be admitted one's mistake, the result are exported.
OCR (Optical Character Recognition) optical character identification.The OCR Chinese meaning is discerned literal by optical technology exactly.This technology can make equipment come identification character by the mechanism of optics.The mankind discern many things with eyes, and its mode is exactly a kind of optics mechanism.
The utility model is added to the OCR function in the mobile phone, makes things convenient for the scholar to store data.The utility model utilizes the camera of mobile phone that some data " bat " are stored in mobile phone, said " bat " be utilize existing OCR system data logging in mobile phone.The OCR system notes data with the form of picture, but utilize its Automatic Editing Function that has that the form of data with text is stored in the mobile phone, and can browse at any time, be convenient for people to browse at any time, and then repetitive memory, reach the purpose of grasping this knowledge! After the utility model is successfully implanted mobile phone with the OCR function, people who reads newspaper helping greatly often reading and the data on numerous students' memory plane medium.
(4) description of drawings
Below in conjunction with accompanying drawing the utility model is further described.
Fig. 1 is a structural representation of the present utility model;
Fig. 2 is the structured flowchart of the utility model OCR recognition system.
Among the figure, 1 mobile phone main body, 2 cameras, 3 mainboard hard disks, 4OCR chip.
(5) embodiment
Accompanying drawing is a kind of specific embodiment of the present utility model.This embodiment comprises mobile phone main body 1, camera 2, mainboard hard disk 3, and an OCR chip 4 that is integrated with the OCR recognition system is installed on the mainboard hard disk 3; The OCR recognition system enters image pre-treatment, character features extraction, comparison identification successively through the image input of camera, through the word correction that manual synchronizing will be admitted one's mistake, the result is exported.
OCR (Optical Character Recognition) optical character identification.It belongs to a knowledge of pattern identification, and its purpose will allow computer know what it has seen on earth exactly, especially written historical materials.Because OCR is a technology with the discrimination tug-of-war, therefore how debug or utilize supplementary raising recognition correct rate is the most important problem of OCR.And the media difference that exists according to written historical materials, and obtain the mode difference of these data, just derive of all kinds, various application.
An OCR recognition system, its purpose is very simple, just to do a conversion to image, make the figure in the image continue to preserve, have form then interior data of form and the interior literal of image, become the mobile phone literal without exception, the literal that the storage capacity that enables to reach image data reduces, identifies can re-use and analyze, and also can save manpower and time because of the keyboard input certainly.Its handling process is as follows:
From the image to result, export, must extract through image input, image pre-treatment, character features, comparison is discerned, after the word correction that manual synchronizing will be admitted one's mistake the result is exported.Make introductions all round at this:
The image input: the data that needs OCR to handle must be transferred to image in the mobile phone by camera.Now the medium-to-high grade mobile phone produced of each big communication apparatus company generally all is equipped with camera, has and takes a picture and camera function.Along with the progress of science and technology, the resolution of camera will be more and more higher, thereby make imaging quality also more and more clear simultaneously, and this can improve reading of OCR system and automatic editing speed greatly.
The image pre-treatment: the image pre-treatment is in the OCR system, the maximum module of must dealing with problems, from obtain one be not black be exactly white binaryzation image, or GTG, colored image, process to independently going out literal image one by one all belongs to the image pre-treatment.Comprise image normalization, removed the image processing of noise, image rectification etc., and the file pre-treatment that separates with word of picture and text analysis, literal line.Therefore aspect image processing, all reached the ripe stage in principle and technical elements, on the market or many available chained libraries are arranged on the website; Aspect the file pre-treatment, then with each tame ability; Image must be separated picture, form and character area earlier, even the layout direction of article, the outline and the content body of article can be distinguished, and the font of the size of literal and literal also can judging as original document.
Character features extracts: single with discrimination, and can the say so core of OCR of feature extraction, with what feature, how to extract, the direct quality discerned of influence, so also study the initial stage at OCR, the research report of feature extraction is many especially.And the chip that feature can be said so and be discerned, easy differentiation can be divided into two classes: a feature for statistics, as the ratio of counting of the black/white in the character area, when literal field is divided into several zones, this regional one by one black/white count than associating, just become a numerical value vector in space, when comparison, basic mathematical theory just is enough to deal with.And the another kind of feature that is characterized as structure, after literal image graph thinning, obtain the stroke end points of word, the quantity and the position in crosspoint, or be feature with the stroke section, cooperate special comparison method, compare the many methods of the recognition methods of hand-written Input Software on the line on the market based on this kind structure.
Comparison database: after input characters has been calculated feature, no matter be feature with statistics or structure, all must there be a comparison database or property data base to compare, the content of database should comprise the word collection literal of all desire identifications, according to the feature group of the feature extraction method gained the same with input characters.
Contrast identification: this is a module can giving full play to the mathematical operation theory, according to different features, select different mathematical distance functions for use, more famous comparison method has, the comparison method of theorem in Euclid space, lax comparison method (Relaxation), dynamic routine comparison method (Dynamic Programming, DP), and the database of neural network is set up and comparison, HMM (Hidden Markov Model) ... etc. famous method, for the result that makes identification more stable, also there is so-called expert system (Experts System) to be suggested, utilize the different complementarity of various feature comparison methods, make the result who identifies, its confidence degree is high especially.
The words reprocessing: because the discrimination of OCR and can't reaching absolutely, or want correctness and the confidence value strengthening comparing, some debugs or even the function of help corrigendum, also become necessary in an OCR system module.The words reprocessing is exactly an example, utilizes in identification literal and its possible similar candidate's sub-block after the comparison, finds out the most logical speech according to the identification literal of front and back, does the function of corrigendum.
Word database: the dictionary of being set up for the words reprocessing.
Manual synchronizing: a good OCR software, except a stable image processing and identification core are arranged, reducing outside the error rate, the operating process of manual synchronizing and function thereof also influence the treatment effeciency of OCR.
Result's output: output is the simple thing of part in fact, but must see the user with OCR on earth for what? someone is as long as text is made the usefulness that re-uses of segment word, so as long as general text file, someone will be beautiful bright with the input file striking resemblances, so the function, the someone that have original text to reappear pay attention to the literal in the form, thus will with software combination such as Excel.No matter how to change, all just export the variation of File Format.
Claims (2)
1. the mobile phone with OCR recognition function comprises mobile phone main body (1), camera (2), mainboard hard disk (3), it is characterized in that: an OCR chip (4) that is integrated with the OCR recognition system is installed on the described mainboard hard disk (3).
2. the mobile phone with OCR recognition function according to claim 1, it is characterized in that: described OCR recognition system is through the image input of camera, enter image pre-treatment, character features extraction, comparison identification successively,, the result is exported through the word correction that manual synchronizing will be admitted one's mistake.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNU2008200207577U CN201191870Y (en) | 2008-04-25 | 2008-04-25 | Mobile phone having OCR recognition function |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNU2008200207577U CN201191870Y (en) | 2008-04-25 | 2008-04-25 | Mobile phone having OCR recognition function |
Publications (1)
Publication Number | Publication Date |
---|---|
CN201191870Y true CN201191870Y (en) | 2009-02-04 |
Family
ID=40336029
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNU2008200207577U Expired - Fee Related CN201191870Y (en) | 2008-04-25 | 2008-04-25 | Mobile phone having OCR recognition function |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN201191870Y (en) |
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2010105701A1 (en) * | 2009-03-20 | 2010-09-23 | Sony Ericsson Mobile Communications Ab | System and method for providing text input to a communication device |
CN101788849B (en) * | 2009-12-31 | 2011-11-16 | 优视科技有限公司 | Optical character recognition input method used for mobile communication equipment system |
CN102364926A (en) * | 2011-10-21 | 2012-02-29 | 镇江科大船苑计算机网络工程有限公司 | Android-based intelligent information conversion method |
CN102591477A (en) * | 2012-01-18 | 2012-07-18 | 邓晓波 | Character selection method and character selection device for typing in short sentence |
CN103186589A (en) * | 2011-12-30 | 2013-07-03 | 牟颖 | A method for quickly judging the authenticity and alarming of drugs through mobile phones |
CN103186593A (en) * | 2011-12-30 | 2013-07-03 | 牟颖 | Method for quickly acquiring quality authentication information of electric product through mobile phone |
CN105096677A (en) * | 2015-08-19 | 2015-11-25 | 北京京东方多媒体科技有限公司 | Teaching system and work method thereof |
CN106446882A (en) * | 2016-08-31 | 2017-02-22 | 武汉颂大教育科技股份有限公司 | method for intelligently marking paper with trace left based on 8-character code |
CN106776069A (en) * | 2016-12-14 | 2017-05-31 | 北京龙贝世纪科技股份有限公司 | The automatic method and system for collecting transmission data between a kind of software systems |
US9984287B2 (en) | 2015-03-05 | 2018-05-29 | Wipro Limited | Method and image processing apparatus for performing optical character recognition (OCR) of an article |
-
2008
- 2008-04-25 CN CNU2008200207577U patent/CN201191870Y/en not_active Expired - Fee Related
Cited By (10)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2010105701A1 (en) * | 2009-03-20 | 2010-09-23 | Sony Ericsson Mobile Communications Ab | System and method for providing text input to a communication device |
CN101788849B (en) * | 2009-12-31 | 2011-11-16 | 优视科技有限公司 | Optical character recognition input method used for mobile communication equipment system |
CN102364926A (en) * | 2011-10-21 | 2012-02-29 | 镇江科大船苑计算机网络工程有限公司 | Android-based intelligent information conversion method |
CN103186589A (en) * | 2011-12-30 | 2013-07-03 | 牟颖 | A method for quickly judging the authenticity and alarming of drugs through mobile phones |
CN103186593A (en) * | 2011-12-30 | 2013-07-03 | 牟颖 | Method for quickly acquiring quality authentication information of electric product through mobile phone |
CN102591477A (en) * | 2012-01-18 | 2012-07-18 | 邓晓波 | Character selection method and character selection device for typing in short sentence |
US9984287B2 (en) | 2015-03-05 | 2018-05-29 | Wipro Limited | Method and image processing apparatus for performing optical character recognition (OCR) of an article |
CN105096677A (en) * | 2015-08-19 | 2015-11-25 | 北京京东方多媒体科技有限公司 | Teaching system and work method thereof |
CN106446882A (en) * | 2016-08-31 | 2017-02-22 | 武汉颂大教育科技股份有限公司 | method for intelligently marking paper with trace left based on 8-character code |
CN106776069A (en) * | 2016-12-14 | 2017-05-31 | 北京龙贝世纪科技股份有限公司 | The automatic method and system for collecting transmission data between a kind of software systems |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN201191870Y (en) | Mobile phone having OCR recognition function | |
Liwicki et al. | IAM-OnDB-an on-line English sentence database acquired from handwritten text on a whiteboard | |
US20080008387A1 (en) | Method and apparatus for recognition of handwritten symbols | |
CN106161873A (en) | A kind of video information extracts method for pushing and system | |
Li et al. | Towards real-world writing assistance: A chinese character checking benchmark with faked and misspelled characters | |
CN111369980A (en) | Voice detection method and device, electronic equipment and storage medium | |
CN114357206A (en) | Education video color subtitle generation method and system based on semantic analysis | |
CN111932418B (en) | Student learning condition identification method and system, teaching terminal and storage medium | |
CN112149680A (en) | Wrong word detection and identification method and device, electronic equipment and storage medium | |
CN115988149A (en) | Method for generating video by AI intelligent graphics context | |
CN201251767Y (en) | Intelligent electronic dictionary | |
US20240211542A1 (en) | Text verification device with battery power supply | |
Xin et al. | Comic text detection and recognition based on deep learning | |
CN111274369A (en) | English word recognition method and device | |
KR102604122B1 (en) | Apparatus and method for classifying images | |
CN117152770A (en) | Handwriting input-oriented writing capability intelligent evaluation method and system | |
CN114118054A (en) | Method and system for reprocessing extracted bill information | |
CN110674859A (en) | Chinese short text similarity detection method and system based on Chinese character strokes | |
Sai et al. | Advanced handwritten text recognition for cursive writings with spelling correction module | |
Puigcerver et al. | Advances in handwritten keyword indexing and search technologies | |
CN114764437A (en) | User intention identification method and device and electronic equipment | |
Rychlik et al. | Development of a New Image-to-Text Conversion System for Pashto, Farsi and Traditional Chinese | |
Oliveira et al. | Comparing human and machine performances in transcribing 18th century handwritten venetian script | |
Bharadwaj et al. | Handwriting Recognition Using CNN | |
Ang et al. | Extracting Medication Information from Typewritten Philippine Medical Prescriptions Using Optical Character Recognition (OCR) and Named Entity Recognition (NER) |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C17 | Cessation of patent right | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20090204 Termination date: 20110425 |