CN101339617A - Mobile phones photographing and translation device - Google Patents
Mobile phones photographing and translation device Download PDFInfo
- Publication number
- CN101339617A CN101339617A CNA2007100435408A CN200710043540A CN101339617A CN 101339617 A CN101339617 A CN 101339617A CN A2007100435408 A CNA2007100435408 A CN A2007100435408A CN 200710043540 A CN200710043540 A CN 200710043540A CN 101339617 A CN101339617 A CN 101339617A
- Authority
- CN
- China
- Prior art keywords
- unit
- engine
- translation
- photographing
- interface
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Images
Abstract
The invention discloses a cellphone photographing and translating device which comprises a photographing unit, a user interface unit, a character feature image identifying engine (OCR engine) unit, a translation engine unit and an image preprocessing unit. The cellphone photographing and translating device of the invention ensures that the loading of the character feature image identification on a handset device, the obtaining and the process of printed characters which are photographed by a photographing device, the identification of the character correction and the displayed human-computer interaction guideline, the identification to the digital image identification characters, the identification character translation and the identification result storage are all very convenient, in summary, the device can translate the required information in time, has high-efficient input function and provide a new developing opportunity for the handset functions.
Description
Technical field
The present invention relates to digital image processing field, area of pattern recognition, and embedded device field, particularly a kind of handheld device interpreting equipment of taking pictures.
Background technology
Along with the development of handheld device with popularize, mobile phone has more and more become the electronic equipment that people's daily life is carried, and the data that how will need to translate is efficiently imported, and data is in time translated, for cell-phone function provides new opportunity to develop.
Summary of the invention
The purpose of this invention is to provide a kind of photographing and translation device.
The objective of the invention is to be achieved through the following technical solutions:
A kind of photographing and translation device comprises:
Character feature image recognition (OCR) engine unit is used for the character conversion of written historical materials digital picture is become character standard ISN.
A translation engine unit is used to translate the written historical materials that OCR identifies;
At least a shooting unit that has 1,300,000 pixels and have macro function is used to take the digital picture of obtaining business card;
An image pretreatment unit, the picture format that the image transitions that is used for taking becomes the translation engine unit to discern carries out the binaryzation compression with image, is used to promote recognition speed;
A user interface elements is used for and user interactions, and the guide user uses this function interface.
Wherein, this OCR engine unit comprises:
The engine library unit is used for the store character proper vector;
Engine is provided with the unit, is used to be provided with operational mode or digital picture parameter;
The engine start unit is used to distribute running space, the engine storehouse is loaded into internal memory, and the startup engine is an executable state;
Engine printed page analysis unit is used to divide the space of a whole page, cuts apart the translation of words zone, and recognizable character is confined with connected region;
The engine recognition unit is used to discern the digital picture that is communicated with in the district, extracts feature according to the digital picture visual pattern and discerns the output character ISN; And
The engine-off unit, engine is closed in the releasing memory space.
This engine library unit comprises:
The translation library unit is used to deposit the speech and the table of comparisons of translating content; And
The translation interface unit is used to provide input translation word, obtains the interface of translation result.
Image pretreatment unit: obtain digital picture from the camera installation unit, this image resolution ratio is more than 1280 * 960, decoding transfers 16 rgb images to the Jpg image through hardware, is converted into 8 gray scale bmp format-patterns from 16 rgb images, then image is carried out binary conversion treatment;
Bianry image is meant the image of only deceiving (gray-scale value is 0) white (gray-scale value is 1) two-value in the entire image picture, does not present the variation of gray scale on them.In Digital Image Processing, bianry image occupies important status.This is because in the image processing system of practicality, requires speed height, the cost of processing low, and it is too big that the shading image that contains much information is handled cost, is not very wise move.And the notion of the image after the binaryzation in can enough geometry analyze and feature description, makes things convenient for manyly compared with gray level image.Thereby binary Images Processing become at present in the Flame Image Process one independently, important branch and obtain to use widely.
If remarked pixel is in that (binary conversion treatment is shown in the following formula for i, the j) gray-scale value of position.
Here t is binary-state threshold (Threshold).The 8-neighborhood (8-Neighbor) of pixel is removed outside d-neighbour's the pixel, and 4 pixels on the remaining diagonal line are called that (symbol is: the i-neighbour for i, non-direct neighborhood j).The linking number of certain pixel can be with the 8-neighborhood value f (x of this pixel
0) ... f (x
7) calculate.
Work as x
k=x
8The time, make x
8=x
0
For the 8-neighborhood of a pixel the value that might exist, calculate according to following formula, its linking number is always got the value between the 0-4.In the automatic identifying of literal, need carry out refinement to bianry image, can also significantly reduce redundant information.
The binary image refined image
Advantage of the present invention is: the data to required translation has efficient input, to data timely translation can be arranged, for cell-phone function provides new opportunity to develop.
User interface elements can comprise:
Preview interface, the speech interface is selected at the printed page analysis interface, the translation interface.
Description of drawings
Fig. 1 is the structured flowchart of the embodiment of the invention;
Fig. 2 is the schematic flow sheet of the embodiment of the invention;
Fig. 3 is the engine schematic flow sheet of the embodiment of the invention.
Fig. 4-1~Fig. 4-4 shows the operating process synoptic diagram of the embodiment of the invention.
Embodiment
Provide better embodiment of the present invention according to Fig. 1~Fig. 4-4 below, and described in detail,, rather than be used for limiting scope of the present invention so that those skilled in the art is easier to understand architectural feature of the present invention and function characteristics.
See also shown in Figure 1, mobile phone with the interpretative function of taking pictures, comprise image pretreatment component 1, user interface 2, image recognition engine 3, dictionary engine 4 and camera installation 5, wherein: user interface 2 comprises preview interface 21, participle interface 22, correction interface 23 and the translation interface 24 that is set up in parallel; The image recognition engine 3 comprises that the engine that is set up in parallel is provided with 31, data printed page analysis engine 32, character recognition engine 33 and engine-off 34; Dictionary engine 4 comprises dictionary library 41 and translation interface 42; Camera installation 5 comprises that camera withdraws from unit 51, camera take pictures unit 52, camera adjustments unit 53 and camera preview unit 54.
See also photograph flow process 100 shown in Figure 2,
S
1001, initialization one OCR engine initialization, camera initialization and the initialization of dictionary engine;
S
1002, data preview one camera preview, preview regulate;
S
1003, data takes that a camera is taken and image transitions;
S
1004, printed page analysis one word piecemeal;
S
1005, select speech identification one to select word, identified word and word correction;
S
1006, translation result one calls dictionary engine, display result;
In each step of above-mentioned flow process, at S
1006In the translation result, if judge that also need proceed just rebound carries out S
1005, select speech identification; If deterministic process finishes, then skip to S
1007, withdraw from mobile phone shooting translation and be in exit status.
At S
1005Select in the speech identification step, judge and select speech to carry out, just carry out S
1007, withdrawing from, mobile phone is taken translation and is in holding state.
At S
1002, in the data preview step, the judgement data need not taken, and just skips to S
1007, withdrawing from, mobile phone is taken translation and is in holding state.
At S
1001In the initialization step, differentiate to take to translate and need not carry out, just carry out S
1007, withdrawing from, mobile phone is taken translation and is in holding state.
At S
1004In the printed page analysis step, or S
1003The undesirable need of discovery image are taken again in the data shooting step, then rebound S
1002, carry out the data preview step again.
See also Fig. 3, it shows the synoptic diagram of engine flow process 200 of the present invention, as shown in the figure,
S
2001, start;
S
2002, start the business card image recognition engine;
S
2003, the business card image attribute is set.
S
2004, business card image is handled
S
2005, the output of business card word
S
2006, also have word output? if word output is arranged, then S is carried out in rebound
2004Otherwise, carry out S
2007,
S
2007, close business card image
S
2008, EOP (end of program).
See also Fig. 4-1~Fig. 4-4
It has provided user's operating process of the present invention: promptly 1 ', shooting data, the preview data image;
2 ', click " identification ", after several seconds, the column picture frame appears on the business card; For example, selected " precdent " column by keyboard or stylus; 3 ', eject English word by identification, precedent if discern wrong can the modification again, clicks translation.4 ', show the translator of Chinese of this word, repeat " continuation "+" translation " operation after, can finish translation to whole section each word of data.
Claims (4)
1, a kind of mobile phones photographing and translation device comprises:
An OCR engine unit is used for the character conversion of written historical materials digital picture is become character standard ISN;
A translation engine unit is used to translate OCR and identifies written historical materials;
At least a shooting unit that has 1,300,000 pixels and macro function is arranged is used to take the digital picture of obtaining business card;
An image pretreatment unit, the picture format that the image transitions that is used for taking becomes the translation engine unit to discern carries out the binaryzation compression with image, to promote recognition speed; And a user interface elements, being used for and user interactions, the guide user uses this interface.
2, mobile phones photographing and translation device according to claim 1 is characterized in that, described ORC engine unit comprises: the engine library unit is used for the store character proper vector; Engine is provided with the unit, is used to establish operational mode or digital picture parameter; The engine start unit is used to distribute running space, and the engine storehouse is loaded into internal memory, and starting this engine start unit is executable state; Engine face version analytic unit is used to divide the space of a whole page, cuts apart the translation of words zone, and recognizable character is confined with connected region; The engine recognition unit is used to discern the digital picture in the connected region, discerns the output character ISN according to the visual pattern extraction feature of digital picture; The engine-off unit, above-mentioned each engine unit is closed in the releasing memory space.
3, mobile phones photographing and translation device according to claim 2 is characterized in that, described engine library unit comprises:
The translation library unit is used to deposit the speech and the table of comparisons of translating content;
The translation interface unit is used to provide input translation word, is the interface that obtains translation result.
4, mobile phones photographing and translation device according to claim 1 is characterized in that, described user interface elements comprises: preview interface, printed page analysis interface, select the speech interface and the translation interface.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNA2007100435408A CN101339617A (en) | 2007-07-06 | 2007-07-06 | Mobile phones photographing and translation device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CNA2007100435408A CN101339617A (en) | 2007-07-06 | 2007-07-06 | Mobile phones photographing and translation device |
Publications (1)
Publication Number | Publication Date |
---|---|
CN101339617A true CN101339617A (en) | 2009-01-07 |
Family
ID=40213682
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNA2007100435408A Pending CN101339617A (en) | 2007-07-06 | 2007-07-06 | Mobile phones photographing and translation device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN101339617A (en) |
Cited By (13)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102223433A (en) * | 2010-04-19 | 2011-10-19 | 辜进荣 | Identification retrieval matching method of character information of camera mobile phone |
CN102355532A (en) * | 2011-10-21 | 2012-02-15 | 镇江科大船苑计算机网络工程有限公司 | Information transmission method based on Android intelligent business card videographing and scanning |
CN102737238A (en) * | 2011-04-01 | 2012-10-17 | 洛阳磊石软件科技有限公司 | Gesture motion-based character recognition system and character recognition method, and application thereof |
CN102982326A (en) * | 2011-09-02 | 2013-03-20 | 汉王科技股份有限公司 | A method and a device for word processing and an electronic translation pen |
CN103699527A (en) * | 2013-12-20 | 2014-04-02 | 上海合合信息科技发展有限公司 | Image translation system and method |
CN103716453A (en) * | 2012-10-02 | 2014-04-09 | Lg电子株式会社 | Mobile terminal and control method for the mobile terminal |
CN104881405A (en) * | 2015-05-22 | 2015-09-02 | 东莞中山大学研究院 | Photo translation implementation method based on smart phone and smart phone |
CN105468226A (en) * | 2014-09-11 | 2016-04-06 | 深圳富泰宏精密工业有限公司 | Picture browsing system and method |
CN106649294A (en) * | 2016-12-29 | 2017-05-10 | 北京奇虎科技有限公司 | Training of classification models and method and device for recognizing subordinate clauses of classification models |
CN106855854A (en) * | 2016-12-29 | 2017-06-16 | 北京奇虎科技有限公司 | A kind of recognition methods of english information and device |
CN108829644A (en) * | 2013-09-27 | 2018-11-16 | 夏普株式会社 | Information processing unit, recording medium and the method for showing translation result |
CN108985201A (en) * | 2018-06-29 | 2018-12-11 | 网易有道信息技术(北京)有限公司 | Image processing method, medium, device and calculating equipment |
CN110245362A (en) * | 2019-06-19 | 2019-09-17 | 京东方科技集团股份有限公司 | A kind of translating equipment and translation system |
-
2007
- 2007-07-06 CN CNA2007100435408A patent/CN101339617A/en active Pending
Cited By (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102223433A (en) * | 2010-04-19 | 2011-10-19 | 辜进荣 | Identification retrieval matching method of character information of camera mobile phone |
CN102737238A (en) * | 2011-04-01 | 2012-10-17 | 洛阳磊石软件科技有限公司 | Gesture motion-based character recognition system and character recognition method, and application thereof |
CN102982326B (en) * | 2011-09-02 | 2016-05-25 | 汉王科技股份有限公司 | Literal processing method, device and electronic translation pen |
CN102982326A (en) * | 2011-09-02 | 2013-03-20 | 汉王科技股份有限公司 | A method and a device for word processing and an electronic translation pen |
CN102355532A (en) * | 2011-10-21 | 2012-02-15 | 镇江科大船苑计算机网络工程有限公司 | Information transmission method based on Android intelligent business card videographing and scanning |
CN103716453A (en) * | 2012-10-02 | 2014-04-09 | Lg电子株式会社 | Mobile terminal and control method for the mobile terminal |
CN109101467A (en) * | 2013-09-27 | 2018-12-28 | 夏普株式会社 | The method of operating of information processing unit, recording medium and information processing unit |
CN108829644A (en) * | 2013-09-27 | 2018-11-16 | 夏普株式会社 | Information processing unit, recording medium and the method for showing translation result |
CN103699527A (en) * | 2013-12-20 | 2014-04-02 | 上海合合信息科技发展有限公司 | Image translation system and method |
CN105468226A (en) * | 2014-09-11 | 2016-04-06 | 深圳富泰宏精密工业有限公司 | Picture browsing system and method |
CN104881405A (en) * | 2015-05-22 | 2015-09-02 | 东莞中山大学研究院 | Photo translation implementation method based on smart phone and smart phone |
CN106649294A (en) * | 2016-12-29 | 2017-05-10 | 北京奇虎科技有限公司 | Training of classification models and method and device for recognizing subordinate clauses of classification models |
CN106855854A (en) * | 2016-12-29 | 2017-06-16 | 北京奇虎科技有限公司 | A kind of recognition methods of english information and device |
CN108985201A (en) * | 2018-06-29 | 2018-12-11 | 网易有道信息技术(北京)有限公司 | Image processing method, medium, device and calculating equipment |
CN110245362A (en) * | 2019-06-19 | 2019-09-17 | 京东方科技集团股份有限公司 | A kind of translating equipment and translation system |
CN110245362B (en) * | 2019-06-19 | 2023-10-13 | 京东方科技集团股份有限公司 | Translation device and translation system |
US11853711B2 (en) | 2019-06-19 | 2023-12-26 | Boe Technology Group Co., Ltd. | Translation pen and translation system |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101339617A (en) | Mobile phones photographing and translation device | |
CN110188365B (en) | Word-taking translation method and device | |
US8626236B2 (en) | System and method for displaying text in augmented reality | |
CN101339618A (en) | Mobile phones name card recognition device | |
EP2472372A1 (en) | Input method of contact information and system | |
US9251428B2 (en) | Entering information through an OCR-enabled viewfinder | |
US20120131520A1 (en) | Gesture-based Text Identification and Selection in Images | |
US20140143721A1 (en) | Information processing device, information processing method, and computer program product | |
KR20100007722A (en) | Method of character recongnition and translation based on camera image | |
CN103678260A (en) | Portable electronic business card holder and processing method | |
US8897594B2 (en) | Image reader, mobile terminal apparatus, and non-transitory computer readable medium | |
CN110674814A (en) | Picture identification and translation method, terminal and medium | |
EP2439676A1 (en) | System and method for displaying text in augmented reality | |
US7623742B2 (en) | Method for processing document image captured by camera | |
Hung et al. | Implementing an android application for automatic vietnamese business card recognition | |
CN103279262A (en) | Method and device for extracting content from image | |
CN103186587A (en) | Method for quickly translating English word of book through mobile phone | |
US20060290789A1 (en) | File naming with optical character recognition | |
KR20150091948A (en) | A system for recognizing a font and providing its information and the method thereof | |
Kaur | Text recognition applications for mobile devices | |
CN104951749A (en) | Image content recognition device and image content recognition method | |
US20060210171A1 (en) | Image processing apparatus | |
US10965801B2 (en) | Method for inputting and processing phone number, mobile terminal and storage medium | |
CN103186581A (en) | Method for quickly acquiring pronunciation of uncommon word in book through mobile phone | |
JP4597644B2 (en) | Character recognition device, program and recording medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |
Open date: 20090107 |