CN101339617A - Mobile phones photographing and translation device - Google Patents

Mobile phones photographing and translation device Download PDF

Info

Publication number
CN101339617A
CN101339617A CNA2007100435408A CN200710043540A CN101339617A CN 101339617 A CN101339617 A CN 101339617A CN A2007100435408 A CNA2007100435408 A CN A2007100435408A CN 200710043540 A CN200710043540 A CN 200710043540A CN 101339617 A CN101339617 A CN 101339617A
Authority
CN
China
Prior art keywords
unit
engine
translation
photographing
interface
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CNA2007100435408A
Other languages
Chinese (zh)
Inventor
杨健
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SHANGHAI SPEED COMMUNICATION TECHNOLOGY Co Ltd
Original Assignee
SHANGHAI SPEED COMMUNICATION TECHNOLOGY Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SHANGHAI SPEED COMMUNICATION TECHNOLOGY Co Ltd filed Critical SHANGHAI SPEED COMMUNICATION TECHNOLOGY Co Ltd
Priority to CNA2007100435408A priority Critical patent/CN101339617A/en
Publication of CN101339617A publication Critical patent/CN101339617A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention discloses a cellphone photographing and translating device which comprises a photographing unit, a user interface unit, a character feature image identifying engine (OCR engine) unit, a translation engine unit and an image preprocessing unit. The cellphone photographing and translating device of the invention ensures that the loading of the character feature image identification on a handset device, the obtaining and the process of printed characters which are photographed by a photographing device, the identification of the character correction and the displayed human-computer interaction guideline, the identification to the digital image identification characters, the identification character translation and the identification result storage are all very convenient, in summary, the device can translate the required information in time, has high-efficient input function and provide a new developing opportunity for the handset functions.

Description

Mobile phones photographing and translation device
Technical field
The present invention relates to digital image processing field, area of pattern recognition, and embedded device field, particularly a kind of handheld device interpreting equipment of taking pictures.
Background technology
Along with the development of handheld device with popularize, mobile phone has more and more become the electronic equipment that people's daily life is carried, and the data that how will need to translate is efficiently imported, and data is in time translated, for cell-phone function provides new opportunity to develop.
Summary of the invention
The purpose of this invention is to provide a kind of photographing and translation device.
The objective of the invention is to be achieved through the following technical solutions:
A kind of photographing and translation device comprises:
Character feature image recognition (OCR) engine unit is used for the character conversion of written historical materials digital picture is become character standard ISN.
A translation engine unit is used to translate the written historical materials that OCR identifies;
At least a shooting unit that has 1,300,000 pixels and have macro function is used to take the digital picture of obtaining business card;
An image pretreatment unit, the picture format that the image transitions that is used for taking becomes the translation engine unit to discern carries out the binaryzation compression with image, is used to promote recognition speed;
A user interface elements is used for and user interactions, and the guide user uses this function interface.
Wherein, this OCR engine unit comprises:
The engine library unit is used for the store character proper vector;
Engine is provided with the unit, is used to be provided with operational mode or digital picture parameter;
The engine start unit is used to distribute running space, the engine storehouse is loaded into internal memory, and the startup engine is an executable state;
Engine printed page analysis unit is used to divide the space of a whole page, cuts apart the translation of words zone, and recognizable character is confined with connected region;
The engine recognition unit is used to discern the digital picture that is communicated with in the district, extracts feature according to the digital picture visual pattern and discerns the output character ISN; And
The engine-off unit, engine is closed in the releasing memory space.
This engine library unit comprises:
The translation library unit is used to deposit the speech and the table of comparisons of translating content; And
The translation interface unit is used to provide input translation word, obtains the interface of translation result.
Image pretreatment unit: obtain digital picture from the camera installation unit, this image resolution ratio is more than 1280 * 960, decoding transfers 16 rgb images to the Jpg image through hardware, is converted into 8 gray scale bmp format-patterns from 16 rgb images, then image is carried out binary conversion treatment;
Bianry image is meant the image of only deceiving (gray-scale value is 0) white (gray-scale value is 1) two-value in the entire image picture, does not present the variation of gray scale on them.In Digital Image Processing, bianry image occupies important status.This is because in the image processing system of practicality, requires speed height, the cost of processing low, and it is too big that the shading image that contains much information is handled cost, is not very wise move.And the notion of the image after the binaryzation in can enough geometry analyze and feature description, makes things convenient for manyly compared with gray level image.Thereby binary Images Processing become at present in the Flame Image Process one independently, important branch and obtain to use widely.
If remarked pixel is in that (binary conversion treatment is shown in the following formula for i, the j) gray-scale value of position.
f ( i , j ) = 1 f ( i , j ) ≥ t 0 f ( i , j ) ≤ t
Here t is binary-state threshold (Threshold).The 8-neighborhood (8-Neighbor) of pixel is removed outside d-neighbour's the pixel, and 4 pixels on the remaining diagonal line are called that (symbol is: the i-neighbour for i, non-direct neighborhood j).The linking number of certain pixel can be with the 8-neighborhood value f (x of this pixel 0) ... f (x 7) calculate.
N c = Σ k = 0,2,4,6 [ ( 1 - f ( x k ) ) - ( 1 - f ( x k ) ) ( 1 - f ( x k + 1 ) ) ( 1 - f ( x k + 2 ) ) ]
Work as x k=x 8The time, make x 8=x 0
For the 8-neighborhood of a pixel the value that might exist, calculate according to following formula, its linking number is always got the value between the 0-4.In the automatic identifying of literal, need carry out refinement to bianry image, can also significantly reduce redundant information.
The binary image refined image
Figure A20071004354000052
Advantage of the present invention is: the data to required translation has efficient input, to data timely translation can be arranged, for cell-phone function provides new opportunity to develop.
User interface elements can comprise:
Preview interface, the speech interface is selected at the printed page analysis interface, the translation interface.
Description of drawings
Fig. 1 is the structured flowchart of the embodiment of the invention;
Fig. 2 is the schematic flow sheet of the embodiment of the invention;
Fig. 3 is the engine schematic flow sheet of the embodiment of the invention.
Fig. 4-1~Fig. 4-4 shows the operating process synoptic diagram of the embodiment of the invention.
Embodiment
Provide better embodiment of the present invention according to Fig. 1~Fig. 4-4 below, and described in detail,, rather than be used for limiting scope of the present invention so that those skilled in the art is easier to understand architectural feature of the present invention and function characteristics.
See also shown in Figure 1, mobile phone with the interpretative function of taking pictures, comprise image pretreatment component 1, user interface 2, image recognition engine 3, dictionary engine 4 and camera installation 5, wherein: user interface 2 comprises preview interface 21, participle interface 22, correction interface 23 and the translation interface 24 that is set up in parallel; The image recognition engine 3 comprises that the engine that is set up in parallel is provided with 31, data printed page analysis engine 32, character recognition engine 33 and engine-off 34; Dictionary engine 4 comprises dictionary library 41 and translation interface 42; Camera installation 5 comprises that camera withdraws from unit 51, camera take pictures unit 52, camera adjustments unit 53 and camera preview unit 54.
See also photograph flow process 100 shown in Figure 2,
S 1001, initialization one OCR engine initialization, camera initialization and the initialization of dictionary engine;
S 1002, data preview one camera preview, preview regulate;
S 1003, data takes that a camera is taken and image transitions;
S 1004, printed page analysis one word piecemeal;
S 1005, select speech identification one to select word, identified word and word correction;
S 1006, translation result one calls dictionary engine, display result;
In each step of above-mentioned flow process, at S 1006In the translation result, if judge that also need proceed just rebound carries out S 1005, select speech identification; If deterministic process finishes, then skip to S 1007, withdraw from mobile phone shooting translation and be in exit status.
At S 1005Select in the speech identification step, judge and select speech to carry out, just carry out S 1007, withdrawing from, mobile phone is taken translation and is in holding state.
At S 1002, in the data preview step, the judgement data need not taken, and just skips to S 1007, withdrawing from, mobile phone is taken translation and is in holding state.
At S 1001In the initialization step, differentiate to take to translate and need not carry out, just carry out S 1007, withdrawing from, mobile phone is taken translation and is in holding state.
At S 1004In the printed page analysis step, or S 1003The undesirable need of discovery image are taken again in the data shooting step, then rebound S 1002, carry out the data preview step again.
See also Fig. 3, it shows the synoptic diagram of engine flow process 200 of the present invention, as shown in the figure,
S 2001, start;
S 2002, start the business card image recognition engine;
S 2003, the business card image attribute is set.
S 2004, business card image is handled
S 2005, the output of business card word
S 2006, also have word output? if word output is arranged, then S is carried out in rebound 2004Otherwise, carry out S 2007,
S 2007, close business card image
S 2008, EOP (end of program).
See also Fig. 4-1~Fig. 4-4
It has provided user's operating process of the present invention: promptly 1 ', shooting data, the preview data image;
2 ', click " identification ", after several seconds, the column picture frame appears on the business card; For example, selected " precdent " column by keyboard or stylus; 3 ', eject English word by identification, precedent if discern wrong can the modification again, clicks translation.4 ', show the translator of Chinese of this word, repeat " continuation "+" translation " operation after, can finish translation to whole section each word of data.

Claims (4)

1, a kind of mobile phones photographing and translation device comprises:
An OCR engine unit is used for the character conversion of written historical materials digital picture is become character standard ISN;
A translation engine unit is used to translate OCR and identifies written historical materials;
At least a shooting unit that has 1,300,000 pixels and macro function is arranged is used to take the digital picture of obtaining business card;
An image pretreatment unit, the picture format that the image transitions that is used for taking becomes the translation engine unit to discern carries out the binaryzation compression with image, to promote recognition speed; And a user interface elements, being used for and user interactions, the guide user uses this interface.
2, mobile phones photographing and translation device according to claim 1 is characterized in that, described ORC engine unit comprises: the engine library unit is used for the store character proper vector; Engine is provided with the unit, is used to establish operational mode or digital picture parameter; The engine start unit is used to distribute running space, and the engine storehouse is loaded into internal memory, and starting this engine start unit is executable state; Engine face version analytic unit is used to divide the space of a whole page, cuts apart the translation of words zone, and recognizable character is confined with connected region; The engine recognition unit is used to discern the digital picture in the connected region, discerns the output character ISN according to the visual pattern extraction feature of digital picture; The engine-off unit, above-mentioned each engine unit is closed in the releasing memory space.
3, mobile phones photographing and translation device according to claim 2 is characterized in that, described engine library unit comprises:
The translation library unit is used to deposit the speech and the table of comparisons of translating content;
The translation interface unit is used to provide input translation word, is the interface that obtains translation result.
4, mobile phones photographing and translation device according to claim 1 is characterized in that, described user interface elements comprises: preview interface, printed page analysis interface, select the speech interface and the translation interface.
CNA2007100435408A 2007-07-06 2007-07-06 Mobile phones photographing and translation device Pending CN101339617A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNA2007100435408A CN101339617A (en) 2007-07-06 2007-07-06 Mobile phones photographing and translation device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNA2007100435408A CN101339617A (en) 2007-07-06 2007-07-06 Mobile phones photographing and translation device

Publications (1)

Publication Number Publication Date
CN101339617A true CN101339617A (en) 2009-01-07

Family

ID=40213682

Family Applications (1)

Application Number Title Priority Date Filing Date
CNA2007100435408A Pending CN101339617A (en) 2007-07-06 2007-07-06 Mobile phones photographing and translation device

Country Status (1)

Country Link
CN (1) CN101339617A (en)

Cited By (13)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102223433A (en) * 2010-04-19 2011-10-19 辜进荣 Identification retrieval matching method of character information of camera mobile phone
CN102355532A (en) * 2011-10-21 2012-02-15 镇江科大船苑计算机网络工程有限公司 Information transmission method based on Android intelligent business card videographing and scanning
CN102737238A (en) * 2011-04-01 2012-10-17 洛阳磊石软件科技有限公司 Gesture motion-based character recognition system and character recognition method, and application thereof
CN102982326A (en) * 2011-09-02 2013-03-20 汉王科技股份有限公司 A method and a device for word processing and an electronic translation pen
CN103699527A (en) * 2013-12-20 2014-04-02 上海合合信息科技发展有限公司 Image translation system and method
CN103716453A (en) * 2012-10-02 2014-04-09 Lg电子株式会社 Mobile terminal and control method for the mobile terminal
CN104881405A (en) * 2015-05-22 2015-09-02 东莞中山大学研究院 Photo translation implementation method based on smart phone and smart phone
CN105468226A (en) * 2014-09-11 2016-04-06 深圳富泰宏精密工业有限公司 Picture browsing system and method
CN106649294A (en) * 2016-12-29 2017-05-10 北京奇虎科技有限公司 Training of classification models and method and device for recognizing subordinate clauses of classification models
CN106855854A (en) * 2016-12-29 2017-06-16 北京奇虎科技有限公司 A kind of recognition methods of english information and device
CN108829644A (en) * 2013-09-27 2018-11-16 夏普株式会社 Information processing unit, recording medium and the method for showing translation result
CN108985201A (en) * 2018-06-29 2018-12-11 网易有道信息技术(北京)有限公司 Image processing method, medium, device and calculating equipment
CN110245362A (en) * 2019-06-19 2019-09-17 京东方科技集团股份有限公司 A kind of translating equipment and translation system

Cited By (17)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102223433A (en) * 2010-04-19 2011-10-19 辜进荣 Identification retrieval matching method of character information of camera mobile phone
CN102737238A (en) * 2011-04-01 2012-10-17 洛阳磊石软件科技有限公司 Gesture motion-based character recognition system and character recognition method, and application thereof
CN102982326B (en) * 2011-09-02 2016-05-25 汉王科技股份有限公司 Literal processing method, device and electronic translation pen
CN102982326A (en) * 2011-09-02 2013-03-20 汉王科技股份有限公司 A method and a device for word processing and an electronic translation pen
CN102355532A (en) * 2011-10-21 2012-02-15 镇江科大船苑计算机网络工程有限公司 Information transmission method based on Android intelligent business card videographing and scanning
CN103716453A (en) * 2012-10-02 2014-04-09 Lg电子株式会社 Mobile terminal and control method for the mobile terminal
CN109101467A (en) * 2013-09-27 2018-12-28 夏普株式会社 The method of operating of information processing unit, recording medium and information processing unit
CN108829644A (en) * 2013-09-27 2018-11-16 夏普株式会社 Information processing unit, recording medium and the method for showing translation result
CN103699527A (en) * 2013-12-20 2014-04-02 上海合合信息科技发展有限公司 Image translation system and method
CN105468226A (en) * 2014-09-11 2016-04-06 深圳富泰宏精密工业有限公司 Picture browsing system and method
CN104881405A (en) * 2015-05-22 2015-09-02 东莞中山大学研究院 Photo translation implementation method based on smart phone and smart phone
CN106649294A (en) * 2016-12-29 2017-05-10 北京奇虎科技有限公司 Training of classification models and method and device for recognizing subordinate clauses of classification models
CN106855854A (en) * 2016-12-29 2017-06-16 北京奇虎科技有限公司 A kind of recognition methods of english information and device
CN108985201A (en) * 2018-06-29 2018-12-11 网易有道信息技术(北京)有限公司 Image processing method, medium, device and calculating equipment
CN110245362A (en) * 2019-06-19 2019-09-17 京东方科技集团股份有限公司 A kind of translating equipment and translation system
CN110245362B (en) * 2019-06-19 2023-10-13 京东方科技集团股份有限公司 Translation device and translation system
US11853711B2 (en) 2019-06-19 2023-12-26 Boe Technology Group Co., Ltd. Translation pen and translation system

Similar Documents

Publication Publication Date Title
CN101339617A (en) Mobile phones photographing and translation device
CN110188365B (en) Word-taking translation method and device
US8626236B2 (en) System and method for displaying text in augmented reality
CN101339618A (en) Mobile phones name card recognition device
EP2472372A1 (en) Input method of contact information and system
US9251428B2 (en) Entering information through an OCR-enabled viewfinder
US20120131520A1 (en) Gesture-based Text Identification and Selection in Images
US20140143721A1 (en) Information processing device, information processing method, and computer program product
KR20100007722A (en) Method of character recongnition and translation based on camera image
CN103678260A (en) Portable electronic business card holder and processing method
US8897594B2 (en) Image reader, mobile terminal apparatus, and non-transitory computer readable medium
CN110674814A (en) Picture identification and translation method, terminal and medium
EP2439676A1 (en) System and method for displaying text in augmented reality
US7623742B2 (en) Method for processing document image captured by camera
Hung et al. Implementing an android application for automatic vietnamese business card recognition
CN103279262A (en) Method and device for extracting content from image
CN103186587A (en) Method for quickly translating English word of book through mobile phone
US20060290789A1 (en) File naming with optical character recognition
KR20150091948A (en) A system for recognizing a font and providing its information and the method thereof
Kaur Text recognition applications for mobile devices
CN104951749A (en) Image content recognition device and image content recognition method
US20060210171A1 (en) Image processing apparatus
US10965801B2 (en) Method for inputting and processing phone number, mobile terminal and storage medium
CN103186581A (en) Method for quickly acquiring pronunciation of uncommon word in book through mobile phone
JP4597644B2 (en) Character recognition device, program and recording medium

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Open date: 20090107