CN107180039A

CN107180039A - A kind of text information recognition methods and device based on picture

Info

Publication number: CN107180039A
Application number: CN201610133793.3A
Authority: CN
Inventors: 邝野
Original assignee: Tencent Technology Shenzhen Co Ltd
Current assignee: Tencent Technology Shenzhen Co Ltd
Priority date: 2016-03-09
Filing date: 2016-03-09
Publication date: 2017-09-19

Abstract

The embodiment of the invention discloses a kind of text information recognition methods based on picture and device, methods described is applied to browser plug-in, including：Receive the search key of user's input；Text information identification is carried out to each picture in web interface, the text information that each picture is included is obtained；The text information that search key is included with each picture is compared, it is determined that the first picture belonging to the text information matched with search key；Show the text information that the first picture is included., can picture is included in ONLINE RECOGNITION web interface text information, simple operation using the embodiment of the present invention.

Description

A kind of text information recognition methods and device based on picture

Technical field

The present invention relates to Internet technical field, more particularly to a kind of text information recognition methods based on picture And device.

Background technology

At present, for some web developers and publisher, to accelerate Homepage Publishing and avoiding browser-safe The problems such as property, usually the text information of issue is placed directly in picture.User can not be directly viewable picture institute Comprising text information, it is necessary to which the picture in webpage is saved in locally, then is parsed by third-party picture Instrument carries out text information identification to picture, obtains the text information that picture is included, cumbersome, word The recognition efficiency of information is relatively low.

The content of the invention

Technical problem to be solved of the embodiment of the present invention is that there is provided a kind of text information knowledge based on picture Other method and device, can picture is included in ONLINE RECOGNITION web interface text information, simple operation.

In order to solve the above-mentioned technical problem, the embodiments of the invention provide a kind of text information knowledge based on picture Other method, methods described is applied to browser plug-in, including：

Receive the search key of user's input；

Text information identification is carried out to each picture in web interface, obtains what each described picture was included Text information；

The text information that the search key is included with picture each described is compared, it is determined that and institute State the first picture belonging to the text information of search key matching；

Show the text information that first picture is included.

Correspondingly, the embodiment of the present invention additionally provides a kind of text information identifying device based on picture, including：

Keyword receiving unit, the search key for receiving user's input；

Text information acquiring unit, for carrying out text information identification to each picture in web interface, is obtained The text information included to picture each described；

Comparing unit, the text information for the search key and each described picture to be included is carried out Compare, it is determined that the first picture belonging to the text information matched with the search key；

Word-information display unit, for showing the text information that first picture is included.

Implement the embodiment of the present invention, by receiving the search key that user inputs, to each in web interface Individual picture carries out text information identification, obtains the text information that each picture is included, by search key with The text information that each picture is included is compared, it is determined that belonging to the text information matched with search key The first picture, the text information that is included of the first picture of display can picture institute in ONLINE RECOGNITION web interface Comprising text information, simple operation.

Brief description of the drawings

In order to illustrate more clearly about the embodiment of the present invention or technical scheme of the prior art, below will be to implementing The accompanying drawing used required in example or description of the prior art is briefly described, it should be apparent that, describe below In accompanying drawing be only some embodiments of the present invention, for those of ordinary skill in the art, do not paying On the premise of going out creative work, other accompanying drawings can also be obtained according to these accompanying drawings；

Fig. 1 is a kind of structural representation of the terminal provided in the embodiment of the present invention；

Fig. 2 is a kind of structural representation of the terminal provided in another embodiment of the present invention；

Fig. 3 is that a kind of flow of the text information recognition methods based on picture provided in the embodiment of the present invention is shown It is intended to；

Fig. 4 is that a kind of structure of the text information identifying device based on picture provided in the embodiment of the present invention is shown It is intended to；

Fig. 5 is a kind of structural representation of the terminal provided in another embodiment of the present invention.

Embodiment

Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is carried out clear Chu, it is fully described by, it is clear that described embodiment is only a part of embodiment of the invention, rather than Whole embodiments.Based on the embodiment in the present invention, those of ordinary skill in the art are not making creation Property work under the premise of the every other embodiment that is obtained, belong to the scope of protection of the invention.

The above-mentioned text information recognition methods based on picture may operate in tablet personal computer, mobile phone or individual calculus In the terminals such as machine (Personal Computer, PC), QQ browsers or Google's browser can also be operated in Etc. in client.

Refer to Fig. 1, Fig. 1 is a kind of structural representation of terminal in the embodiment of the present invention, the terminal can be with For browser plug-in, the terminal in the embodiment of the present invention can include input module 101, logic control as shown in the figure Molding block 102, content storage module 103, application programming interface (Application Programming Interface, API) 104 and output module 105, wherein input module 101 and the He of output module 105 Logic control module 102 is connected, and Logic control module 102 and content storage module 103 are connected, and content is deposited Module 103 and API104 connections are stored up, wherein：

Input module 101 be used to realizing user and terminal interact and/or information is input in terminal.For example, Input module 101 can receive the numeral or character information of user's input, be set with producing with user or function The relevant signal input of control.In the specific embodiment of the invention, input module 101 at least includes touch-control Panel and/or other human-computer interaction interfaces, such as entity enter key, microphone.

Contact panel, also referred to as touch-screen or touch screen, collect user in touch or close operation thereon Action.Such as user is on contact panel or close using any suitable objects such as finger, stylus or annex The operational motion of the position of contact panel, and corresponding attachment means are driven according to formula set in advance.Can Choosing, contact panel may include both touch detecting apparatus and touch controller.Wherein, touch detection Device detects the touch operation of user, and the touch operation detected is converted into electric signal, and will be described Electric signal sends touch controller to；Touch controller receives the electric signal from touch detecting apparatus, and Contact coordinate is converted into, then gives processor.The touch controller can be sent with reception processing device Order and execution.Furthermore, it is possible to using resistance-type, condenser type, infrared ray (Infrared) and surface The polytypes such as sound wave realize contact panel.In the other embodiment of the present invention, the institute of input module 101 The entity enter key of use can include but is not limited to physical keyboard, function key (such as volume control button, Switch key etc.), trace ball, mouse, the one or more in action bars etc..The input mould of microphone form Block 101 can collect the voice of user or environment input and convert thereof into electrical signal form, logic control The executable order of module.

Logic control module 102 is the control centre of terminal, utilizes various interfaces and the whole terminal of connection Various pieces.

Content storage module 103 can be used for storage software program and data, and content storage module 103 is main Including program storage area and data storage area, wherein, program storage area can storage program area, at least one Application program needed for function, such as sound playing program, image player program etc.；Data storage area can Storage uses created data (such as voice data, phone directory etc.) etc. according to terminal.

Output module 105 includes but is not limited to image output unit and voice output unit.Image output unit For output character, picture and/or video.Image output unit in the embodiment of the present invention at least includes display Screen, for example with LCD (Liquid Crystal Display, liquid crystal display), OLED (Organic Light-Emitting Diode, Organic Light Emitting Diode), Field Emission Display (field emission display, Abbreviation FED) etc. form come the display screen that configures.Or the image output unit can include reflective display Device, such as electrophoresis-type (electrophoretic) display, or utilize interference of light modulation tech (Interferometric Modulation of Light) display.The image output unit can include individual monitor or difference Multiple displays of size.In the embodiment of the present invention, what above-mentioned input module 101 was used The display screen that contact panel and output unit 105 are used may be collectively referred to as display.When contact panel detection To after touch thereon or close gesture operation, Logic control module 102 is sent to determine touch thing The type of part, subsequent Logic control module 102 is provided accordingly on a display screen according to the type of touch event Visual output.Although in Fig. 1, input module 101 and output module 105 are as two independent portions Part realizes input and the output function of terminal, but in some embodiments it is possible to by contact panel with it is aobvious Display screen is integrated and realizes the input of terminal and output function.For example, the image output unit can show Show various Graphic User Interfaces (Graphical User Interface, GUI) to be used as virtual controlling component, bag Include but be not limited to window, scroll bar, icon and scrapbook, so that user is operated by touch control manner.

In a kind of possible implementation, user can input search key by input module 101, After Logic control module 102 receives and determines the type of event, the identification instruction in the event is sent to Content memorizer 103, content memorizer 103 determines each picture in web interface according to identification instruction, patrols Collect control module 102 and call API, and dynamic load and parsing are carried out to each picture by API, to obtain The text information for taking each picture to be included.Further, Logic control module 102 by search key with The text information that each picture is included is compared, it is determined that belonging to the text information matched with search key The first picture, Logic control module 102 by the text information that the first picture is included be sent to content store Device 103, the text information that content memorizer 103 is included the first picture is shown by output module 105 Show.

In a kind of possible implementation, after web interface of the display of output module 105 comprising picture, User can patrol by the input pin of input module 101 to needing the identification events of the picture of text information identification After volume control module 102 receives and determines the type of event, the identification instruction in the event is sent in Hold memory 103, the picture that text information is recognized the need for content memorizer 103 is carried according to identification instruction Picture identification information, it is determined that needing the picture that text information is recognized, Logic control module 102 calls API, and By API to needing the picture that text information is recognized to carry out dynamic load and parsing, word is needed to believe to obtain The text information that the picture of breath identification is included.

In a kind of possible implementation, after web interface of the display of output module 105 comprising picture, The event that all pictures in the web interface that currently shows are carried out with text information identification, logic can be generated After control module 102 receives and determines the type of event, the identification instruction in the event is sent to content Memory 103, content memorizer 103 instructs the picture identification information of each picture carried according to identification, really Surely the picture for needing text information to recognize, Logic control module 102 calls API, and by API to needing The picture of text information identification carries out dynamic load and parsing, to obtain the picture institute for needing text information to recognize Comprising text information.

Wherein, above-mentioned terminal is included source code is authenticated to be changed, then can by API set into In terminal, without recognizing that driving carries out text information identification to picture by text information, in order to upgrading or Person safeguards.Specifically, Logic control module 102 can include background.html files, for indicating Under html language, background is used to keep background patterns and background color.Content storage module 103 Content script files (content script) can be included, be can be used for by back-stage management page capturing pictures mark Know the corresponding picture of information.API104 can include text information recognition function, for needing text information The picture of identification carries out text information identification.

Wherein, picture identification information can be used for the unique mark picture, such as picture name, store path Or data capacity etc..

It will be appreciated that in the implementation, the function of each functional module of terminal can be according to Fig. 3 Method in embodiment of the method is implemented, can specific corresponding diagram 3 associated description, here is omitted.

Refer to Fig. 2, Fig. 2 is a kind of structural representation of terminal in the embodiment of the present invention, the terminal can be with For browser plug-in, the terminal in the embodiment of the present invention can include input module 201, logic control as shown in the figure Molding block 202, content storage module 203, text information identification driving 204 and output module 205, its Middle input module 201 and output module 205 and Logic control module 202 are connected, Logic control module 202 Connected with content storage module 203, content storage module 203 and text information identification drive 204 and defeated Go out module 205 to connect, wherein：

In a kind of possible implementation, user can input search key by input module 201, After Logic control module 202 receives and determines the type of event, the identification instruction in the event is sent to Content memorizer 203, content memorizer 203 determines each picture in web interface according to identification instruction, defeated Go out each picture that module 205 determines content memorizer 203 and enter row format to be converted to picture stream, and will Picture stream is sent to text information identification driving 104, and text information identification driving 204 enters action to each picture State is loaded and parsed, to obtain the text information that is included of picture for needing text information to recognize, and by word Information enters row format and is converted to data flow, and data flow is sent into content storage by output module 205 Module 203.Further, the text information that each picture is included is sent to and patrolled by content storage module 203 Collect control module 202, the text information that Logic control module 202 is included search key and each picture It is compared, it is determined that the first picture belonging to the text information matched with search key, Logic control module 102 text informations for being included the first picture are sent to content memorizer 203, and content memorizer 203 is by The text information that one picture is included is shown by output module 205.

In a kind of possible implementation, after web interface of the display of output module 205 comprising picture, User can patrol by the input pin of input module 201 to needing the identification events of the picture of text information identification After volume control module 202 receives and determines the type of event, the identification instruction in the event is sent in Hold memory 203, the picture that text information is recognized the need for content memorizer 203 is carried according to identification instruction Picture identification information, it is determined that needing the picture that text information is recognized, output module 205 is to content memorizer 203 The picture of determination enters row format and is converted to picture stream, and picture stream is sent into text information identification driving 204, 204 pairs of pictures for needing text information to recognize of text information identification driving carry out dynamic load and parsing, to obtain The text information that is included of picture for needing text information to recognize is taken, and text information is entered into row format and is changed Content storage module 203 is sent to by output module 205 to data flow, and by data flow.

In a kind of possible implementation, output module 205 is shown after web interface, can be generated pair All pictures in the web interface currently shown carry out the event of text information identification, Logic control module 202 After receiving and determining the type of event, the identification instruction in the event is sent to content memorizer 203, it is interior Hold the picture identification information that memory 203 instructs each picture carried according to identification, it is determined that needing word to believe The picture of identification is ceased, the picture that output module 205 is determined to content memorizer 203 enters row format and is converted to Picture stream, and picture stream is sent to text information identification driving 204, text information identification 204 pairs of need of driving The picture for wanting text information to recognize carries out dynamic load and parsing, to obtain the picture for needing text information to recognize Comprising text information, and text information entered into row format be converted to data flow, and data flow is passed through Output module 205 is sent to content storage module 203.

Wherein, above-mentioned terminal is included source code is authenticated to be changed, then needs to believe by word Breath identification driving carries out text information identification to picture, to realize that picture is included in ONLINE RECOGNITION web interface Text information.Specifically, Logic control module 202 can include background.html files, it is used for Indicate under html language, background is used to keep background patterns and background color.Content storage module 203 can include content script files (content script), can be used for by back-stage management page crawl figure Picture corresponding to piece identification information.Text information identification driving 204 is used for the figure to needing text information to recognize Piece carries out text information identification.

Fig. 3 is referred to, Fig. 3 is a kind of text information recognition methods based on picture in the embodiment of the present invention Schematic flow sheet, methods described be applied to browser plug-in, as shown in the figure in the embodiment of the present invention based on figure The text information recognition methods of piece can include：

S301, receives the search key of user's input.

Terminal can receive the search key of user's input.Wherein, keyword is exactly user using search Word or word inputted during engine, can at utmost summarizing the information content to be searched of user, for example " k " or " intelligence " etc..In the specific implementation, terminal can draw webpage floating window, so that user's input is searched Rope keyword, when user needs to retrieve the picture comprising specified word information, can be inputted by webpage floating window Search key.

S302, carries out text information identification to each picture in web interface, obtains each picture and included Text information.

Terminal is received after the search key of user's input, can obtain all pictures in web interface, Text information identification is carried out to each above-mentioned picture, the text information that each picture is included is obtained.Wherein, Webpage is the basic element for constituting website, is the platform of the various website applications of carrying.On the display screen of terminal The webpage of display can be web interface, web interface be between people and machine (such as computer) transmit and The medium of information is exchanged, web interface can include word, picture and/or animation etc..Picture can include text Word information, and user can not be directly viewable the text information that picture is included in the web interface currently shown. Wherein, web interface can be the web interface currently shown, and optionally, web interface can include website Comprising all web interfaces, such as main page, sub-pages etc..

In an alternative embodiment, after terminal obtains the text information that each picture is included, it will can recognize Obtained text information storage arrives local, when receiving the search key of user's input again so as to terminal, The text information that each picture is included can be locally directly being obtained, without receiving user's input every time After search key, text information identification all is carried out to the picture in web interface, the embodiment of the present invention can Improve resource utilization.

S303, the text information that search key is included with each picture is compared, it is determined that with search The first picture belonging to the text information of keyword match.

The text information that terminal can be included search key with each picture is compared, it is determined that with searching The first picture belonging to the text information of rope keyword match.If for example, search key be " intelligence ", Whether terminal may determine that in the text information that each picture is included includes character " intelligence ", if there is bag The text information of character " intelligence " is included, the picture belonging to the text information can be confirmed as the first figure by terminal Piece.

In an alternative embodiment, terminal is compared the text information that search key and each picture are included Compared with if there is no the text information matched with search key, terminal can be by search key and webpage Each web page contents in interface are compared, it is determined that the web page contents matched with search key, and show The web page contents.Wherein, web page contents can include text, animation, audio or video etc., in webpage Face, building or scenery that picture is included etc. can also be included by holding.

In an alternative embodiment, each picture in terminal-pair web interface carries out text information identification, obtains After the text information that each picture is included, the word that search key and each picture can be included Information is compared, it is determined that the first picture belonging to the text information matched with search key, and will search Keyword is compared with each web page contents in web interface, it is determined that the webpage matched with search key Content.

S304, the text information that the first picture of display is included.

Terminal determined after the first picture belonging to the text information that is matched with search key, can show the The text information that one picture is included.For example, terminal can be highlighted the word letter that the first picture is included Breath.And for example, the text information that terminal can be included the first picture is converted to voice, and then passes through Mike The voice is put in anemochory.And for example, terminal can create suspended frame, show that the first picture is included in suspended frame Text information, wherein suspended frame can be located at terminal display screen foremost.

Optionally, can after the first picture belonging to text information that terminal determination is matched with search key The text information included with the first picture of display and/or the first picture.

In an alternative embodiment, terminal can determine to need text information to know in the web interface currently shown Other picture, carries out text information identification to picture, obtains the text information that picture is included.Implement In, terminal can show web interface, and determination needs text information to know in the web interface currently shown Other picture, if the text information that the picture that above-mentioned determination obtains is included is locally stored, terminal can be with Directly locally obtaining the text information that the picture is included；If locally not storing the figure that above-mentioned determination is obtained The text information that piece is included, terminal can carry out text information identification to the picture, obtain the picture and wrapped The text information contained, and then the text information storage that the picture is included is to locally.

In an alternative embodiment, after terminal shows web interface, it can be determined in the web interface all Picture comprising text information, and using each above-mentioned picture as the picture for needing text information to recognize, to upper State each picture and carry out text information identification, obtain the text information that each picture is included.

In an alternative embodiment, terminal can receive the word that user submits to the second picture in web interface Information identification instruction, the picture that second picture is defined as needing text information to recognize.For example, terminal is shown , can be with when user needs to check the text information that second picture is included in web interface after web interface Recognize and instruct for second picture inputting word information, for being instructed according to the identification of literary sub-information by second picture It is defined as the picture for needing text information to recognize.Wherein, user recognizes for second picture inputting word information Instruction is specifically as follows：User is by mouse by cursor placement in second picture belonging positions, the webpage clicking right side Key menu is recognized with inputting word information and instructed.Optionally, user knows for second picture inputting word information Zhi Ling can also not be：User's long-press second picture recognizes instruction, etc. with inputting word information, specifically not Limited by the embodiment of the present invention.

In an alternative embodiment, terminal can call default API, and the picture is carried out by default API Text information is recognized, obtains the text information that the picture is included.The embodiment of the present invention need not install word letter Breath identification driving, can be easy to upgrade or safeguard.

In an alternative embodiment, terminal can enter row format conversion to picture, obtain picture stream, pass through word Information identification driving carries out text information identification to picture stream, obtains the text information that the picture is included.

In the embodiment of the present invention, the search key of user's input is received, to each picture in web interface Text information identification is carried out, the text information that each picture is included is obtained, by search key and each figure The text information that piece is included is compared, it is determined that first belonging to the text information matched with search key Picture, the text information that is included of the first picture of display picture can be included in ONLINE RECOGNITION web interface Text information, simple operation.

Refer to Fig. 4, Fig. 4 is a kind of text information identification dress based on picture provided in the embodiment of the present invention The text information identifying device based on picture in the structural representation put, the embodiment of the present invention can include flat The terminals such as plate computer, mobile phone or personal computer, can also including browser etc. client, as shown in the figure this The text information identifying device based on picture in embodiment can at least include keyword receiving unit 401, text Word information acquisition unit 402, comparing unit 403 and word-information display unit 404, wherein：

Keyword receiving unit 401, the search key for receiving user's input.

Text information acquiring unit 402, for carrying out text information identification to each picture in web interface, Obtain the text information that each picture is included.

Comparing unit 403, the text information for search key to be included with each picture is compared, It is determined that the first picture belonging to the text information matched with search key.

Word-information display unit 404, for showing the text information that the first picture is included.

In an alternative embodiment, the text information identifying device based on picture in the embodiment of the present invention can be with Including：

Picture determine unit 405, for determining to need what text information was recognized in the web interface currently shown Picture.

Text information acquiring unit 402, is additionally operable to carry out text information identification to picture, obtains picture and included Text information.

In an alternative embodiment, picture determine unit 401, specifically for：

Receive user and instruction is recognized to the text information that the second picture in web interface is submitted.

The picture that second picture is defined as needing text information to recognize.

In an alternative embodiment, picture recognition unit 402, specifically for：

Default API is called, and text information identification is carried out to each picture by default API, picture is obtained Comprising text information.

In an alternative embodiment, picture recognition unit 402, specifically for：

Enter row format conversion to each picture, obtain picture stream.

Recognize that driving carries out text information identification to picture stream by text information, obtain the text that picture is included Word information.

In the embodiment of the present invention, keyword receiving unit 401 receives the search key of user's input, word Information acquisition unit 402 carries out text information identification to each picture in web interface, obtains each picture Comprising text information, the text information that comparing unit 403 is included search key and each picture It is compared, it is determined that the first picture belonging to the text information matched with search key, word-information display Unit 404 shows the text information that is included of the first picture, picture can be included in ONLINE RECOGNITION web interface Text information, simple operation.

Refer to Fig. 5, a kind of structural representation for terminal that Fig. 5 provides for another embodiment of the present invention, this hair The terminal that bright embodiment is provided can be used for implementing the method that the embodiment of the present invention shown in above-mentioned Fig. 3 is realized, For convenience of description, the part related to the embodiment of the present invention is illustrate only, particular technique details is not disclosed, It refer to the embodiment of the present invention shown in Fig. 3.

As shown in figure 5, the terminal includes：At least one processor 501, such as CPU, at least one is defeated Enter device 503, at least one output device 504, memory 505, at least one communication bus 502.Its In, communication bus 502 is used to realize the connection communication between these components.Wherein, input unit 503 has Body can be network interface, for being communicated with external network.Wherein, output device 504 specifically can be with For display screen, for display image.Wherein, memory 505 may include high-speed RAM memory, also may be used Non-labile memory, for example, at least one magnetic disk storage, specifically for storage binaryzation can also be included Image.Memory 505 can optionally be located remotely from the storage dress of aforementioned processor 501 comprising at least one Put.Processor 501 can be with reference to shown in Fig. 4 the background information identifying device based on image.Memory 505 Middle storage batch processing code, and processor 501 calls the program code stored in memory 505, is used for Perform following operate：

Input unit 503 receives the search key of user's input.

Processor 501 carries out text information identification to each picture in web interface, obtains each picture institute Comprising text information.

The text information that search key is included with each picture is compared by processor 501, it is determined that with The first picture belonging to the text information of search key matching.

Output device 504 shows the text information that the first picture is included.

In an alternative embodiment, processor 501 can also carry out following operation：

Processor 501 determines the picture for needing text information to recognize in the web interface currently shown.

Processor 501 carries out text information identification to picture, obtains the text information that picture is included.

In an alternative embodiment, processor 501 determines to need text information in the web interface currently shown The picture of identification, is specifically as follows：

Input unit 503 receives user and recognizes instruction to the text information that the second picture in web interface is submitted.

The picture that second picture is defined as needing text information to recognize by processor 501.

In an alternative embodiment, processor 501 carries out text information identification to each picture in web interface, The text information that each picture is included is obtained, is specifically as follows：

Processor 501 calls default API, and carries out text information identification to each picture by default API, Obtain the text information that picture is included.

Processor 501 enters row format conversion to each picture, obtains picture stream.

Processor 501 recognizes that driving carries out text information identification to picture stream by text information, obtains picture Comprising text information.

Specifically, the terminal introduced in the embodiment of the present invention can combine what Fig. 3 was introduced to implement the present invention Part or all of flow in embodiment of the method.

One of ordinary skill in the art will appreciate that all or part of flow in above-described embodiment method is realized, It can be by computer program to instruct the hardware of correlation to complete, described program can be stored in computer In read/write memory medium, the program is upon execution, it may include such as the flow of the embodiment of above-mentioned each method. Wherein, described storage medium can for magnetic disc, CD, read-only memory (Read-Only Memory, ) or random access memory (Random Access Memory, RAM) etc. ROM.

Above disclosure is only preferred embodiment of present invention, can not limit the present invention's with this certainly Interest field, therefore the equivalent variations made according to the claims in the present invention, still belong to the scope that the present invention is covered.

Claims

1. a kind of text information recognition methods based on picture, it is characterised in that methods described is applied to browse Device plug-in unit, methods described includes：

Receive the search key of user's input；

Show the text information that first picture is included.

2. method according to claim 1, it is characterised in that methods described also includes：

The picture for needing text information to recognize is determined in the web interface currently shown；

Text information identification is carried out to the picture, the text information that the picture is included is obtained.

3. method according to claim 2, it is characterised in that described in the web interface currently shown It is determined that the picture that text information is recognized is needed, including：

Receive user and instruction is recognized to the text information that the second picture in the web interface is submitted；

The picture that the second picture is defined as needing text information to recognize.

4. method according to claim 1, it is characterised in that described each picture in web interface Text information identification is carried out, the text information that each described picture is included is obtained, including：

Default application programming interface is called, and by the default application programming interface to each institute State picture and carry out text information identification, obtain the text information that the picture is included.

5. method according to claim 1, it is characterised in that described each picture in web interface Text information identification is carried out, the text information that each described picture is included is obtained, including：

Enter row format conversion to picture each described, obtain picture stream；

Recognize that driving carries out text information identification to the picture stream by text information, obtain the picture institute Comprising text information.

6. a kind of text information identifying device based on picture, it is characterised in that including：

Keyword receiving unit, the search key for receiving user's input；

7. device according to claim 6, it is characterised in that described device also includes：

Picture determine unit, for determining the figure for needing text information to recognize in the web interface currently shown Piece；

The text information acquiring unit, is additionally operable to carry out text information identification to the picture, obtains described The text information that picture is included.

8. device according to claim 7, it is characterised in that the picture determine unit, specifically for：

9. method according to claim 6, it is characterised in that the picture recognition unit, specifically for：

10. device according to claim 6, it is characterised in that the picture recognition unit, specific to use In：

Enter row format conversion to picture each described, obtain picture stream；