CN103838866B

CN103838866B - A kind of text conversion method and device

Info

Publication number: CN103838866B
Application number: CN201410105981.6A
Authority: CN
Inventors: 简文杰
Original assignee: Guangdong Genius Technology Co Ltd
Current assignee: Guangdong Genius Technology Co Ltd
Priority date: 2014-03-20
Filing date: 2014-03-20
Publication date: 2017-04-05
Anticipated expiration: 2034-03-20
Also published as: CN103838866A

Abstract

The invention discloses a kind of text conversion method and device.The method includes：Obtain target text image；Text region is carried out to the target text image and obtains corresponding Word message；Carry out participle to the Word message, and determine the part of speech of participle, the pictorial information and/or movement locus information of correspondence participle are obtained from picture database according to the part of speech；According to the type of the sentence in the Word message, the voice messaging corresponding to the Word message is obtained from speech database；The pictorial information and/or movement locus information are exported after being adapted to the voice messaging.Technical scheme proposed by the present invention can be realized for the article of word class being converted into corresponding voice and image, increased the display format of information.

Description

A kind of text conversion method and device

Technical field

The present embodiments relate to field of computer technology, more particularly to a kind of text conversion method and device.

Background technology

Books are carry in the middle of the process of people's acquisition information and study indispensable as a kind of main tool at present Role, such as children help typically by read books oneself understand the world, enlarge one's knowledge.But, for the age compared with Little child, it is inaccurate or completely illiterate because reading, cause them and be difficult to the complex word class text of reading stories Chapter, so that the content and scope of child's reading are restricted, reduces the interest of its reading.

With reaching its maturity for computer technology, electronic equipment miscellaneous（Such as learning machine, smart mobile phone, individual Digital assistants etc.）Swarm and show, be that the aspects such as daily life, working and learning bring great convenience.It is how sharp The article of word class is converted into into other with electronic equipment and can be seemed particularly significant by the form that child easily receives.

The content of the invention

The present invention provides a kind of text conversion method and device, so that realize for the article of word class being converted into other can be by Voice and image that child easily receives.

In a first aspect, embodiments providing a kind of text conversion method, the method includes：

Obtain target text image；

Text region is carried out to the target text image and obtains corresponding Word message；

Carry out participle to the Word message, and determine the part of speech of participle, obtained from picture database according to the part of speech Take the pictorial information and/or movement locus information of correspondence participle；

According to the type of the sentence in the Word message, obtain from speech database corresponding to the Word message Voice messaging；

The pictorial information and/or movement locus information are exported after being adapted to the voice messaging.

Second aspect, the embodiment of the present invention additionally provide a kind of text conversion device, and the device includes：

Text image acquiring unit, for obtaining target text image；

Word message recognition unit, obtains corresponding word letter for Text region is carried out to the target text image Breath；

Picture track acquiring unit, for carrying out participle to the Word message, and determines the part of speech of participle, according to described Part of speech obtains the pictorial information and/or movement locus information of correspondence participle from picture database；

Voice messaging acquiring unit, for the type according to the sentence in the Word message, obtains from speech database Take the voice messaging corresponding to the Word message；

Picture voice-output unit, for the pictorial information and/or movement locus information are existed with the voice messaging Exported after being adapted to.

Technical scheme proposed by the present invention can be realized for the article of word class being converted into corresponding voice and image, increase The display format of information.

Description of the drawings

Fig. 1 is a kind of schematic flow sheet of text conversion method that the embodiment of the present invention one is provided；

Fig. 2 is a kind of structural representation of text conversion device that the embodiment of the present invention two is provided.

Specific embodiment

The present invention is described in further detail with reference to the accompanying drawings and examples.It is understood that this place is retouched The specific embodiment stated is used only for explaining the present invention, rather than limitation of the invention.It also should be noted that, in order to just Part related to the present invention rather than entire infrastructure are illustrate only in description, accompanying drawing.

Embodiment one

Fig. 1 is a kind of schematic flow sheet of text conversion method that the embodiment of the present invention one is provided, and the method can be by text This conversion equipment performing, described device can be only fitted to learning machine, smart mobile phone, panel computer, personal digital assistant or During other any one have the electronic equipment of processor, memory and display, realized by software and/or hardware.Referring to Fig. 1, text conversion method specifically include following steps：

Step 110, acquisition target text image.

In the present embodiment, target text image is the image for including word content, can be that the books to papery enter Row shooting obtains, or directly reads local disk related data and obtains, or acquires from respective server.

Specifically, the process of text conversion device acquisition target text image can be：Shooting is sent to picture collection device Instruction；Receive the target text image that picture collection device shoots；Or can also be：Show text image；Reception acts on text Input instruction on this image；The target text image in text image is determined according to input instruction.Wherein, input instruction is used In it is determined that target text image to be converted in text image.When text conversion device is configured in point reader, input instruction Can be that user uses the specified scope on shown text image of talking pen.

Step 120, Text region is carried out to the target text image obtain corresponding Word message.

Text conversion device is carried out to the image using the Text region algorithm of setting after target text image is got Identification, produces corresponding Word message.Wherein, Text region algorithm can be：Line tilt correction, two-value are entered to the image first Change, denoising, the cutting of monocase region etc. are pre-processed；Then, extract the feature of each character after cutting；Further, adopt and set Fixed matching algorithm, each character feature for being extracted and locally stored template characteristic is compared, is recognized accordingly As a result（That is Word message）.

Certainly those skilled in the art should be understood that Text region algorithm can also be the algorithm of other forms, for example, After pre-processing to target text image, using this Text region algorithm of artificial neural network, directly by target text In image, the lattice information of each character zone is sent into network model and carries out learning training, so as to identify in target text image Word content, obtain corresponding Word message.

Step 130, participle is carried out to the Word message, and determine the part of speech of participle, according to the part of speech from picture number According to pictorial information and/or movement locus information that correspondence participle is obtained in storehouse.

In a specific embodiment of the present embodiment, text conversion device is to obtain target text image corresponding After Word message, participle can be carried out according to default participle technique to the Word message first；Then, further determine that each The part of speech of participle, obtains the noun and/or the verb corresponding to the noun included in Word message；Further, from the picture The movement locus information corresponding to the pictorial information and/or the verb corresponding to the noun is searched and is obtained in database. Wherein, the participle technique is included but is not limited to：By word traversal, the segmenting method matched based on dictionary dictionary or knowledge based The segmenting method of understanding.

Step 140, according to the type of the sentence in the Word message, obtain from speech database corresponding to the text The voice messaging of word information.

In a specific embodiment of the present embodiment, after the corresponding Word message of target text image is obtained, Text conversion device can recognize the type of each sentence in Word message according to default punctuation mark collection and/or crucial verb collection, Wherein described type includes declarative sentence and dialogue sentence；Further, from speech database search and obtain and the statement for being identified Voice messaging and its voice messaging matched with the dialogue sentence for being identified that sentence matches.For example, default punctuation mark Concentration includes colon double quotation marks, and crucial verb is concentrated including " saying ", " crying out " or " asking " etc..Text conversion device is being searched sentence by sentence When including colon double quotation marks in Word message, may recognize that the sentence between this group of punctuation mark is dialogue sentence；By text The sentence being not included in word information between colon double quotation marks is identified as declarative sentence.Text conversion device also can found sentence by sentence When including certain verb that crucial verb is concentrated in Word message, a corresponding behind sentence is judged to talk with sentence. Certainly, text conversion device can also recognize each sentence in Word message in combination with punctuation mark collection and crucial verb collection Type.

In this example, if certain sentence in identification text message is declarative sentence, search simultaneously from speech database The voice messaging matched with the declarative sentence is obtained, for example, the voice signal of the sentence is read aloud by background sound；If identification text The voice messaging matched with the dialogue sentence is searched and obtained to certain sentence in information from speech database to talk with during sentence, The voice signal for for example pronouncing by the sound of correspondence personage.

Step 150, the pictorial information and/or movement locus information are carried out after being adapted to the voice messaging Output.

In the present embodiment, text conversion device can be set up and perform according to Word message resulting during execution step 120 During step 130 between resulting pictorial information and/or resulting voice messaging when movement locus information and execution step 140 Fitting relation；It is according to this fitting relation, while showing to the pictorial information and/or movement locus information, right The voice messaging synchronizes broadcasting.

The technical scheme that the present embodiment is proposed can be realized for the article of word class being converted into corresponding voice and image, increase The display format of information is added.

Embodiment two

Fig. 2 is a kind of structural representation of text conversion device that the embodiment of the present invention two is provided.Referring to Fig. 2, the device Concrete structure it is as follows：

Text image acquiring unit 210, for obtaining target text image；

Word message recognition unit 220, obtains corresponding word for carrying out Text region to the target text image Information；

Picture track acquiring unit 230, for carrying out participle to the Word message, and determines the part of speech of participle, according to The part of speech obtains the pictorial information and/or movement locus information of correspondence participle from picture database；

Voice messaging acquiring unit 240, for the type according to the sentence in the Word message, from speech database Obtain the voice messaging corresponding to the Word message；

Picture voice-output unit 250, for by the pictorial information and/or movement locus information and the voice messaging Exported after being adapted to.

Further, text image acquiring unit 210, specifically for：

Shooting instruction is sent to picture collection device；Receive the target text image that the picture collection device shoots；Or

Show text image；Reception acts on the input instruction on the text image；Determined according to the input instruction Target text image in the text image.

Further, picture track acquiring unit 230, specifically for：

Participle is carried out according to default participle technique to the Word message；

Determine the part of speech of each participle, obtain the noun included in the Word message and/or moving corresponding to the noun Word；

Pictorial information and/or the verb institute searched from the picture database and obtain corresponding to the noun is right The movement locus information answered.

Further, the voice messaging acquiring unit 240, specifically for：

According to default punctuation mark collection and/or crucial verb collection, the type of each sentence in the Word message is recognized, its Described in type include declarative sentence and dialogue sentence；

Search from speech database and obtain the voice messaging that matches with the declarative sentence for being identified and its with known The voice messaging that the dialogue sentence not gone out matches.

Further, picture voice-output unit 250, specifically for：

According to the Word message, set up between the pictorial information and/or movement locus information and the voice messaging Fitting relation；

According to the fitting relation, while showing to the pictorial information and/or movement locus information, to institute State voice messaging and synchronize broadcasting.

The said goods can perform the method provided by any embodiment of the present invention, possess the corresponding functional module of execution method And beneficial effect.

Note, above are only presently preferred embodiments of the present invention and institute's application technology principle.It will be appreciated by those skilled in the art that The invention is not restricted to specific embodiment described here, can carry out for a person skilled in the art various obvious changes, Readjust and substitute without departing from protection scope of the present invention.Therefore, although the present invention is carried out by above example It is described in further detail, but the present invention is not limited only to above example, without departing from the inventive concept, also More other Equivalent embodiments can be included, and the scope of the present invention is determined by scope of the appended claims.

Claims

1. a kind of text conversion method, it is characterised in that include：

Obtain target text image；

Carry out participle to the Word message, and determine the part of speech of participle, obtain right from picture database according to the part of speech Answer the pictorial information and/or movement locus information of participle；

According to the type of the sentence in the Word message, the voice corresponding to the Word message is obtained from speech database Information；

2. text conversion method according to claim 1, it is characterised in that obtain target text image, including：

Show text image；Reception acts on the input instruction on the text image；According to the input instruction determines Target text image in text image.

3. text conversion method according to claim 1, it is characterised in that carry out participle to the Word message, and really Determine the part of speech of participle, the pictorial information and/or movement locus letter of correspondence participle are obtained from picture database according to the part of speech Breath, including：

Determine the part of speech of each participle, obtain noun and/or the verb corresponding to the noun included in the Word message；

Search from the picture database and obtain corresponding to the pictorial information and/or the verb corresponding to the noun Movement locus information.

4. text conversion method according to claim 1, it is characterised in that according to the class of the sentence in the Word message Type, obtains the voice messaging corresponding to the Word message from speech database, including：

According to default punctuation mark collection and/or crucial verb collection, the type of each sentence in the Word message, wherein institute are recognized Stating type includes declarative sentence and dialogue sentence；

Search from speech database and obtain the voice messaging that matches with the declarative sentence for being identified and its with identified The dialogue voice messaging that matches of sentence.

5. text conversion method according to claim 1, it is characterised in that by the pictorial information and/or movement locus Information is exported after being adapted to the voice messaging, including：

According to the Word message, that what is set up between the pictorial information and/or movement locus information and the voice messaging is suitable With relation；

According to the fitting relation, while showing to the pictorial information and/or movement locus information, to institute's predicate Message breath synchronizes broadcasting.

6. a kind of text conversion device, it is characterised in that include：

Text image acquiring unit, for obtaining target text image；

Word message recognition unit, obtains corresponding Word message for carrying out Text region to the target text image；

Picture track acquiring unit, for carrying out participle to the Word message, and determines the part of speech of participle, according to the part of speech The pictorial information and/or movement locus information of correspondence participle are obtained from picture database；

Voice messaging acquiring unit, for the type according to the sentence in the Word message, obtains right from speech database The voice messaging of Word message described in Ying Yu；

Picture voice-output unit, for the pictorial information and/or movement locus information are being carried out with the voice messaging Exported after adaptation.

7. text conversion device according to claim 6, it is characterised in that the text image acquiring unit, it is concrete to use In：

8. text conversion device according to claim 6, it is characterised in that picture track acquiring unit, it is concrete to use In：

9. text conversion method according to claim 6, it is characterised in that the voice messaging acquiring unit, it is concrete to use In：

10. text conversion device according to claim 6, it is characterised in that picture voice-output unit, specifically for：