CN103838866B - A kind of text conversion method and device - Google Patents

A kind of text conversion method and device Download PDF

Info

Publication number
CN103838866B
CN103838866B CN201410105981.6A CN201410105981A CN103838866B CN 103838866 B CN103838866 B CN 103838866B CN 201410105981 A CN201410105981 A CN 201410105981A CN 103838866 B CN103838866 B CN 103838866B
Authority
CN
China
Prior art keywords
word message
participle
sentence
text image
speech
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201410105981.6A
Other languages
Chinese (zh)
Other versions
CN103838866A (en
Inventor
简文杰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Genius Technology Co Ltd
Original Assignee
Guangdong Genius Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Genius Technology Co Ltd filed Critical Guangdong Genius Technology Co Ltd
Priority to CN201410105981.6A priority Critical patent/CN103838866B/en
Publication of CN103838866A publication Critical patent/CN103838866A/en
Application granted granted Critical
Publication of CN103838866B publication Critical patent/CN103838866B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V30/00Character recognition; Recognising digital ink; Document-oriented image-based pattern recognition
    • G06V30/10Character recognition
    • G06V30/32Digital ink
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/08Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination

Abstract

The invention discloses a kind of text conversion method and device.The method includes:Obtain target text image;Text region is carried out to the target text image and obtains corresponding Word message;Carry out participle to the Word message, and determine the part of speech of participle, the pictorial information and/or movement locus information of correspondence participle are obtained from picture database according to the part of speech;According to the type of the sentence in the Word message, the voice messaging corresponding to the Word message is obtained from speech database;The pictorial information and/or movement locus information are exported after being adapted to the voice messaging.Technical scheme proposed by the present invention can be realized for the article of word class being converted into corresponding voice and image, increased the display format of information.

Description

A kind of text conversion method and device
Technical field
The present embodiments relate to field of computer technology, more particularly to a kind of text conversion method and device.
Background technology
Books are carry in the middle of the process of people's acquisition information and study indispensable as a kind of main tool at present Role, such as children help typically by read books oneself understand the world, enlarge one's knowledge.But, for the age compared with Little child, it is inaccurate or completely illiterate because reading, cause them and be difficult to the complex word class text of reading stories Chapter, so that the content and scope of child's reading are restricted, reduces the interest of its reading.
With reaching its maturity for computer technology, electronic equipment miscellaneous(Such as learning machine, smart mobile phone, individual Digital assistants etc.)Swarm and show, be that the aspects such as daily life, working and learning bring great convenience.It is how sharp The article of word class is converted into into other with electronic equipment and can be seemed particularly significant by the form that child easily receives.
The content of the invention
The present invention provides a kind of text conversion method and device, so that realize for the article of word class being converted into other can be by Voice and image that child easily receives.
In a first aspect, embodiments providing a kind of text conversion method, the method includes:
Obtain target text image;
Text region is carried out to the target text image and obtains corresponding Word message;
Carry out participle to the Word message, and determine the part of speech of participle, obtained from picture database according to the part of speech Take the pictorial information and/or movement locus information of correspondence participle;
According to the type of the sentence in the Word message, obtain from speech database corresponding to the Word message Voice messaging;
The pictorial information and/or movement locus information are exported after being adapted to the voice messaging.
Second aspect, the embodiment of the present invention additionally provide a kind of text conversion device, and the device includes:
Text image acquiring unit, for obtaining target text image;
Word message recognition unit, obtains corresponding word letter for Text region is carried out to the target text image Breath;
Picture track acquiring unit, for carrying out participle to the Word message, and determines the part of speech of participle, according to described Part of speech obtains the pictorial information and/or movement locus information of correspondence participle from picture database;
Voice messaging acquiring unit, for the type according to the sentence in the Word message, obtains from speech database Take the voice messaging corresponding to the Word message;
Picture voice-output unit, for the pictorial information and/or movement locus information are existed with the voice messaging Exported after being adapted to.
Technical scheme proposed by the present invention can be realized for the article of word class being converted into corresponding voice and image, increase The display format of information.
Description of the drawings
Fig. 1 is a kind of schematic flow sheet of text conversion method that the embodiment of the present invention one is provided;
Fig. 2 is a kind of structural representation of text conversion device that the embodiment of the present invention two is provided.
Specific embodiment
The present invention is described in further detail with reference to the accompanying drawings and examples.It is understood that this place is retouched The specific embodiment stated is used only for explaining the present invention, rather than limitation of the invention.It also should be noted that, in order to just Part related to the present invention rather than entire infrastructure are illustrate only in description, accompanying drawing.
Embodiment one
Fig. 1 is a kind of schematic flow sheet of text conversion method that the embodiment of the present invention one is provided, and the method can be by text This conversion equipment performing, described device can be only fitted to learning machine, smart mobile phone, panel computer, personal digital assistant or During other any one have the electronic equipment of processor, memory and display, realized by software and/or hardware.Referring to Fig. 1, text conversion method specifically include following steps:
Step 110, acquisition target text image.
In the present embodiment, target text image is the image for including word content, can be that the books to papery enter Row shooting obtains, or directly reads local disk related data and obtains, or acquires from respective server.
Specifically, the process of text conversion device acquisition target text image can be:Shooting is sent to picture collection device Instruction;Receive the target text image that picture collection device shoots;Or can also be:Show text image;Reception acts on text Input instruction on this image;The target text image in text image is determined according to input instruction.Wherein, input instruction is used In it is determined that target text image to be converted in text image.When text conversion device is configured in point reader, input instruction Can be that user uses the specified scope on shown text image of talking pen.
Step 120, Text region is carried out to the target text image obtain corresponding Word message.
Text conversion device is carried out to the image using the Text region algorithm of setting after target text image is got Identification, produces corresponding Word message.Wherein, Text region algorithm can be:Line tilt correction, two-value are entered to the image first Change, denoising, the cutting of monocase region etc. are pre-processed;Then, extract the feature of each character after cutting;Further, adopt and set Fixed matching algorithm, each character feature for being extracted and locally stored template characteristic is compared, is recognized accordingly As a result(That is Word message).
Certainly those skilled in the art should be understood that Text region algorithm can also be the algorithm of other forms, for example, After pre-processing to target text image, using this Text region algorithm of artificial neural network, directly by target text In image, the lattice information of each character zone is sent into network model and carries out learning training, so as to identify in target text image Word content, obtain corresponding Word message.
Step 130, participle is carried out to the Word message, and determine the part of speech of participle, according to the part of speech from picture number According to pictorial information and/or movement locus information that correspondence participle is obtained in storehouse.
In a specific embodiment of the present embodiment, text conversion device is to obtain target text image corresponding After Word message, participle can be carried out according to default participle technique to the Word message first;Then, further determine that each The part of speech of participle, obtains the noun and/or the verb corresponding to the noun included in Word message;Further, from the picture The movement locus information corresponding to the pictorial information and/or the verb corresponding to the noun is searched and is obtained in database. Wherein, the participle technique is included but is not limited to:By word traversal, the segmenting method matched based on dictionary dictionary or knowledge based The segmenting method of understanding.
Step 140, according to the type of the sentence in the Word message, obtain from speech database corresponding to the text The voice messaging of word information.
In a specific embodiment of the present embodiment, after the corresponding Word message of target text image is obtained, Text conversion device can recognize the type of each sentence in Word message according to default punctuation mark collection and/or crucial verb collection, Wherein described type includes declarative sentence and dialogue sentence;Further, from speech database search and obtain and the statement for being identified Voice messaging and its voice messaging matched with the dialogue sentence for being identified that sentence matches.For example, default punctuation mark Concentration includes colon double quotation marks, and crucial verb is concentrated including " saying ", " crying out " or " asking " etc..Text conversion device is being searched sentence by sentence When including colon double quotation marks in Word message, may recognize that the sentence between this group of punctuation mark is dialogue sentence;By text The sentence being not included in word information between colon double quotation marks is identified as declarative sentence.Text conversion device also can found sentence by sentence When including certain verb that crucial verb is concentrated in Word message, a corresponding behind sentence is judged to talk with sentence. Certainly, text conversion device can also recognize each sentence in Word message in combination with punctuation mark collection and crucial verb collection Type.
In this example, if certain sentence in identification text message is declarative sentence, search simultaneously from speech database The voice messaging matched with the declarative sentence is obtained, for example, the voice signal of the sentence is read aloud by background sound;If identification text The voice messaging matched with the dialogue sentence is searched and obtained to certain sentence in information from speech database to talk with during sentence, The voice signal for for example pronouncing by the sound of correspondence personage.
Step 150, the pictorial information and/or movement locus information are carried out after being adapted to the voice messaging Output.
In the present embodiment, text conversion device can be set up and perform according to Word message resulting during execution step 120 During step 130 between resulting pictorial information and/or resulting voice messaging when movement locus information and execution step 140 Fitting relation;It is according to this fitting relation, while showing to the pictorial information and/or movement locus information, right The voice messaging synchronizes broadcasting.
The technical scheme that the present embodiment is proposed can be realized for the article of word class being converted into corresponding voice and image, increase The display format of information is added.
Embodiment two
Fig. 2 is a kind of structural representation of text conversion device that the embodiment of the present invention two is provided.Referring to Fig. 2, the device Concrete structure it is as follows:
Text image acquiring unit 210, for obtaining target text image;
Word message recognition unit 220, obtains corresponding word for carrying out Text region to the target text image Information;
Picture track acquiring unit 230, for carrying out participle to the Word message, and determines the part of speech of participle, according to The part of speech obtains the pictorial information and/or movement locus information of correspondence participle from picture database;
Voice messaging acquiring unit 240, for the type according to the sentence in the Word message, from speech database Obtain the voice messaging corresponding to the Word message;
Picture voice-output unit 250, for by the pictorial information and/or movement locus information and the voice messaging Exported after being adapted to.
Further, text image acquiring unit 210, specifically for:
Shooting instruction is sent to picture collection device;Receive the target text image that the picture collection device shoots;Or
Show text image;Reception acts on the input instruction on the text image;Determined according to the input instruction Target text image in the text image.
Further, picture track acquiring unit 230, specifically for:
Participle is carried out according to default participle technique to the Word message;
Determine the part of speech of each participle, obtain the noun included in the Word message and/or moving corresponding to the noun Word;
Pictorial information and/or the verb institute searched from the picture database and obtain corresponding to the noun is right The movement locus information answered.
Further, the voice messaging acquiring unit 240, specifically for:
According to default punctuation mark collection and/or crucial verb collection, the type of each sentence in the Word message is recognized, its Described in type include declarative sentence and dialogue sentence;
Search from speech database and obtain the voice messaging that matches with the declarative sentence for being identified and its with known The voice messaging that the dialogue sentence not gone out matches.
Further, picture voice-output unit 250, specifically for:
According to the Word message, set up between the pictorial information and/or movement locus information and the voice messaging Fitting relation;
According to the fitting relation, while showing to the pictorial information and/or movement locus information, to institute State voice messaging and synchronize broadcasting.
The said goods can perform the method provided by any embodiment of the present invention, possess the corresponding functional module of execution method And beneficial effect.
Note, above are only presently preferred embodiments of the present invention and institute's application technology principle.It will be appreciated by those skilled in the art that The invention is not restricted to specific embodiment described here, can carry out for a person skilled in the art various obvious changes, Readjust and substitute without departing from protection scope of the present invention.Therefore, although the present invention is carried out by above example It is described in further detail, but the present invention is not limited only to above example, without departing from the inventive concept, also More other Equivalent embodiments can be included, and the scope of the present invention is determined by scope of the appended claims.

Claims (10)

1. a kind of text conversion method, it is characterised in that include:
Obtain target text image;
Text region is carried out to the target text image and obtains corresponding Word message;
Carry out participle to the Word message, and determine the part of speech of participle, obtain right from picture database according to the part of speech Answer the pictorial information and/or movement locus information of participle;
According to the type of the sentence in the Word message, the voice corresponding to the Word message is obtained from speech database Information;
The pictorial information and/or movement locus information are exported after being adapted to the voice messaging.
2. text conversion method according to claim 1, it is characterised in that obtain target text image, including:
Shooting instruction is sent to picture collection device;Receive the target text image that the picture collection device shoots;Or
Show text image;Reception acts on the input instruction on the text image;According to the input instruction determines Target text image in text image.
3. text conversion method according to claim 1, it is characterised in that carry out participle to the Word message, and really Determine the part of speech of participle, the pictorial information and/or movement locus letter of correspondence participle are obtained from picture database according to the part of speech Breath, including:
Participle is carried out according to default participle technique to the Word message;
Determine the part of speech of each participle, obtain noun and/or the verb corresponding to the noun included in the Word message;
Search from the picture database and obtain corresponding to the pictorial information and/or the verb corresponding to the noun Movement locus information.
4. text conversion method according to claim 1, it is characterised in that according to the class of the sentence in the Word message Type, obtains the voice messaging corresponding to the Word message from speech database, including:
According to default punctuation mark collection and/or crucial verb collection, the type of each sentence in the Word message, wherein institute are recognized Stating type includes declarative sentence and dialogue sentence;
Search from speech database and obtain the voice messaging that matches with the declarative sentence for being identified and its with identified The dialogue voice messaging that matches of sentence.
5. text conversion method according to claim 1, it is characterised in that by the pictorial information and/or movement locus Information is exported after being adapted to the voice messaging, including:
According to the Word message, that what is set up between the pictorial information and/or movement locus information and the voice messaging is suitable With relation;
According to the fitting relation, while showing to the pictorial information and/or movement locus information, to institute's predicate Message breath synchronizes broadcasting.
6. a kind of text conversion device, it is characterised in that include:
Text image acquiring unit, for obtaining target text image;
Word message recognition unit, obtains corresponding Word message for carrying out Text region to the target text image;
Picture track acquiring unit, for carrying out participle to the Word message, and determines the part of speech of participle, according to the part of speech The pictorial information and/or movement locus information of correspondence participle are obtained from picture database;
Voice messaging acquiring unit, for the type according to the sentence in the Word message, obtains right from speech database The voice messaging of Word message described in Ying Yu;
Picture voice-output unit, for the pictorial information and/or movement locus information are being carried out with the voice messaging Exported after adaptation.
7. text conversion device according to claim 6, it is characterised in that the text image acquiring unit, it is concrete to use In:
Shooting instruction is sent to picture collection device;Receive the target text image that the picture collection device shoots;Or
Show text image;Reception acts on the input instruction on the text image;According to the input instruction determines Target text image in text image.
8. text conversion device according to claim 6, it is characterised in that picture track acquiring unit, it is concrete to use In:
Participle is carried out according to default participle technique to the Word message;
Determine the part of speech of each participle, obtain noun and/or the verb corresponding to the noun included in the Word message;
Search from the picture database and obtain corresponding to the pictorial information and/or the verb corresponding to the noun Movement locus information.
9. text conversion method according to claim 6, it is characterised in that the voice messaging acquiring unit, it is concrete to use In:
According to default punctuation mark collection and/or crucial verb collection, the type of each sentence in the Word message, wherein institute are recognized Stating type includes declarative sentence and dialogue sentence;
Search from speech database and obtain the voice messaging that matches with the declarative sentence for being identified and its with identified The dialogue voice messaging that matches of sentence.
10. text conversion device according to claim 6, it is characterised in that picture voice-output unit, specifically for:
According to the Word message, that what is set up between the pictorial information and/or movement locus information and the voice messaging is suitable With relation;
According to the fitting relation, while showing to the pictorial information and/or movement locus information, to institute's predicate Message breath synchronizes broadcasting.
CN201410105981.6A 2014-03-20 2014-03-20 A kind of text conversion method and device Active CN103838866B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201410105981.6A CN103838866B (en) 2014-03-20 2014-03-20 A kind of text conversion method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201410105981.6A CN103838866B (en) 2014-03-20 2014-03-20 A kind of text conversion method and device

Publications (2)

Publication Number Publication Date
CN103838866A CN103838866A (en) 2014-06-04
CN103838866B true CN103838866B (en) 2017-04-05

Family

ID=50802362

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201410105981.6A Active CN103838866B (en) 2014-03-20 2014-03-20 A kind of text conversion method and device

Country Status (1)

Country Link
CN (1) CN103838866B (en)

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104536570A (en) * 2014-12-29 2015-04-22 广东小天才科技有限公司 Information processing method and device of intelligent watch
CN104599670B (en) * 2015-01-30 2017-12-26 泰顺县福田园艺玩具厂 The audio recognition method of talking pen
CN105516457A (en) * 2015-11-24 2016-04-20 小米科技有限责任公司 Communication message processing method and apparatus
CN106528742A (en) * 2016-11-04 2017-03-22 广东小天才科技有限公司 Information query method and device
CN106855854A (en) * 2016-12-29 2017-06-16 北京奇虎科技有限公司 A kind of recognition methods of english information and device
CN106911959B (en) * 2017-02-06 2020-01-14 深圳创维数字技术有限公司 Voice picture reading method and system based on smart television
CN107291676B (en) * 2017-06-20 2021-11-19 广东小天才科技有限公司 Method for cutting off voice file, terminal equipment and computer storage medium
CN107748744B (en) * 2017-10-31 2021-01-26 广东小天才科技有限公司 Method and device for establishing drawing box knowledge base
CN107885827B (en) * 2017-11-07 2021-06-01 Oppo广东移动通信有限公司 File acquisition method and device, storage medium and electronic equipment
CN107948405A (en) * 2017-11-13 2018-04-20 百度在线网络技术(北京)有限公司 A kind of information processing method, device and terminal device
CN108108412A (en) * 2017-12-12 2018-06-01 山东师范大学 Children cognition study interactive system and method based on AI open platforms
CN108470067B (en) * 2018-03-28 2019-03-01 掌阅科技股份有限公司 E-book shows the conversion method of form, calculates equipment and computer storage medium
CN109766826A (en) * 2019-01-08 2019-05-17 广东小天才科技有限公司 A kind of method and system of automatic identification job information
CN111866609B (en) * 2019-04-08 2022-12-13 百度(美国)有限责任公司 Method and apparatus for generating video
CN111968424A (en) * 2020-08-27 2020-11-20 北京大米科技有限公司 Interactive learning method, device, system and computer storage medium

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH09152883A (en) * 1995-11-29 1997-06-10 Ricoh Co Ltd Accent phrase division position detecting method and text /voice converter
CN201213041Y (en) * 2008-06-30 2009-03-25 东莞市步步高教育电子产品有限公司 Optical click-to-read machine
CN101477699A (en) * 2008-01-04 2009-07-08 白涛 Basic programming method for converting literal sentences into corresponding animation cartoons
CN102479240A (en) * 2010-11-30 2012-05-30 英业达股份有限公司 System and method for providing example sentences according to input types
CN103035134A (en) * 2012-12-10 2013-04-10 张肃 Image touch and talk playing system and mage touch and talk playing method
CN103208207A (en) * 2013-04-12 2013-07-17 北京天奇健教育科技有限公司 Portable learning machine and using method thereof

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH09152883A (en) * 1995-11-29 1997-06-10 Ricoh Co Ltd Accent phrase division position detecting method and text /voice converter
CN101477699A (en) * 2008-01-04 2009-07-08 白涛 Basic programming method for converting literal sentences into corresponding animation cartoons
CN201213041Y (en) * 2008-06-30 2009-03-25 东莞市步步高教育电子产品有限公司 Optical click-to-read machine
CN102479240A (en) * 2010-11-30 2012-05-30 英业达股份有限公司 System and method for providing example sentences according to input types
CN103035134A (en) * 2012-12-10 2013-04-10 张肃 Image touch and talk playing system and mage touch and talk playing method
CN103208207A (en) * 2013-04-12 2013-07-17 北京天奇健教育科技有限公司 Portable learning machine and using method thereof

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
《盲人阅读机中图像字符识别方法的研究》;刘云曼等;《天津市生物医学工程学会学术年会2013年学术年会论文摘要》;20131231;第61页 *
捷通语音设计开发智能电子书;孙騕等;《中外玩具制造》;20051031(第10期);第70页 *

Also Published As

Publication number Publication date
CN103838866A (en) 2014-06-04

Similar Documents

Publication Publication Date Title
CN103838866B (en) A kind of text conversion method and device
CN106575500B (en) Method and apparatus for synthesizing speech based on facial structure
CN103761892B (en) A kind of method of speech play paper book content and device
US20190205708A1 (en) Method and apparatus for processing information
CN103955454B (en) A kind of method and apparatus that style conversion is carried out between writings in the vernacular and the writing in classical Chinese
CN113870635A (en) Voice answering method and device
CN108470188B (en) Interaction method based on image analysis and electronic equipment
Jaech et al. Phonological pun-derstanding
KR102043419B1 (en) Speech recognition based training system and method for child language learning
CN111524045A (en) Dictation method and device
Hoque et al. Automated Bangla sign language translation system: Prospects, limitations and applications
KR102222035B1 (en) Method, terminal and program of language education for infants
CN111079489B (en) Content identification method and electronic equipment
CN109473007B (en) English natural spelling teaching method and system combining phonemes with sound side
US20190304454A1 (en) Information providing device, information providing method, and recording medium
Selvaraj et al. Enhanced portable text to speech converter for visually impaired
KR102129089B1 (en) Method, terminal and program of language education for infants
Kapitanov et al. Slovo: Russian Sign Language Dataset
Bin Munir et al. A machine learning based sign language interpretation system for communication with deaf-mute people
CN111832412A (en) Sound production training correction method and system
KR102645783B1 (en) System for providing korean education service for foreigner
KR20210022288A (en) Method for providing english education service using step-by-step expanding sentence structure unit
Alam et al. A Machine Learning Based Sign Language Interpretation System for Communication with Deaf-mute People
JP6538399B2 (en) Voice processing apparatus, voice processing method and program
Bhatt et al. Reading Assistant: a reciter in your pocket

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant