CN109509464A - It is a kind of text to be read aloud the method and device for being recorded as audio - Google Patents

It is a kind of text to be read aloud the method and device for being recorded as audio Download PDF

Info

Publication number
CN109509464A
CN109509464A CN201710813854.5A CN201710813854A CN109509464A CN 109509464 A CN109509464 A CN 109509464A CN 201710813854 A CN201710813854 A CN 201710813854A CN 109509464 A CN109509464 A CN 109509464A
Authority
CN
China
Prior art keywords
text
content
converted
read
audio
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710813854.5A
Other languages
Chinese (zh)
Other versions
CN109509464B (en
Inventor
胡娟
黄兰花
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Kingsoft Office Software Inc
Zhuhai Kingsoft Office Software Co Ltd
Guangzhou Jinshan Mobile Technology Co Ltd
Original Assignee
Beijing Kingsoft Office Software Inc
Zhuhai Kingsoft Office Software Co Ltd
Guangzhou Jinshan Mobile Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Kingsoft Office Software Inc, Zhuhai Kingsoft Office Software Co Ltd, Guangzhou Jinshan Mobile Technology Co Ltd filed Critical Beijing Kingsoft Office Software Inc
Priority to CN201710813854.5A priority Critical patent/CN109509464B/en
Publication of CN109509464A publication Critical patent/CN109509464A/en
Application granted granted Critical
Publication of CN109509464B publication Critical patent/CN109509464B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/08Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination

Abstract

The embodiment of the invention provides a kind of text are read aloud to be recorded as the method and device of audio, wherein text is read aloud the method for being recorded as audio includes: to obtain content of text to be read;From reading the corresponding voice data of each character in content of text to be read in speech database, and successively read aloud voice data;It receives and according to audio conversion instruction, the determining origin identification that content of text to be converted in content of text to be read is converted to audio file;From origin identification, content of text to be converted is converted into audio file, and save audio file.Reading aloud simultaneously for content of text can be being listened to by this programme, content of text is recorded as audio file automatically, improve the interactive experience of operating efficiency and user.

Description

It is a kind of text to be read aloud the method and device for being recorded as audio
Technical field
The present invention relates to human-computer interaction technique field, more particularly to it is a kind of text read aloud be recorded as audio method and Device.
Background technique
With the rapid development of computer and mobile phone, people are increasingly dependent on computer and mobile phone, and reading habit is also gradually Gradually by paper reading be converted to e-text read, still, for example, drive, cook or hand in propose the scenes such as thing under, It is not easy to read content of text.Therefore, for the scene for being not easy to reading content of text, many class softwares of reading are provided at present The function of reading aloud, that is, content of text is converted into voice, then bright read out.User can listen to text without reading Content, it might even be possible to listen to and read aloud while doing other things, this is not easy in the scene for reading text in user, gives user It obtains text information and provides very big convenience.
But the existing reading class software that function of reading aloud can be provided, often only with the function of reading aloud of text, if User want with audio file by content of text be shared with other people or by content of text save as audio file so as to it is subsequent repeatedly It listens to, user needs to imported into text in audio switching software, and content of text is converted into audio by audio switching software File saves;Therefore, user is listening to when reading aloud of content of text by reading class software, only again by audio switching software Content of text could be converted to audio file preservation, such realization process is complicated, the operating efficiency of user is very low, thus shadow Ring the interactive experience of user.
Summary of the invention
The embodiment of the present invention is designed to provide a kind of method and device that text is read aloud and is recorded as audio, to realize Reading aloud simultaneously for content of text is being listened to, content of text audio file can be recorded as automatically, improve operating efficiency and user Interactive experience.Specific technical solution is as follows:
In a first aspect, the embodiment of the invention provides a kind of method that text is read aloud and is recorded as audio, the method packet It includes:
Obtain content of text to be read;
From reading the corresponding voice data of each character in the content of text to be read in speech database, and successively Read aloud the voice data;
It receives and according to audio conversion instruction, determines the content of text to be converted in the content of text to be read Be converted to the origin identification of audio file;
From the origin identification, the content of text to be converted is converted into audio file, and save the audio File.
It is optionally, described to obtain content of text to be read, comprising:
Obtain the reading instruction of text source and user's input, wherein the reading instruction includes: that the reading text source is complete The instruction of text or the instruction for reading partial content in the text source;
Full text or partial content are read from the text source, as content of text to be read.
Optionally, described from reading the corresponding voice of each character in the content of text to be read in speech database Data, and before successively reading aloud the voice data, the method also includes:
Obtain user's input reads aloud enabled instruction.
Optionally, the audio conversion instruction are as follows: the audio conversion instruction of user's input;Alternatively, by described to be read The audio conversion instruction that the paragraph of content of text generates;
The reception and according to audio conversion instruction, determines the text to be converted in the content of text to be read Content Transformation is the origin identification of audio file, comprising:
When receiving the audio conversion instruction of user's input, determine that the content of text to be read is text to be converted This content, and the first character in the content of text to be converted is determined as the origin identification that audio is converted;
Alternatively, determining institute when receiving the audio conversion instruction by the paragraph generation of the content of text to be read Stating each paragraph in content of text to be read is content of text to be converted, and by first in each content of text to be converted A character is determined as the origin identification of audio conversion;
It is described that the content of text to be converted is converted into audio file from the origin identification, and described in saving Audio file, comprising:
Since the first character of the content of text to be converted, the content of text to be converted is converted into sound Frequency file, and save the audio file.
Optionally, the audio conversion instruction includes: the text selecting instruction of user's input;
The reception and according to audio conversion instruction, determines the text to be converted in the content of text to be read Content Transformation is the origin identification of audio file, comprising:
It receives and is instructed according to the text selecting of user's input, read the text from the content of text to be read The corresponding content of text to be read aloud of selection instruction;
The first character for determining the content of text to be converted in the content of text to be read aloud is origin identification;
It is described that the content of text to be converted is converted into audio file from the origin identification, and described in saving Audio file, comprising:
Since the first character of the content of text to be converted, the content of text to be converted is converted into sound Frequency file, and save the audio file.
Optionally, described from the origin identification, the content of text to be converted is converted into audio file, and protect Before depositing the audio file, the method also includes:
It obtains text and records strategy;
It is described that the content of text to be converted is converted into audio file from the origin identification, and described in saving Audio file, comprising:
From the origin identification, strategy is recorded according to the text, the content of text to be converted is converted into sound Frequency file, and save the audio file.
Second aspect, the embodiment of the invention also provides a kind of text are read aloud to be recorded as the device of audio, described device Include:
Module is obtained, for obtaining content of text to be read;
Bright read through model, for from reading the corresponding language of each character in the content of text to be read in speech database Sound data, and successively read aloud the voice data;
Determining module, for receiving and according to audio conversion instruction, determine by the content of text to be read to The content of text of conversion is converted to the origin identification of audio file;
Preserving module, for from the origin identification, the content of text to be converted to be converted to audio file, and Save the audio file.
Optionally, described device further include:
Read aloud enabled instruction obtain module, for obtain user input read aloud enabled instruction.
The third aspect, the embodiment of the invention provides a kind of electronic equipment, including processor, communication interface, memory and Communication bus, wherein the processor, the communication interface, the memory are completed each other by the communication bus Communication;
The memory, for storing computer program;
The processor realizes the step of method described in first aspect for executing the program stored on the memory Suddenly.
Fourth aspect, the embodiment of the invention provides a kind of computer readable storage medium, the computer-readable storage Dielectric memory contains computer program, and the step of method described in first aspect is realized when the computer program is executed by processor Suddenly.
It is provided in an embodiment of the present invention text to be read aloud the method and device for being recorded as audio, by from speech database The corresponding voice data of each character in content of text to be read is read, and successively reads aloud voice data, and according to reception Audio conversion instruction, from audio convert origin identification, by the content of text to be converted in content of text to be read Audio file is converted to, and saves audio file, while reading aloud content of text to be read, automatically by text to be converted Content Transformation is audio file, improves the operating efficiency read aloud content of text and save corresponding audio file, and simplify The operation of user, to improve the interactive experience of user.
Detailed description of the invention
In order to more clearly explain the embodiment of the invention or the technical proposal in the existing technology, to embodiment or will show below There is attached drawing needed in technical description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this Some embodiments of invention for those of ordinary skill in the art without creative efforts, can be with It obtains other drawings based on these drawings.
Fig. 1 is a kind of flow diagram for text being read aloud the method for being recorded as audio of the embodiment of the present invention;
Fig. 2 is another flow diagram for text being read aloud the method for being recorded as audio of the embodiment of the present invention;
Fig. 3 is the interface schematic diagram of the reading class software of the embodiment of the present invention;
Fig. 4 is a kind of flow diagram for text being read aloud the device for being recorded as audio of the embodiment of the present invention;
Fig. 5 is another flow diagram for text being read aloud the device for being recorded as audio of the embodiment of the present invention;
Fig. 6 is the structural schematic diagram of the electronic equipment of the embodiment of the present invention.
Specific embodiment
Following will be combined with the drawings in the embodiments of the present invention, and technical solution in the embodiment of the present invention carries out clear, complete Site preparation description, it is clear that described embodiments are only a part of the embodiments of the present invention, instead of all the embodiments.It is based on Embodiment in the present invention, it is obtained by those of ordinary skill in the art without making creative efforts every other Embodiment shall fall within the protection scope of the present invention.
In order to listen to reading aloud simultaneously for content of text, content of text is recorded as audio file automatically, improves operation effect The interactive experience of rate and user, the embodiment of the invention provides a kind of text are read aloud to be recorded as the method and device of audio.
A kind of method for being recorded as audio of reading aloud text is provided for the embodiments of the invention first below to be introduced.
It should be noted that a kind of provided by the embodiment of the present invention read aloud text the execution for being recorded as the method for audio Main body can be the equipment such as computer, tablet computer, mobile phone or E-book reader, and specific method flow, which can be, to be passed through Reading class application program in above equipment is realized.
As shown in Figure 1, a kind of provided by the embodiment of the present invention read aloud text the method for being recorded as audio, may include Following steps:
S101 obtains content of text to be read.
Wherein, content of text to be read be user from the local novel for importing or being obtained from network, prose, poem, Content in the text sources such as news, under normal circumstances, be user it is interested or wish read text source in content.
It should be noted that the format of content of text can be the text of word processing format, PDF (Portable Document Format, Portable document format) text, the text of table format or format slide text, it is right here The format of content of text is not construed as limiting, and the text file of arbitrary format belongs to the protection scope of the embodiment of the present invention.User from It is local to import or from the text source that network obtains, it is complete text source under normal circumstances, such as a novel, a series of new Hear etc., and user may be only interested in a certain partial content therein, therefore, content of text can be the institute of entire text source There is content, a part of content being also possible in text source.
Optionally, described the step of obtaining content of text to be read, may include:
The first step obtains the reading instruction of text source and user's input.
Wherein, reading instruction may include: partial content in the instruction or reading text source for read text source full text Instruction.It should be noted that user after getting the text source such as novel, news, needs according to oneself concern or sense Therefore the content of interest, the partial content that selection is read text source full text or read in text source according to the user's choice can It generates the instruction for reading text source full text or reads the instruction of partial content in text source.
Second step reads full text or partial content, as content of text to be read from text source.
It should be noted that in instruction of the reading instruction got for reading text source full text, text to be read Content is the full text of text source;It is to be read when the reading instruction got is reads the instruction of partial content in text source Content of text is the partial content in text source.It is emphasized that reading the instruction of partial content in text source should include: The bebinning character of the contents of the section and the length of content.
It is understood that such as users from networks obtains a series of news, 15 are contained in this series of news News, and user is only interested in 5 news in 15 news from discovery in the title of news, then user can input and read The instruction of 5 news is read, 5 news will only be read aloud by reading class application software, helped user filtering to fall user and do not felt emerging The content of interest, provide the user good usage experience.
S102, from reading the corresponding voice data of each character in content of text to be read in speech database, and according to It is secondary to read aloud voice data.
It should be noted that in equipment local or network, it is to be read getting there are a speech database After content of text, reading class application software can be corresponding from each character in content of text to be read is read in speech database Voice data, then read aloud each voice data.It is emphasized that under normal circumstances, in order to reduce data storage Process successively reads corresponding voice data according to the sequence of character in content of text from speech database, and successively bright It reads, it is, reading the corresponding voice data of a character, then reads aloud the voice data.And reading aloud voice data can be with Support multilingual, user can choose different language and read aloud, for example, Chinese, English, French, German etc..
It is emphasized that starting to read aloud to content of text to be read, it can be and opened when user imports content of text Beginning reads aloud, and is also possible to user's input one and reads aloud enabled instruction, starts to read aloud when receiving this and reading aloud enabled instruction, this is all It is reasonable.
S103, receives and according to audio conversion instruction, and determination will be in the text to be converted in content of text to be read Hold the origin identification for being converted to audio file.
Wherein, audio conversion instruction is that content of text is converted to the enabled instruction of audio file, the audio conversion instruction It can be what user inputted as needed, for example, user, listening to during reading aloud of content of text, discovery text content is very It is valuable, it is desirable to text content be preserved as audio file and be shared with other people, therefore, user can input one Audio conversion instruction;Alternatively, the audio conversion instruction is also in order to reduce the data volume for the audio file for saving text full content Can be by content of text to be read paragraph generation.Certainly, audio conversion instruction is also possible to reading aloud content of text At the beginning of default generate, audio is converted reads aloud synchronous progress with content of text;Audio conversion instruction can also be At the end of reading aloud content of text, read aloud what end of identification generated according to content of text.It is emphasized that any audio conversion The mode of instruction belongs to the protection scope of the present embodiment.
It should be noted that needing after receiving audio conversion instruction to be converted in content of text to be read Content of text carry out audio conversion, content of text to be converted can be the full content in content of text to be read, It can be the partial content in content of text to be read.Audio conversion needs to be converted since an origin identification, should Origin identification can be the first character of content of text full text, be also possible to the first character of each paragraph in content of text Symbol can also be the first character of the partial content of user's selection, all be reasonable.
Optionally, audio conversion instruction can be with are as follows: the audio conversion instruction of user's input;Alternatively, pressing text to be read The audio conversion instruction that the paragraph of content generates;Alternatively, the text selecting instruction of user's input.
It should be noted that the audio conversion instruction of user's input can be and input during content of text is read aloud , it is also possible to what input and bright reading instruction before content of text is read aloud were multiplexed, is not construed as limiting here.It is emphasized that When audio conversion instruction and bright reading instruction are multiplexed, audio conversion and content of text are read aloud and are started simultaneously at.User can be according to listening to To read aloud content, then determine whether to content of text being converted to audio file, meet the needs of users.Audio conversion instruction Can also be by content of text to be read paragraph generation, according to paragraph carry out audio conversion can reduce conversion all text When this content, loss of data risk caused by data volume is excessive.Audio conversion quality can also be the text selecting of user's input Instruction, that is, user select from content of text to be read according to interested degree in content of text to be read Certain selection instruction of a part of content of text as content of text to be converted.
Optionally, the reception and according to audio conversion instruction, determination will be to be converted in content of text to be read Content of text is converted to the step of origin identification of audio file, may include:
When receiving the audio conversion instruction of user's input, determine that content of text to be read is in text to be converted Hold, and the first character in content of text to be converted is determined as to the origin identification of audio conversion;
Alternatively,
When receiving the audio conversion instruction by the paragraph generation of content of text to be read, text to be read is determined Each paragraph in content is content of text to be converted, and the first character in each content of text to be converted is determined as sound The origin identification of frequency conversion;
Alternatively,
It receives and is instructed according to the text selecting of user's input, text selecting instruction is read from content of text to be read Corresponding content of text to be read aloud;The first character for determining the content of text to be converted in content of text to be read aloud is Origin identification.
It should be noted that, first according to the difference of audio conversion instruction, being determined wait read after receiving audio conversion instruction Then content of text to be converted in the content of text of reading starts to carry out audio conversion to content of text to be converted, need Determine that the origin identification of audio conversion selects the first character in content of text to be converted for audio conversion under normal circumstances The origin identification changed;Certainly, origin identification may not be first character, still, if origin identification is in content of text Any one character, may result in play save audio file when, content there is interrupted situation, influence play effect Fruit.Therefore, the present embodiment, the origin identification for selecting the first character of content of text to be converted to convert as audio guarantee The continuity for the audio file converted, the effect played after being conducive to.Also, user can treat and read according to demand The content of text of reading is selected, and listens to reading aloud and recording for partial content, user can specify to be read aloud in content of text The length of the bebinning character of content of text and content of text to be converted, for example, the of the specified content of text to be read aloud of user One character is the 5th section of first character in content of text to be read, and the length of content of text to be read aloud is 30 words Symbol.It is possible to be read aloud since the first character of content of text to be read aloud;Content of text to be converted can To be the full content of content of text to be read aloud, the partial content being also possible in content of text to be read aloud, to be converted The first character of content of text start, whole content of text to be converted is converted into audio file, and save.According to The specified of the content of text read aloud is treated at family, is more able to satisfy the demand of user.
It is emphasized that the content of text to be read aloud that user specifies can be used as audiovisual content, read aloud listening to It in the process, then can will be in text to be read if the user thinks that having the value saved as audio file in the text The first character of appearance saves as audio file as origin identification, by the full text of content of text to be read.
It is understood that the text selecting instruction of user's input, can be user by lighting, selecting in content of text Partial content, be also possible to user and selected by inputting the specific condition of reading aloud, for example, the content of user's input are as follows: The tenth character of first segment, the first row is to the 5th second segment, the third line character, then the content of audio conversion is to read aloud item The corresponding content of part.It is emphasized that the method that any user selects content of text belongs to the guarantor of the embodiment of the present invention Range is protected, is no longer repeated one by one here.
Content of text to be converted is converted to audio file, and save audio file from origin identification by S104.
It should be noted that origin identification is to start the starting point that content of text audio is converted therefore to open from origin identification Begin to carry out content of text to be converted audio conversion, and the audio file for completing conversion is saved, so that user is anti- The audio file of preservation is shared with other people by the corresponding content of text of audio file or user for listening to preservation again, is convenient for Other people listen to audio file, without obtaining original text source.
It is emphasized that the format of audio file can be the audio file of arbitrary format when audio conversion, for example, CD format, WAV format, WMA format, AAC format, APE format etc..Also, the audio file converted can be multilingual, example Such as, Chinese, English, French, German etc..
Optionally, described from origin identification, content of text to be converted is converted into audio file, and save audio text The step of part may include:
Since the first character of content of text to be converted, content of text to be converted is converted into audio file, And save the audio file.
It should be noted that content of text to be converted is turned since the first character of content of text to be converted It is changed to audio file.If content of text to be converted is the full text of content of text to be read, corresponding audio text is saved Part, if file occur lose or damage when, the audio file that may cause entire content of text is impacted, and if to The content of text converted is the partial content in content of text to be read, such as each section of content of text, then saves corresponding Audio file, if the content of text of certain a part does not influence other parts there is a situation where losing or damaging.
Content of text to be converted is converted, the process of the audio file after saving conversion can be understood as text Recording process, there are different demands for process that text is recorded by different users, for example, there is part not need to record in text It makes, the sequence of recording is had different needs, needing to repeat to record certain contents etc., in order to cope with these situations, described From origin identification, content of text to be converted is converted into audio file, and before the step of saving audio file, it can be with Include:
It obtains text and records strategy.
It is then described from origin identification, content of text to be converted is converted into audio file, and save audio file Step may include:
From origin identification, strategy is recorded according to text, content of text to be converted is converted into audio file, and save Audio file.
Wherein, text, which records strategy, can be what user configured according to demand, be also possible to the executing subject of the present embodiment It is generated according to the analysis to historical data.For example, it is interested in sports news to obtain user by historical data, then preferential record The corresponding content of text of sports news processed, saves corresponding audio file.Specifically, text records strategy may include but not only It is limited to: preferentially records certain content of text, do not record certain content of text and repeat to record certain content of text.
Using the present embodiment, by from reading the corresponding language of each character in content of text to be read in speech database Sound data, and voice data is successively read aloud, and audio conversion instruction based on the received, from the origin identification that audio is converted, Content of text to be converted in content of text to be read is converted into audio file, and saves audio file, read aloud to While the content of text of reading, content of text to be converted is converted into audio file automatically, improves and reads aloud text to be read This content and the operating efficiency for saving corresponding audio file, and the operation of user is simplified, to improve the interaction body of user It tests.
As shown in Fig. 2, a kind of provided by the present embodiment read aloud text the method for being recorded as audio, it is real described in Fig. 1 Before the step S102 for applying example, text being read aloud the method for being recorded as audio can also include:
S201, obtain user's input reads aloud enabled instruction.
It should be noted that enabled instruction is read aloud in user's input, start when receiving this and reading aloud enabled instruction to text Content is word for word read aloud.The mode that enabled instruction is read aloud in user's input can be through control input, be also possible to pass through External switch equipment input, it is any can be used as user input read aloud enabled instruction, belong to the guarantor of the embodiment of the present invention Protect range.
It is emphasized that step S101 to S104 is identical as embodiment illustrated in fig. 1, which is not described herein again.
Using the present embodiment, by from reading the corresponding language of each character in content of text to be read in speech database Sound data, and voice data is successively read aloud, and audio conversion instruction based on the received, from the origin identification that audio is converted, Content of text to be converted in content of text to be read is converted into audio file, and saves audio file, read aloud to While the content of text of reading, content of text to be converted is converted into audio file automatically, improves and reads aloud text to be read This content and the operating efficiency for saving corresponding audio file, and the operation of user is simplified, to improve the interaction body of user It tests.Also, enabled instruction is read aloud by acquisition user's input, starts to read aloud content of text, the starting and use read aloud The demand at family is related, improves the experience of user.
Below with reference to specific application example, it is provided for the embodiments of the invention the side that text is read aloud and is recorded as audio Method is introduced.
As shown in figure 3, the interface schematic diagram of the reading class software for the embodiment of the present invention.Include text in software interface 301 This content area 302 reads aloud start button 303 and audio switching button 304, and then news, importing should for selection 5 in users from networks Class software is read, 5 news are shown in content of text region 302, when user clicks and reads aloud start button 303, this is read Read class software since the 1st news, from speech database read news the corresponding voice data of each character, word for word by Item is read aloud;User is in the process listened to, it is believed that news is meaningful, it is desirable to save as audio file and be shared with friend, lead to Cross click audio switching button 304, the reading class software start to the news in content of text region 302 according to paragraph content into The conversion of row audio, that is, each paragraph are converted to an audio file, and save to the audio file after conversion.This Sample, user can find the audio file of preservation from local storage space, share, be sent to friend.
Compared with prior art, in the present solution, reading class software by reading content of text area from speech database The corresponding voice data of each character in domain, and word for word read aloud one by one, and according to the corresponding audio of audio switching button Each paragraph of text in content of text region is converted to audio since the first character of each paragraph by conversion instruction File, and audio file is saved, while reading aloud the text in content of text region, content of text is converted into audio text automatically Part improves the operating efficiency read aloud content of text and save corresponding audio file, and simplifies the operation of user, to mention The interactive experience of high user;And audio file is generated according to paragraph, avoids and is saving the corresponding audio file of full text, if When file occurs to lose or damage, the impacted situation of the audio file of entire content of text may cause, if a certain The content of paragraph does not influence other paragraphs there is a situation where losing or damaging.
Corresponding to above-described embodiment, the embodiment of the invention provides a kind of text are read aloud to be recorded as the device of audio, such as Shown in Fig. 4, text is read aloud and is recorded as the device of audio and may include:
Module 410 is obtained, for obtaining content of text to be read;
Bright read through model 420, for corresponding from each character in the content of text to be read is read in speech database Voice data, and successively read aloud the voice data;
Determining module 430, for receiving and according to audio conversion instruction, determination will be in the content of text to be read Content of text to be converted is converted to the origin identification of audio file;
Preserving module 440, for from the origin identification, the content of text to be converted to be converted to audio text Part, and save the audio file.
Using the present embodiment, by from reading the corresponding language of each character in content of text to be read in speech database Sound data, and voice data is successively read aloud, and audio conversion instruction based on the received, from the origin identification that audio is converted, Content of text to be converted in content of text to be read is converted into audio file, and saves audio file, read aloud to While the content of text of reading, content of text to be converted is converted into audio file automatically, improves and reads aloud text to be read This content and the operating efficiency for saving corresponding audio file, and the operation of user is simplified, to improve the interaction body of user It tests.
Optionally, the acquisition module 410, specifically can be used for:
Obtain the reading instruction of text source and user's input, wherein the reading instruction includes: that the reading text source is complete The instruction of text or the instruction for reading partial content in the text source;
Full text or partial content are read from the text source, as content of text to be read.
Optionally, the audio conversion instruction are as follows: the audio conversion instruction of user's input;Alternatively, by described to be read The audio conversion instruction that the paragraph of content of text generates;
The determining module 430, specifically can be used for:
When receiving the audio conversion instruction of user's input, determine that the content of text to be read is text to be converted This content, and the first character in the content of text to be converted is determined as the origin identification that audio is converted;
Alternatively, determining institute when receiving the audio conversion instruction by the paragraph generation of the content of text to be read Stating each paragraph in content of text to be read is content of text to be converted, and by first in each content of text to be converted A character is determined as the origin identification of audio conversion;
The preserving module 440, specifically can be used for:
Since the first character of the content of text to be converted, the content of text to be converted is converted into sound Frequency file, and save the audio file.
Optionally, the audio conversion instruction includes: the text selecting instruction of user's input;
The determining module 430, specifically can be also used for:
It receives and is instructed according to the text selecting of user's input, read the text from the content of text to be read The corresponding content of text to be read aloud of selection instruction;
The first character for determining the content of text to be converted in the content of text to be read aloud is origin identification.
Optionally, described device can also include:
It records strategy and obtains module, record strategy for obtaining text;
The preserving module 440, specifically can be used for:
From the origin identification, strategy is recorded according to the text, the content of text to be converted is converted into sound Frequency file, and save the audio file.
Further, in the base comprising obtaining module 410, bright read through model 420, determining module 430, preserving module 440 On plinth, as shown in figure 5, a kind of provided by the embodiment of the present invention read aloud text in the device for being recorded as audio, can also include:
Read aloud enabled instruction obtain module 510, for obtain user input read aloud enabled instruction.
Using the present embodiment, by from reading the corresponding language of each character in content of text to be read in speech database Sound data, and voice data is successively read aloud, and audio conversion instruction based on the received, from the origin identification that audio is converted, Content of text to be converted in content of text to be read is converted into audio file, and saves audio file, read aloud to While the content of text of reading, content of text to be converted is converted into audio file automatically, improves and reads aloud text to be read This content and the operating efficiency for saving corresponding audio file, and the operation of user is simplified, to improve the interaction body of user It tests.Also, enabled instruction is read aloud by acquisition user's input, starts to read aloud content of text, the starting and use read aloud The demand at family is related, improves the experience of user.
It should be noted that the embodiment of the present invention reads aloud text in the device for being recorded as audio as using above-mentioned text The device for being recorded as the method for audio is read aloud, then the above-mentioned all embodiments for text being read aloud the method for being recorded as audio are applicable in In the device, and it can reach the same or similar beneficial effect.
The embodiment of the invention also provides a kind of electronic equipment, as shown in fig. 6, include processor 610, communication interface 620, Memory 630 and communication bus 640, wherein the processor 610, the communication interface 620, the memory 630 pass through institute It states communication bus 640 and completes mutual communication;
The memory 630, for storing computer program;
The processor 610 promotes to realize following steps for executing the program stored on the memory:
Obtain content of text to be read;
From reading the corresponding voice data of each character in the content of text to be read in speech database, and successively Read aloud the voice data;
It receives and according to audio conversion instruction, determines the content of text to be converted in the content of text to be read Be converted to the origin identification of audio file;
From the origin identification, the content of text to be converted is converted into audio file, and save the audio File.
The processor 610 specifically may be implemented when realizing the step for obtaining content of text to be read:
Obtain the reading instruction of text source and user's input, wherein the reading instruction includes: that the reading text source is complete The instruction of text or the instruction for reading partial content in the text source;
Full text or partial content are read from the text source, as content of text to be read.
The processor 610 realize it is described from reading each word in the content of text to be read in speech database Corresponding voice data is accorded with, and before successively reading aloud the voice data, can also be realized:
Obtain user's input reads aloud enabled instruction.
Optionally, the audio conversion instruction are as follows: the audio conversion instruction of user's input;Alternatively, by described to be read The audio conversion instruction that the paragraph of content of text generates;
The processor 610 is determined in the realization reception and according to audio conversion instruction by the text to be read When the step for the origin identification that the content of text to be converted in content is converted to audio file, specifically it may be implemented:
When receiving the audio conversion instruction of user's input, determine that the content of text to be read is text to be converted This content, and the first character in the content of text to be converted is determined as the origin identification that audio is converted;
Alternatively, determining institute when receiving the audio conversion instruction by the paragraph generation of the content of text to be read Stating each paragraph in content of text to be read is content of text to be converted, and by first in each content of text to be converted A character is determined as the origin identification of audio conversion;
The processor 610 is described from the origin identification in realization, and the content of text to be converted is converted to Audio file, and when saving the step of the audio file, specifically it may be implemented:
Since the first character of the content of text to be converted, the content of text to be converted is converted into sound Frequency file, and save the audio file.
Optionally, the audio conversion instruction includes: the text selecting instruction of user's input;
The processor 610 is determined in the realization reception and according to audio conversion instruction by the text to be read When the step for the origin identification that the content of text to be converted in content is converted to audio file, specifically it may be implemented:
It receives and is instructed according to the text selecting of user's input, read the text from the content of text to be read The corresponding content of text to be read aloud of selection instruction;
The first character for determining the content of text to be converted in the content of text to be read aloud is origin identification;
The processor 610 is described from the origin identification in realization, and the content of text to be converted is converted to Audio file, and when saving the step of the audio file, specifically it may be implemented:
Since the first character of the content of text to be converted, the content of text to be converted is converted into sound Frequency file, and save the audio file.
The processor 610 is described from the origin identification in realization, and the content of text to be converted is converted to Audio file, and before the step of saving the audio file, it can also realize:
It obtains text and records strategy;
The processor 610 is described from the origin identification in realization, and the content of text to be converted is converted to Audio file, and when saving the step of the audio file, specifically it may be implemented:
From the origin identification, strategy is recorded according to the text, the content of text to be converted is converted into sound Frequency file, and save the audio file.
In the present embodiment, processor passes through the operation computer by reading the computer program stored in memory Program can be realized: by from reading the corresponding voice data of each character in content of text to be read in speech database, And voice data is successively read aloud, and audio conversion instruction based on the received, it, will be to be read from the origin identification that audio is converted Content of text in content of text to be converted be converted to audio file, and save audio file, reading aloud text to be read While this content, content of text to be converted is converted into audio file automatically, improve read aloud content of text to be read and The operating efficiency of corresponding audio file is saved, and simplifies the operation of user, to improve the interactive experience of user.
Above-mentioned memory may include random access memory (Random Access Memory, RAM), also can wrap Include nonvolatile memory (Non-Volatile Memory, NVM), for example, at least a magnetic disk storage.Optionally, it stores Device can also be that at least one is located remotely from the storage device of aforementioned processor.
Above-mentioned processor can be general processor, including CPU (Central Processing Unit, central processing Device), NP (Network Processor, network processing unit) etc.;Can also be DSP (Digital Signal Processing, Digital signal processor), ASIC (Application Specific Integrated Circuit, specific integrated circuit), FPGA (Field-Programmable Gate Array, field programmable gate array) or other programmable logic device are divided Vertical door or transistor logic, discrete hardware components.
Corresponding to text being read aloud provided by above-described embodiment the method for being recorded as audio, the embodiment of the present invention is also provided A kind of machine readable storage medium, for being stored with machine-executable instruction, when being called and being executed by processor, the machine Device executable instruction promotes the processor to realize following steps:
Obtain content of text to be read;
From reading the corresponding voice data of each character in the content of text to be read in speech database, and successively Read aloud the voice data;
It receives and according to audio conversion instruction, determines the content of text to be converted in the content of text to be read Be converted to the origin identification of audio file;
From the origin identification, the content of text to be converted is converted into audio file, and save the audio File.
The processor specifically may be implemented when realizing the step for obtaining content of text to be read:
Obtain the reading instruction of text source and user's input, wherein the reading instruction includes: that the reading text source is complete The instruction of text or the instruction for reading partial content in the text source;
Full text or partial content are read from the text source, as content of text to be read.
The processor realize it is described from reading each character in the content of text to be read in speech database Corresponding voice data, and before successively reading aloud the voice data, it can also realize:
Obtain user's input reads aloud enabled instruction.
Optionally, the audio conversion instruction are as follows: the audio conversion instruction of user's input;Alternatively, by described to be read The audio conversion instruction that the paragraph of content of text generates;
The processor is determined in the realization reception and according to audio conversion instruction by the content of text to be read In the content of text to be converted origin identification that is converted to audio file step when, specifically may be implemented:
When receiving the audio conversion instruction of user's input, determine that the content of text to be read is text to be converted This content, and the first character in the content of text to be converted is determined as the origin identification that audio is converted;
Alternatively, determining institute when receiving the audio conversion instruction by the paragraph generation of the content of text to be read Stating each paragraph in content of text to be read is content of text to be converted, and by first in each content of text to be converted A character is determined as the origin identification of audio conversion;
The processor is described from the origin identification in realization, and the content of text to be converted is converted to audio File, and when saving the step of the audio file, specifically it may be implemented:
Since the first character of the content of text to be converted, the content of text to be converted is converted into sound Frequency file, and save the audio file.
Optionally, the audio conversion instruction includes: the text selecting instruction of user's input;
The processor is determined in the realization reception and according to audio conversion instruction by the content of text to be read In the content of text to be converted origin identification that is converted to audio file step when, specifically may be implemented:
It receives and is instructed according to the text selecting of user's input, read the text from the content of text to be read The corresponding content of text to be read aloud of selection instruction;
The first character for determining the content of text to be converted in the content of text to be read aloud is origin identification;
The processor is described from the origin identification in realization, and the content of text to be converted is converted to audio File, and when saving the step of the audio file, specifically it may be implemented:
Since the first character of the content of text to be converted, the content of text to be converted is converted into sound Frequency file, and save the audio file.
The processor is described from the origin identification in realization, and the content of text to be converted is converted to audio File, and before the step of saving the audio file, it can also realize:
It obtains text and records strategy;
The processor is described from the origin identification in realization, and the content of text to be converted is converted to audio File, and when saving the step of the audio file, specifically it may be implemented:
From the origin identification, strategy is recorded according to the text, the content of text to be converted is converted into sound Frequency file, and save the audio file.
In the present embodiment, machine readable storage medium, which is stored with, to be executed provided by the embodiment of the present application at runtime text Originally the application program for being recorded as the method for audio is read aloud, therefore can be realized: is to be read by being read from speech database The corresponding voice data of each character in content of text, and voice data is successively read aloud, and audio conversion refers to based on the received It enables, from the origin identification that audio is converted, the content of text to be converted in content of text to be read is converted into audio text Part, and audio file is saved, while reading aloud content of text to be read, content of text to be converted is converted into sound automatically Frequency file improves the operating efficiency read aloud content of text to be read and save corresponding audio file, and simplifies user's Operation, to improve the interactive experience of user.
For electronic equipment and machine readable storage medium embodiment, method content as involved in it is basic It is similar to embodiment of the method above-mentioned, so being described relatively simple, related place is referring to the part explanation of embodiment of the method It can.
It should be noted that, in this document, relational terms such as first and second and the like are used merely to a reality Body or operation are distinguished with another entity or operation, are deposited without necessarily requiring or implying between these entities or operation In any actual relationship or order or sequence.Moreover, the terms "include", "comprise" or its any other variant are intended to Non-exclusive inclusion, so that the process, method, article or equipment including a series of elements is not only wanted including those Element, but also including other elements that are not explicitly listed, or further include for this process, method, article or equipment Intrinsic element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that There is also other identical elements in process, method, article or equipment including the element.
Each embodiment in this specification is all made of relevant mode and describes, same and similar portion between each embodiment Dividing may refer to each other, and each embodiment focuses on the differences from other embodiments.Especially for system reality For applying example, since it is substantially similar to the method embodiment, so being described relatively simple, related place is referring to embodiment of the method Part explanation.
The foregoing is merely illustrative of the preferred embodiments of the present invention, is not intended to limit the scope of the present invention.It is all Any modification, equivalent replacement, improvement and so within the spirit and principles in the present invention, are all contained in protection scope of the present invention It is interior.

Claims (10)

1. a kind of read aloud text the method for being recorded as audio, which is characterized in that the described method includes:
Obtain content of text to be read;
From reading the corresponding voice data of each character in the content of text to be read in speech database, and successively read aloud The voice data;
It receives and according to audio conversion instruction, the determining content of text conversion to be converted by the content of text to be read For the origin identification of audio file;
From the origin identification, the content of text to be converted is converted into audio file, and save the audio file.
2. according to claim 1 read aloud text the method for being recorded as audio, which is characterized in that the acquisition is to be read Content of text, comprising:
Obtain the reading instruction of text source and user's input, wherein the reading instruction includes: to read the text source full text Instruct or read the instruction of partial content in the text source;
Full text or partial content are read from the text source, as content of text to be read.
3. according to claim 1 read aloud text the method for being recorded as audio, which is characterized in that described from voice data The corresponding voice data of each character in the content of text to be read is read in library, and successively read aloud the voice data it Before, the method also includes:
Obtain user's input reads aloud enabled instruction.
4. according to claim 1 read aloud text the method for being recorded as audio, which is characterized in that the audio conversion refers to It enables are as follows: the audio conversion instruction of user's input;Alternatively, the audio conversion generated by the paragraph of the content of text to be read refers to It enables;
The reception and according to audio conversion instruction, determines the content of text to be converted in the content of text to be read Be converted to the origin identification of audio file, comprising:
When receiving the audio conversion instruction of user's input, determine that the content of text to be read is in text to be converted Hold, and the first character in the content of text to be converted is determined as to the origin identification of audio conversion;
Alternatively, when receiving the audio conversion instruction by the paragraph generation of the content of text to be read, determine it is described to Each paragraph in the content of text of reading is content of text to be converted, and by the first character in each content of text to be converted Symbol is determined as the origin identification of audio conversion;
It is described from the origin identification, the content of text to be converted is converted into audio file, and save the audio File, comprising:
Since the first character of the content of text to be converted, the content of text to be converted is converted into audio text Part, and save the audio file.
5. according to claim 1 read aloud text the method for being recorded as audio, which is characterized in that the audio conversion refers to Order includes: the text selecting instruction of user's input;
The reception and according to audio conversion instruction, determines the content of text to be converted in the content of text to be read Be converted to the origin identification of audio file, comprising:
It receives and is instructed according to the text selecting of user's input, read the text selecting from the content of text to be read Instruct corresponding content of text to be read aloud;
The first character for determining the content of text to be converted in the content of text to be read aloud is origin identification;
It is described from the origin identification, the content of text to be converted is converted into audio file, and save the audio File, comprising:
Since the first character of the content of text to be converted, the content of text to be converted is converted into audio text Part, and save the audio file.
6. according to claim 1 read aloud text the method for being recorded as audio, which is characterized in that described from the starting It identifies, the content of text to be converted is converted into audio file, and before saving the audio file, the method is also Include:
It obtains text and records strategy;
It is described from the origin identification, the content of text to be converted is converted into audio file, and save the audio File, comprising:
From the origin identification, strategy is recorded according to the text, the content of text to be converted is converted into audio text Part, and save the audio file.
7. a kind of read aloud text in the device for being recorded as audio, which is characterized in that described device includes:
Module is obtained, for obtaining content of text to be read;
Bright read through model, for from reading the corresponding voice number of each character in the content of text to be read in speech database According to, and successively read aloud the voice data;
Determining module, for receiving and according to audio conversion instruction, determination will be to be converted in the content of text to be read Content of text be converted to the origin identification of audio file;
Preserving module, for the content of text to be converted being converted to audio file, and save from the origin identification The audio file.
8. according to claim 7 read aloud text in the device for being recorded as audio, which is characterized in that described device is also wrapped It includes:
Read aloud enabled instruction obtain module, for obtain user input read aloud enabled instruction.
9. a kind of electronic equipment, which is characterized in that including processor, communication interface, memory and communication bus, wherein described Processor, the communication interface, the memory complete mutual communication by the communication bus;
The memory, for storing computer program;
The processor realizes any method of claim 1-6 for executing the program stored on the memory Step.
10. a kind of computer readable storage medium, which is characterized in that be stored with computer in the computer readable storage medium Program realizes claim 1-6 any method and step when the computer program is executed by processor.
CN201710813854.5A 2017-09-11 2017-09-11 Method and device for recording text reading as audio Active CN109509464B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710813854.5A CN109509464B (en) 2017-09-11 2017-09-11 Method and device for recording text reading as audio

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710813854.5A CN109509464B (en) 2017-09-11 2017-09-11 Method and device for recording text reading as audio

Publications (2)

Publication Number Publication Date
CN109509464A true CN109509464A (en) 2019-03-22
CN109509464B CN109509464B (en) 2022-11-04

Family

ID=65744232

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710813854.5A Active CN109509464B (en) 2017-09-11 2017-09-11 Method and device for recording text reading as audio

Country Status (1)

Country Link
CN (1) CN109509464B (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113626013A (en) * 2021-08-04 2021-11-09 中国人民解放军战略支援部队航天工程大学 Automatic interpretation method and device for slides

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060106609A1 (en) * 2004-07-21 2006-05-18 Natsuki Saito Speech synthesis system
CN1855223A (en) * 2005-04-18 2006-11-01 株式会社理光 Audio font output device, font database, and language input front end processor
CN101141666A (en) * 2006-09-05 2008-03-12 中兴通讯股份有限公司 Method of converting text note to voice broadcast in mobile phone
CN102280104A (en) * 2010-06-11 2011-12-14 北大方正集团有限公司 File phoneticization processing method and system based on intelligent indexing
CN104765714A (en) * 2014-01-08 2015-07-08 中国移动通信集团浙江有限公司 Switching method and device for electronic reading and listening
CN104810015A (en) * 2015-03-24 2015-07-29 深圳市创世达实业有限公司 Voice converting device, voice synthesis method and sound box using voice converting device and supporting text storage
CN106856091A (en) * 2016-12-21 2017-06-16 北京智能管家科技有限公司 The automatic broadcasting method and system of a kind of multi-language text

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20060106609A1 (en) * 2004-07-21 2006-05-18 Natsuki Saito Speech synthesis system
CN1906660A (en) * 2004-07-21 2007-01-31 松下电器产业株式会社 Speech synthesis device
CN1855223A (en) * 2005-04-18 2006-11-01 株式会社理光 Audio font output device, font database, and language input front end processor
CN101141666A (en) * 2006-09-05 2008-03-12 中兴通讯股份有限公司 Method of converting text note to voice broadcast in mobile phone
CN102280104A (en) * 2010-06-11 2011-12-14 北大方正集团有限公司 File phoneticization processing method and system based on intelligent indexing
CN104765714A (en) * 2014-01-08 2015-07-08 中国移动通信集团浙江有限公司 Switching method and device for electronic reading and listening
CN104810015A (en) * 2015-03-24 2015-07-29 深圳市创世达实业有限公司 Voice converting device, voice synthesis method and sound box using voice converting device and supporting text storage
CN106856091A (en) * 2016-12-21 2017-06-16 北京智能管家科技有限公司 The automatic broadcasting method and system of a kind of multi-language text

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113626013A (en) * 2021-08-04 2021-11-09 中国人民解放军战略支援部队航天工程大学 Automatic interpretation method and device for slides

Also Published As

Publication number Publication date
CN109509464B (en) 2022-11-04

Similar Documents

Publication Publication Date Title
US9104290B2 (en) Method for controlling screen of mobile terminal
CN105335455A (en) Text reading method and apparatus
CN103916704A (en) Dialog-type interface apparatus and method for controlling the same
CN103905644A (en) Generating method and equipment of mobile terminal call interface
US20200097528A1 (en) Method and Device for Quickly Inserting Text of Speech Carrier
CN102917119A (en) Method and system for processing music by mobile terminal according to voice recognition
CN109634501B (en) Electronic book annotation adding method, electronic equipment and computer storage medium
CN109074821A (en) Speech is to Text enhancement media editing
CN101739437A (en) Implementation method for network sound-searching unit and specific device thereof
CN108614851A (en) Notes content display methods in tutoring system and device
JP2020003774A (en) Method and apparatus for processing speech
CN108460120A (en) Data save method, device, terminal device and storage medium
KR101156934B1 (en) Method for Creating and Playing Sound-Recorded File with Keyword and Portable Device thereof
CN101465146B (en) Method and equipment for playing media file
CN111540370A (en) Audio processing method and device, computer equipment and computer readable storage medium
CN105637586A (en) Method and apparatus for editing audio files
CN108305622A (en) A kind of audio summary texts creation method and its creating device based on speech recognition
CN104575545B (en) A kind of generation method of list to be played
CN109509464A (en) It is a kind of text to be read aloud the method and device for being recorded as audio
CN108682426A (en) Voice sensual pleasure conversion method and device
CN104866186B (en) A kind of word playback method and electronic equipment
CN104598229B (en) A kind of terminal
CN112000254B (en) Corpus resource playing method and device, storage medium and electronic device
US20200348804A1 (en) Sectional user interface for controlling a mobile terminal
CN113765754A (en) Audio synchronous playing method and device and computer readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant