CN109509464B - Method and device for recording text reading as audio - Google Patents

Method and device for recording text reading as audio Download PDF

Info

Publication number
CN109509464B
CN109509464B CN201710813854.5A CN201710813854A CN109509464B CN 109509464 B CN109509464 B CN 109509464B CN 201710813854 A CN201710813854 A CN 201710813854A CN 109509464 B CN109509464 B CN 109509464B
Authority
CN
China
Prior art keywords
text
text content
read
converted
reading
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201710813854.5A
Other languages
Chinese (zh)
Other versions
CN109509464A (en
Inventor
胡娟
黄兰花
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Kingsoft Office Software Inc
Zhuhai Kingsoft Office Software Co Ltd
Guangzhou Kingsoft Mobile Technology Co Ltd
Original Assignee
Beijing Kingsoft Office Software Inc
Zhuhai Kingsoft Office Software Co Ltd
Guangzhou Kingsoft Mobile Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Kingsoft Office Software Inc, Zhuhai Kingsoft Office Software Co Ltd, Guangzhou Kingsoft Mobile Technology Co Ltd filed Critical Beijing Kingsoft Office Software Inc
Priority to CN201710813854.5A priority Critical patent/CN109509464B/en
Publication of CN109509464A publication Critical patent/CN109509464A/en
Application granted granted Critical
Publication of CN109509464B publication Critical patent/CN109509464B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/08Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination

Abstract

The embodiment of the invention provides a method and a device for recording text reading as audio, wherein the method for recording the text reading as the audio comprises the following steps: acquiring text content to be read; reading voice data corresponding to each character in text content to be read from a voice database, and reading the voice data in sequence; receiving and determining a starting identifier for converting the text content to be converted in the text content to be read into an audio file according to an audio conversion instruction; and converting the text content to be converted into an audio file from the starting identifier, and storing the audio file. By the scheme, the text content can be automatically recorded into the audio file while listening to the reading of the text content, so that the operation efficiency and the interaction experience of a user are improved.

Description

Method and device for recording text reading as audio
Technical Field
The invention relates to the technical field of man-machine interaction, in particular to a method and a device for recording text reading as audio.
Background
With the rapid development of computers and mobile phones, people increasingly rely on computers and mobile phones, and reading habits are gradually changed from paper reading to electronic text reading, but text content is inconvenient to read in scenes such as driving, cooking or carrying things in hands. Therefore, for scenes in which text content is inconvenient to read, many reading software provide a reading function at present, that is, text content is converted into voice and then read. The user can listen to the content of the text without reading, and can even listen to the reading while doing other things, so that great convenience is provided for the user to acquire the text information in a scene that the user is not convenient to read the text.
However, the existing reading software capable of providing a reading function often only has a text reading function, and if a user wants to share text contents with others by using an audio file or store the text contents as an audio file for listening repeatedly in the following, the user needs to import a text into audio conversion software, and convert the text contents into the audio file for storage through the audio conversion software; therefore, when a user listens to the text content by reading software, the text content can be converted into an audio file for storage only by the aid of audio conversion software, and the implementation process is complex, the operation efficiency of the user is low, and the interaction experience of the user is influenced.
Disclosure of Invention
The embodiment of the invention aims to provide a method and a device for recording text reading aloud as audio, so that the text content can be automatically recorded as an audio file while listening to the reading aloud of the text content, and the operation efficiency and the interactive experience of a user are improved. The specific technical scheme is as follows:
in a first aspect, an embodiment of the present invention provides a method for recording text reads as audio, where the method includes:
acquiring text content to be read;
reading voice data corresponding to each character in the text content to be read from a voice database, and reading the voice data in sequence;
receiving and determining a starting identifier for converting the text content to be converted in the text content to be read into an audio file according to an audio conversion instruction;
and converting the text content to be converted into an audio file from the starting identifier, and storing the audio file.
Optionally, the obtaining of the text content to be read includes:
acquiring a text source and a reading instruction input by a user, wherein the reading instruction comprises: reading the text source complete text or reading part of the content in the text source;
and reading the full text or partial content from the text source to serve as the text content to be read.
Optionally, before reading the voice data corresponding to each character in the text content to be read from the voice database and sequentially reading the voice data, the method further includes:
and acquiring a reading starting instruction input by a user.
Optionally, the audio conversion instruction is: an audio conversion instruction input by a user; or, generating an audio conversion instruction according to the paragraph of the text content to be read;
the receiving and determining, according to an audio conversion instruction, a start identifier for converting the text content to be converted in the text content to be read into an audio file includes:
when an audio conversion instruction input by a user is received, determining the text content to be read as the text content to be converted, and determining a first character in the text content to be converted as an initial identifier of audio conversion;
or when receiving an audio conversion instruction generated according to the paragraphs of the text content to be read, determining each paragraph in the text content to be read as the text content to be converted, and determining a first character in each text content to be converted as an initial identifier of audio conversion;
the converting the text content to be converted into an audio file from the starting identifier, and storing the audio file, including:
and starting from the first character of the text content to be converted, converting the text content to be converted into an audio file, and storing the audio file.
Optionally, the audio conversion instruction includes: a text selection instruction input by a user;
the receiving and determining, according to an audio conversion instruction, a start identifier for converting the text content to be converted in the text content to be read into an audio file includes:
receiving and reading the text content to be read corresponding to the text selection instruction from the text content to be read according to the text selection instruction input by a user;
determining a first character of the text content to be converted in the text content to be read aloud as a starting identifier;
the converting the text content to be converted into an audio file from the starting identifier, and saving the audio file includes:
and starting from the first character of the text content to be converted, converting the text content to be converted into an audio file, and storing the audio file.
Optionally, before the converting the text content to be converted into the audio file from the start identifier and saving the audio file, the method further includes:
acquiring a text recording strategy;
the converting the text content to be converted into an audio file from the starting identifier, and storing the audio file, including:
and converting the text content to be converted into an audio file according to the text recording strategy from the starting identifier, and storing the audio file.
In a second aspect, an embodiment of the present invention further provides an apparatus for recording text reads as audio, where the apparatus includes:
the acquisition module is used for acquiring text contents to be read;
the reading module is used for reading the voice data corresponding to each character in the text content to be read from a voice database and sequentially reading the voice data;
the determining module is used for receiving and determining a starting identifier for converting the text content to be converted in the text content to be read into an audio file according to an audio conversion instruction;
and the storage module is used for converting the text content to be converted into an audio file from the starting identifier and storing the audio file.
Optionally, the apparatus further comprises:
and the reading starting instruction acquisition module is used for acquiring the reading starting instruction input by the user.
In a third aspect, an embodiment of the present invention provides an electronic device, including a processor, a communication interface, a memory, and a communication bus, where the processor, the communication interface, and the memory complete mutual communication through the communication bus;
the memory is used for storing a computer program;
the processor is configured to execute the program stored in the memory, and implement the method steps of the first aspect.
In a fourth aspect, the present invention provides a computer-readable storage medium, in which a computer program is stored, and the computer program, when executed by a processor, implements the method steps of the first aspect.
According to the method and the device for recording the text read aloud as the audio, the voice data corresponding to each character in the text content to be read are read from the voice database, the voice data are read aloud in sequence, the text content to be converted in the text content to be read is converted into the audio file from the initial identification of audio conversion according to the received audio conversion instruction, the audio file is stored, the text content to be converted is automatically converted into the audio file while the text content to be read is read aloud, the operation efficiency of reading the text content aloud and storing the corresponding audio file is improved, the operation of a user is simplified, and the interaction experience of the user is improved.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, it is obvious that the drawings in the following description are only some embodiments of the present invention, and for those skilled in the art, other drawings can be obtained according to the drawings without creative efforts.
Fig. 1 is a schematic flowchart of a method for recording text reads as audio according to an embodiment of the present invention;
fig. 2 is a schematic flow chart of another method for recording text reads as audio according to an embodiment of the present invention;
FIG. 3 is a schematic interface diagram of the reading software according to the embodiment of the present invention;
fig. 4 is a flowchart illustrating an apparatus for recording text reads as audio according to an embodiment of the present invention;
fig. 5 is a schematic flow chart of an apparatus for recording text reads as audio according to an embodiment of the present invention;
fig. 6 is a schematic structural diagram of an electronic device according to an embodiment of the present invention.
Detailed Description
The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
In order to automatically record text content into an audio file while listening to the reading of the text content and improve the operation efficiency and the interaction experience of a user, the embodiment of the invention provides a method and a device for recording the text reading into audio.
First, a method for recording text as audio by reading text is described below.
It should be noted that an execution main body of the method for reading and recording text as audio provided by the embodiment of the present invention may be a computer, a tablet computer, a mobile phone, or an electronic book reader, and the specific method flow may be implemented by a reading application program in the above-mentioned device.
As shown in fig. 1, a method for recording text reads as audio according to an embodiment of the present invention may include the following steps:
s101, acquiring text content to be read.
The text content to be read is the content in the text sources of novels, prose, poetry, news and the like imported from the local or acquired from the network by the user, and is generally the content in the text sources which the user is interested in or wants to read.
It should be noted that the Format of the text content may be a text in a word processing Format, a PDF (Portable Document Format) text, a text in a table Format, or a text in a slide Format, and the Format of the text content is not limited herein, and a text file in any Format falls within the protection scope of the embodiment of the present invention. The text source imported locally by the user or obtained from the network is generally a complete text source, such as a novel, a series of news, etc., and the user may only be interested in a certain part of the text source, so the text content may be all the content of the entire text source or a part of the content of the text source.
Optionally, the step of acquiring the text content to be read may include:
firstly, a text source and a reading instruction input by a user are obtained.
Wherein the reading instructions may include: instructions to read the full text of the text source, or instructions to read a portion of the content of the text source. It should be noted that after obtaining text sources such as novels and news, a user needs to select to read the full text of the text source or read part of the content in the text source according to the content concerned or interested by the user, so that an instruction for reading the full text of the text source or an instruction for reading part of the content in the text source is generated according to the selection of the user.
And secondly, reading the full text or partial content from the text source as the text content to be read.
It should be noted that, when the obtained reading instruction is an instruction for reading a text source full text, the text content to be read is the text source full text; and when the obtained reading instruction is an instruction for reading part of the content in the text source, the text content to be read is the part of the content in the text source. It is emphasized that the instructions for reading the part of the content in the text source should include: the starting character of the portion of content and the length of the content.
It can be understood that, for example, a user acquires a series of news from the network, the series of news includes 15 news, and the user finds that only 5 news of the 15 news are interested in the title of the news, the user can input an instruction for reading the 5 news, and the reading application software will only read the 5 news, thereby helping the user filter out content that is not interested in the user, and providing a good use experience for the user.
S102, reading voice data corresponding to each character in the text content to be read from the voice database, and reading the voice data in sequence.
It should be noted that, a voice database exists in the local device or the network, and after the text content to be read is obtained, the reading application software may read the voice data corresponding to each character in the text content to be read from the voice database, and then read each voice data. It should be emphasized that, in general, in order to reduce the data storage process, corresponding voice data is read from the voice database in sequence according to the order of characters in the text content and is read out in sequence, that is, voice data corresponding to one character is read out and then is read out. And the reading voice data can support multiple languages, and the user can select different languages to read, such as Chinese, english, french, german, and the like.
It should be emphasized that it is reasonable to start reading the text content to be read, starting reading when the user imports the text content, or starting reading when the user inputs a reading start instruction and receives the reading start instruction.
S103, receiving and determining a starting identifier for converting the text content to be converted in the text content to be read into an audio file according to the audio conversion instruction.
The audio conversion instruction is a starting instruction for converting the text content into the audio file, and the audio conversion instruction can be input by a user according to needs, for example, in the process that the user listens to the reading process of the text content, the text content is found to be valuable, and the text content is required to be stored as the audio file to be shared with other people, so that the user can input the audio conversion instruction; alternatively, in order to reduce the data amount of an audio file that stores the entire content of the text, the audio conversion instruction may also be generated by a paragraph of the text content to be read. Of course, the audio conversion instruction may also be generated by default at the start time of reading the text content, and the audio conversion is performed in synchronization with the reading of the text content; the audio conversion instruction can also be generated according to the reading ending identification of the text content when the text content is read. It is emphasized that any manner of audio conversion instruction falls within the scope of the present embodiment.
It should be noted that after receiving the audio conversion instruction, the text content to be converted in the text content to be read needs to be subjected to audio conversion, and the text content to be converted may be all of the text content to be read or may be part of the text content to be read. The audio conversion needs to be started from a start identifier, which may be the first character of the full text content, the first character of each paragraph in the text content, or the first character of the part of the content selected by the user.
Optionally, the audio conversion instruction may be: an audio conversion instruction input by a user; or, an audio conversion instruction is generated according to the paragraph of the text content to be read; alternatively, a text selection instruction entered by the user.
It should be noted that the audio conversion command input by the user may be input during the text content reading process, or may be input before the text content reading process and multiplexed with the reading command, which is not limited herein. It is emphasized that when the audio conversion instructions are multiplexed with the speakable instructions, the audio conversion and the text content speaks simultaneously. The user can determine whether to convert the text content into the audio file according to the received reading content, so that the requirements of the user are met. The audio conversion instruction can also be generated according to the paragraph of the text content to be read, and the audio conversion according to the paragraph can reduce the risk of data loss caused by overlarge data quantity when the whole text content is converted. The audio conversion quality may also be a text selection instruction input by the user, that is, the user selects a certain part of text content from the text content to be read as a selection instruction of the text content to be converted according to the degree of interest in the text content to be read.
Optionally, the step of receiving and determining, according to the audio conversion instruction, a start identifier for converting the text content to be converted in the text content to be read into an audio file may include:
when an audio conversion instruction input by a user is received, determining that the text content to be read is the text content to be converted, and determining a first character in the text content to be converted as an initial identifier of audio conversion;
alternatively, the first and second electrodes may be,
when receiving an audio conversion instruction generated according to paragraphs of text contents to be read, determining each paragraph in the text contents to be read as the text contents to be converted, and determining a first character in each text content to be converted as an initial identifier of audio conversion;
alternatively, the first and second liquid crystal display panels may be,
receiving and reading the text content to be read corresponding to the text selection instruction from the text content to be read according to the text selection instruction input by the user; and determining the first character of the text content to be converted in the text content to be read as a starting identifier.
It should be noted that after receiving the audio conversion instruction, determining the text content to be converted in the text content to be read according to the difference of the audio conversion instruction, then starting to perform audio conversion on the text content to be converted, and needing to determine an initial identifier of the audio conversion, generally, selecting a first character in the text content to be converted as the initial identifier of the audio conversion; of course, the start identifier may not be the first character, but if the start identifier is any character in the text content, it may cause a situation that the content is discontinuous when the saved audio file is played, and affect the playing effect. Therefore, in the embodiment, the first character of the text content to be converted is selected as the initial identifier of the audio conversion, so that the continuity of the converted audio file is ensured, and the effect of playing the converted audio file later is facilitated. And the user can select the text content to be read according to the requirement, listen to the reading and recording of part of the content, and specify the starting character of the text content to be read and the length of the text content to be converted in the text content, for example, the user specifies that the first character of the text content to be read is the first character of the fifth segment in the text content to be read, and the length of the text content to be read is 30 characters. Then, reading can be performed starting from the first character of the text content to be read; the text content to be converted may be all the content of the text content to be read aloud, or may be a part of the content of the text content to be read aloud, and all the text content to be converted is converted into an audio file and stored from the first character of the text content to be converted. According to the designation of the text content to be read aloud by the user, the requirement of the user can be better met.
It should be emphasized that the text content to be read specified by the user may be regarded as trial listening content, and in the process of listening to reading, if the user considers that the text content has a value saved as an audio file, the first character of the text content to be read may be regarded as a start identifier, and the full text of the text content to be read may be saved as an audio file.
It is understood that the text selection instruction input by the user may be that the user selects a part of the text content by lighting up, or that the user selects by inputting a specific reading condition, for example, the content input by the user is: and the content of the audio conversion is the content corresponding to the reading condition from the first segment, the tenth character in the first line to the second segment and the fifth character in the third line. It should be emphasized that any method for selecting text content by a user belongs to the scope of the embodiments of the present invention, and details are not repeated here.
And S104, converting the text content to be converted into an audio file from the initial identifier, and storing the audio file.
It should be noted that the start identifier is a start point for starting audio conversion of the text content, so that audio conversion is performed on the text content to be converted from the start identifier, and the converted audio file is stored, so that the user repeatedly listens to the text content corresponding to the stored audio file, or the user shares the stored audio file with other people, so that the other people listen to the audio file without obtaining an original text source.
It is emphasized that the format of the audio file in the audio conversion may be any format of audio file, such as CD format, WAV format, WMA format, AAC format, APE format, etc. Also, the converted audio file may be in a variety of languages, such as chinese, english, french, german, etc.
Optionally, the step of converting the text content to be converted into an audio file from the start identifier and storing the audio file may include:
starting from the first character of the text content to be converted, the text content to be converted is converted into an audio file, and the audio file is saved.
It should be noted that, starting from the first character of the text content to be converted, the text content to be converted is converted into an audio file. If the text content to be converted is the full text of the text content to be read, the corresponding audio file is saved, if the file is lost or damaged, the audio file of the whole text content may be affected, and if the text content to be converted is a part of the text content to be read, for example, each section of text content, the corresponding audio file is saved, and if the text content of a certain part is lost or damaged, the other part is not affected.
In order to deal with the above situations, before the steps of converting the text content to be converted into the audio file and saving the audio file from the start identifier, the method may further include:
and acquiring a text recording strategy.
The step of converting the text content to be converted into an audio file from the start identifier and saving the audio file may include:
and converting the text content to be converted into an audio file according to a text recording strategy from the initial identifier, and storing the audio file.
The text recording policy may be configured by the user according to the requirement, or may be generated by the execution main body of this embodiment according to analysis of the history data. For example, if the user is interested in sports news through the historical data, the text content corresponding to the sports news is recorded preferentially, and the corresponding audio file is saved. In particular, the text recording strategy may include, but is not limited to: some text content is recorded preferentially, some text content is not recorded, and some text content is recorded repeatedly.
By applying the embodiment, the voice data corresponding to each character in the text content to be read is read from the voice database, the voice data is read in sequence, the text content to be converted in the text content to be read is converted into the audio file from the starting identifier of the audio conversion according to the received audio conversion instruction, the audio file is stored, the text content to be converted is automatically converted into the audio file while the text content to be read is read, the operation efficiency of reading the text content to be read and storing the corresponding audio file is improved, the operation of a user is simplified, and the interaction experience of the user is improved.
As shown in fig. 2, in the method for recording text reads as audio provided in this embodiment, before step S102 in the embodiment shown in fig. 1, the method for recording text reads as audio may further include:
s201, obtaining a reading starting instruction input by a user.
It should be noted that, the user inputs a reading start instruction, and starts to read the text content word by word when receiving the reading start instruction. The mode of inputting the reading starting instruction by the user can be input through the control or through the external switch device, and any reading starting instruction which can be input by the user belongs to the protection scope of the embodiment of the invention.
It should be emphasized that steps S101 to S104 are the same as the embodiment shown in fig. 1, and are not repeated here.
By applying the embodiment, the voice data corresponding to each character in the text content to be read is read from the voice database, the voice data is read in sequence, the text content to be converted in the text content to be read is converted into the audio file from the initial identification of the audio conversion according to the received audio conversion instruction, the audio file is stored, the text content to be converted is automatically converted into the audio file while the text content to be read is read, the operation efficiency of reading the text content to be read and storing the corresponding audio file is improved, the operation of a user is simplified, and the interaction experience of the user is improved. And the text content is read aloud by acquiring the reading starting instruction input by the user, the reading starting is related to the requirement of the user, and the user experience is improved.
The method for recording text reads as audio provided by the embodiment of the present invention is described below with reference to specific application examples.
Fig. 3 is a schematic interface diagram of the reading software according to the embodiment of the present invention. The software interface 301 comprises a text content area 302, a reading start button 303 and an audio conversion button 304, a user selects 5 news from the network, the reading software is imported, the 5 news are displayed in the text content area 302, and when the user clicks the reading start button 303, the reading software reads voice data corresponding to each character of the news from the 1 st news and from a voice database, and reads the voice data word by word and strip by strip; when a user listens to a book, which considers news as meaningful and wants to be stored as an audio file shared with friends, the reading software starts to audio-convert the news in the text content area 302 according to the content of the paragraphs, that is, each paragraph is converted into an audio file, and stores the converted audio file by clicking the audio conversion button 304. Therefore, the user can find the saved audio file from the local storage space, share and send the audio file to friends.
Compared with the prior art, in the scheme, the reading software reads the voice data corresponding to each character in the text content area from the voice database, reads the voice data word by word and item by item, and converts each paragraph of the text in the text content area into an audio file starting from the first character of each paragraph according to the audio conversion instruction corresponding to the audio conversion button, stores the audio file, automatically converts the text content into the audio file while reading the text in the text content area, improves the operation efficiency of reading the text content and storing the corresponding audio file, and simplifies the operation of a user, thereby improving the interaction experience of the user; and the audio files are generated according to the paragraphs, so that the situation that the audio files of the whole text content are affected if the audio files corresponding to the whole text are saved and the other paragraphs are not affected if the content of one paragraph is lost or damaged is avoided.
Corresponding to the foregoing embodiments, an apparatus for recording text reads as audio according to an embodiment of the present invention is provided, and as shown in fig. 4, the apparatus for recording text reads as audio may include:
an obtaining module 410, configured to obtain text content to be read;
a reading module 420, configured to read voice data corresponding to each character in the text content to be read from a voice database, and sequentially read the voice data;
the determining module 430 is configured to receive and determine, according to an audio conversion instruction, a starting identifier for converting to-be-converted text content in the to-be-read text content into an audio file;
a saving module 440, configured to convert the text content to be converted into an audio file from the start identifier, and save the audio file.
By applying the embodiment, the voice data corresponding to each character in the text content to be read is read from the voice database, the voice data is read in sequence, the text content to be converted in the text content to be read is converted into the audio file from the initial identification of the audio conversion according to the received audio conversion instruction, the audio file is stored, the text content to be converted is automatically converted into the audio file while the text content to be read is read, the operation efficiency of reading the text content to be read and storing the corresponding audio file is improved, the operation of a user is simplified, and the interaction experience of the user is improved.
Optionally, the obtaining module 410 may be specifically configured to:
acquiring a text source and a reading instruction input by a user, wherein the reading instruction comprises: reading the text source complete text or reading part of content in the text source;
and reading the full text or partial content from the text source to serve as the text content to be read.
Optionally, the audio conversion instruction is: an audio conversion instruction input by a user; or, generating an audio conversion instruction according to the paragraph of the text content to be read;
the determining module 430 may be specifically configured to:
when an audio conversion instruction input by a user is received, determining the text content to be read as the text content to be converted, and determining a first character in the text content to be converted as an initial identifier of audio conversion;
or when receiving an audio conversion instruction generated according to the paragraphs of the text content to be read, determining each paragraph in the text content to be read as the text content to be converted, and determining a first character in each text content to be converted as an initial identifier of audio conversion;
the saving module 440 may specifically be configured to:
and starting from the first character of the text content to be converted, converting the text content to be converted into an audio file, and storing the audio file.
Optionally, the audio conversion instruction includes: a text selection instruction input by a user;
the determining module 430 may be further configured to:
receiving and reading the text content to be read corresponding to the text selection instruction from the text content to be read according to the text selection instruction input by a user;
and determining a first character of the text content to be converted in the text content to be read as a starting identifier.
Optionally, the apparatus may further include:
the recording strategy acquisition module is used for acquiring a text recording strategy;
the saving module 440 may be specifically configured to:
and converting the text content to be converted into an audio file according to the text recording strategy from the starting identifier, and storing the audio file.
Further, on the basis of including the obtaining module 410, the reading module 420, the determining module 430, and the storing module 440, as shown in fig. 5, an apparatus for recording text reading as audio provided by the embodiment of the present invention may further include:
and a reading starting instruction obtaining module 510, configured to obtain a reading starting instruction input by a user.
By applying the embodiment, the voice data corresponding to each character in the text content to be read is read from the voice database, the voice data is read in sequence, the text content to be converted in the text content to be read is converted into the audio file from the starting identifier of the audio conversion according to the received audio conversion instruction, the audio file is stored, the text content to be converted is automatically converted into the audio file while the text content to be read is read, the operation efficiency of reading the text content to be read and storing the corresponding audio file is improved, the operation of a user is simplified, and the interaction experience of the user is improved. And the text content is read aloud by acquiring a reading starting instruction input by the user, the reading starting is related to the requirement of the user, and the user experience is improved.
It should be noted that, the apparatus for recording text reading aloud as audio in the embodiment of the present invention is an apparatus that applies the method for recording text reading aloud as audio, and all embodiments of the method for recording text reading aloud as audio are applicable to the apparatus and can achieve the same or similar beneficial effects.
An embodiment of the present invention further provides an electronic device, as shown in fig. 6, including a processor 610, a communication interface 620, a memory 630, and a communication bus 640, where the processor 610, the communication interface 620, and the memory 630 complete mutual communication through the communication bus 640;
the memory 630 is used for storing computer programs;
the processor 610 is configured to execute the program stored in the memory, and to cause the following steps to be implemented:
acquiring text content to be read;
reading voice data corresponding to each character in the text content to be read from a voice database, and reading the voice data in sequence;
receiving and determining a starting identifier for converting the text content to be converted in the text content to be read into an audio file according to an audio conversion instruction;
and converting the text content to be converted into an audio file from the starting identifier, and storing the audio file.
When the processor 610 implements the step of acquiring the text content to be read, it may specifically implement:
acquiring a text source and a reading instruction input by a user, wherein the reading instruction comprises: reading the text source complete text or reading part of content in the text source;
and reading the full text or partial content from the text source to serve as the text content to be read.
Before the processor 610 reads the voice data corresponding to each character in the text content to be read from the voice database and reads the voice data in sequence, the following may also be implemented:
and acquiring a reading starting instruction input by a user.
Optionally, the audio conversion instruction is: an audio conversion instruction input by a user; or, generating an audio conversion instruction according to the paragraph of the text content to be read;
when the processor 610 implements the receiving and determines, according to the audio conversion instruction, to convert the text content to be converted in the text content to be read into the start identifier of the audio file, the following steps may be specifically implemented:
when an audio conversion instruction input by a user is received, determining the text content to be read as the text content to be converted, and determining a first character in the text content to be converted as an initial identifier of audio conversion;
or when receiving an audio conversion instruction generated according to the paragraphs of the text content to be read, determining each paragraph in the text content to be read as the text content to be converted, and determining a first character in each text content to be converted as an initial identifier of audio conversion;
when the processor 610 implements the steps of converting the text content to be converted into an audio file from the start identifier, and saving the audio file, it may specifically implement:
and starting from the first character of the text content to be converted, converting the text content to be converted into an audio file, and storing the audio file.
Optionally, the audio conversion instruction includes: a text selection instruction input by a user;
when the processor 610 implements the receiving and determines, according to the audio conversion instruction, to convert the text content to be converted in the text content to be read into the start identifier of the audio file, specifically, the following steps may be implemented:
receiving and reading the text content to be read corresponding to the text selection instruction from the text content to be read according to the text selection instruction input by a user;
determining a first character of the text content to be converted in the text content to be read aloud as a starting identifier;
when the processor 610 implements the steps of converting the text content to be converted into an audio file from the start identifier, and saving the audio file, the following may be specifically implemented:
and starting from the first character of the text content to be converted, converting the text content to be converted into an audio file, and storing the audio file.
Before implementing the steps of converting the text content to be converted into an audio file from the start identifier and saving the audio file, the processor 610 may further implement:
acquiring a text recording strategy;
when the processor 610 implements the steps of converting the text content to be converted into an audio file from the start identifier, and saving the audio file, it may specifically implement:
and converting the text content to be converted into an audio file according to the text recording strategy from the starting identifier, and storing the audio file.
In this embodiment, the processor can realize that: the voice data corresponding to each character in the text content to be read is read from the voice database, the voice data are read in sequence, the text content to be converted in the text content to be read is converted into an audio file from the initial identification of audio conversion according to the received audio conversion instruction, the audio file is stored, the text content to be converted is automatically converted into the audio file while the text content to be read is read, the operation efficiency of reading the text content to be read and storing the corresponding audio file is improved, the operation of a user is simplified, and therefore the interaction experience of the user is improved.
The Memory may include a Random Access Memory (RAM) or a Non-Volatile Memory (NVM), such as at least one disk Memory. Optionally, the memory may also be at least one memory device located remotely from the processor.
The Processor may be a general-purpose Processor, including a Central Processing Unit (CPU), a Network Processor (NP), and the like; but also a DSP (Digital Signal Processing), an ASIC (Application Specific Integrated Circuit), an FPGA (Field Programmable Gate Array) or other Programmable logic device, discrete Gate or transistor logic device, discrete hardware component.
In response to the method for recording text reads as audio provided in the foregoing embodiments, embodiments of the present invention further provide a machine-readable storage medium storing machine-executable instructions that, when invoked and executed by a processor, cause the processor to implement the following steps:
acquiring text content to be read;
reading voice data corresponding to each character in the text content to be read from a voice database, and reading the voice data in sequence;
receiving and determining a starting identifier for converting the text content to be converted in the text content to be read into an audio file according to an audio conversion instruction;
and converting the text content to be converted into an audio file from the starting identifier, and storing the audio file.
When the processor implements the step of obtaining the text content to be read, the following may be specifically implemented:
acquiring a text source and a reading instruction input by a user, wherein the reading instruction comprises: reading the text source complete text or reading part of content in the text source;
and reading the full text or partial content from the text source to serve as the text content to be read.
Before the processor reads the voice data corresponding to each character in the text content to be read from the voice database and reads the voice data in sequence, the processor may further:
and acquiring a reading starting instruction input by a user.
Optionally, the audio conversion instruction is: an audio conversion instruction input by a user; or, generating an audio conversion instruction according to a paragraph of the text content to be read;
when the processor implements the step of receiving and determining, according to the audio conversion instruction, to convert the text content to be converted in the text content to be read into the start identifier of the audio file, the following may be specifically implemented:
when an audio conversion instruction input by a user is received, determining the text content to be read as the text content to be converted, and determining a first character in the text content to be converted as an initial identifier of audio conversion;
or when receiving an audio conversion instruction generated according to the paragraphs of the text content to be read, determining each paragraph in the text content to be read as the text content to be converted, and determining a first character in each text content to be converted as an initial identifier of audio conversion;
when the processor implements the steps of converting the text content to be converted into an audio file from the start identifier, and storing the audio file, the following may be specifically implemented:
and starting from the first character of the text content to be converted, converting the text content to be converted into an audio file, and storing the audio file.
Optionally, the audio conversion instruction includes: a text selection instruction input by a user;
when the processor implements the step of receiving and determining, according to the audio conversion instruction, to convert the text content to be converted in the text content to be read into the start identifier of the audio file, the following may be specifically implemented:
receiving and reading the text content to be read corresponding to the text selection instruction from the text content to be read according to the text selection instruction input by a user;
determining a first character of the text content to be converted in the text content to be read aloud as a starting identifier;
when the processor implements the steps of converting the text content to be converted into an audio file from the start identifier and saving the audio file, the following may be specifically implemented:
and starting from the first character of the text content to be converted, converting the text content to be converted into an audio file, and storing the audio file.
Before the step of converting the text content to be converted into an audio file from the start identifier and saving the audio file is implemented by the processor, the following steps may be further implemented:
acquiring a text recording strategy;
when the processor implements the steps of converting the text content to be converted into an audio file from the start identifier and saving the audio file, the following may be specifically implemented:
and converting the text content to be converted into an audio file according to the text recording strategy from the starting identifier, and storing the audio file.
In this embodiment, the machine-readable storage medium stores an application program that executes the method for reading and recording text as audio provided in the embodiment of the present application when running, so that the following can be implemented: the voice data corresponding to each character in the text content to be read is read from the voice database, the voice data are read in sequence, the text content to be converted in the text content to be read is converted into an audio file from the initial identification of audio conversion according to the received audio conversion instruction, the audio file is stored, the text content to be converted is automatically converted into the audio file while the text content to be read is read, the operation efficiency of reading the text content to be read and storing the corresponding audio file is improved, the operation of a user is simplified, and therefore the interaction experience of the user is improved.
As for the embodiments of the electronic device and the machine-readable storage medium, since the contents of the related methods are substantially similar to those of the foregoing method embodiments, the description is relatively simple, and reference may be made to the partial description of the method embodiments for relevant points.
It is noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising a … …" does not exclude the presence of another identical element in a process, method, article, or apparatus that comprises the element.
All the embodiments in the present specification are described in a related manner, and the same and similar parts among the embodiments may be referred to each other, and each embodiment focuses on differences from other embodiments. In particular, for the system embodiment, since it is substantially similar to the method embodiment, the description is simple, and for the relevant points, reference may be made to the partial description of the method embodiment.
The above description is only for the preferred embodiment of the present invention, and is not intended to limit the scope of the present invention. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention shall fall within the protection scope of the present invention.

Claims (9)

1. A method of recording text reads as audio, the method comprising:
acquiring text content to be read;
reading voice data corresponding to each character in the text content to be read from a voice database, and reading the voice data in sequence;
receiving and determining a starting identifier for converting the text content to be converted in the text content to be read into an audio file according to an audio conversion instruction; the starting identifier is arranged in the text content to be read of the voice data which is not already read or the text content to be read of the voice data which is already read; the audio conversion instruction is input in the process of reading the text content, or is input before the text content is read and is multiplexed with the reading instruction;
under the condition that the starting identifier is arranged in the text content to be read of the voice data, the text content to be converted is converted into an audio file according to a text recording strategy from the starting identifier while the text content to be read is read, and the audio file is stored;
under the condition that the starting identifier is arranged in the text content to be read of the voice data, converting the text content to be converted into an audio file according to a text recording strategy from the starting identifier, and storing the audio file, wherein the method comprises the following steps: converting the read text content to be read as the text content to be converted into an audio file;
the text recording strategy comprises the following steps: and preferentially recording, not recording and repeatedly recording part of the text content in the text content to be converted.
2. The method for recording text reads as audio according to claim 1, wherein the obtaining text content to be read comprises:
acquiring a text source and a reading instruction input by a user, wherein the reading instruction comprises: reading the text source complete text or reading part of content in the text source;
and reading the full text or partial content from the text source to serve as the text content to be read.
3. The method for recording text reads as audio according to claim 1, wherein before reading the voice data corresponding to each character in the text content to be read from the voice database and reading the voice data in sequence, the method further comprises:
and acquiring a reading starting instruction input by a user.
4. The method for recording text reads as audio according to claim 1, wherein the audio conversion instructions are: an audio conversion instruction input by a user; or, generating an audio conversion instruction according to a paragraph of the text content to be read;
the receiving and determining, according to an audio conversion instruction, a start identifier for converting the text content to be converted in the text content to be read into an audio file includes:
when an audio conversion instruction input by a user is received, determining the text content to be read as the text content to be converted, and determining a first character in the text content to be converted as an initial identifier of audio conversion;
or when receiving an audio conversion instruction generated according to the paragraphs of the text contents to be read, determining each paragraph in the text contents to be read as the text contents to be converted, and determining a first character in each text contents to be converted as a start identifier of audio conversion;
the converting the text content to be converted into an audio file according to a text recording strategy while reading the text content to be read from the starting identifier, and storing the audio file includes:
starting from the first character of the text content to be converted, converting the text content to be converted into an audio file according to a text recording strategy while reading the text content to be read, and storing the audio file;
converting the text content to be converted into an audio file according to a text recording strategy from the starting identifier, and storing the audio file, wherein the converting comprises the following steps:
and starting from the first character of the text content to be converted, converting the text content to be converted into an audio file according to a text recording strategy, and storing the audio file.
5. The method for recording text reads as audio of claim 1, wherein the audio conversion instructions comprise: a text selection instruction input by a user;
the receiving and determining, according to an audio conversion instruction, a start identifier for converting the text content to be converted in the text content to be read into an audio file includes:
receiving and reading the text content to be read corresponding to the text selection instruction from the text content to be read according to the text selection instruction input by a user;
determining a first character of the text content to be converted in the text content to be read aloud as a starting identifier;
the converting the text content to be converted into an audio file according to a text recording strategy while reading the text content to be read from the starting identifier, and storing the audio file includes:
starting from the first character of the text content to be converted, converting the text content to be converted into an audio file according to a text recording strategy while reading the text content to be read, and storing the audio file;
converting the text content to be converted into an audio file according to a text recording strategy from the starting identifier, and storing the audio file, wherein the converting comprises the following steps:
and starting from the first character of the text content to be converted, converting the text content to be converted into an audio file according to a text recording strategy, and storing the audio file.
6. An apparatus for recording text reads as audio, the apparatus comprising:
the acquisition module is used for acquiring text contents to be read;
the reading module is used for reading the voice data corresponding to each character in the text content to be read from a voice database and sequentially reading the voice data;
the determining module is used for receiving and determining a starting identifier for converting the text content to be converted in the text content to be read into an audio file according to an audio conversion instruction; the starting identifier is arranged in the text content to be read of the voice data which is not already read or the text content to be read of the voice data which is already read; the audio conversion instruction is input in the process of reading the text content aloud, or is input before the text content is read aloud and is multiplexed with the reading instruction;
the storage module is used for converting the text content to be converted into an audio file according to a text recording strategy from the starting identifier when the starting identifier is arranged in the text content to be read of the voice data which is not read; and when the starting identifier is set in the text content to be read of the voice data, converting the text content to be converted into an audio file according to a text recording strategy from the starting identifier, and storing the audio file, including: converting the text content to be read after being read as the text content to be converted into an audio file; the text recording strategy comprises the following steps: and preferentially recording, not recording and repeatedly recording part of the text content in the text content to be converted.
7. The apparatus for recording text reads as audio according to claim 6, further comprising:
and the reading starting instruction acquisition module is used for acquiring the reading starting instruction input by the user.
8. An electronic device, comprising a processor, a communication interface, a memory and a communication bus, wherein the processor, the communication interface and the memory complete communication with each other through the communication bus;
the memory is used for storing a computer program;
the processor, configured to execute the program stored in the memory, implements the method steps of any of claims 1-5.
9. A computer-readable storage medium, characterized in that a computer program is stored in the computer-readable storage medium, which computer program, when being executed by a processor, carries out the method steps of any one of the claims 1-5.
CN201710813854.5A 2017-09-11 2017-09-11 Method and device for recording text reading as audio Active CN109509464B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710813854.5A CN109509464B (en) 2017-09-11 2017-09-11 Method and device for recording text reading as audio

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710813854.5A CN109509464B (en) 2017-09-11 2017-09-11 Method and device for recording text reading as audio

Publications (2)

Publication Number Publication Date
CN109509464A CN109509464A (en) 2019-03-22
CN109509464B true CN109509464B (en) 2022-11-04

Family

ID=65744232

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710813854.5A Active CN109509464B (en) 2017-09-11 2017-09-11 Method and device for recording text reading as audio

Country Status (1)

Country Link
CN (1) CN109509464B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113626013A (en) * 2021-08-04 2021-11-09 中国人民解放军战略支援部队航天工程大学 Automatic interpretation method and device for slides

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1855223A (en) * 2005-04-18 2006-11-01 株式会社理光 Audio font output device, font database, and language input front end processor
CN1906660A (en) * 2004-07-21 2007-01-31 松下电器产业株式会社 Speech synthesis device
CN101141666A (en) * 2006-09-05 2008-03-12 中兴通讯股份有限公司 Method of converting text note to voice broadcast in mobile phone
CN102280104A (en) * 2010-06-11 2011-12-14 北大方正集团有限公司 File phoneticization processing method and system based on intelligent indexing
CN104765714A (en) * 2014-01-08 2015-07-08 中国移动通信集团浙江有限公司 Switching method and device for electronic reading and listening
CN104810015A (en) * 2015-03-24 2015-07-29 深圳市创世达实业有限公司 Voice converting device, voice synthesis method and sound box using voice converting device and supporting text storage
CN106856091A (en) * 2016-12-21 2017-06-16 北京智能管家科技有限公司 The automatic broadcasting method and system of a kind of multi-language text

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1906660A (en) * 2004-07-21 2007-01-31 松下电器产业株式会社 Speech synthesis device
CN1855223A (en) * 2005-04-18 2006-11-01 株式会社理光 Audio font output device, font database, and language input front end processor
CN101141666A (en) * 2006-09-05 2008-03-12 中兴通讯股份有限公司 Method of converting text note to voice broadcast in mobile phone
CN102280104A (en) * 2010-06-11 2011-12-14 北大方正集团有限公司 File phoneticization processing method and system based on intelligent indexing
CN104765714A (en) * 2014-01-08 2015-07-08 中国移动通信集团浙江有限公司 Switching method and device for electronic reading and listening
CN104810015A (en) * 2015-03-24 2015-07-29 深圳市创世达实业有限公司 Voice converting device, voice synthesis method and sound box using voice converting device and supporting text storage
CN106856091A (en) * 2016-12-21 2017-06-16 北京智能管家科技有限公司 The automatic broadcasting method and system of a kind of multi-language text

Also Published As

Publication number Publication date
CN109509464A (en) 2019-03-22

Similar Documents

Publication Publication Date Title
US10782856B2 (en) Method and device for displaying application function information, and terminal device
US9104290B2 (en) Method for controlling screen of mobile terminal
JP6060989B2 (en) Voice recording apparatus, voice recording method, and program
CN101567186B (en) Speech synthesis apparatus, method, program, system, and portable information terminal
US11868710B2 (en) Method and apparatus for displaying a text string copied from a first application in a second application
US20200097528A1 (en) Method and Device for Quickly Inserting Text of Speech Carrier
CN103853778A (en) Methods for updating music label information and pushing music, as well as corresponding device and system
WO2015154577A1 (en) Application icon setting method and device
WO2016179128A1 (en) Techniques to automatically generate bookmarks for media files
KR101567449B1 (en) E-Book Apparatus Capable of Playing Animation on the Basis of Voice Recognition and Method thereof
CN109509464B (en) Method and device for recording text reading as audio
CN108492826B (en) Audio processing method and device, intelligent equipment and medium
US20140372455A1 (en) Smart tags for content retrieval
CN108897584B (en) Application starting method and device and intelligent terminal
CN114020197B (en) Cross-application message processing method, electronic device and readable storage medium
US11249619B2 (en) Sectional user interface for controlling a mobile terminal
CN104318923B (en) Voice processing method and device and terminal
US8639514B2 (en) Method and apparatus for accessing information identified from a broadcast audio signal
CN112035739B (en) Knowledge pushing method and device based on calendar and computer storage medium
CN111818214A (en) Terminal device control method, terminal device, and medium
KR20170037302A (en) Electronic device and method of controlling thereof
CN111368099A (en) Core information semantic map generation method and device
CN115877997A (en) Interactive element-oriented voice interaction method, system and storage medium
CN114548055A (en) Description document editing method, description document display method, terminal and computer-readable storage medium
CN112750432A (en) Voice instruction response method and device and terminal equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant