CN108595470B - Audio paragraph collection method, device and system and computer equipment - Google Patents

Audio paragraph collection method, device and system and computer equipment Download PDF

Info

Publication number
CN108595470B
CN108595470B CN201810184584.0A CN201810184584A CN108595470B CN 108595470 B CN108595470 B CN 108595470B CN 201810184584 A CN201810184584 A CN 201810184584A CN 108595470 B CN108595470 B CN 108595470B
Authority
CN
China
Prior art keywords
paragraph
audio
data
collection
information
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810184584.0A
Other languages
Chinese (zh)
Other versions
CN108595470A (en
Inventor
常哲珲
黄仕强
高铭瑜
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Kingsoft Internet Security Software Co Ltd
Original Assignee
Beijing Kingsoft Internet Security Software Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Kingsoft Internet Security Software Co Ltd filed Critical Beijing Kingsoft Internet Security Software Co Ltd
Priority to CN201810184584.0A priority Critical patent/CN108595470B/en
Publication of CN108595470A publication Critical patent/CN108595470A/en
Application granted granted Critical
Publication of CN108595470B publication Critical patent/CN108595470B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)

Abstract

The invention relates to an audio paragraph collection method, device and system, a computer readable storage medium and computer equipment. The method comprises the following steps: receiving an audio paragraph collection request of an audio playing device, analyzing the audio paragraph collection request to obtain audio paragraph information to be collected, obtaining corresponding paragraph data according to a sentence forming label corresponding to the audio paragraph information to be collected, and collecting the paragraph data. The method collects the audio paragraph data through the voice command, is convenient and quick, and overcomes the defect that the audio paragraph cannot be collected in the traditional technology.

Description

Audio paragraph collection method, device and system and computer equipment
Technical Field
The present invention relates to the field of voice control technologies, and in particular, to a method, an apparatus, a system, a computer-readable storage medium, and a computer device for storing an audio paragraph.
Background
At present, when a user listens to the audio of a network resource, the user wants to collect audio paragraphs such as interesting words and sentences when finding out the audio paragraphs. Usually, a screen of an intelligent terminal, such as a mobile phone, a tablet, a computer, etc., is clicked or pressed to trigger a related function instruction, so as to implement a collection operation of the whole audio (equivalent to a network link for storing the audio), and at a later stage, an image of a corresponding audio is obtained through the network link and rendered into a format suitable for a favorite display frame.
However, the currently adopted method for collecting the audio by the touch intelligent terminal has the technical problem that audio paragraphs cannot be collected.
Disclosure of Invention
Therefore, it is necessary to provide an audio paragraph collecting method, an apparatus, a system, a computer readable storage medium and a computer device capable of collecting audio paragraphs for solving the technical problem that audio paragraphs cannot be collected when the touch intelligent terminal collects audio.
An audio paragraph collection method comprising the steps of:
receiving an audio paragraph collection request of an audio playing device;
analyzing the audio paragraph collection request to obtain audio paragraph information to be collected, and obtaining corresponding paragraph data according to the sentence forming labels corresponding to the audio paragraph information to be collected;
and collecting the paragraph data.
In one embodiment, the step of obtaining corresponding paragraph data according to the sentence forming tag corresponding to the to-be-collected audio paragraph information includes: determining a sentence forming label corresponding to the audio paragraph information to be stored; inquiring a pre-established data source according to the sentence forming label, thereby obtaining paragraph data corresponding to the to-be-stored audio paragraph information; the data source stores a plurality of paragraph data, and each paragraph data corresponds to a sentence forming tag.
In one embodiment, the step of obtaining corresponding paragraph data according to the sentence forming tag corresponding to the to-be-collected audio paragraph information includes: determining a sentence forming label corresponding to the audio paragraph information to be stored; acquiring an audio file to which audio paragraph information to be collected belongs, obtaining a plurality of paragraph data according to the audio file, and generating sentence forming labels corresponding to the paragraph data; and obtaining paragraph data corresponding to the paragraph information of the audio to be stored according to the sentence forming label corresponding to the paragraph information of the audio to be stored and the sentence forming label contained in the audio file.
In one embodiment, the audio paragraph collection request is a request generated according to a collection voice instruction after an audio playing device monitors the collection voice instruction;
and/or the presence of a gas in the gas,
after the step of querying a pre-established data source according to the sentence forming tag, the method further comprises the following steps: if the data source does not have paragraph data corresponding to the paragraph information of the audio to be stored, acquiring an audio file to which the paragraph information of the audio to be stored belongs, obtaining a plurality of paragraph data according to the audio file, and generating sentence forming labels corresponding to the plurality of paragraph data; and obtaining paragraph data corresponding to the paragraph information of the audio to be stored according to the sentence forming label corresponding to the paragraph information of the audio to be stored and the sentence forming label contained in the audio file.
In one embodiment, the step of collecting the paragraph data includes: and determining a collection format corresponding to the paragraph data, and collecting the paragraph data according to the corresponding collection format.
In one embodiment, the step of receiving an audio paragraph collection request of an audio playing device is preceded by:
receiving a collection request of an audio playing device, and judging whether the collection request is an audio file collection request or not;
the audio paragraph collection method further comprises:
and if the collection request is judged to be the audio file collection request, analyzing the audio file collection request to obtain the information of the audio file to be collected, and collecting the information of the audio file to be collected.
In one embodiment, the step of parsing the audio paragraph collection request to obtain the information of the audio paragraph to be collected includes:
analyzing the audio paragraph collection request to obtain the audio paragraph information to be collected and the data type thereof;
the step of determining the collection format corresponding to the paragraph data includes:
and determining a collection format corresponding to the paragraph data according to the data type.
In one embodiment, the step of collecting the paragraph data according to the corresponding collection format includes:
if the collection format is a text format, collecting the text information corresponding to the paragraph data;
and/or the presence of a gas in the gas,
if the collection format is an audio format, collecting the audio paragraphs corresponding to the paragraph data;
and/or the presence of a gas in the gas,
and if the collection format is a text and audio format, collecting the text information corresponding to the paragraph data and the corresponding audio paragraph.
In one embodiment, the step of collecting text information corresponding to the paragraph data includes:
inquiring whether a data source has text information corresponding to the paragraph data, if so, acquiring the text information corresponding to the paragraph data from the data source, and collecting the acquired text information; if not, converting the paragraph data into corresponding text information, and collecting the converted text information.
An audio paragraph collection method comprising the steps of:
the audio playing device monitors a collection voice instruction, generates an audio paragraph collection request according to the collection voice instruction, and sends the audio paragraph collection request to a corresponding server;
the server analyzes the audio paragraph collection request to obtain audio paragraph information to be collected, and corresponding paragraph data are obtained according to the sentence forming labels corresponding to the audio paragraph information to be collected;
and collecting the paragraph data.
An audio paragraph stowage arrangement, the arrangement comprising:
the request receiving module is used for receiving an audio paragraph collection request sent by the audio playing equipment;
the request analysis module is used for analyzing the audio paragraph collection request to obtain the audio paragraph information to be collected and obtaining corresponding paragraph data according to the sentence forming labels corresponding to the audio paragraph information to be collected;
and the paragraph collection module is used for collecting the paragraph data.
The audio paragraph collection method and device receive an audio paragraph collection request of an audio playing device, analyze the audio paragraph collection request to obtain the audio paragraph information to be collected, obtain the corresponding paragraph data according to the sentence forming tags corresponding to the audio paragraph information to be collected, and finally collect the paragraph data. The method and the device collect the audio paragraph data through the voice command, are convenient and fast, and overcome the defect that the audio paragraph cannot be collected in the traditional technology.
An audio passage collection system, the system comprising: the system comprises an audio playing device and a server;
the audio playing device is used for monitoring a collection voice instruction, generating an audio paragraph collection request according to the collection voice instruction, and sending the audio paragraph collection request to a corresponding server;
the server is used for analyzing the audio paragraph collection request to obtain audio paragraph information to be collected and obtaining corresponding paragraph data according to the sentence forming labels corresponding to the audio paragraph information to be collected; and collecting the paragraph data.
The audio paragraph collection system collects the audio paragraph data through the voice command, is convenient and quick, and overcomes the defect that the audio paragraph cannot be collected in the traditional technology.
In one embodiment, the server is further configured to send collection result information to a corresponding client after collecting the paragraph data, so as to display the collection result information at the client; the client is a client associated with the audio playing device.
In the embodiment, after the server collects the paragraph data, the server sends the collection result information to the client for the client to display, so that the method is convenient and quick, and the technical effect of synchronously collecting the audio paragraphs by the server and the client is achieved.
A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of the audio paragraph collection method described above.
When the steps of the audio paragraph collection method are executed, the computer readable storage medium collects the audio paragraph data through the voice command, is convenient and quick, and overcomes the defect that the audio paragraph cannot be collected in the traditional technology.
A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, the processor implementing the steps of the audio paragraph collection method when executing the program.
When the steps of the audio paragraph collection method are executed, the computer equipment collects the audio paragraph data through the voice command, is convenient and quick, and overcomes the defect that the audio paragraph cannot be collected in the traditional technology.
Drawings
FIG. 1 is a diagram of an application environment for an audio paragraph collection method of an embodiment;
FIG. 2 is an internal block diagram of an audio playback device according to an embodiment;
FIG. 3 is a schematic flow chart diagram of an audio paragraph collection method of one embodiment;
FIG. 4 is a schematic flow chart diagram of an audio paragraph collection method of another embodiment;
FIG. 5 is a schematic flow chart diagram of an audio paragraph collection method of yet another embodiment;
FIG. 6 is a schematic block diagram of an audio paragraph stowage arrangement of an embodiment.
Detailed Description
In order to make the objects, technical solutions and advantages of the present application more apparent, the present application is described in further detail below with reference to the accompanying drawings and embodiments. It should be understood that the specific embodiments described herein are merely illustrative of the present application and are not intended to limit the present application.
In one embodiment, the audio paragraph collection method provided by the present application can be applied to the application environment shown in fig. 1. Wherein the audio playing device 110 communicates with the server 120 through a network. As shown in fig. 2, the audio playing device 110 includes an audio collecting unit, an audio decoding unit, a processor, a wireless communication module (i.e., a network interface), and an internal memory; the audio acquisition unit can monitor voice information, the audio acquisition unit is still used for sending the voice information who gathers to the audio decoding unit, the audio decoding unit can be followed the voice information and discerned semantic information to send for the treater. The processor is capable of parsing the semantic information and generating an audio paragraph collection request. The internal memory is used for storing operating system data, software programs and/or file data. The audio playing device 110 further establishes a connection with the server 120 through the wireless communication module, and is configured to send an audio paragraph collection request to the server 120, so as to collect the audio paragraph through the server 120. The server 120 can also allocate a corresponding sentence tag to the audio paragraph, and obtain corresponding paragraph data according to the sentence tag. The audio playing device 110 may be, but not limited to, various smart speakers, smart televisions, personal computers, notebook computers, smart phones, tablet computers, and portable wearable devices, and the server 120 may be implemented by an independent server or a server cluster formed by a plurality of servers.
Based on the above description of the application environment and the internal structure of the audio playback device, the following description describes an embodiment of an audio paragraph collection method.
In one embodiment, as shown in fig. 3, an audio paragraph collection method is provided, which is described by taking the audio playing device in fig. 1 as an example, and includes the following steps:
step S101, an audio paragraph collection request of an audio playing device is received.
In this step, the audio paragraph collection request is a request generated by the audio playing device according to the collected voice instruction after monitoring the collected voice instruction. In addition, the collection voice instruction is an instruction generated by sending voice information to the audio playing device when a user wants to collect an audio paragraph of interest of the user, for example, "help me collect this paragraph", and triggering the audio playing device to semantically convert the voice information. The audio playing device monitors a voice collecting instruction, and generates an audio paragraph collecting request according to the voice collecting instruction, wherein the audio paragraph collecting request is used for triggering the server to carry out corresponding collecting operation according to the request.
The voice information of the user is converted into a collection voice instruction by triggering the audio playing device, and an ASR (Automatic Speech Recognition) technology is adopted to convert the vocabulary content in the voice information of the user into computer readable input, such as a key, a binary code or a character sequence, so as to facilitate Recognition and information communication, thereby triggering the collection voice instruction. In addition, the audio playing device comprises a smart sound box, a smart television and the like.
In one embodiment, an audio paragraph collection request of a smart sound box is received; the audio paragraph collection request is a request generated according to the collection voice instruction after the intelligent sound box monitors the collection voice instruction. According to the embodiment, the audio paragraph collection request of the intelligent sound box is received, so that the corresponding collection operation can be executed according to the request, convenience and rapidness are realized, and the technical effect of collecting paragraph data is achieved.
Step S102, the audio paragraph collection request is analyzed to obtain the audio paragraph information to be collected, and corresponding paragraph data is obtained according to the sentence forming labels corresponding to the audio paragraph information to be collected.
In this step, the audio paragraph information to be stored includes an audio address, an audio name, an audio author, an audio tag, and the like. The audio paragraph information to be collected can be songs, novels, poems, lectures and the like. The phrase tag refers to identification information of the phrase audio, such as the lyrics of the first phrase of a certain song.
In one embodiment, the server is pre-configured with corresponding sentence-forming tags for audio resources, and the audio resources and the corresponding sentence-forming tags are stored in the data source. The sentence label processing is mainly through a voice recognition technology, namely, the sentence audio in the audio file is labeled, so that the corresponding paragraph data can be conveniently acquired according to the sentence label corresponding to the paragraph information to be collected. For example, the audio content is: the intelligent sound box can realize the collection function, has other strong functions and is worthy of being affirmed and trusted by users. Moreover, under the vigorous marketing of intelligent sound boxes, the sound boxes are believed to be capable of obtaining good market effects. "for the audio content, the server will identify and identify the punctuation locations of the corresponding audio. In addition, the natural pause of language mood in audio frequency does not affect the mark of punctuation mark position, the main application is the existing semantic recognition technology, according to the current sentence content and the extensive Chinese word library, automatically synthesizing into the form of sentence, mainly according to the common format of Chinese, such as subject + verb; preposition + verb + adjective and other natural formats. The method and the device have the advantages that three factors of the Chinese common format, the pause of language mood in the audio and the punctuation mark position of the audio are combined, the audio of the sentence in the audio file is labeled, and the accuracy of configuring corresponding sentence labels for audio resources can be improved.
In another embodiment, the server may further determine whether the preceding and following sentences are a complete sentence according to a time limit of a pause between the preceding and following sentences in the audio file. Of course, the question and the like can be judged, and the method of converting the text by voice input in science fiction may be specifically referred to.
No matter which method is adopted to label the sentence-forming audio in the audio file, a set of preset symbol rules is adopted, so that the accuracy of configuring corresponding sentence-forming labels for the audio resources is further improved.
In one embodiment, the audio paragraph information to be collected is analyzed to obtain a corresponding sentence formation tag, for example, the first sentence of the song lyrics, and the corresponding data source is queried according to the sentence formation tag, so as to obtain the corresponding paragraph data. Namely, according to the sentence forming label corresponding to the audio paragraph information to be collected, the corresponding paragraph data can be conveniently searched, and the collection of the paragraph data in the later process is facilitated.
And step S103, collecting the paragraph data.
In one embodiment, paragraph data may be collected in a specified format, and paragraph data may also be collected in a specified collection.
The audio paragraph collection method in the above embodiment receives an audio paragraph collection request of an audio playing device, analyzes the audio paragraph collection request to obtain audio paragraph information to be collected, obtains corresponding paragraph data according to a sentence forming tag corresponding to the audio paragraph information to be collected, and collects the paragraph data. The method collects the audio paragraph data through the voice command, is convenient and quick, and overcomes the defect that the audio paragraph cannot be collected in the traditional technology.
In an embodiment, in the step S102, the step of obtaining corresponding paragraph data according to the sentence forming tag corresponding to the to-be-stored audio paragraph information includes: determining a sentence forming label corresponding to the audio paragraph information to be stored; inquiring a pre-established data source according to the sentence forming label, thereby obtaining paragraph data corresponding to the to-be-stored audio paragraph information; the data source stores a plurality of paragraph data, and each paragraph data corresponds to a sentence forming tag. For example, the song played by the current audio playing device is the book of heaven's words to mom, the user wants to collect paragraph data, namely, the book of heaven's words to mom, but is injured and can be protected as soon as possible, a sentence label (for example, the words of the first sentence in the words of the book of heaven's words) corresponding to the paragraph information of the audio to be collected is determined according to the paragraph information of the audio to be collected, a corresponding data source is searched according to the sentence label, paragraph data corresponding to the sentence label is obtained, and the paragraph data is the paragraph data corresponding to the paragraph information of the audio to be collected.
Of course, the audio paragraph information to be stored is not necessarily audio played by the current audio playing device, but may also be audio played by the audio playing device, such as the previous song; or the audio to be played by the audio playing device, such as the next song; or, not the audio in the audio playback device playlist (i.e., there is no audio in the audio playback device playlist).
In the embodiment, the sentence forming tag corresponding to the audio paragraph information to be collected is determined, and then the data source is searched according to the sentence forming tag corresponding to the audio paragraph information to be collected, so that the paragraph data corresponding to the audio paragraph information to be collected is obtained, the method and the device are convenient and quick, the accuracy of obtaining the corresponding paragraph data is achieved, and meanwhile, a basis is provided for subsequently collecting the paragraph data.
In another embodiment, in the step S102, the step of obtaining corresponding paragraph data according to the sentence forming tag corresponding to the to-be-stored audio paragraph information includes: determining a sentence forming label corresponding to the audio paragraph information to be stored; acquiring an audio file to which audio paragraph information to be collected belongs, for example, acquiring a song file corresponding to a certain sentence of lyrics from a cloud, acquiring a plurality of paragraph data according to the audio file, and generating sentence forming tags corresponding to the plurality of paragraph data; and obtaining paragraph data corresponding to the paragraph information of the audio to be stored according to the sentence forming label corresponding to the paragraph information of the audio to be stored and the sentence forming label contained in the audio file. For example, if the user wants to get the lyrics "listen to mother, she is injured and wants to grow fast and fast, she can protect the" song file "listen to mother" to which she belongs, and get a plurality of sentence-forming lyric paragraphs according to the song file, and mark the sentence-forming lyric paragraphs with sentence-forming labels. And searching for a plurality of sentence-forming lyric paragraphs and corresponding sentence-forming labels contained in the song file according to the sentence-forming labels corresponding to the audio paragraph information to be stored, and obtaining paragraph data corresponding to the audio paragraph information to be stored according to the matching relationship between the sentence-forming labels corresponding to the audio paragraph information to be stored and the sentence-forming lyric paragraphs.
In the embodiment, before the audio file to which the audio paragraph information to be collected belongs is acquired, whether paragraph data corresponding to the audio paragraph information to be collected exists in the data source or not is directly generated according to the acquired audio file, the accuracy of the paragraph data corresponding to the audio paragraph information to be collected can be ensured, and the defect that the audio file to which the audio paragraph information to be collected belongs needs to be acquired when the paragraph data corresponding to the audio paragraph information to be collected cannot be searched in the data source is avoided. And meanwhile, according to the generated plurality of paragraph data and the sentence forming labels corresponding to the plurality of paragraph data, the paragraph data corresponding to the audio paragraph information to be stored is obtained, so that the method is convenient and quick, and the accuracy of obtaining the corresponding paragraph data is improved.
Further, sentence forming labels corresponding to the paragraph information of the audio to be stored are determined, the audio file to which the paragraph information of the audio to be stored belongs is obtained, a plurality of paragraph data are obtained according to the audio file, sentence forming labels corresponding to the paragraph data are generated, no specific sequence exists between the sentence forming labels and the paragraph data, and the sentence forming labels and the paragraph data can be executed independently.
Therefore, in another embodiment, the obtaining of corresponding paragraph data according to the sentence forming tag corresponding to the to-be-collected audio paragraph information may be implemented as follows: acquiring an audio file to which audio paragraph information to be collected belongs, obtaining a plurality of paragraph data according to the audio file, and generating sentence forming labels corresponding to the paragraph data; determining a sentence forming label corresponding to the audio paragraph information to be stored; and obtaining paragraph data corresponding to the paragraph information of the audio to be stored according to the sentence forming label corresponding to the paragraph information of the audio to be stored and the sentence forming label contained in the audio file. According to the embodiment, the corresponding paragraph data is obtained according to the sentence forming label corresponding to the paragraph information of the audio to be stored and the sentence forming label contained in the audio file, so that the method and the device are convenient and quick, and the accuracy of obtaining the corresponding paragraph data is improved.
In another embodiment, the obtaining of the corresponding paragraph data according to the sentence forming tag corresponding to the paragraph information of the audio to be collected may also be implemented by: meanwhile, determining a sentence forming label corresponding to the audio paragraph information to be stored; acquiring an audio file to which the to-be-collected audio paragraph information belongs, obtaining a plurality of paragraph data according to the audio file, and generating sentence forming labels corresponding to the paragraph data; and obtaining paragraph data corresponding to the paragraph information of the audio to be stored according to the sentence forming label corresponding to the paragraph information of the audio to be stored and the sentence forming label contained in the audio file. According to the embodiment, the corresponding paragraph data is obtained according to the sentence forming label corresponding to the paragraph information of the audio to be collected, the method and the device are convenient and quick, and the accuracy of obtaining the corresponding paragraph data is improved.
In one embodiment, after the step of querying the pre-established data source according to the sentence tag, the method further includes: if the data source does not have paragraph data corresponding to the paragraph information of the audio to be stored, acquiring an audio file to which the paragraph information of the audio to be stored belongs, obtaining a plurality of paragraph data according to the audio file, and generating sentence forming labels corresponding to the plurality of paragraph data; and obtaining paragraph data corresponding to the paragraph information of the audio to be stored according to the sentence forming label corresponding to the paragraph information of the audio to be stored and the sentence forming label contained in the audio file. In the embodiment, when it is determined that there is no paragraph data corresponding to the to-be-stored audio paragraph information in the data source, the paragraph data corresponding to the to-be-stored audio paragraph information is obtained according to the generated multiple paragraph data and the sentence forming tags corresponding to the multiple paragraph data, so that the accuracy of obtaining the corresponding paragraph data can be improved, the defect that the corresponding paragraph data cannot be obtained if there is no paragraph data corresponding to the to-be-stored audio paragraph information in the data source is avoided, and a basis is provided for subsequently collecting the paragraph data.
It should be noted that, in the above embodiments, obtaining the audio file to which the audio paragraph information to be collected belongs may be implemented in various ways, and may be directly obtained from a cloud (e.g., the internet), may also be obtained from a local collection information library, may also be obtained from other servers, and the like.
In one embodiment, after the step of obtaining an audio file to which the to-be-stored audio paragraph information belongs, obtaining a plurality of paragraph data according to the audio file, and generating sentence forming tags corresponding to the plurality of paragraph data, the method further includes: and storing the obtained paragraph data and sentence forming labels corresponding to the paragraph data into the data source. In the embodiment, the obtained paragraph data and the sentence forming tags corresponding to the paragraph data are stored in the data source, and the data source is updated, so that the defect that the corresponding paragraph data cannot be obtained because the paragraph data corresponding to the audio paragraph information to be collected is not stored in advance can be avoided, and meanwhile, the accuracy of subsequently obtaining the paragraph data corresponding to the audio paragraph information to be collected is ensured.
Further, for a data source, after a certain time, for example, 1 month, an audio file on the internet is re-acquired, a plurality of paragraph data are obtained according to the audio file, and sentence formation tags corresponding to the plurality of paragraph data are generated, and for the same audio file, the re-acquired plurality of paragraph data and the sentence formation tags corresponding to the plurality of paragraph data are used to cover the original plurality of paragraph data corresponding to each audio file and the sentence formation tags corresponding to the plurality of paragraph data. And further updating the data source according to the actual situation. According to the embodiment, the data source is updated in time within the set time, so that the accuracy and timeliness of obtaining the paragraph data corresponding to the to-be-stored audio paragraph information can be improved, and the situation that the corresponding paragraph data cannot be obtained in time is avoided.
In one embodiment, the step of collecting the paragraph data includes: determining a collection format corresponding to the paragraph data, such as an audio format and a text format, and collecting the paragraph data according to the corresponding collection format. In the above embodiment, the paragraph data is collected according to the corresponding collection format, so that not only the paragraph data can be collected, but also the paragraph data can be collected according to the specified format, and the defect that the audio paragraph cannot be collected in the conventional technology is overcome.
In an embodiment, in the step S102, the step of analyzing the audio paragraph collection request to obtain the information of the audio paragraph to be collected includes: and analyzing the audio paragraph collection request to obtain the audio paragraph information to be collected and the data type thereof, such as song type, lecture type, recitation type and the like. The data type refers to the specific type of the paragraph information to be collected, for example, a piece of lyric belongs to the song type, and an ancient poem belongs to the reciting type. In the embodiment, the data type corresponding to the paragraph information to be collected is obtained by analyzing the request, and a basis is provided for determining the collection format corresponding to the paragraph data according to the data type.
And determining a collection format corresponding to the paragraph data according to the obtained data type. In one embodiment, the step of determining the collection format corresponding to the paragraph data includes: and determining a collection format corresponding to the paragraph data according to the data type. Specifically, the specific type of the to-be-collected paragraph information and the collection format corresponding to the specific type are preset, for example, the to-be-collected paragraph information belonging to the song type and the collection format of the corresponding paragraph data are audio formats. Of course, a paragraph data may have a variety of collection formats. By identifying the specific type of the paragraph information to be collected, the collection format corresponding to the paragraph data can be quickly determined.
In one embodiment, after semantic conversion, a collection voice instruction is triggered according to voice information sent by a user, for example, "help me collect this paragraph", and the collection voice instruction carries information of the paragraph to be collected and its collection format (the collection format is determined by characteristic information of vocabulary content in the voice information and can be preset); the audio playing device determines an audio paragraph collection request according to the collection voice instruction and sends the audio paragraph collection request to a corresponding server; the audio paragraph collection request comprises paragraph information to be collected and a collection format thereof; the server analyzes the audio paragraph collection request to obtain the information of the paragraphs to be collected and the collection format thereof, and the collection format corresponding to the paragraph data can be determined according to the collection format. In short, according to the voice information sent by the user, the collection format corresponding to the paragraph data can be determined. The specific voice information sent by the user can be predefined, and the collection format of the triggered corresponding paragraph data, for example, the voice "help me collect this paragraph of characters", "help me collect this paragraph of words", "help me write the just-spoken prescription" and the like sent by the user are the paragraph data which triggers to save the text format; the words of 'help me collect the just music', 'help me collect the just article', 'help me record the words of the exercise in the past', and the like are paragraph data which trigger the storage of the audio format.
Furthermore, different collection formats can be determined according to different data types, and paragraph data are collected according to the corresponding collection formats.
In one embodiment, the step of collecting the paragraph data according to the corresponding collection format includes: and if the collection format is a text format, collecting the text information corresponding to the paragraph data.
In another embodiment, the step of collecting the paragraph data according to the corresponding collection format includes: and if the collection format is an audio format, collecting the audio paragraphs corresponding to the paragraph data.
In another embodiment, the step of collecting the paragraph data according to the corresponding collection format includes: and if the collection format is a text and audio format, collecting the text information corresponding to the paragraph data and the corresponding audio paragraph.
In the embodiment, the collection format corresponding to the paragraph data is determined according to the data type, and the paragraph data is collected according to the corresponding collection format, so that the technical effect of collecting the paragraph data according to the specified format is achieved.
Furthermore, the collection of the text information corresponding to the paragraph data can be realized in various ways. In one embodiment, the step of collecting text information corresponding to the paragraph data includes: inquiring whether a data source has text information corresponding to the paragraph data, if so, acquiring the text information corresponding to the paragraph data from the data source, and collecting the acquired text information; if not, converting the paragraph data into corresponding text information, and collecting the converted text information. In the embodiment, whether the text information corresponding to the paragraph data exists is judged by searching the data source, and the text information corresponding to the paragraph data is acquired in various ways, so that the text information corresponding to the paragraph data can be ensured to be acquired, and a basis is provided for subsequently collecting the acquired text information.
Wherein, after the step of converting the paragraph data into corresponding text information and collecting the converted text information, the method comprises the following steps: and storing the converted text information into a data source, updating the data source, and avoiding the situation that the text information corresponding to the paragraph data cannot be obtained in time.
It should be noted that the data source storing the text information corresponding to the paragraph data and the data source storing the multiple paragraph data and the sentence forming tags corresponding to the multiple paragraph data belong to the same data source.
In one embodiment, the step of receiving an audio paragraph collection request of an audio playing device is preceded by: receiving a collection request of an audio playing device, and judging whether the collection request is an audio file collection request. The audio paragraph collection method further comprises: and if the collection request is judged to be the audio file collection request, analyzing the audio file collection request to obtain the information of the audio file to be collected, and collecting the information of the audio file to be collected. According to the embodiment, different collection operations are executed according to different collection requests, so that not only paragraph data but also the whole audio file can be collected.
In one embodiment, in step S101, the audio paragraph collection request includes collection address information. In the step S102, the step of analyzing the audio paragraph collection request to obtain the information of the audio paragraph to be collected includes: and analyzing the audio paragraph collection request to obtain the audio paragraph information to be collected and the collection address information. For example, for the voice information "help me put this lyric into the favorite" sent by the user, "after semantic conversion, the address information of the favorite and the audio paragraph information to be stored (for example, a certain lyric in the above" listen to mom's speech ") are determined. In the step S103, the step of collecting the paragraph data includes: and collecting the paragraph data to an address corresponding to the collection address information. In the above embodiment, according to the audio paragraph information to be collected and the collection address information obtained by analyzing the instruction, a basis is provided for subsequently collecting the paragraph data to the specified address.
In addition, if the collection address information carried in the voice sent by the user comprises a multilayer address, for example,// total favorite/music/favorite lyrics, semantically converting the voice information, analyzing a corresponding audio paragraph collection request, determining the name of the favorite lyrics and the corresponding collection address information thereof, and collecting the paragraph data to the corresponding collection address.
In one embodiment, the audio paragraph collection method further comprises: receiving an audio paragraph deletion request of an audio playing device, for example, the audio playing device can help me to delete the lyrics aiming at the 'elegant' voice information sent by a user, and triggering the corresponding audio paragraph deletion request after semantic conversion; and analyzing the audio paragraph deletion request to obtain audio paragraph information to be deleted, obtaining corresponding paragraph data according to the sentence forming label corresponding to the audio paragraph information to be deleted, and deleting the paragraph data. Meanwhile, the audio paragraph deletion request carries paragraph data already collected by the server. In the above embodiment, the corresponding paragraph data is deleted through the audio paragraph deletion request, that is, the paragraph data in the favorite corresponding to the user personal account is controlled and sorted through the audio playing device.
In one embodiment, the audio paragraph collection method further comprises: judging whether an audio paragraph collecting request of the audio playing device is received, if the audio paragraph collecting request cannot be received, re-receiving the audio paragraph collecting request of the audio playing device according to a preset frequency until the audio paragraph collecting request of the audio playing device is received. In the above embodiment, the audio paragraph collection request of the audio playing device is received again according to the preset frequency, which provides a basis for obtaining the paragraph data corresponding to the audio paragraph information to be collected according to the audio paragraph collection request later.
In one embodiment, an account associated with the audio playing device may be logged in through a mobile phone APP, collection result information sent by a corresponding server, such as paragraph data in a favorite, is obtained, and the collection result information is displayed. According to the embodiment, the paragraph data collected by the server can be synchronized to the favorite corresponding to the account number associated with the audio playing device, the operation is convenient and quick, and the technical effect of synchronous collection of the server and the client is achieved.
From the perspective of an audio playing device, the invention also provides another audio paragraph collection method.
FIG. 4 is a schematic flow chart diagram of an audio paragraph collection method of another embodiment; as shown in fig. 4, the audio paragraph collection method in this embodiment includes the following steps:
step S401, a collection voice command is monitored, and an audio paragraph collection request is generated according to the collection voice command.
Step S402, sending the audio paragraph collection request to a corresponding server; the audio paragraph collection request is used for instructing the server to analyze the audio paragraph collection request to obtain audio paragraph information to be collected, and corresponding paragraph data is obtained according to a sentence forming label corresponding to the audio paragraph information to be collected; and collecting the paragraph data.
According to the audio paragraph collection method, after the voice collection instruction is monitored, the audio paragraph data are collected through the server, the method is convenient and fast, and the defect that the audio paragraphs cannot be collected in the traditional technology is overcome.
From the perspective of interaction between the audio playing device and the corresponding server, the invention also provides another audio paragraph collection method.
FIG. 5 is a schematic flow chart diagram of an audio paragraph collection method of yet another embodiment; as shown in fig. 5, the audio paragraph collection method in this embodiment includes the following steps:
step S501, the audio playing device monitors a collection voice command, generates an audio paragraph collection request according to the collection voice command, and sends the audio paragraph collection request to a corresponding server.
Step S502, the server analyzes the audio paragraph collection request to obtain the audio paragraph information to be collected, and obtains corresponding paragraph data according to the sentence forming labels corresponding to the audio paragraph information to be collected; and collecting the paragraph data.
In the embodiment, the audio playing device monitors the collection voice command, generates the audio paragraph collection request, and sends the audio paragraph collection request to the corresponding server; analyzing the audio paragraph collection request through a server to obtain audio paragraph information to be collected, and obtaining corresponding paragraph data according to the sentence forming labels corresponding to the audio paragraph information to be collected; and collecting the paragraph data. The audio paragraph data is collected through the voice command, the operation is convenient and fast, and the defect that the audio paragraph cannot be collected in the traditional technology is overcome.
In an embodiment, based on the same principle as that of the server collecting paragraph data, the present invention further provides another audio paragraph collecting method, where the audio paragraph collecting method in this embodiment includes the following steps: the audio playing device monitors a voice collecting instruction; determining audio paragraph information to be collected according to the voice collecting instruction, and obtaining corresponding paragraph data according to a sentence forming label corresponding to the audio paragraph information to be collected; and collecting the paragraph data. According to the embodiment, the paragraph data is collected through the voice command, the convenience and the rapidness are realized, the defect that the audio paragraphs cannot be collected in the traditional technology is overcome, and meanwhile, the technical effect of collecting the paragraph data through the audio playing equipment is achieved.
In one embodiment, after the step of monitoring a collection voice instruction and determining information of an audio paragraph to be collected according to the collection voice instruction, the audio playing device further includes: the audio playing device determines a collection mode according to the collection voice instruction, if the collection mode is local collection, corresponding paragraph data is obtained according to a sentence forming label corresponding to the to-be-collected audio paragraph information, and the paragraph data is collected; and if the collection mode is server collection, generating an audio paragraph collection request according to the audio paragraph information to be collected and the collection mode, and sending the audio paragraph collection request to a corresponding server, wherein the audio paragraph collection request is used for indicating the server to obtain corresponding paragraph data according to the sentence forming tags corresponding to the audio paragraph information to be collected, and collecting the paragraph data. According to the embodiment, the corresponding collection operation is executed according to the collection mode, and the technical effect of collecting the audio paragraph data in different modes is achieved.
It should be understood that although the various steps in the flow charts of fig. 3-5 are shown in order as indicated by the arrows, the steps are not necessarily performed in order as indicated by the arrows. The steps are not performed in the exact order shown and described, and may be performed in other orders, unless explicitly stated otherwise. Moreover, at least some of the steps in fig. 3-5 may include multiple sub-steps or multiple stages that are not necessarily performed at the same time, but may be performed at different times, and the order of performance of the sub-steps or stages is not necessarily sequential, but may be performed in turn or alternating with other steps or at least some of the sub-steps or stages of other steps.
In one embodiment, as shown in fig. 6, there is provided an audio paragraph stowage arrangement comprising: a request receiving module 610, a request parsing module 620 and a paragraph collection module 630, wherein:
a request receiving module 610, configured to receive an audio paragraph collection request of an audio playing device;
and the request analysis module 620 is configured to analyze the audio paragraph collection request to obtain the audio paragraph information to be collected, and obtain corresponding paragraph data according to the sentence forming tag corresponding to the audio paragraph information to be collected.
A paragraph collection module 630, configured to collect the paragraph data.
In one embodiment, the audio paragraph collection request is a request generated according to a collection voice instruction after the audio playing device monitors the collection voice instruction.
In an embodiment, the request parsing module 620 may be configured to determine a sentence forming tag corresponding to the to-be-collected audio paragraph information; inquiring a pre-established data source according to the sentence forming label, thereby obtaining paragraph data corresponding to the to-be-stored audio paragraph information; the data source stores a plurality of paragraph data, and each paragraph data corresponds to a sentence forming tag.
In an embodiment, the request parsing module 620 may be configured to determine a sentence forming tag corresponding to the to-be-collected audio paragraph information; acquiring an audio file to which audio paragraph information to be collected belongs, obtaining a plurality of paragraph data according to the audio file, and generating sentence forming labels corresponding to the paragraph data; and obtaining paragraph data corresponding to the paragraph information of the audio to be stored according to the sentence forming label corresponding to the paragraph information of the audio to be stored and the sentence forming label contained in the audio file.
In an embodiment, the request parsing module 620 may be further configured to, if there is no paragraph data corresponding to the to-be-stored audio paragraph information in the data source, obtain an audio file to which the to-be-stored audio paragraph information belongs, obtain a plurality of paragraph data according to the audio file, and generate sentence forming tags corresponding to the plurality of paragraph data; and obtaining paragraph data corresponding to the paragraph information of the audio to be stored according to the sentence forming label corresponding to the paragraph information of the audio to be stored and the sentence forming label contained in the audio file.
In an embodiment, the paragraph collection module 630 may be configured to determine a collection format corresponding to the paragraph data, and collect the paragraph data according to the corresponding collection format.
In an embodiment, the audio paragraph collection apparatus further includes a request determining module, configured to receive a collection request of an audio playing device, and determine whether the collection request is an audio file collection request.
In an embodiment, the audio paragraph collection device further includes an audio file collection module, configured to, if the request determination module determines that the collection request is an audio file collection request, parse the audio file collection request to obtain the information of the audio file to be collected, and collect the information of the audio file to be collected.
In an embodiment, the request parsing module 620 may be configured to parse the audio paragraph collection request to obtain the information of the audio paragraph to be collected and the data type thereof. The paragraph collection module 630 may be further configured to determine a collection format corresponding to the paragraph data according to the data type.
In an embodiment, the paragraph collection module 630 may be further configured to collect text information corresponding to the paragraph data if the collection format is a text format; if the collection format is an audio format, collecting the audio paragraphs corresponding to the paragraph data; and if the collection format is a text and audio format, collecting the text information corresponding to the paragraph data and the corresponding audio paragraph.
In an embodiment, the paragraph collection module 630 is further configured to query whether a data source has text information corresponding to the paragraph data, and if so, obtain the text information corresponding to the paragraph data from the data source and collect the obtained text information; if not, converting the paragraph data into corresponding text information, and collecting the converted text information.
In the above embodiments, the audio paragraph collection device receives an audio paragraph collection request of the audio playing device, analyzes the audio paragraph collection request to obtain the audio paragraph information to be collected, obtains the corresponding paragraph data according to the sentence forming tag corresponding to the audio paragraph information to be collected, and finally collects the paragraph data. The device collects the audio paragraph data through the voice command, is convenient and quick, and overcomes the defect that the audio paragraph cannot be collected in the traditional technology.
For specific limitations of the audio paragraph collection device, reference may be made to the above limitations of the audio paragraph collection method, which will not be described herein again. The modules in the audio paragraph collection device can be wholly or partially implemented by software, hardware and a combination thereof. The modules can be embedded in a hardware form or independent from a processor in the computer device, and can also be stored in a memory in the computer device in a software form, so that the processor can call and execute operations corresponding to the modules.
With reference to the foregoing embodiments, the present invention further provides an audio paragraph collection system.
In one embodiment, as shown in FIG. 1, an audio passage collection system is illustrated, the system comprising: audio playback device 110 and server 120, wherein:
the audio playing device 110 is configured to monitor a collection voice instruction, generate an audio paragraph collection request according to the collection voice instruction, and send the audio paragraph collection request to the corresponding server 120. The server 120 is configured to parse the audio paragraph collection request to obtain the to-be-collected audio paragraph information, obtain corresponding paragraph data according to the sentence forming tag corresponding to the to-be-collected audio paragraph information, and collect the paragraph data.
In one embodiment, the server 120 is further configured to: determining a sentence forming label corresponding to the audio paragraph information to be stored; and inquiring a pre-established data source according to the sentence forming label, thereby obtaining paragraph data corresponding to the to-be-stored audio paragraph information. The data source stores a plurality of paragraph data, and each paragraph data corresponds to a sentence forming tag.
In one embodiment, the server 120 is further operable to: determining a sentence forming label corresponding to the audio paragraph information to be stored; acquiring an audio file to which audio paragraph information to be collected belongs, obtaining a plurality of paragraph data according to the audio file, and generating sentence forming labels corresponding to the paragraph data; and obtaining paragraph data corresponding to the paragraph information of the audio to be stored according to the sentence forming label corresponding to the paragraph information of the audio to be stored and the sentence forming label contained in the audio file.
In one embodiment, the server 120 is further operable to: if the data source does not have paragraph data corresponding to the paragraph information of the audio to be stored, acquiring an audio file to which the paragraph information of the audio to be stored belongs, obtaining a plurality of paragraph data according to the audio file, and generating sentence forming labels corresponding to the plurality of paragraph data; and obtaining paragraph data corresponding to the paragraph information of the audio to be stored according to the sentence forming label corresponding to the paragraph information of the audio to be stored and the sentence forming label contained in the audio file. And storing the obtained paragraph data and sentence forming labels corresponding to the paragraph data into the data source.
In one embodiment, the server 120 is further configured to: and determining a collection format corresponding to the paragraph data, and collecting the paragraph data according to the corresponding collection format.
In one embodiment, the server 120 is further configured to: and analyzing the audio paragraph collection request to obtain the audio paragraph information to be collected and the data type thereof, and determining the collection format corresponding to the paragraph data according to the data type.
In one embodiment, the server 120 is further configured to: if the collection format is a text format, collecting the text information corresponding to the paragraph data; if the collection format is an audio format, collecting the audio paragraphs corresponding to the paragraph data; and if the collection format is a text and audio format, collecting the text information corresponding to the paragraph data and the corresponding audio paragraph.
In one embodiment, the server 120 is further configured to: inquiring whether a data source has text information corresponding to the paragraph data, if so, acquiring the text information corresponding to the paragraph data from the data source, and collecting the acquired text information; if not, converting the paragraph data into corresponding text information, and collecting the converted text information.
In one embodiment, the server 120 is further configured to: receiving a collection request of an audio playing device, and judging whether the collection request is an audio file collection request or not; and if the collection request is judged to be the audio file collection request, analyzing the audio file collection request to obtain the information of the audio file to be collected, and collecting the information of the audio file to be collected.
In one embodiment, the server 120 is further configured to: after the paragraph data is collected, sending collection result information to a corresponding client to display the collection result information on the client; the client is a client associated with the audio playing device. For example, an account associated with the audio playing device is logged in through the mobile phone APP, collection result information sent by the corresponding server is obtained, and the collection result information, such as collection records in the cloud collection database, is displayed.
In the above embodiments, the audio paragraph collection system first monitors a collected voice instruction, generates an audio paragraph collection request according to the collected voice instruction, analyzes the audio paragraph collection request to obtain the audio paragraph information to be collected, obtains corresponding paragraph data according to the sentence formation tag corresponding to the audio paragraph information to be collected, and finally collects the paragraph data. The system collects the audio paragraph data through the voice command, is convenient and quick, and overcomes the defect that the audio paragraph cannot be collected in the traditional technology.
For specific definitions of the audio paragraph collection system, reference may be made to the above definitions of the audio paragraph collection method, which is not described herein again. The modules in the audio paragraph collection system can be implemented in whole or in part by software, hardware and a combination thereof. The modules can be embedded in a hardware form or independent from a processor in the computer device, and can also be stored in a memory in the computer device in a software form, so that the processor can call and execute operations corresponding to the modules.
In one embodiment, a computer device is provided, comprising a memory and a processor, the memory having a computer program stored therein, the processor implementing the following steps when executing the computer program: receiving an audio paragraph collection request of an audio playing device; analyzing the audio paragraph collection request to obtain audio paragraph information to be collected, and obtaining corresponding paragraph data according to the sentence forming labels corresponding to the audio paragraph information to be collected; and collecting the paragraph data.
In one embodiment, the processor, when executing the computer program, further performs the steps of: determining a sentence forming label corresponding to the audio paragraph information to be stored; inquiring a pre-established data source according to the sentence forming label, thereby obtaining paragraph data corresponding to the to-be-stored audio paragraph information; the data source stores a plurality of paragraph data, and each paragraph data corresponds to a sentence forming tag.
In one embodiment, the processor, when executing the computer program, further performs the steps of: determining a sentence forming label corresponding to the audio paragraph information to be stored; acquiring an audio file to which audio paragraph information to be collected belongs, obtaining a plurality of paragraph data according to the audio file, and generating sentence forming labels corresponding to the paragraph data; and obtaining paragraph data corresponding to the paragraph information of the audio to be stored according to the sentence forming label corresponding to the paragraph information of the audio to be stored and the sentence forming label contained in the audio file.
In one embodiment, the processor, when executing the computer program, further performs the steps of: if the data source does not have paragraph data corresponding to the paragraph information of the audio to be stored, acquiring an audio file to which the paragraph information of the audio to be stored belongs, obtaining a plurality of paragraph data according to the audio file, and generating sentence forming labels corresponding to the plurality of paragraph data; and obtaining paragraph data corresponding to the paragraph information of the audio to be stored according to the sentence forming label corresponding to the paragraph information of the audio to be stored and the sentence forming label contained in the audio file.
In one embodiment, the processor, when executing the computer program, further performs the steps of: and determining a collection format corresponding to the paragraph data, and collecting the paragraph data according to the corresponding collection format.
In one embodiment, the processor, when executing the computer program, further performs the steps of: receiving a collection request of an audio playing device, and judging whether the collection request is an audio file collection request or not; and if the collection request is judged to be the audio file collection request, analyzing the audio file collection request to obtain the information of the audio file to be collected, and collecting the information of the audio file to be collected.
In one embodiment, the processor, when executing the computer program, further performs the steps of: analyzing the audio paragraph collection request to obtain the audio paragraph information to be collected and the data type thereof; and determining a collection format corresponding to the paragraph data according to the data type.
In one embodiment, the processor, when executing the computer program, further performs the steps of: and if the collection format is a text format, collecting the text information corresponding to the paragraph data.
In one embodiment, the processor, when executing the computer program, further performs the steps of: and if the collection format is an audio format, collecting the audio paragraphs corresponding to the paragraph data.
In one embodiment, the processor, when executing the computer program, further performs the steps of: and if the collection format is a text and audio format, collecting the text information corresponding to the paragraph data and the corresponding audio paragraph.
In one embodiment, the processor, when executing the computer program, further performs the steps of: inquiring whether the data source has text information corresponding to the paragraph data, if so, acquiring the text information corresponding to the paragraph data from the data source, and collecting the acquired text information; if not, converting the paragraph data into corresponding text information, and collecting the converted text information.
In the embodiments, the audio paragraph collection method is realized when the processor executes the computer program, and the audio paragraph data is collected through the voice command, so that the method is convenient and quick, and overcomes the defect that the audio paragraph cannot be collected in the traditional technology.
In one embodiment, a computer-readable storage medium is provided, having a computer program stored thereon, which when executed by a processor, performs the steps of: receiving an audio paragraph collection request of an audio playing device; analyzing the audio paragraph collection request to obtain audio paragraph information to be collected, and obtaining corresponding paragraph data according to the sentence forming labels corresponding to the audio paragraph information to be collected; and collecting the paragraph data.
In one embodiment, the computer program when executed by the processor further performs the steps of: determining a sentence forming label corresponding to the audio paragraph information to be stored; inquiring a pre-established data source according to the sentence forming label, thereby obtaining paragraph data corresponding to the to-be-stored audio paragraph information; the data source stores a plurality of paragraph data, and each paragraph data corresponds to a sentence forming tag.
In one embodiment, the computer program when executed by the processor further performs the steps of: determining a sentence forming label corresponding to the audio paragraph information to be stored; acquiring an audio file to which audio paragraph information to be collected belongs, obtaining a plurality of paragraph data according to the audio file, and generating sentence forming labels corresponding to the paragraph data; and obtaining paragraph data corresponding to the paragraph information of the audio to be stored according to the sentence forming label corresponding to the paragraph information of the audio to be stored and the sentence forming label contained in the audio file.
In one embodiment, the computer program when executed by the processor further performs the steps of: if the data source does not have paragraph data corresponding to the paragraph information of the audio to be stored, acquiring an audio file to which the paragraph information of the audio to be stored belongs, obtaining a plurality of paragraph data according to the audio file, and generating sentence forming labels corresponding to the plurality of paragraph data; and obtaining paragraph data corresponding to the paragraph information of the audio to be stored according to the sentence forming label corresponding to the paragraph information of the audio to be stored and the sentence forming label contained in the audio file.
In one embodiment, the computer program when executed by the processor further performs the steps of: and determining a collection format corresponding to the paragraph data, and collecting the paragraph data according to the corresponding collection format.
In one embodiment, the computer program when executed by the processor further performs the steps of: receiving a collection request of an audio playing device, and judging whether the collection request is an audio file collection request or not; and if the collection request is judged to be the audio file collection request, analyzing the audio file collection request to obtain the information of the audio file to be collected, and collecting the information of the audio file to be collected.
In one embodiment, the computer program when executed by the processor further performs the steps of: analyzing the audio paragraph collection request to obtain the audio paragraph information to be collected and the data type thereof; and determining a collection format corresponding to the paragraph data according to the data type.
In one embodiment, the computer program when executed by the processor further performs the steps of: and if the collection format is a text format, collecting the text information corresponding to the paragraph data.
In one embodiment, the computer program when executed by the processor further performs the steps of: and if the collection format is an audio format, collecting the audio paragraphs corresponding to the paragraph data.
In one embodiment, the computer program when executed by the processor further performs the steps of: and if the collection format is a text and audio format, collecting the text information corresponding to the paragraph data and the corresponding audio paragraph.
In one embodiment, the computer program when executed by the processor further performs the steps of: inquiring whether the data source has text information corresponding to the paragraph data, if so, acquiring the text information corresponding to the paragraph data from the data source, and collecting the acquired text information; if not, converting the paragraph data into corresponding text information, and collecting the converted text information.
In the embodiments, the computer program is executed by the processor to realize the audio paragraph collection method, and the audio paragraph data is collected through the voice instruction, so that the method is convenient and quick, and overcomes the defect that the audio paragraph cannot be collected in the traditional technology.
It will be understood by those skilled in the art that all or part of the processes of the methods of the embodiments described above can be implemented by hardware instructions of a computer program, which can be stored in a non-volatile computer-readable storage medium, and when executed, can include the processes of the embodiments of the methods described above. Any reference to memory, storage, database, or other medium used in the embodiments provided herein may include non-volatile and/or volatile memory, among others. Non-volatile memory can include read-only memory (ROM), Programmable ROM (PROM), Electrically Programmable ROM (EPROM), Electrically Erasable Programmable ROM (EEPROM), or flash memory. Volatile memory can include Random Access Memory (RAM) or external cache memory. By way of illustration and not limitation, RAM is available in a variety of forms such as Static RAM (SRAM), Dynamic RAM (DRAM), Synchronous DRAM (SDRAM), Double Data Rate SDRAM (DDRSDRAM), Enhanced SDRAM (ESDRAM), Synchronous Link DRAM (SLDRAM), Rambus Direct RAM (RDRAM), direct bus dynamic RAM (DRDRAM), and memory bus dynamic RAM (RDRAM).
The technical features of the above embodiments can be arbitrarily combined, and for the sake of brevity, all possible combinations of the technical features in the above embodiments are not described, but should be considered as the scope of the present specification as long as there is no contradiction between the combinations of the technical features.
The above-mentioned embodiments only express several embodiments of the present application, and the description thereof is more specific and detailed, but not construed as limiting the scope of the invention. It should be noted that, for a person skilled in the art, several variations and modifications can be made without departing from the concept of the present application, which falls within the scope of protection of the present application. Therefore, the protection scope of the present patent shall be subject to the appended claims.

Claims (13)

1. An audio paragraph collection method, comprising the steps of:
receiving an audio paragraph collection request of an audio playing device; analyzing the audio paragraph collection request to obtain audio paragraph information to be collected, and obtaining corresponding paragraph data according to the sentence forming labels corresponding to the audio paragraph information to be collected;
collecting the paragraph data;
the step of obtaining corresponding paragraph data according to the sentence forming label corresponding to the paragraph information of the audio to be stored comprises the following steps:
determining a sentence forming label corresponding to the audio paragraph information to be collected;
inquiring a pre-established data source according to the sentence forming label, thereby obtaining paragraph data corresponding to the to-be-stored audio paragraph information; the data source stores a plurality of paragraph data, and each paragraph data corresponds to a sentence forming tag; and the sentence label is obtained by labeling the audio frequency of the sentence in the audio file through a voice recognition technology.
2. The audio paragraph collection method according to claim 1, wherein the step of obtaining corresponding paragraph data according to the sentence formation tag corresponding to the audio paragraph information to be collected comprises:
determining a sentence forming label corresponding to the audio paragraph information to be stored;
acquiring an audio file to which audio paragraph information to be collected belongs, obtaining a plurality of paragraph data according to the audio file, and generating sentence forming labels corresponding to the paragraph data;
and obtaining paragraph data corresponding to the paragraph information of the audio to be stored according to the sentence forming label corresponding to the paragraph information of the audio to be stored and the sentence forming label contained in the audio file.
3. The audio paragraph collection method according to claim 2, wherein the audio paragraph collection request is a request generated by an audio playing device according to a collection voice instruction after monitoring the collection voice instruction;
and/or the presence of a gas in the gas,
after the step of querying a pre-established data source according to the sentence forming tag, the method further comprises the following steps:
if the data source does not have paragraph data corresponding to the paragraph information of the audio to be stored, acquiring an audio file to which the paragraph information of the audio to be stored belongs, obtaining a plurality of paragraph data according to the audio file, and generating sentence forming labels corresponding to the plurality of paragraph data; and obtaining paragraph data corresponding to the paragraph information of the audio to be stored according to the sentence forming label corresponding to the paragraph information of the audio to be stored and the sentence forming label contained in the audio file.
4. The audio paragraph collection method according to any one of claims 1 to 3, wherein the step of collecting the paragraph data comprises:
determining a collection format corresponding to the paragraph data, and collecting the paragraph data according to the corresponding collection format;
and/or the presence of a gas in the gas,
the step of receiving an audio paragraph collection request of an audio playing device comprises:
receiving a collection request of an audio playing device, and judging whether the collection request is an audio file collection request or not;
the audio paragraph collection method further comprises:
and if the collection request is judged to be the audio file collection request, analyzing the audio file collection request to obtain the information of the audio file to be collected, and collecting the information of the audio file to be collected.
5. The method according to claim 4, wherein the step of parsing the audio paragraph collection request to obtain the audio paragraph information to be collected comprises:
analyzing the audio paragraph collection request to obtain the audio paragraph information to be collected and the data type thereof;
the step of determining the collection format corresponding to the paragraph data includes:
and determining a collection format corresponding to the paragraph data according to the data type.
6. The audio paragraph collection method according to claim 4, wherein the step of collecting the paragraph data according to the corresponding collection format comprises:
if the collection format is a text format, collecting the text information corresponding to the paragraph data;
and/or the presence of a gas in the gas,
if the collection format is an audio format, collecting the audio paragraphs corresponding to the paragraph data;
and/or the presence of a gas in the gas,
and if the collection format is a text and audio format, collecting the text information corresponding to the paragraph data and the corresponding audio paragraph.
7. The method of claim 6, wherein the step of collecting the text information corresponding to the paragraph data comprises:
inquiring whether the data source has text information corresponding to the paragraph data, if so, acquiring the text information corresponding to the paragraph data from the data source, and collecting the acquired text information; if not, converting the paragraph data into corresponding text information, and collecting the converted text information.
8. An audio paragraph collection method, comprising the steps of:
the audio playing device monitors a collection voice instruction, generates an audio paragraph collection request according to the collection voice instruction, and sends the audio paragraph collection request to a corresponding server;
the server analyzes the audio paragraph collection request to obtain audio paragraph information to be collected, and corresponding paragraph data are obtained according to sentence forming labels corresponding to the audio paragraph information to be collected; collecting the paragraph data;
the server is also used for determining a sentence forming label corresponding to the to-be-collected audio paragraph information; inquiring a pre-established data source according to the sentence forming label, thereby obtaining paragraph data corresponding to the to-be-stored audio paragraph information; the data source stores a plurality of paragraph data, and each paragraph data corresponds to a sentence forming tag; and the sentence label is obtained by labeling the audio frequency of the sentence in the audio file through a voice recognition technology.
9. An audio paragraph stowage apparatus, characterized in that the apparatus comprises:
the request receiving module is used for receiving an audio paragraph collection request of the audio playing device; the audio paragraph collection request is a request generated by the audio playing device according to a collection voice instruction after monitoring the collection voice instruction;
the request analysis module is used for analyzing the audio paragraph collection request to obtain the audio paragraph information to be collected and obtaining corresponding paragraph data according to the sentence forming labels corresponding to the audio paragraph information to be collected;
the paragraph collection module is used for collecting the paragraph data;
the request analysis module is also used for determining a sentence forming label corresponding to the to-be-collected audio paragraph information; inquiring a pre-established data source according to the sentence forming label, thereby obtaining paragraph data corresponding to the to-be-stored audio paragraph information; the data source stores a plurality of paragraph data, and each paragraph data corresponds to a sentence forming tag; and the sentence label is obtained by labeling the audio frequency of the sentence in the audio file through a voice recognition technology.
10. An audio paragraph collection system, the system comprising: the system comprises an audio playing device and a server;
the audio playing device is used for monitoring a collection voice instruction, generating an audio paragraph collection request according to the collection voice instruction, and sending the audio paragraph collection request to a corresponding server;
the server is used for analyzing the audio paragraph collection request to obtain audio paragraph information to be collected and obtaining corresponding paragraph data according to the sentence forming labels corresponding to the audio paragraph information to be collected; collecting the paragraph data;
the server is also used for determining a sentence forming label corresponding to the to-be-collected audio paragraph information; inquiring a pre-established data source according to the sentence forming label, thereby obtaining paragraph data corresponding to the to-be-stored audio paragraph information; the data source stores a plurality of paragraph data, and each paragraph data corresponds to a sentence forming tag; and the sentence label is obtained by labeling the audio frequency of the sentence in the audio file through a voice recognition technology.
11. The audio paragraph collection system of claim 10, wherein the server is further configured to send collection result information to the corresponding client after collecting the paragraph data, so as to display the collection result information at the client; the client is a client associated with the audio playing device.
12. A computer device comprising a memory, a processor and a computer program stored on the memory and executable on the processor, characterized in that the processor, when executing the program, carries out the steps of the audio paragraph collection method according to any one of claims 1 to 8.
13. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the steps of the audio paragraph collection method according to any one of claims 1 to 8.
CN201810184584.0A 2018-03-06 2018-03-06 Audio paragraph collection method, device and system and computer equipment Active CN108595470B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810184584.0A CN108595470B (en) 2018-03-06 2018-03-06 Audio paragraph collection method, device and system and computer equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810184584.0A CN108595470B (en) 2018-03-06 2018-03-06 Audio paragraph collection method, device and system and computer equipment

Publications (2)

Publication Number Publication Date
CN108595470A CN108595470A (en) 2018-09-28
CN108595470B true CN108595470B (en) 2020-11-06

Family

ID=63625782

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810184584.0A Active CN108595470B (en) 2018-03-06 2018-03-06 Audio paragraph collection method, device and system and computer equipment

Country Status (1)

Country Link
CN (1) CN108595470B (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114168764B (en) * 2021-11-04 2024-05-17 海南视联通信技术有限公司 Multimedia data processing method and device

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070256008A1 (en) * 2006-04-26 2007-11-01 Bedingfield James C Sr Methods, systems, and computer program products for managing audio information
CN104853251A (en) * 2015-04-30 2015-08-19 北京奇艺世纪科技有限公司 Online collection method and device for multimedia data
CN105260382A (en) * 2015-09-09 2016-01-20 百度在线网络技术(北京)有限公司 Searching result processing method and searching result processing system
CN106294547A (en) * 2016-07-25 2017-01-04 惠州Tcl移动通信有限公司 A kind of method and system intercepting audio file
CN106484856A (en) * 2016-10-09 2017-03-08 北京小米移动软件有限公司 Audio frequency playing method and device
CN106847315A (en) * 2017-01-24 2017-06-13 广州朗锐数字传媒科技有限公司 A kind of talking book synchronous methods of exhibiting sentence by sentence

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070256008A1 (en) * 2006-04-26 2007-11-01 Bedingfield James C Sr Methods, systems, and computer program products for managing audio information
CN104853251A (en) * 2015-04-30 2015-08-19 北京奇艺世纪科技有限公司 Online collection method and device for multimedia data
CN105260382A (en) * 2015-09-09 2016-01-20 百度在线网络技术(北京)有限公司 Searching result processing method and searching result processing system
CN106294547A (en) * 2016-07-25 2017-01-04 惠州Tcl移动通信有限公司 A kind of method and system intercepting audio file
CN106484856A (en) * 2016-10-09 2017-03-08 北京小米移动软件有限公司 Audio frequency playing method and device
CN106847315A (en) * 2017-01-24 2017-06-13 广州朗锐数字传媒科技有限公司 A kind of talking book synchronous methods of exhibiting sentence by sentence

Also Published As

Publication number Publication date
CN108595470A (en) 2018-09-28

Similar Documents

Publication Publication Date Title
US20220214775A1 (en) Method for extracting salient dialog usage from live data
US10643610B2 (en) Voice interaction based method and apparatus for generating multimedia playlist
JP7335062B2 (en) Voice service providing method and apparatus
US8521766B1 (en) Systems and methods for providing information discovery and retrieval
WO2018045646A1 (en) Artificial intelligence-based method and device for human-machine interaction
US20080177536A1 (en) A/v content editing
CN107943877B (en) Method and device for generating multimedia content to be played
US20090327272A1 (en) Method and System for Searching Multiple Data Types
US10108698B2 (en) Common data repository for improving transactional efficiencies of user interactions with a computing device
JP2018504727A (en) Reference document recommendation method and apparatus
US8825661B2 (en) Systems and methods for two stream indexing of audio content
CN112395420A (en) Video content retrieval method and device, computer equipment and storage medium
US20140324858A1 (en) Information processing apparatus, keyword registration method, and program
CN110717337A (en) Information processing method, device, computing equipment and storage medium
US10255321B2 (en) Interactive system, server and control method thereof
CN109857901B (en) Information display method and device, and method and device for information search
US11693900B2 (en) Method and system for providing resegmented audio content
CN111159546A (en) Event pushing method and device, computer readable storage medium and computer equipment
CN114328996A (en) Method and device for publishing information
CN107145509B (en) Information searching method and equipment thereof
CN107844587B (en) Method and apparatus for updating multimedia playlist
CN111723235B (en) Music content identification method, device and equipment
US11437038B2 (en) Recognition and restructuring of previously presented materials
CN108595470B (en) Audio paragraph collection method, device and system and computer equipment
CN113407775A (en) Video searching method and device and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right

Effective date of registration: 20190118

Address after: 100085 East District, Second Floor, 33 Xiaoying West Road, Haidian District, Beijing

Applicant after: BEIJING KINGSOFT INTERNET SECURITY SOFTWARE Co.,Ltd.

Address before: 511400 Tian'an Science and Technology Industrial Building, Panyu Energy-saving Science Park, 555 North Panyu Avenue, Donghuan Street, Panyu District, Guangzhou City, Guangdong Province

Applicant before: GUANGZHOU LANBO INTELLIGENT TECHNOLOGY CO.,LTD.

TA01 Transfer of patent application right
GR01 Patent grant
GR01 Patent grant