CN101996195A - Searching method and device of voice information in audio files and equipment - Google Patents

Searching method and device of voice information in audio files and equipment Download PDF

Info

Publication number
CN101996195A
CN101996195A CN2009100916619A CN200910091661A CN101996195A CN 101996195 A CN101996195 A CN 101996195A CN 2009100916619 A CN2009100916619 A CN 2009100916619A CN 200910091661 A CN200910091661 A CN 200910091661A CN 101996195 A CN101996195 A CN 101996195A
Authority
CN
China
Prior art keywords
audio file
audio
correlation
degree
key words
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN2009100916619A
Other languages
Chinese (zh)
Other versions
CN101996195B (en
Inventor
薛頔
樊科
刘威
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Mobile Communications Group Co Ltd
Original Assignee
China Mobile Communications Group Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Mobile Communications Group Co Ltd filed Critical China Mobile Communications Group Co Ltd
Priority to CN2009100916619A priority Critical patent/CN101996195B/en
Publication of CN101996195A publication Critical patent/CN101996195A/en
Application granted granted Critical
Publication of CN101996195B publication Critical patent/CN101996195B/en
Expired - Fee Related legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Landscapes

  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The invention discloses searching method and device of voice information in audio files and equipment, which are used for realizing the text search on the contents of the audio files, improving the accuracy and the efficiency of the audio file searching and improving the usability of the audio file searching. The searching method comprises the following steps of: carrying out voice identification on each audio file comprising voice information in the audio resource base, converting the audio files into text files comprising text information, and carrying out participle processing on text information of each text file; extracting key words included by corresponding audio files according to words included by each text file, determining the relevance of key words included by each audio file, and establishing an index database of the key words through being combined with the relevance information of each audio file; carrying out specific key word matching in the index database while receiving the voice information searching request carrying the specific key words, and providing the corresponding audio files according to the relevant information of the audio files with the relevance with the specific key words.

Description

The searching method of voice messaging, device and equipment in the audio file
Technical field
The present invention relates to the audio search technical field, relate in particular to searching method, device and the equipment of voice messaging in a kind of audio file.
Background technology
Become the information age of geometric growth in quantity of information, search technique has become one of requisite gordian technique in people's work and the life, make the information that people can fast search exactly oneself to be needed from the information ocean, thereby greatly improved work and life efficient.Along with search technique reaches its maturity, it is used more and more widely, and people, increase the demand of audio search also in continuous lifting day by day to the requirement of search technique.
Existing audio search technology mainly comprises following dual mode:
Mode one, be audio file and set up label for audio file interpolation Word message by artificial in advance, the label of audio file is searched for based on special key words.This mode can't satisfy the demand of audio file being carried out full-text search according to the content of audio file.Simultaneously, because the label of audio file can't be contained the full content of audio file, and label is by artificial foundation, subjective factor is bigger, cause the accuracy of audio search low, be difficult to guarantee the integrality of Search Results, also can't accurately locate the particular location of special key words in Search Results; If the enormous amount of audio resource storehouse sound intermediate frequency file with making that the workload of manually setting up label is huge, causes expending of a large amount of human resources.
Mode two, audio file is searched for based on the audio frequency matching technique, at first need to extract the eigenwert of the frequency spectrum or the energy of audio-frequency information to be searched, extract the eigenwert of the frequency spectrum or the energy of the audio-frequency information of each audio file in the audio resource storehouse then, carry out the coupling of eigenwert at last.The audio frequency matching technique lays particular emphasis on the coupling of the eigenwert of audio frequency itself, and this mode can't satisfy the demand of audio file being carried out full-text search according to the content of audio file equally.Simultaneously, the audio-frequency information that this mode is imported search requires harshness, not only the content of the audio-frequency information of requirement input is consistent with the content of audio resource storehouse sound intermediate frequency file, but also require the frequency of audio-frequency information and the frequency and the energy of energy and audio resource storehouse sound intermediate frequency file to be close, could successfully mate, cause the efficient of audio search low, ease for use is poor.
The audio search technology that provides in the prior art does not provide the scheme of carrying out full-text search based on the content of audio file, and the accuracy of audio search is low, efficient is low, ease for use is poor.
Summary of the invention
The invention provides the searching method and the device of voice messaging in a kind of audio file,, improve the accuracy and the efficient of audio search, promote the ease for use of audio search in order to realize that the content of audio file is carried out full-text search.
Accordingly, the present invention also provides a kind of terminal device and Website server.
The invention provides the searching method of voice messaging in a kind of audio file, comprising:
To each comprises that the audio file of voice messaging carries out speech recognition in the audio resource storehouse, be converted into the text that comprises Word message, and the Word message of each text is carried out word segmentation processing;
The speech included according to each text extracts the included key word of corresponding audio files, determine the degree of correlation of each audio file and included key word, and set up the index data base of key word, each key word of storage and the degree of correlation of each audio file and the relevant information of each audio file in the described index data base in conjunction with the relevant information of each audio file;
When receiving the voice messaging searching request of carrying special key words, in described index data base, carry out the coupling of described special key words, and provide corresponding audio file according to the relevant information that has an audio file of the degree of correlation with described special key words.
The invention provides the searcher of voice messaging in a kind of audio file, comprising:
Sound identification module is used for that each comprises that the audio file of voice messaging carries out speech recognition to the audio resource storehouse, is converted into the text that comprises Word message, and the Word message of each text is carried out word segmentation processing;
Module set up in index, be used for extracting the included key word of corresponding audio files according to the included speech of each text, determine the degree of correlation of each audio file and included key word, and set up the index data base of key word in conjunction with the relevant information of each audio file;
Index data base is used to store the degree of correlation of each key word and each audio file and the relevant information of each audio file;
The search processing module, be used for when receiving the voice messaging searching request of carrying special key words, in described index data base, carry out the coupling of described special key words, and provide corresponding audio file according to the relevant information that has an audio file of the degree of correlation with described special key words.
The invention provides a kind of terminal device, comprise the searcher of voice messaging in this audio file.
The invention provides a kind of Website server, comprise the searcher of voice messaging in this audio file.
The searching method of voice messaging, device and equipment in the audio file provided by the invention, to comprise that by speech recognition the audio file of voice messaging is converted into the text that comprises Word message, text according to the audio file correspondence is the full content of audio file, sets up the index data base of key word; When the user imports the search operation of special key words initiation voice messaging, index data base based on key word provides the audio file that has the degree of correlation with this special key words, thereby realized the content of audio file is carried out full-text search, remedied the deficiency of existing audio search technology; Because the index data base of key word is set up based on speech recognition technology, and has contained the full content of audio file, thereby has improved the accuracy of audio search, has also improved the efficient of audio search based on the search of key word; When the user initiates to search for, only need the input special key words to get final product, promoted the ease for use of audio search.
Description of drawings
The searcher block diagram of voice messaging in the audio file that Fig. 1 provides for the embodiment of the invention;
The searching method process flow diagram of voice messaging in the audio file that Fig. 2 provides for the embodiment of the invention;
The local search method process flow diagram of voice messaging in the audio file that Fig. 3 provides for embodiment one;
The network search method process flow diagram of voice messaging in the audio file that Fig. 4 provides for embodiment two.
Embodiment
The embodiment of the invention aims to provide a kind of scheme of the content of audio file being carried out full-text search based on key word, can be according to the special key words of user's input, content to each audio file in the audio resource storehouse is carried out full-text search, and provides corresponding audio file to the user.Based on key word the content of audio file is carried out full-text search, can effectively improve the accuracy and the efficient of audio search, promote the ease for use of audio search.
As shown in Figure 1, the embodiment of the invention at first provides the searcher of voice messaging in a kind of audio file, comprising:
Sound identification module 101 is used for that each comprises that the audio file of voice messaging carries out speech recognition to the audio resource storehouse, is converted into the text that comprises Word message, and the Word message of each text is carried out word segmentation processing;
Speech is minimum in the Chinese, independent movable, the significant language element of energy, and speech can comprise a Chinese character, two Chinese characters or a plurality of Chinese character.Various minutes word algorithms can be realized the word segmentation processing to Word message in the prior art, divide word algorithm mainly to comprise three types: based on the branch word algorithm of string matching, based on the branch word algorithm of understanding with based on the branch word algorithm of statistics;
Module 102 set up in index, be used for extracting the included key word of corresponding audio files according to the included speech of each text, determine the degree of correlation of each audio file and included key word, and set up the index data base 103 of key word in conjunction with the relevant information of each audio file;
Index data base 103 is used to store the degree of correlation of each key word and each audio file and the relevant information of each audio file;
Search processing module 104, be used for when receiving the voice messaging searching request of carrying special key words, in index data base 103, carry out the coupling of this special key words, and provide corresponding audio file according to the relevant information that has an audio file of the degree of correlation with this special key words.
In concrete the enforcement, in order to promote the degree of accuracy of audio search, when providing corresponding audio file to the user, the particular location that can also provide this special key words in corresponding audio file, to occur to the user, under this application scenarios, the temporal information that module 102 also occurs in this audio file in conjunction with the included key word of each audio file set up in index when setting up index data base 103; Accordingly, index data base 103 also is used for storing the temporal information that each key word occurs at the audio file with degree of correlation; Search processing module 104, also be used for provide have the audio file of the degree of correlation with this special key words in, the temporal information that also provides this special key words in having the audio file of the degree of correlation, to occur.In order accurately to determine the temporal information that the included key word of each audio file occurs in this audio file, in concrete the enforcement, sound identification module 101, the Word message that also is used for each text carries out after the word segmentation processing, for the included speech of each text adds the temporal information that it occurs in corresponding audio files, be the included speech of each text and add a timestamp.
In concrete the enforcement, may have the audio file that does not comprise voice messaging in the audio resource storehouse, for example only comprise the audio file of music rhythm, under this application scenarios, the searcher of voice messaging also comprises in this audio file:
Audio frequency parsing module 105 is used for that each audio file of audio resource storehouse is carried out voice and resolves, and extracts the audio file that comprises voice messaging according to the voice analysis result.
Filter out after the audio file that does not comprise voice messaging, can be to each comprises that the audio file of voice messaging carries out speech recognition in the audio resource storehouse.
In concrete the enforcement, the audio file in the audio resource storehouse may change, and for the accuracy and the completeness that guarantee Search Results, the searcher of voice messaging also comprises in this audio file:
Update module 106 is used for regularly or the audio file in the audio resource storehouse when changing, and index data base 103 is upgraded;
Concrete, if added new audio file in the audio resource storehouse, then this new audio file is carried out speech recognition, word segmentation processing, keyword extraction, determine the degree of correlation of audio file that this is new and included key word, and in index data base 103, increase the degree of correlation of this new audio file and included key word and the relevant information of this new audio file in conjunction with the relevant information of this new audio file; If deleted existing audio file in the audio resource storehouse, then in index data base 103, delete all information relevant with this existing audio file.
The searcher of voice messaging is all applicable at local search and web search in the audio file that the embodiment of the invention provides.If the searcher of voice messaging is arranged in the terminal device that end side is the user in this audio file, can realize that the user carries out local search to the content of each audio file in the local audio resources bank.The local audio resources bank is meant the local storage in user's the terminal device, for example local hard drive, local disk etc.In the local audio resources bank, the relevant information of audio file comprises the file name and the local store path of audio file, and described local store path is " E: music " for example, and expression is stored in local E dish name and is called under the file of " music ".At local search, provide have the audio file of the degree of correlation with this special key words in, the file name and the local store path that have the audio file of the degree of correlation with this special key words also are provided.In concrete the enforcement, the relevant information of audio file can also comprise other relevant informations such as the size, type, modification time of audio file, accordingly, provide have the audio file of the degree of correlation with this special key words in, above-mentioned other relevant information that has the audio file of the degree of correlation with this special key words can also be provided.
Promptly provide in the Website server of website of audio search business if the searcher of voice messaging is arranged on network side in this audio file, by Website server and be installed in cooperatively interacting between the browser of end side, can realize that the user carries out web search to the content of each audio file in the network audio resources bank.The network audio resources bank is meant site databases, and in the network audio resources bank, the relevant information of audio file comprises the file name and the URL (URL(uniform resource locator)) of audio file.At web search, the relevant information that described basis and this special key words have an audio file of the degree of correlation provides corresponding audio file to be meant provides the hyperlink that has the audio file of the degree of correlation with this special key words.
Based on same technical conceive, the embodiment of the invention provides the searching method of voice messaging in a kind of audio file simultaneously, as shown in Figure 2, comprising:
S200, each audio file in the audio resource storehouse is carried out voice resolve, extract the audio file that comprises voice messaging according to the voice analysis result;
In concrete the enforcement,, then need not to carry out this step, directly begin to carry out from S201 if each audio file includes voice messaging in the audio resource storehouse.
S201, to each comprises that the audio file of voice messaging carries out speech recognition in the audio resource storehouse, be converted into the text that comprises Word message, and the Word message of each text carried out word segmentation processing;
In concrete the enforcement, the Word message of each text is carried out can also adding the temporal information that it occurs in corresponding audio files for the included speech of each text after the word segmentation processing.
S202, extract the included key word of corresponding audio files according to the included speech of each text, determine the degree of correlation of each audio file and included key word, and set up the index data base of key word in conjunction with the relevant information of each audio file, accordingly, each key word of storage and the degree of correlation of each audio file and the relevant information of each audio file in the index data base of key word;
In concrete the enforcement, audio file is definite based on degree of correlation algorithm with the degree of correlation of included key word, and the degree of correlation of audio file and included key word is relevant with the number of times that this key word occurs in audio file, and occurrence number is many more, and the degree of correlation is high more;
In concrete the enforcement, in order to promote the degree of accuracy of audio search, the temporal information that when setting up the index data base of key word, also in this audio file, occurs in conjunction with the included key word of each audio file, accordingly, also store the temporal information that each key word occurs in the index data base of key word in having the audio file of the degree of correlation.
So far, finished the search preparatory stage of voice messaging in the audio file, in the search preparatory stage, need handle each audio file in the audio resource storehouse, identify voice messaging and voice messaging is converted to word information relates based on speech recognition technology; Word message through word segmentation processing and keyword extraction and determine each audio file and the degree of correlation of included key word after set up the index data base of key word.
After the index data base of key word is set up and is finished, can enter the search execute phase of voice messaging in the audio file, the search execute phase is initiated by the user, and by the search operation of input special key words initiation voice messaging, then this method also comprises the steps:
S203, when receiving the voice messaging searching request of carrying special key words, in the index data base of key word, carry out the coupling of this special key words, and provide corresponding audio file according to the relevant information that has an audio file of the degree of correlation with this special key words;
In concrete the enforcement, generally from high to low the audio file that has a degree of correlation with this special key words is sorted according to the degree of correlation, the high more ordering of the degree of correlation is forward more;
If also store the temporal information that each key word occurs in the index data base of key word in having the audio file of the degree of correlation, for the ease of the user special key words in the Search Results is accurately located, provide have the audio file of the degree of correlation with this special key words in, the temporal information that also provides special key words to occur in having the audio file of the degree of correlation, specifically the form with time shaft provides.
In concrete the enforcement, also comprise regularly or the audio file in the audio resource storehouse when changing, the index data base of key word is carried out updating steps.
To be example with local search and web search respectively below, the search plan of voice messaging in the audio file that the detailed description embodiment of the invention provides.
Embodiment one
Present embodiment provides the local search scheme of voice messaging in the audio file, corresponding audio resources bank (can be called the local audio resources bank) is arranged on end side, be specially the local storage in user's the terminal device, in order to realize local search, the searcher of voice messaging in the audio file that the embodiment of the invention provides need be set in user's terminal device to voice messaging in the audio file.The local search flow process of voice messaging in the audio file as shown in Figure 3, comprises local search preparatory stage and local search execute phase.The local search preparatory stage, comprise the steps:
S301, terminal device extract a untreated audio file from the audio resource storehouse, current audio file is carried out voice resolve;
S302, terminal device judge according to the voice analysis result whether current audio file comprises voice messaging, if, then carry out S303, if not, then turn to and carry out S307;
S303, terminal device carry out speech recognition to current audio file, are converted into the text that comprises Word message;
S304, terminal device carry out word segmentation processing to the Word message of current text, and add the temporal information that it occurs for the current included speech of text in corresponding audio files;
S305, terminal device extract the included key word of corresponding audio files according to the included speech of current text, determine the degree of correlation of current audio file and included key word;
S306, terminal device store the temporal information that the file name of the degree of correlation of current audio file and included key word, current audio file and local store path and the current included key word of audio file occur in the index data base of key word in this audio file;
S307, the current audio file of terminal device are set to handle;
S308, terminal device judge whether also there is untreated audio file in the audio resource storehouse, if then return and carry out S301, if not, then the index data base of key word is set up and to be finished, and promptly the local search preparatory stage finishes, and follow-uply can enter the local search execute phase.
If the user imports special key words in the local search toolbar, initiate the local search of voice messaging, then the local search execute phase, comprise the steps:
S309, when receiving the local search query of the voice messaging that carries special key words, terminal device carries out the coupling of this special key words in the index data base of key word;
S310, terminal device are according to having the file name and the local store path of the audio file of the degree of correlation with this special key words, the temporal information that provides corresponding audio file and this special key words to occur in having the audio file of the degree of correlation can also provide the file name and the local store path of this audio file certainly in the lump;
Accordingly, the temporal information that audio file and this special key words occur in having the audio file of the degree of correlation, the file name of this audio file and local store path represent on terminal device for the user and check.
It is to be noted, in concrete the enforcement since the audio file in the local audio resources bank can change, for example the user has added new audio file or has deleted existing audio file in the local storage in the local storage of terminal device, therefore need regularly or the audio file in the local audio resources bank when changing, index data base to key word upgrades, to guarantee the accuracy and the completeness of local search results.
Embodiment two
Present embodiment provides the web search scheme of voice messaging in the audio file.Corresponding audio resources bank (can be called the local audio resources bank) is arranged on network side, be specially site databases, in order to realize web search, the searcher of voice messaging in the audio file that the embodiment of the invention provides need be set in the Website server of the website that the audio search business is provided to voice messaging in the audio file.The web search flow process of voice messaging in the audio file as shown in Figure 4, comprises web search preparatory stage and web search execute phase.The web search preparatory stage, comprise the steps:
S401, Website server extract a untreated audio file from the audio resource storehouse, current audio file is carried out voice resolve;
S402, Website server judge according to the voice analysis result whether current audio file comprises voice messaging, if, then carry out S403, if not, then turn to and carry out S407;
S403, Website server carry out speech recognition to current audio file, are converted into the text that comprises Word message;
S404, Website server carry out word segmentation processing to the Word message of current text, and add the temporal information that it occurs for the current included speech of text in corresponding audio files;
S405, Website server extract the included key word of corresponding audio files according to the included speech of current text, determine the degree of correlation of current audio file and included key word;
S406, Website server store the temporal information that the file name of the degree of correlation of current audio file and included key word, current audio file and URL and the current included key word of audio file occur in the index data base of key word in this audio file
S407, the current audio file of Website server are set to handle;
S408, Website server judge whether also there is untreated audio file in the audio resource storehouse, if then return and carry out S401, if not, then the index data base of key word is set up and to be finished, and promptly the web search preparatory stage finishes, and follow-uply can enter the web search execute phase.
If the user imports special key words in the cyber stalker hurdle of the browser of end side, initiate the web search of voice messaging, then the web search execute phase, comprise the steps:
S409, when receiving the network search request of the voice messaging that carries special key words, Website server carries out the coupling of this special key words in the index data base of key word;
S410, Website server provide the hyperlink of corresponding audio file and the temporal information that this special key words occurs according to having the file name and the URL of the audio file of the degree of correlation with this special key words in having the audio file of the degree of correlation;
Accordingly, the temporal information that the hyperlink of audio file and this special key words occur in having the audio file of the degree of correlation sends to the browser of end side by transmission network, represents on terminal device for the user and checks.
It is to be noted, in concrete the enforcement since the audio file in the network audio resources bank can change, for example add new audio file in the site databases or deleted existing audio file, therefore need regularly or the audio file in the network audio resources bank when changing, index data base to key word upgrades, to guarantee the accuracy and the completeness of web search results.
The searching method of voice messaging, device and equipment in the audio file provided by the invention, to comprise that by speech recognition the audio file of voice messaging is converted into the text that comprises Word message, text according to the audio file correspondence is the full content of audio file, sets up the index data base of key word; When the user imports the search of special key words initiation voice messaging, index data base based on key word provides the audio file that has the degree of correlation with this special key words, thereby realized the content of audio file is carried out full-text search, remedied the deficiency of existing audio search technology; Because the index data base of key word is set up based on speech recognition technology, and has contained the full content of audio file, thereby has improved the accuracy of audio search, has also improved the efficient of audio search based on the search of key word; When the user initiates to search for, only need the input special key words to get final product, promoted the ease for use of audio search.
The searching method of voice messaging, device and equipment in the audio file provided by the invention, in the index data base of key word, also store each key word and have the temporal information that occurs in the audio file of the degree of correlation, when the user imports the search of special key words initiation voice messaging, the temporal information that can also provide this special key words in having the audio file of the degree of correlation, to occur based on the index data base of key word, thus realized the particular location of accurate location special key words in Search Results.
It will be understood by those skilled in the art that embodiments of the invention can be provided as method, device, equipment or computer program.Therefore, the present invention can adopt complete hardware embodiment, complete software implementation example or in conjunction with the form of the embodiment of software and hardware aspect.And the present invention can adopt the form that goes up the computer program of implementing in one or more computer-usable storage medium (including but not limited to magnetic disk memory, CD-ROM, optical memory etc.) that wherein include computer usable program code.
The present invention is that reference is described according to the process flow diagram and/or the block scheme of method, device, equipment and the computer program of the embodiment of the invention.Should understand can be by the flow process in each flow process in computer program instructions realization flow figure and/or the block scheme and/or square frame and process flow diagram and/or the block scheme and/or the combination of square frame.Can provide these computer program instructions to the processor of multi-purpose computer, special purpose computer, Embedded Processor or other programmable data processing device to produce a machine, make the instruction of carrying out by the processor of computing machine or other programmable data processing device produce to be used for the device of the function that is implemented in flow process of process flow diagram or a plurality of flow process and/or square frame of block scheme or a plurality of square frame appointments.
These computer program instructions also can be stored in energy vectoring computer or the computer-readable memory of other programmable data processing device with ad hoc fashion work, make the instruction that is stored in this computer-readable memory produce the manufacture that comprises command device, this command device is implemented in the function of appointment in flow process of process flow diagram or a plurality of flow process and/or square frame of block scheme or a plurality of square frame.
These computer program instructions also can be loaded on computing machine or other programmable data processing device, make on computing machine or other programmable devices and to carry out the sequence of operations step producing computer implemented processing, thereby the instruction of carrying out on computing machine or other programmable devices is provided for being implemented in the step of the function of appointment in flow process of process flow diagram or a plurality of flow process and/or square frame of block scheme or a plurality of square frame.
Although described the preferred embodiments of the present invention, in a single day those skilled in the art get the basic creative notion of cicada, then can make other change and modification to these embodiment.So claims are intended to all changes and the modification that are interpreted as comprising preferred embodiment and fall into the scope of the invention.
Obviously, those skilled in the art can carry out various changes and modification to the present invention and not break away from the spirit and scope of the present invention.Like this, if of the present invention these are revised and modification belongs within the scope of claim of the present invention and equivalent technologies thereof, then the present invention also is intended to comprise these changes and modification interior.

Claims (15)

1. the searching method of voice messaging in the audio file is characterized in that, comprising:
To each comprises that the audio file of voice messaging carries out speech recognition in the audio resource storehouse, be converted into the text that comprises Word message, and the Word message of each text is carried out word segmentation processing;
The speech included according to each text extracts the included key word of corresponding audio files, determine the degree of correlation of each audio file and included key word, and set up the index data base of key word, each key word of storage and the degree of correlation of each audio file and the relevant information of each audio file in the described index data base in conjunction with the relevant information of each audio file;
When receiving the voice messaging searching request of carrying special key words, in described index data base, carry out the coupling of described special key words, and provide corresponding audio file according to the relevant information that has an audio file of the degree of correlation with described special key words.
2. the method for claim 1, it is characterized in that, the temporal information that also occurs in this audio file in conjunction with the included key word of each audio file when setting up described index data base is also stored the temporal information that each key word occurs in having the audio file of the degree of correlation in the described index data base; And
Provide have the audio file of the degree of correlation with described special key words in, the temporal information that also provides described special key words in having the audio file of the degree of correlation, to occur.
3. method as claimed in claim 2 is characterized in that, also comprises:
The Word message of each text is carried out after the word segmentation processing, for the included speech of each text adds the temporal information that it occurs in corresponding audio files.
4. as claim 1,2 or 3 arbitrary described methods, it is characterized in that, from high to low the audio file that has a degree of correlation with described special key words is sorted according to the degree of correlation.
5. the method for claim 1, it is characterized in that, described audio resource lab setting is in end side, and described voice messaging searching request is the local search query of voice messaging, and the relevant information of described audio file comprises the file name and the local store path of audio file; And
Provide have the audio file of the degree of correlation with described special key words in, the file name and the local store path that have the audio file of the degree of correlation with described special key words also are provided.
6. the method for claim 1, it is characterized in that, described audio resource lab setting is at network side, and described voice messaging searching request is the network search request of voice messaging, and the relevant information of described audio file comprises the file name and the uniform resource position mark URL of audio file; And
The relevant information that described basis and described special key words have an audio file of the degree of correlation provides corresponding audio file to be meant provides the hyperlink that has the audio file of the degree of correlation with described special key words.
7. the method for claim 1 is characterized in that, also comprises:
Each audio file in the audio resource storehouse is carried out voice resolve, extract the audio file that comprises voice messaging according to the voice analysis result.
8. the method for claim 1 is characterized in that, also comprises:
When regular the or audio file in described audio resource storehouse changes, described index data base is upgraded.
9. the searcher of voice messaging in the audio file is characterized in that, comprising:
Sound identification module is used for that each comprises that the audio file of voice messaging carries out speech recognition to the audio resource storehouse, is converted into the text that comprises Word message, and the Word message of each text is carried out word segmentation processing;
Module set up in index, be used for extracting the included key word of corresponding audio files according to the included speech of each text, determine the degree of correlation of each audio file and included key word, and set up the index data base of key word in conjunction with the relevant information of each audio file;
Index data base is used to store the degree of correlation of each key word and each audio file and the relevant information of each audio file;
The search processing module, be used for when receiving the voice messaging searching request of carrying special key words, in described index data base, carry out the coupling of described special key words, and provide corresponding audio file according to the relevant information that has an audio file of the degree of correlation with described special key words.
10. device as claimed in claim 9 is characterized in that,
The temporal information that module also occurs in this audio file in conjunction with the included key word of each audio file set up in described index when setting up described index data base;
Described index data base also is used for storing the temporal information that each key word occurs at the audio file with degree of correlation;
Described search processing module, also be used for provide have the audio file of the degree of correlation with described special key words in, the temporal information that also provides described special key words in having the audio file of the degree of correlation, to occur.
11. device as claimed in claim 10 is characterized in that,
Described sound identification module, the Word message that also is used for each text carries out after the word segmentation processing, for the included speech of each text adds the temporal information that it occurs in corresponding audio files.
12. device as claimed in claim 9 is characterized in that, also comprises:
The audio frequency parsing module is used for that each audio file of audio resource storehouse is carried out voice and resolves, and extracts the audio file that comprises voice messaging according to the voice analysis result.
13. device as claimed in claim 9 is characterized in that, also comprises:
Update module is used for regularly or the audio file in described audio resource storehouse when changing, and described index data base is upgraded.
14. a terminal device is characterized in that, comprises as the arbitrary described searcher of claim 9 to 13.
15. a Website server is characterized in that, comprises as the arbitrary described searcher of claim 9 to 13.
CN2009100916619A 2009-08-28 2009-08-28 Searching method and device of voice information in audio files and equipment Expired - Fee Related CN101996195B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2009100916619A CN101996195B (en) 2009-08-28 2009-08-28 Searching method and device of voice information in audio files and equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2009100916619A CN101996195B (en) 2009-08-28 2009-08-28 Searching method and device of voice information in audio files and equipment

Publications (2)

Publication Number Publication Date
CN101996195A true CN101996195A (en) 2011-03-30
CN101996195B CN101996195B (en) 2012-07-11

Family

ID=43786362

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2009100916619A Expired - Fee Related CN101996195B (en) 2009-08-28 2009-08-28 Searching method and device of voice information in audio files and equipment

Country Status (1)

Country Link
CN (1) CN101996195B (en)

Cited By (43)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102316361A (en) * 2011-07-04 2012-01-11 深圳市子栋科技有限公司 Audio-frequency / video-frequency on demand method based on natural speech recognition and system thereof
CN102867511A (en) * 2011-07-04 2013-01-09 余喆 Method and device for recognizing natural speech
CN102867512A (en) * 2011-07-04 2013-01-09 余喆 Method and device for recognizing natural speech
CN103218454A (en) * 2013-05-06 2013-07-24 百度在线网络技术(北京)有限公司 Voice-data-based file searching method, voice-data-based file device and voice-data-based file system
CN103366010A (en) * 2013-07-25 2013-10-23 北京小米科技有限责任公司 Method and device for searching audio file
CN103365959A (en) * 2013-06-03 2013-10-23 深圳市爱渡飞科技有限公司 Voice search method and voice search device
CN103425668A (en) * 2012-05-16 2013-12-04 联想(北京)有限公司 Information search method and electronic equipment
WO2014169731A1 (en) * 2013-09-11 2014-10-23 中兴通讯股份有限公司 Information query method and terminal device
CN104375997A (en) * 2013-08-13 2015-02-25 腾讯科技(深圳)有限公司 Method and device for adding note information to instant messaging audio information
CN104391924A (en) * 2014-11-21 2015-03-04 南京讯思雅信息科技有限公司 Mixed audio and video search method and system
CN104572714A (en) * 2013-10-18 2015-04-29 英业达科技有限公司 Learning video inquiring system and learning video inquiring method
CN104809115A (en) * 2014-01-24 2015-07-29 贝壳网际(北京)安全技术有限公司 Searching method and terminal device
CN104834740A (en) * 2015-05-20 2015-08-12 深圳市东方泰明科技有限公司 Full-automatic audio/video structuralized accurate searching method
CN104978366A (en) * 2014-04-14 2015-10-14 深圳市北科瑞声科技有限公司 Voice data index building method and system based on mobile terminal
CN105550217A (en) * 2015-12-03 2016-05-04 腾讯科技(深圳)有限公司 Scene music searching method and scene music searching apparatus
CN105760399A (en) * 2014-12-19 2016-07-13 华为软件技术有限公司 Data retrieval method and device
CN105824930A (en) * 2016-03-17 2016-08-03 深圳市金立通信设备有限公司 Voice message processing method and terminal
CN106024013A (en) * 2016-04-29 2016-10-12 努比亚技术有限公司 Voice data searching method and system
CN106202204A (en) * 2016-06-24 2016-12-07 维沃移动通信有限公司 The lookup method of a kind of voice document and mobile terminal
WO2017101266A1 (en) * 2015-12-15 2017-06-22 深圳Tcl数字技术有限公司 Voice control method and system
CN106911832A (en) * 2017-04-28 2017-06-30 上海与德科技有限公司 A kind of method and device of voice record
CN107016109A (en) * 2017-04-14 2017-08-04 维沃移动通信有限公司 A kind of photo film making method and mobile terminal
CN108121715A (en) * 2016-11-28 2018-06-05 中国移动通信集团公司 A kind of word tag method and word tag device
CN108170691A (en) * 2016-12-07 2018-06-15 北京国双科技有限公司 It is associated with the determining method and apparatus of document
CN108257597A (en) * 2017-12-28 2018-07-06 合肥凯捷技术有限公司 A kind of audio retrieval system based on speech recognition
CN108320318A (en) * 2018-01-15 2018-07-24 腾讯科技(深圳)有限公司 Image processing method, device, computer equipment and storage medium
CN108829765A (en) * 2018-05-29 2018-11-16 平安科技(深圳)有限公司 A kind of information query method, device, computer equipment and storage medium
CN108833971A (en) * 2018-06-06 2018-11-16 北京奇艺世纪科技有限公司 A kind of method for processing video frequency and device
CN109034418A (en) * 2018-07-26 2018-12-18 国家电网公司 Operation field information transferring method and system
CN109274586A (en) * 2018-11-14 2019-01-25 深圳市云歌人工智能技术有限公司 Storage method, device and the storage medium of chat message
CN109460209A (en) * 2018-12-20 2019-03-12 广东小天才科技有限公司 A kind of control method and electronic equipment for dictating the progress that enters for
CN109559764A (en) * 2017-09-27 2019-04-02 北京国双科技有限公司 The treating method and apparatus of audio file
CN109741750A (en) * 2018-05-09 2019-05-10 北京字节跳动网络技术有限公司 A kind of method of speech recognition, document handling method and terminal device
CN110275978A (en) * 2019-07-01 2019-09-24 成都启英泰伦科技有限公司 Quick storage of the voice big data on redundant arrays of inexpensive disks and access amending method
CN110275979A (en) * 2019-07-01 2019-09-24 成都启英泰伦科技有限公司 A kind of mapping management process of voice data and text data
CN110287364A (en) * 2019-06-28 2019-09-27 合肥讯飞读写科技有限公司 Voice search method, system, equipment and computer readable storage medium
CN110335598A (en) * 2019-06-26 2019-10-15 重庆金美通信有限责任公司 A kind of wireless narrow band channel speech communication method based on speech recognition
CN111008300A (en) * 2019-11-20 2020-04-14 四川互慧软件有限公司 Keyword-based timestamp positioning search method in audio and video
WO2020093720A1 (en) * 2018-11-07 2020-05-14 平安医疗健康管理股份有限公司 Speech recognition-based information query method and device
CN111161738A (en) * 2019-12-27 2020-05-15 苏州欧孚网络科技股份有限公司 Voice file retrieval system and retrieval method thereof
CN111353065A (en) * 2018-12-20 2020-06-30 北京嘀嘀无限科技发展有限公司 Voice archive storage method, device, equipment and computer readable storage medium
CN112115282A (en) * 2020-09-17 2020-12-22 北京达佳互联信息技术有限公司 Question answering method, device, equipment and storage medium based on search
CN113299279A (en) * 2021-05-18 2021-08-24 上海明略人工智能(集团)有限公司 Method, apparatus, electronic device and readable storage medium for associating voice data and retrieving voice data

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7177795B1 (en) * 1999-11-10 2007-02-13 International Business Machines Corporation Methods and apparatus for semantic unit based automatic indexing and searching in data archive systems
US7809568B2 (en) * 2005-11-08 2010-10-05 Microsoft Corporation Indexing and searching speech with text meta-data
NO325191B1 (en) * 2005-12-30 2008-02-18 Tandberg Telecom As Sociable multimedia stream
CN100565532C (en) * 2008-05-28 2009-12-02 叶睿智 A kind of multimedia resource search method based on the audio content retrieval
CN101510222B (en) * 2009-02-20 2012-05-30 北京大学 Multilayer index voice document searching method

Cited By (50)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102867511A (en) * 2011-07-04 2013-01-09 余喆 Method and device for recognizing natural speech
CN102867512A (en) * 2011-07-04 2013-01-09 余喆 Method and device for recognizing natural speech
CN102316361A (en) * 2011-07-04 2012-01-11 深圳市子栋科技有限公司 Audio-frequency / video-frequency on demand method based on natural speech recognition and system thereof
CN103425668A (en) * 2012-05-16 2013-12-04 联想(北京)有限公司 Information search method and electronic equipment
CN103218454A (en) * 2013-05-06 2013-07-24 百度在线网络技术(北京)有限公司 Voice-data-based file searching method, voice-data-based file device and voice-data-based file system
CN103365959A (en) * 2013-06-03 2013-10-23 深圳市爱渡飞科技有限公司 Voice search method and voice search device
CN103366010A (en) * 2013-07-25 2013-10-23 北京小米科技有限责任公司 Method and device for searching audio file
CN104375997A (en) * 2013-08-13 2015-02-25 腾讯科技(深圳)有限公司 Method and device for adding note information to instant messaging audio information
WO2014169731A1 (en) * 2013-09-11 2014-10-23 中兴通讯股份有限公司 Information query method and terminal device
CN104572714A (en) * 2013-10-18 2015-04-29 英业达科技有限公司 Learning video inquiring system and learning video inquiring method
CN104809115A (en) * 2014-01-24 2015-07-29 贝壳网际(北京)安全技术有限公司 Searching method and terminal device
CN104978366A (en) * 2014-04-14 2015-10-14 深圳市北科瑞声科技有限公司 Voice data index building method and system based on mobile terminal
CN104391924A (en) * 2014-11-21 2015-03-04 南京讯思雅信息科技有限公司 Mixed audio and video search method and system
CN105760399A (en) * 2014-12-19 2016-07-13 华为软件技术有限公司 Data retrieval method and device
CN104834740A (en) * 2015-05-20 2015-08-12 深圳市东方泰明科技有限公司 Full-automatic audio/video structuralized accurate searching method
CN105550217A (en) * 2015-12-03 2016-05-04 腾讯科技(深圳)有限公司 Scene music searching method and scene music searching apparatus
CN105550217B (en) * 2015-12-03 2021-05-07 腾讯科技(深圳)有限公司 Scene music searching method and scene music searching device
WO2017101266A1 (en) * 2015-12-15 2017-06-22 深圳Tcl数字技术有限公司 Voice control method and system
CN105824930A (en) * 2016-03-17 2016-08-03 深圳市金立通信设备有限公司 Voice message processing method and terminal
CN106024013A (en) * 2016-04-29 2016-10-12 努比亚技术有限公司 Voice data searching method and system
CN106024013B (en) * 2016-04-29 2022-01-14 努比亚技术有限公司 Voice data searching method and system
CN106202204A (en) * 2016-06-24 2016-12-07 维沃移动通信有限公司 The lookup method of a kind of voice document and mobile terminal
CN108121715A (en) * 2016-11-28 2018-06-05 中国移动通信集团公司 A kind of word tag method and word tag device
CN108121715B (en) * 2016-11-28 2022-01-25 中国移动通信集团公司 Character labeling method and character labeling device
CN108170691A (en) * 2016-12-07 2018-06-15 北京国双科技有限公司 It is associated with the determining method and apparatus of document
CN107016109A (en) * 2017-04-14 2017-08-04 维沃移动通信有限公司 A kind of photo film making method and mobile terminal
CN107016109B (en) * 2017-04-14 2018-11-30 维沃移动通信有限公司 A kind of photo film making method and mobile terminal
CN106911832A (en) * 2017-04-28 2017-06-30 上海与德科技有限公司 A kind of method and device of voice record
CN109559764A (en) * 2017-09-27 2019-04-02 北京国双科技有限公司 The treating method and apparatus of audio file
CN108257597A (en) * 2017-12-28 2018-07-06 合肥凯捷技术有限公司 A kind of audio retrieval system based on speech recognition
CN108320318A (en) * 2018-01-15 2018-07-24 腾讯科技(深圳)有限公司 Image processing method, device, computer equipment and storage medium
CN109741750A (en) * 2018-05-09 2019-05-10 北京字节跳动网络技术有限公司 A kind of method of speech recognition, document handling method and terminal device
CN108829765A (en) * 2018-05-29 2018-11-16 平安科技(深圳)有限公司 A kind of information query method, device, computer equipment and storage medium
CN108833971A (en) * 2018-06-06 2018-11-16 北京奇艺世纪科技有限公司 A kind of method for processing video frequency and device
CN109034418A (en) * 2018-07-26 2018-12-18 国家电网公司 Operation field information transferring method and system
CN109034418B (en) * 2018-07-26 2021-05-28 国家电网公司 Operation site information transmission method and system
WO2020093720A1 (en) * 2018-11-07 2020-05-14 平安医疗健康管理股份有限公司 Speech recognition-based information query method and device
CN109274586A (en) * 2018-11-14 2019-01-25 深圳市云歌人工智能技术有限公司 Storage method, device and the storage medium of chat message
CN111353065A (en) * 2018-12-20 2020-06-30 北京嘀嘀无限科技发展有限公司 Voice archive storage method, device, equipment and computer readable storage medium
CN109460209B (en) * 2018-12-20 2022-03-01 广东小天才科技有限公司 Control method for dictation and reading progress and electronic equipment
CN109460209A (en) * 2018-12-20 2019-03-12 广东小天才科技有限公司 A kind of control method and electronic equipment for dictating the progress that enters for
CN110335598A (en) * 2019-06-26 2019-10-15 重庆金美通信有限责任公司 A kind of wireless narrow band channel speech communication method based on speech recognition
CN110287364A (en) * 2019-06-28 2019-09-27 合肥讯飞读写科技有限公司 Voice search method, system, equipment and computer readable storage medium
CN110287364B (en) * 2019-06-28 2021-10-08 合肥讯飞读写科技有限公司 Voice search method, system, device and computer readable storage medium
CN110275978A (en) * 2019-07-01 2019-09-24 成都启英泰伦科技有限公司 Quick storage of the voice big data on redundant arrays of inexpensive disks and access amending method
CN110275979A (en) * 2019-07-01 2019-09-24 成都启英泰伦科技有限公司 A kind of mapping management process of voice data and text data
CN111008300A (en) * 2019-11-20 2020-04-14 四川互慧软件有限公司 Keyword-based timestamp positioning search method in audio and video
CN111161738A (en) * 2019-12-27 2020-05-15 苏州欧孚网络科技股份有限公司 Voice file retrieval system and retrieval method thereof
CN112115282A (en) * 2020-09-17 2020-12-22 北京达佳互联信息技术有限公司 Question answering method, device, equipment and storage medium based on search
CN113299279A (en) * 2021-05-18 2021-08-24 上海明略人工智能(集团)有限公司 Method, apparatus, electronic device and readable storage medium for associating voice data and retrieving voice data

Also Published As

Publication number Publication date
CN101996195B (en) 2012-07-11

Similar Documents

Publication Publication Date Title
CN101996195B (en) Searching method and device of voice information in audio files and equipment
CN102193917B (en) Method and device for processing and querying data
CN108304444B (en) Information query method and device
CN102096717B (en) Search method and search engine
CN101179472B (en) Network resource searching method and searching system
CN102063469B (en) Method and device for acquiring relevant keyword message and computer equipment
CN103425687A (en) Retrieval method and system based on queries
CN102591880A (en) Information providing method and device
CN104376406A (en) Enterprise innovation resource management and analysis system and method based on big data
CN103136228A (en) Image search method and image search device
JP6355840B2 (en) Stopword identification method and apparatus
CN102023989A (en) Information retrieval method and system thereof
CN101876981A (en) Method and device for establishing knowledge base
CN103473230A (en) Service range determining method, logistics service provider recommending method and corresponding device
CN102722499B (en) Search engine and implementation method thereof
CN102043843A (en) Method and obtaining device for obtaining target entry based on target application
CN103810224A (en) Information persistence and query method and device
CN102722498A (en) Search engine and implementation method thereof
CN104850554A (en) Searching method and system
CN102722501A (en) Search engine and realization method thereof
CN103092943A (en) Method of advertisement dispatch and advertisement dispatch server
CN102955802B (en) The method and apparatus of data is obtained from data sheet
CN102737021A (en) Search engine and realization method thereof
CN105302807A (en) Method and apparatus for obtaining information category
CN104484413A (en) Method and device for obtaining searching results

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20120711

Termination date: 20210828

CF01 Termination of patent right due to non-payment of annual fee