CN101996195A

CN101996195A - Searching method and device of voice information in audio files and equipment

Info

Publication number: CN101996195A
Application number: CN2009100916619A
Authority: CN
Inventors: 薛頔; 樊科; 刘威
Original assignee: China Mobile Communications Group Co Ltd
Current assignee: China Mobile Communications Group Co Ltd
Priority date: 2009-08-28
Filing date: 2009-08-28
Publication date: 2011-03-30
Anticipated expiration: 2029-08-28
Also published as: CN101996195B

Abstract

The invention discloses searching method and device of voice information in audio files and equipment, which are used for realizing the text search on the contents of the audio files, improving the accuracy and the efficiency of the audio file searching and improving the usability of the audio file searching. The searching method comprises the following steps of: carrying out voice identification on each audio file comprising voice information in the audio resource base, converting the audio files into text files comprising text information, and carrying out participle processing on text information of each text file; extracting key words included by corresponding audio files according to words included by each text file, determining the relevance of key words included by each audio file, and establishing an index database of the key words through being combined with the relevance information of each audio file; carrying out specific key word matching in the index database while receiving the voice information searching request carrying the specific key words, and providing the corresponding audio files according to the relevant information of the audio files with the relevance with the specific key words.

Description

The searching method of voice messaging, device and equipment in the audio file

Technical field

The present invention relates to the audio search technical field, relate in particular to searching method, device and the equipment of voice messaging in a kind of audio file.

Background technology

Become the information age of geometric growth in quantity of information, search technique has become one of requisite gordian technique in people's work and the life, make the information that people can fast search exactly oneself to be needed from the information ocean, thereby greatly improved work and life efficient.Along with search technique reaches its maturity, it is used more and more widely, and people, increase the demand of audio search also in continuous lifting day by day to the requirement of search technique.

Existing audio search technology mainly comprises following dual mode:

Mode one, be audio file and set up label for audio file interpolation Word message by artificial in advance, the label of audio file is searched for based on special key words.This mode can't satisfy the demand of audio file being carried out full-text search according to the content of audio file.Simultaneously, because the label of audio file can't be contained the full content of audio file, and label is by artificial foundation, subjective factor is bigger, cause the accuracy of audio search low, be difficult to guarantee the integrality of Search Results, also can't accurately locate the particular location of special key words in Search Results; If the enormous amount of audio resource storehouse sound intermediate frequency file with making that the workload of manually setting up label is huge, causes expending of a large amount of human resources.

Mode two, audio file is searched for based on the audio frequency matching technique, at first need to extract the eigenwert of the frequency spectrum or the energy of audio-frequency information to be searched, extract the eigenwert of the frequency spectrum or the energy of the audio-frequency information of each audio file in the audio resource storehouse then, carry out the coupling of eigenwert at last.The audio frequency matching technique lays particular emphasis on the coupling of the eigenwert of audio frequency itself, and this mode can't satisfy the demand of audio file being carried out full-text search according to the content of audio file equally.Simultaneously, the audio-frequency information that this mode is imported search requires harshness, not only the content of the audio-frequency information of requirement input is consistent with the content of audio resource storehouse sound intermediate frequency file, but also require the frequency of audio-frequency information and the frequency and the energy of energy and audio resource storehouse sound intermediate frequency file to be close, could successfully mate, cause the efficient of audio search low, ease for use is poor.

The audio search technology that provides in the prior art does not provide the scheme of carrying out full-text search based on the content of audio file, and the accuracy of audio search is low, efficient is low, ease for use is poor.

Summary of the invention

The invention provides the searching method and the device of voice messaging in a kind of audio file,, improve the accuracy and the efficient of audio search, promote the ease for use of audio search in order to realize that the content of audio file is carried out full-text search.

Accordingly, the present invention also provides a kind of terminal device and Website server.

The invention provides the searching method of voice messaging in a kind of audio file, comprising:

To each comprises that the audio file of voice messaging carries out speech recognition in the audio resource storehouse, be converted into the text that comprises Word message, and the Word message of each text is carried out word segmentation processing;

The speech included according to each text extracts the included key word of corresponding audio files, determine the degree of correlation of each audio file and included key word, and set up the index data base of key word, each key word of storage and the degree of correlation of each audio file and the relevant information of each audio file in the described index data base in conjunction with the relevant information of each audio file;

When receiving the voice messaging searching request of carrying special key words, in described index data base, carry out the coupling of described special key words, and provide corresponding audio file according to the relevant information that has an audio file of the degree of correlation with described special key words.

The invention provides the searcher of voice messaging in a kind of audio file, comprising:

Sound identification module is used for that each comprises that the audio file of voice messaging carries out speech recognition to the audio resource storehouse, is converted into the text that comprises Word message, and the Word message of each text is carried out word segmentation processing;

Module set up in index, be used for extracting the included key word of corresponding audio files according to the included speech of each text, determine the degree of correlation of each audio file and included key word, and set up the index data base of key word in conjunction with the relevant information of each audio file;

Index data base is used to store the degree of correlation of each key word and each audio file and the relevant information of each audio file;

The search processing module, be used for when receiving the voice messaging searching request of carrying special key words, in described index data base, carry out the coupling of described special key words, and provide corresponding audio file according to the relevant information that has an audio file of the degree of correlation with described special key words.

The invention provides a kind of terminal device, comprise the searcher of voice messaging in this audio file.

The invention provides a kind of Website server, comprise the searcher of voice messaging in this audio file.

The searching method of voice messaging, device and equipment in the audio file provided by the invention, to comprise that by speech recognition the audio file of voice messaging is converted into the text that comprises Word message, text according to the audio file correspondence is the full content of audio file, sets up the index data base of key word; When the user imports the search operation of special key words initiation voice messaging, index data base based on key word provides the audio file that has the degree of correlation with this special key words, thereby realized the content of audio file is carried out full-text search, remedied the deficiency of existing audio search technology; Because the index data base of key word is set up based on speech recognition technology, and has contained the full content of audio file, thereby has improved the accuracy of audio search, has also improved the efficient of audio search based on the search of key word; When the user initiates to search for, only need the input special key words to get final product, promoted the ease for use of audio search.

Description of drawings

The searcher block diagram of voice messaging in the audio file that Fig. 1 provides for the embodiment of the invention;

The searching method process flow diagram of voice messaging in the audio file that Fig. 2 provides for the embodiment of the invention;

The local search method process flow diagram of voice messaging in the audio file that Fig. 3 provides for embodiment one;

The network search method process flow diagram of voice messaging in the audio file that Fig. 4 provides for embodiment two.

Embodiment

The embodiment of the invention aims to provide a kind of scheme of the content of audio file being carried out full-text search based on key word, can be according to the special key words of user's input, content to each audio file in the audio resource storehouse is carried out full-text search, and provides corresponding audio file to the user.Based on key word the content of audio file is carried out full-text search, can effectively improve the accuracy and the efficient of audio search, promote the ease for use of audio search.

As shown in Figure 1, the embodiment of the invention at first provides the searcher of voice messaging in a kind of audio file, comprising:

Sound identification module 101 is used for that each comprises that the audio file of voice messaging carries out speech recognition to the audio resource storehouse, is converted into the text that comprises Word message, and the Word message of each text is carried out word segmentation processing;

Speech is minimum in the Chinese, independent movable, the significant language element of energy, and speech can comprise a Chinese character, two Chinese characters or a plurality of Chinese character.Various minutes word algorithms can be realized the word segmentation processing to Word message in the prior art, divide word algorithm mainly to comprise three types: based on the branch word algorithm of string matching, based on the branch word algorithm of understanding with based on the branch word algorithm of statistics;

Module 102 set up in index, be used for extracting the included key word of corresponding audio files according to the included speech of each text, determine the degree of correlation of each audio file and included key word, and set up the index data base 103 of key word in conjunction with the relevant information of each audio file;

Index data base 103 is used to store the degree of correlation of each key word and each audio file and the relevant information of each audio file;

Search processing module 104, be used for when receiving the voice messaging searching request of carrying special key words, in index data base 103, carry out the coupling of this special key words, and provide corresponding audio file according to the relevant information that has an audio file of the degree of correlation with this special key words.

In concrete the enforcement, in order to promote the degree of accuracy of audio search, when providing corresponding audio file to the user, the particular location that can also provide this special key words in corresponding audio file, to occur to the user, under this application scenarios, the temporal information that module 102 also occurs in this audio file in conjunction with the included key word of each audio file set up in index when setting up index data base 103; Accordingly, index data base 103 also is used for storing the temporal information that each key word occurs at the audio file with degree of correlation; Search processing module 104, also be used for provide have the audio file of the degree of correlation with this special key words in, the temporal information that also provides this special key words in having the audio file of the degree of correlation, to occur.In order accurately to determine the temporal information that the included key word of each audio file occurs in this audio file, in concrete the enforcement, sound identification module 101, the Word message that also is used for each text carries out after the word segmentation processing, for the included speech of each text adds the temporal information that it occurs in corresponding audio files, be the included speech of each text and add a timestamp.

In concrete the enforcement, may have the audio file that does not comprise voice messaging in the audio resource storehouse, for example only comprise the audio file of music rhythm, under this application scenarios, the searcher of voice messaging also comprises in this audio file:

Audio frequency parsing module 105 is used for that each audio file of audio resource storehouse is carried out voice and resolves, and extracts the audio file that comprises voice messaging according to the voice analysis result.

Filter out after the audio file that does not comprise voice messaging, can be to each comprises that the audio file of voice messaging carries out speech recognition in the audio resource storehouse.

In concrete the enforcement, the audio file in the audio resource storehouse may change, and for the accuracy and the completeness that guarantee Search Results, the searcher of voice messaging also comprises in this audio file:

Update module 106 is used for regularly or the audio file in the audio resource storehouse when changing, and index data base 103 is upgraded;

Concrete, if added new audio file in the audio resource storehouse, then this new audio file is carried out speech recognition, word segmentation processing, keyword extraction, determine the degree of correlation of audio file that this is new and included key word, and in index data base 103, increase the degree of correlation of this new audio file and included key word and the relevant information of this new audio file in conjunction with the relevant information of this new audio file; If deleted existing audio file in the audio resource storehouse, then in index data base 103, delete all information relevant with this existing audio file.

The searcher of voice messaging is all applicable at local search and web search in the audio file that the embodiment of the invention provides.If the searcher of voice messaging is arranged in the terminal device that end side is the user in this audio file, can realize that the user carries out local search to the content of each audio file in the local audio resources bank.The local audio resources bank is meant the local storage in user's the terminal device, for example local hard drive, local disk etc.In the local audio resources bank, the relevant information of audio file comprises the file name and the local store path of audio file, and described local store path is " E: music " for example, and expression is stored in local E dish name and is called under the file of " music ".At local search, provide have the audio file of the degree of correlation with this special key words in, the file name and the local store path that have the audio file of the degree of correlation with this special key words also are provided.In concrete the enforcement, the relevant information of audio file can also comprise other relevant informations such as the size, type, modification time of audio file, accordingly, provide have the audio file of the degree of correlation with this special key words in, above-mentioned other relevant information that has the audio file of the degree of correlation with this special key words can also be provided.

Promptly provide in the Website server of website of audio search business if the searcher of voice messaging is arranged on network side in this audio file, by Website server and be installed in cooperatively interacting between the browser of end side, can realize that the user carries out web search to the content of each audio file in the network audio resources bank.The network audio resources bank is meant site databases, and in the network audio resources bank, the relevant information of audio file comprises the file name and the URL (URL(uniform resource locator)) of audio file.At web search, the relevant information that described basis and this special key words have an audio file of the degree of correlation provides corresponding audio file to be meant provides the hyperlink that has the audio file of the degree of correlation with this special key words.

Based on same technical conceive, the embodiment of the invention provides the searching method of voice messaging in a kind of audio file simultaneously, as shown in Figure 2, comprising:

S200, each audio file in the audio resource storehouse is carried out voice resolve, extract the audio file that comprises voice messaging according to the voice analysis result;

In concrete the enforcement,, then need not to carry out this step, directly begin to carry out from S201 if each audio file includes voice messaging in the audio resource storehouse.

S201, to each comprises that the audio file of voice messaging carries out speech recognition in the audio resource storehouse, be converted into the text that comprises Word message, and the Word message of each text carried out word segmentation processing;

In concrete the enforcement, the Word message of each text is carried out can also adding the temporal information that it occurs in corresponding audio files for the included speech of each text after the word segmentation processing.

S202, extract the included key word of corresponding audio files according to the included speech of each text, determine the degree of correlation of each audio file and included key word, and set up the index data base of key word in conjunction with the relevant information of each audio file, accordingly, each key word of storage and the degree of correlation of each audio file and the relevant information of each audio file in the index data base of key word;

In concrete the enforcement, audio file is definite based on degree of correlation algorithm with the degree of correlation of included key word, and the degree of correlation of audio file and included key word is relevant with the number of times that this key word occurs in audio file, and occurrence number is many more, and the degree of correlation is high more;

In concrete the enforcement, in order to promote the degree of accuracy of audio search, the temporal information that when setting up the index data base of key word, also in this audio file, occurs in conjunction with the included key word of each audio file, accordingly, also store the temporal information that each key word occurs in the index data base of key word in having the audio file of the degree of correlation.

So far, finished the search preparatory stage of voice messaging in the audio file, in the search preparatory stage, need handle each audio file in the audio resource storehouse, identify voice messaging and voice messaging is converted to word information relates based on speech recognition technology; Word message through word segmentation processing and keyword extraction and determine each audio file and the degree of correlation of included key word after set up the index data base of key word.

After the index data base of key word is set up and is finished, can enter the search execute phase of voice messaging in the audio file, the search execute phase is initiated by the user, and by the search operation of input special key words initiation voice messaging, then this method also comprises the steps:

S203, when receiving the voice messaging searching request of carrying special key words, in the index data base of key word, carry out the coupling of this special key words, and provide corresponding audio file according to the relevant information that has an audio file of the degree of correlation with this special key words;

In concrete the enforcement, generally from high to low the audio file that has a degree of correlation with this special key words is sorted according to the degree of correlation, the high more ordering of the degree of correlation is forward more;

If also store the temporal information that each key word occurs in the index data base of key word in having the audio file of the degree of correlation, for the ease of the user special key words in the Search Results is accurately located, provide have the audio file of the degree of correlation with this special key words in, the temporal information that also provides special key words to occur in having the audio file of the degree of correlation, specifically the form with time shaft provides.

In concrete the enforcement, also comprise regularly or the audio file in the audio resource storehouse when changing, the index data base of key word is carried out updating steps.

To be example with local search and web search respectively below, the search plan of voice messaging in the audio file that the detailed description embodiment of the invention provides.

Embodiment one

Present embodiment provides the local search scheme of voice messaging in the audio file, corresponding audio resources bank (can be called the local audio resources bank) is arranged on end side, be specially the local storage in user's the terminal device, in order to realize local search, the searcher of voice messaging in the audio file that the embodiment of the invention provides need be set in user's terminal device to voice messaging in the audio file.The local search flow process of voice messaging in the audio file as shown in Figure 3, comprises local search preparatory stage and local search execute phase.The local search preparatory stage, comprise the steps:

S301, terminal device extract a untreated audio file from the audio resource storehouse, current audio file is carried out voice resolve;

S302, terminal device judge according to the voice analysis result whether current audio file comprises voice messaging, if, then carry out S303, if not, then turn to and carry out S307;

S303, terminal device carry out speech recognition to current audio file, are converted into the text that comprises Word message;

S304, terminal device carry out word segmentation processing to the Word message of current text, and add the temporal information that it occurs for the current included speech of text in corresponding audio files;

S305, terminal device extract the included key word of corresponding audio files according to the included speech of current text, determine the degree of correlation of current audio file and included key word;

S306, terminal device store the temporal information that the file name of the degree of correlation of current audio file and included key word, current audio file and local store path and the current included key word of audio file occur in the index data base of key word in this audio file;

S307, the current audio file of terminal device are set to handle;

S308, terminal device judge whether also there is untreated audio file in the audio resource storehouse, if then return and carry out S301, if not, then the index data base of key word is set up and to be finished, and promptly the local search preparatory stage finishes, and follow-uply can enter the local search execute phase.

If the user imports special key words in the local search toolbar, initiate the local search of voice messaging, then the local search execute phase, comprise the steps:

S309, when receiving the local search query of the voice messaging that carries special key words, terminal device carries out the coupling of this special key words in the index data base of key word;

S310, terminal device are according to having the file name and the local store path of the audio file of the degree of correlation with this special key words, the temporal information that provides corresponding audio file and this special key words to occur in having the audio file of the degree of correlation can also provide the file name and the local store path of this audio file certainly in the lump;

Accordingly, the temporal information that audio file and this special key words occur in having the audio file of the degree of correlation, the file name of this audio file and local store path represent on terminal device for the user and check.

It is to be noted, in concrete the enforcement since the audio file in the local audio resources bank can change, for example the user has added new audio file or has deleted existing audio file in the local storage in the local storage of terminal device, therefore need regularly or the audio file in the local audio resources bank when changing, index data base to key word upgrades, to guarantee the accuracy and the completeness of local search results.

Embodiment two

Present embodiment provides the web search scheme of voice messaging in the audio file.Corresponding audio resources bank (can be called the local audio resources bank) is arranged on network side, be specially site databases, in order to realize web search, the searcher of voice messaging in the audio file that the embodiment of the invention provides need be set in the Website server of the website that the audio search business is provided to voice messaging in the audio file.The web search flow process of voice messaging in the audio file as shown in Figure 4, comprises web search preparatory stage and web search execute phase.The web search preparatory stage, comprise the steps:

S401, Website server extract a untreated audio file from the audio resource storehouse, current audio file is carried out voice resolve;

S402, Website server judge according to the voice analysis result whether current audio file comprises voice messaging, if, then carry out S403, if not, then turn to and carry out S407;

S403, Website server carry out speech recognition to current audio file, are converted into the text that comprises Word message;

S404, Website server carry out word segmentation processing to the Word message of current text, and add the temporal information that it occurs for the current included speech of text in corresponding audio files;

S405, Website server extract the included key word of corresponding audio files according to the included speech of current text, determine the degree of correlation of current audio file and included key word;

S406, Website server store the temporal information that the file name of the degree of correlation of current audio file and included key word, current audio file and URL and the current included key word of audio file occur in the index data base of key word in this audio file

S407, the current audio file of Website server are set to handle;

S408, Website server judge whether also there is untreated audio file in the audio resource storehouse, if then return and carry out S401, if not, then the index data base of key word is set up and to be finished, and promptly the web search preparatory stage finishes, and follow-uply can enter the web search execute phase.

If the user imports special key words in the cyber stalker hurdle of the browser of end side, initiate the web search of voice messaging, then the web search execute phase, comprise the steps:

S409, when receiving the network search request of the voice messaging that carries special key words, Website server carries out the coupling of this special key words in the index data base of key word;

S410, Website server provide the hyperlink of corresponding audio file and the temporal information that this special key words occurs according to having the file name and the URL of the audio file of the degree of correlation with this special key words in having the audio file of the degree of correlation;

Accordingly, the temporal information that the hyperlink of audio file and this special key words occur in having the audio file of the degree of correlation sends to the browser of end side by transmission network, represents on terminal device for the user and checks.

It is to be noted, in concrete the enforcement since the audio file in the network audio resources bank can change, for example add new audio file in the site databases or deleted existing audio file, therefore need regularly or the audio file in the network audio resources bank when changing, index data base to key word upgrades, to guarantee the accuracy and the completeness of web search results.

The searching method of voice messaging, device and equipment in the audio file provided by the invention, to comprise that by speech recognition the audio file of voice messaging is converted into the text that comprises Word message, text according to the audio file correspondence is the full content of audio file, sets up the index data base of key word; When the user imports the search of special key words initiation voice messaging, index data base based on key word provides the audio file that has the degree of correlation with this special key words, thereby realized the content of audio file is carried out full-text search, remedied the deficiency of existing audio search technology; Because the index data base of key word is set up based on speech recognition technology, and has contained the full content of audio file, thereby has improved the accuracy of audio search, has also improved the efficient of audio search based on the search of key word; When the user initiates to search for, only need the input special key words to get final product, promoted the ease for use of audio search.

The searching method of voice messaging, device and equipment in the audio file provided by the invention, in the index data base of key word, also store each key word and have the temporal information that occurs in the audio file of the degree of correlation, when the user imports the search of special key words initiation voice messaging, the temporal information that can also provide this special key words in having the audio file of the degree of correlation, to occur based on the index data base of key word, thus realized the particular location of accurate location special key words in Search Results.

It will be understood by those skilled in the art that embodiments of the invention can be provided as method, device, equipment or computer program.Therefore, the present invention can adopt complete hardware embodiment, complete software implementation example or in conjunction with the form of the embodiment of software and hardware aspect.And the present invention can adopt the form that goes up the computer program of implementing in one or more computer-usable storage medium (including but not limited to magnetic disk memory, CD-ROM, optical memory etc.) that wherein include computer usable program code.

The present invention is that reference is described according to the process flow diagram and/or the block scheme of method, device, equipment and the computer program of the embodiment of the invention.Should understand can be by the flow process in each flow process in computer program instructions realization flow figure and/or the block scheme and/or square frame and process flow diagram and/or the block scheme and/or the combination of square frame.Can provide these computer program instructions to the processor of multi-purpose computer, special purpose computer, Embedded Processor or other programmable data processing device to produce a machine, make the instruction of carrying out by the processor of computing machine or other programmable data processing device produce to be used for the device of the function that is implemented in flow process of process flow diagram or a plurality of flow process and/or square frame of block scheme or a plurality of square frame appointments.

These computer program instructions also can be stored in energy vectoring computer or the computer-readable memory of other programmable data processing device with ad hoc fashion work, make the instruction that is stored in this computer-readable memory produce the manufacture that comprises command device, this command device is implemented in the function of appointment in flow process of process flow diagram or a plurality of flow process and/or square frame of block scheme or a plurality of square frame.

These computer program instructions also can be loaded on computing machine or other programmable data processing device, make on computing machine or other programmable devices and to carry out the sequence of operations step producing computer implemented processing, thereby the instruction of carrying out on computing machine or other programmable devices is provided for being implemented in the step of the function of appointment in flow process of process flow diagram or a plurality of flow process and/or square frame of block scheme or a plurality of square frame.

Although described the preferred embodiments of the present invention, in a single day those skilled in the art get the basic creative notion of cicada, then can make other change and modification to these embodiment.So claims are intended to all changes and the modification that are interpreted as comprising preferred embodiment and fall into the scope of the invention.

Obviously, those skilled in the art can carry out various changes and modification to the present invention and not break away from the spirit and scope of the present invention.Like this, if of the present invention these are revised and modification belongs within the scope of claim of the present invention and equivalent technologies thereof, then the present invention also is intended to comprise these changes and modification interior.

Claims

1. the searching method of voice messaging in the audio file is characterized in that, comprising:

2. the method for claim 1, it is characterized in that, the temporal information that also occurs in this audio file in conjunction with the included key word of each audio file when setting up described index data base is also stored the temporal information that each key word occurs in having the audio file of the degree of correlation in the described index data base; And

Provide have the audio file of the degree of correlation with described special key words in, the temporal information that also provides described special key words in having the audio file of the degree of correlation, to occur.

3. method as claimed in claim 2 is characterized in that, also comprises:

The Word message of each text is carried out after the word segmentation processing, for the included speech of each text adds the temporal information that it occurs in corresponding audio files.

4. as claim 1,2 or 3 arbitrary described methods, it is characterized in that, from high to low the audio file that has a degree of correlation with described special key words is sorted according to the degree of correlation.

5. the method for claim 1, it is characterized in that, described audio resource lab setting is in end side, and described voice messaging searching request is the local search query of voice messaging, and the relevant information of described audio file comprises the file name and the local store path of audio file; And

Provide have the audio file of the degree of correlation with described special key words in, the file name and the local store path that have the audio file of the degree of correlation with described special key words also are provided.

6. the method for claim 1, it is characterized in that, described audio resource lab setting is at network side, and described voice messaging searching request is the network search request of voice messaging, and the relevant information of described audio file comprises the file name and the uniform resource position mark URL of audio file; And

The relevant information that described basis and described special key words have an audio file of the degree of correlation provides corresponding audio file to be meant provides the hyperlink that has the audio file of the degree of correlation with described special key words.

7. the method for claim 1 is characterized in that, also comprises:

Each audio file in the audio resource storehouse is carried out voice resolve, extract the audio file that comprises voice messaging according to the voice analysis result.

8. the method for claim 1 is characterized in that, also comprises:

When regular the or audio file in described audio resource storehouse changes, described index data base is upgraded.

9. the searcher of voice messaging in the audio file is characterized in that, comprising:

10. device as claimed in claim 9 is characterized in that,

The temporal information that module also occurs in this audio file in conjunction with the included key word of each audio file set up in described index when setting up described index data base;

Described index data base also is used for storing the temporal information that each key word occurs at the audio file with degree of correlation;

Described search processing module, also be used for provide have the audio file of the degree of correlation with described special key words in, the temporal information that also provides described special key words in having the audio file of the degree of correlation, to occur.

11. device as claimed in claim 10 is characterized in that,

Described sound identification module, the Word message that also is used for each text carries out after the word segmentation processing, for the included speech of each text adds the temporal information that it occurs in corresponding audio files.

12. device as claimed in claim 9 is characterized in that, also comprises:

The audio frequency parsing module is used for that each audio file of audio resource storehouse is carried out voice and resolves, and extracts the audio file that comprises voice messaging according to the voice analysis result.

13. device as claimed in claim 9 is characterized in that, also comprises:

Update module is used for regularly or the audio file in described audio resource storehouse when changing, and described index data base is upgraded.

14. a terminal device is characterized in that, comprises as the arbitrary described searcher of claim 9 to 13.

15. a Website server is characterized in that, comprises as the arbitrary described searcher of claim 9 to 13.