CN104021151A

CN104021151A - Information processing method and electronic equipment

Info

Publication number: CN104021151A
Application number: CN201410212083.0A
Authority: CN
Inventors: 张守鹏
Original assignee: Lenovo Beijing Ltd
Current assignee: Lenovo Beijing Ltd
Priority date: 2014-05-19
Filing date: 2014-05-19
Publication date: 2014-09-03

Abstract

The invention discloses an information processing method. The method is applied to electronic equipment provided with an audio input unit and an audio output unit. The method comprises the steps that when the electronic equipment plays multimedia data, the multimedia data at least comprise the first audio data, the second audio data input by the audio input unit are detected, whether the second input audio data are matched with the first audio data in the played multimedia data is judged, and if yes, predetermined operation is executed on the multimedia data. The invention further discloses the electronic equipment. According to the technical scheme, manual involvement is not needed, the predetermined operation of automatic collecting, specific function setting, recommending and the like performed on the audio data is achieved, user experience is promoted, and the electronic equipment is user-friendly.

Description

A kind of information processing method and electronic equipment

Technical field

The present invention relates to the information processing technology, be specifically related to a kind of information processing method and electronic equipment.

Background technology

User just can the current comparatively popular music of uppick by the electronic equipment such as mobile terminal, PC.In the time of music that a certain head of user's uppick likes, can operate and add this song to collection by interpolation, or by down operation, this song be downloaded in electronic equipment, listen to for follow-up.As can be seen here, the method for the collection music interested that user uses at present needs too much manual operation, has reduced user's experience.

Summary of the invention

For solving the technical matters of existing existence, the embodiment of the present invention is to provide a kind of information processing method and electronic equipment, can realize the automatic collection of music, without manual operation, can promote user and experience, and embodies the hommization of electronic equipment.

The technical scheme of the embodiment of the present invention is achieved in that

The embodiment of the present invention provides a kind of information processing method, is applied to one and has in the electronic equipment of audio frequency input block and audio output unit; Described method comprises:

In the time of described electronic equipment play multimedia data, wherein, described multi-medium data at least comprises the first voice data;

Detect by the second audio data of described audio frequency input block input;

Whether the second audio data that judgement is inputted mates with the first voice data in the described multi-medium data of broadcasting;

While being judged as coupling, described multi-medium data is carried out to scheduled operation.

The embodiment of the present invention also provides a kind of electronic equipment, and described electronic equipment comprises: audio frequency input block, audio output unit; Described electronic equipment also comprises:

The first detecting unit, for when the described audio output unit play multimedia data, detects by the second audio data of described audio frequency input block input, and wherein, described multi-medium data at least comprises the first voice data;

The first judging unit, for judging whether described second audio data mates with described first voice data of described multi-medium data, while being judged as coupling, triggers described the first performance element;

The first performance element, for carrying out scheduled operation to described multi-medium data.

The information processing method that the embodiment of the present invention provides and electronic equipment, described method is applied to one to be had in the electronic equipment of audio frequency input block and audio output unit; Described method comprises: in the time of described electronic equipment play multimedia data, wherein, described multi-medium data at least comprises the first voice data; Detect by the second audio data of described audio frequency input block input; Whether the second audio data that judgement is inputted mates with the first voice data in the described multi-medium data of broadcasting; While being judged as coupling, described multi-medium data is carried out to scheduled operation.Utilize the technical scheme of the embodiment of the present invention, realized voice data automatic collection, the scheduled operation such as specific function, recommendation is set, without artificial participation, promoted user's experience, embodied the hommization of electronic equipment.

Brief description of the drawings

Fig. 1 is the realization flow schematic diagram of the first embodiment of information processing method of the present invention;

Fig. 2 is the realization flow schematic diagram of the second embodiment of information processing method of the present invention;

Fig. 3 is the realization flow schematic diagram of the 3rd embodiment of information processing method of the present invention;

Fig. 4 is the realization flow schematic diagram of the 4th embodiment of information processing method of the present invention;

Fig. 5 is the composition structural representation of the first embodiment of electronic equipment of the present invention;

Fig. 6 is the composition structural representation of the second embodiment of electronic equipment of the present invention;

Fig. 7 is the composition structural representation of the 3rd embodiment of electronic equipment of the present invention;

Fig. 8 is the composition structural representation of the 4th embodiment of electronic equipment of the present invention.

Embodiment

To a preferred embodiment of the present invention will be described in detail, should be appreciated that following illustrated preferred embodiment, only for description and interpretation the present invention, is not intended to limit the present invention below in conjunction with accompanying drawing.

The first embodiment of a kind of information processing method provided by the invention, is applied to one and has in the electronic equipment of audio frequency input block and audio output unit; Wherein, described audio frequency input block can be specially Mike, the external earphone etc. of described electronic equipment; Described audio output unit can be specially receiver, external earphone etc.

Fig. 1 is the realization flow schematic diagram of the first embodiment of information processing method of the present invention; As shown in Figure 1, described method comprises:

Step 101: in the time of described electronic equipment play multimedia data, wherein, described multi-medium data at least comprises the first voice data; Detect by the second audio data of described audio frequency input block input.

Here, described multi-medium data can be song, or is the videos such as film, TV play; Accordingly, described the first voice data is described song or the music for playing in described video; Described electronic equipment, specifically described audio output unit is play described multi-medium data.

Described second audio data can be the music of user's humming; Detect the audio frequency that user inputs by audio frequency input block.

Step 102: whether the second audio data that judgement is inputted mates with the first voice data in the described multi-medium data of broadcasting.

Here from multi-medium data, parse the first voice data; Second audio data and the first voice data are compared, obtain the first similarity; In the time that described the first similarity is greater than default similarity threshold, determine that described second audio data and the first voice data match.

Step 103: while being judged as coupling, described multi-medium data is carried out to scheduled operation.

Here, described multi-medium data is carried out to scheduled operation, Ke Yiwei: mark or collect described multi-medium data, for example described in mark, multi-medium data is the favorite music of user, while carrying out music, preferentially plays this music follow-up; Or, collect described multi-medium data to multimedia as in music collection folder, so that the easy-to-look-up described multi-medium data of subsequent user.

Described multi-medium data is carried out to scheduled operation, can also be: determine that described multi-medium data is the output audio of predetermined function, for example described multi-medium data is set to incoming ring tone, ring of alarm clock etc.

Described multi-medium data is carried out to scheduled operation, can also be: recommend described multi-medium data.

Wherein, the described multi-medium data of described recommendation, can be: recommend described multi-medium data to the network being associated with described electronic equipment, for example, described in the network of described electronic equipment being registered is considered as, be associated network time, by this network, described multi-medium data is uploaded to microblogging, QQ space by described electronic equipment, and using this multi-medium data as playing music, so that beautifying of the space page carried out in microblogging, QQ space.

The described multi-medium data of described recommendation, can also be: recommend described multi-medium data to other electronic equipment being associated with described electronic equipment, for example, when described electronic equipment and described other electronic equipment are emerged good friend and are related to by network focus, described electronic equipment can be sent to described other electronic equipment by this multi-medium data by the mode such as QQ, micro-letter, uses for described other electronic equipment.

As from the foregoing, in first embodiment of the inventive method, without user, self interested music is manually collected, the judgement that only needs music that music that electronic equipment is play self is inputted by audio frequency input block with user whether to mate, while being judged as coupling, described music is carried out collection, the scheduled operation such as specific function, recommendation is set; When embodying electronic functionalities variation, hommization, the equipment that has promoted user is experienced.

The second embodiment of a kind of information processing method provided by the invention, is applied to one and has in the electronic equipment of audio frequency input block and audio output unit; Wherein, described audio frequency input block can be specially Mike, the external earphone etc. of described electronic equipment; Described audio output unit can be specially receiver, external earphone etc.

Fig. 2 is the realization flow schematic diagram of the first embodiment of information processing method of the present invention; As shown in Figure 2, described method comprises:

Step 201: in the time of described electronic equipment play multimedia data, wherein, described multi-medium data at least comprises the first voice data; Detect by the second audio data of described audio frequency input block input.

Step 202: parse the first voice data from multi-medium data; Second audio data and the first voice data are compared, obtain the first similarity; In the time that described the first similarity is greater than default similarity threshold, determine that described second audio data and the first voice data match.

Here the described electronic equipment that, described step 202 can be used as the embodiment of the present invention judges further describing that whether the first voice data in the described multi-medium data of the second audio data inputted and broadcasting mate.

In the present embodiment, can realize comparing between second audio data and the first voice data by following three kinds of modes.

Mode one: described electronic equipment is resolved the audio volume control in described second audio data; The similarity of the audio volume control in the audio volume control in more described second audio data and described the first voice data, generates the first sub-similarity in described the first similarity; In the time that described the first sub-similarity is greater than the first sub-threshold value in described similarity threshold, determine that described second audio data and described the first voice data match.

Mode two: inputted second audio data is carried out to speech recognition, obtain meaning of one's words information; The similarity of the meaning of one's words information comprising in the meaning of one's words information relatively obtaining and the first voice data, generates the second sub-similarity in described the first similarity; In the time that described the second sub-similarity is greater than the second sub-threshold value in described similarity threshold, determine that described second audio data and described the first voice data match.

Mode three: utilize described mode one and described mode two simultaneously, in the time that the first sub-similarity is greater than the first sub-threshold value and the second sub-similarity and is greater than the second sub-threshold value, determine that described second audio data and described the first voice data match.

Step 203: described multi-medium data is carried out to scheduled operation.

As from the foregoing, in second embodiment of the inventive method, by the similarity of the audio volume control in the audio volume control in more described second audio data and described the first voice data, and/or the similarity of the meaning of one's words information in comparison second audio data and the meaning of one's words information in the first voice data; Be greater than the first sub-threshold value in the first sub-similarity, and/or the second sub-similarity is while being greater than the second sub-threshold value, determines that described second audio data and described the first voice data match; Then multi-medium data is carried out collection, the scheduled operation such as specific function, recommendation is set; Without user's participation, can realize the automatic collection of multi-medium data, embody variation, the hommization of electronic functionalities, the equipment that has promoted user is experienced.

The 3rd embodiment of a kind of information processing method provided by the invention, is applied to one and has in the electronic equipment of audio frequency input block and audio output unit; Wherein, described audio frequency input block can be specially Mike, the external earphone etc. of described electronic equipment; Described audio output unit can be specially receiver, external earphone etc.

Fig. 3 is the realization flow schematic diagram of the 3rd embodiment of information processing method of the present invention; As shown in Figure 3, described method comprises:

Step 301: in the time of described electronic equipment play multimedia data, wherein, described multi-medium data at least comprises the first voice data; Detect by the second audio data of described audio frequency input block input.

Step 302: whether the second audio data that judgement is inputted mates with the first voice data in the described multi-medium data of broadcasting.

Further, the method for performing step 302 can be: from multi-medium data, parse the first voice data; Second audio data and the first voice data are compared, obtain the first similarity; In the time that described the first similarity is greater than default similarity threshold, determine that described second audio data and the first voice data match.

Concrete, can realize comparing between second audio data in such scheme and the first voice data by following three kinds of modes.

Step 303: while being judged as coupling, described multi-medium data is carried out to scheduled operation.

Step 304: according to a preset rules, recommend the 3rd voice data being associated with described second audio data.

Here, recommending before described the 3rd voice data, can in the local resource of described electronic equipment, to search for described the 3rd voice data, as described in the 3rd voice data as described in search in the electronic equipment song collection of having collected;

Also can in the resource of other electronic equipment being associated with described electronic equipment, search for, as with as described in the computer of electronic equipment collaborative work or the resource of personal digital assistant (PDA, Personal Digital Assistant) or in the resource of other electronic equipment by network focus and described electronic equipment composition group, search for the 3rd voice data;

Can also be in the resource of the network being associated with described electronic equipment, as described in the 3rd voice data as described in search in the network of electronic equipment registration.

In such scheme, which can be used as the 3rd voice data to the data of definite search, mainly by following two kinds of implementations:

First kind of way: search is greater than the voice data of default Second Threshold with the similarity of described second audio data, and determines that the voice data finding is the 3rd voice data being associated with described second audio data.Under this mode, mainly consider that existing user to sing has situation out of tune a little, search for the audio frequency similar to audio frequency out of tune, and using the audio frequency searching as the 3rd voice data.

The second way: obtain the First Characteristic of described second audio data, described First Characteristic is for characterizing the prosodic features of described second audio data; Search meets the voice data of described First Characteristic, and determines that the voice data searching is the 3rd voice data being associated with described second audio data.Under this mode, mainly consider the music for user search and second audio data with identical rhythm attribute, if the rhythm attribute of second audio data is brisk type/sad type, in the local resource of described electronic equipment, search for the song of brisk type/sad type, and using the song of the brisk type/sad type searching as the 3rd voice data.

Wherein, recommend the concrete mode of described the 3rd voice data identical with the mode of the described multi-medium data of aforesaid recommendation, repeat no more here.

As from the foregoing, in the 3rd embodiment of the inventive method, not only can carry out predetermined operation to multi-medium data; Can also recommend to have with second audio data to network or other electronic equipment the 3rd voice data of certain similarity or identical rhythm attribute; Without user's participation, can realize voice data automatic collection, the operation such as specific function, recommendation is set, embodied variation, the hommization of electronic functionalities, promoted user equipment experience.

The 4th embodiment of a kind of information processing method provided by the invention, is applied to one and has in the electronic equipment of audio frequency input block and audio output unit; Wherein, described audio frequency input block can be specially Mike, the external earphone etc. of described electronic equipment; Described audio output unit can be specially receiver, external earphone etc.

Fig. 4 is the realization flow schematic diagram of the 4th embodiment of information processing method of the present invention; As shown in Figure 4, described method comprises:

Step 401: in the time of described electronic equipment play multimedia data, wherein, described multi-medium data at least comprises the first voice data; Detect by the second audio data of described audio frequency input block input.

Step 402: whether the second audio data that judgement is inputted mates with the first voice data in the described multi-medium data of broadcasting.

Further, the method for performing step 402 can be: from multi-medium data, parse the first voice data; Second audio data and the first voice data are compared, obtain the first similarity; In the time that described the first similarity is greater than default similarity threshold, determine that described second audio data and the first voice data match.

Step 403: while being judged as coupling, described multi-medium data is carried out to scheduled operation.

Here, described multi-medium data is carried out to scheduled operation, Ke Yiwei: mark or collect described multi-medium data, for example described in mark, multi-medium data is the favorite music of user, while carrying out music, preferentially plays this music follow-up; Or, collect described multi-medium data to multimedia as in music collection folder, so that the easy-to-look-up described multi-medium data of subsequent user.。

Step 404: the first voice data matching with second audio data in described multi-medium data is replaced into described second audio data, forms the 4th voice data.

Here, consider following this situation, described multi-medium data is the audio frequency of Chinese musical telling unification, the first voice data in described multi-medium data is the lyrics that need to sing, other parts are the lyrics that need to say, so, the part of just can user humming is that second audio data is replaced described the first voice data from multi-medium data, and described multi-medium data now i.e. the 4th audio frequency just comprises the lyrics two large divisions that the lyrics that user hums and needs are said.

As from the foregoing, in the 4th embodiment of the inventive method, not only can be judging that second audio data and the first voice data in multi-medium data when mating, carry out predetermined operation to multi-medium data; The first voice data in multi-medium data can also be replaced with to second audio data, form a new voice data, the equipment that has promoted user is experienced.

The first embodiment of a kind of electronic equipment provided by the invention, described electronic equipment comprises: audio frequency input block, audio output unit; Wherein, specifically Mike, external earphone etc. of described audio frequency input block; Described audio output unit can be specially receiver, external earphone etc.

Fig. 5 is the composition schematic diagram of the first embodiment of electronic equipment of the present invention; As shown in Figure 5, described electronic equipment comprises: the first detecting unit 501, the first judging unit 502, the first performance element 503; Wherein,

Described the first detecting unit 501, for when the described audio output unit play multimedia data, detects by the second audio data of described audio frequency input block input, and wherein, described multi-medium data at least comprises the first voice data.

Here, described multi-medium data can be song, or is the videos such as film, TV play; Accordingly, described the first voice data is described song or the music for playing in described video.

Described second audio data can be the music of user's humming; The first detecting unit 501 detects user by the audio frequency of described audio frequency input block input.

Described the first judging unit 502, for judging whether described second audio data mates with described first voice data of described multi-medium data, while being judged as coupling, triggers described the first performance element 503;

Here, described the first judging unit 502 parses the first voice data from described multi-medium data; Second audio data and the first voice data are compared, obtain the first similarity; In the time that described the first similarity is greater than default similarity threshold, determine that described second audio data and the first voice data match.

Described the first performance element 503, for carrying out scheduled operation to described multi-medium data.

Here, the first performance element 503 can mark or is collected described multi-medium data, and for example described in mark, multi-medium data is the favorite music of user, while carrying out music, preferentially plays this music follow-up; Or, collect described multi-medium data to multimedia as in music collection folder, so that the easy-to-look-up described multi-medium data of subsequent user.

The first performance element 503 can also determine that described multi-medium data is the output audio of predetermined function, and for example described multi-medium data is set to incoming ring tone, ring of alarm clock etc.

The first performance element 503 can also be recommended described multi-medium data.

Wherein, described the first performance element 503 can: recommend described multi-medium data to the network that is associated with described electronic equipment, for example, described in the network of described electronic equipment being registered is considered as, be associated network time, by this network, described multi-medium data is uploaded to microblogging, QQ space by described electronic equipment, and using this multi-medium data as playing music, so that beautifying of the space page carried out in microblogging, QQ space.

Described the first performance element 503 is all right: recommend described multi-medium data to other electronic equipment being associated with described electronic equipment, for example, when described electronic equipment and described other electronic equipment are emerged good friend and are related to by network focus, described electronic equipment can be sent to described other electronic equipment by this multi-medium data by the mode such as QQ, micro-letter, uses for described other electronic equipment.

As from the foregoing, in the first embodiment of electronic equipment of the present invention, without user, self interested music is manually collected, the judgement that only needs music that music that electronic equipment is play self is inputted by audio frequency input block with user whether to mate, while being judged as coupling, described music is carried out collection, the operation such as specific function, recommendation is set; When embodying electronic functionalities variation, hommization, the equipment that has promoted user is experienced.

The second embodiment of a kind of electronic equipment provided by the invention, described electronic equipment comprises: audio frequency input block, audio output unit; Wherein, specifically Mike, external earphone etc. of described audio frequency input block; Described audio output unit can be specially receiver, external earphone etc.

Fig. 6 is the composition schematic diagram of the second embodiment of electronic equipment of the present invention; As shown in Figure 6, described electronic equipment comprises: the first detecting unit 601, the first judging unit 602, the first performance element 603; Wherein,

Described the first detecting unit 601, for when the described audio output unit play multimedia data, detects by the second audio data of described audio frequency input block input, and wherein, described multi-medium data at least comprises the first voice data.

Described second audio data can be the music of user's humming; The first detecting unit 601 detects user by the audio frequency of described audio frequency input block input.

Described the first judging unit 602, for parsing the first voice data from described multi-medium data; Second audio data and the first voice data are compared, obtain the first similarity; In the time that described the first similarity is greater than default similarity threshold, determine that described second audio data and the first voice data match; While being defined as mating, trigger described the first performance element 603.

Here, the above-mentioned functions of described the first judging unit 602 explanation can be used as described the first judging unit 602 and judges described second audio data and further describing that whether described the first voice data in described multi-medium data mates.

In the present embodiment, described the first judging unit 602 can be realized comparing between second audio data and the first voice data by following three kinds of modes.

Mode one: described the first judging unit 602 is resolved the audio volume control in described second audio data; The similarity of the audio volume control in the audio volume control in more described second audio data and described the first voice data, generates the first sub-similarity in described the first similarity; In the time that described the first sub-similarity is greater than the first sub-threshold value in described similarity threshold, determine that described second audio data and described the first voice data match.

Mode two: described the first judging unit 602 carries out speech recognition to inputted second audio data, obtains meaning of one's words information; The similarity of the meaning of one's words information comprising in the meaning of one's words information relatively obtaining and the first voice data, generates the second sub-similarity in described the first similarity; In the time that described the second sub-similarity is greater than the second sub-threshold value in described similarity threshold, determine that described second audio data and described the first voice data match.

Mode three: described the first judging unit 602 utilizes described mode one and described mode two simultaneously, in the time that the first sub-similarity is greater than the first sub-threshold value and the second sub-similarity and is greater than the second sub-threshold value, determine that described second audio data and described the first voice data match.

Described the first performance element 603, for carrying out scheduled operation to described multi-medium data.

Here, the first performance element 603 can mark or is collected described multi-medium data, and for example described in mark, multi-medium data is the favorite music of user, while carrying out music, preferentially plays this music follow-up; Or, collect described multi-medium data to multimedia as in music collection folder, so that the easy-to-look-up described multi-medium data of subsequent user.

The first performance element 603 can also determine that described multi-medium data is the output audio of predetermined function, and for example described multi-medium data is set to incoming ring tone, ring of alarm clock etc.

The first performance element 603 can also be recommended described multi-medium data.

Wherein, described the first performance element 603 can: recommend described multi-medium data to the network that is associated with described electronic equipment, for example, described in the network of described electronic equipment being registered is considered as, be associated network time, by this network, described multi-medium data is uploaded to microblogging, QQ space by described electronic equipment, and using this multi-medium data as playing music, so that beautifying of the space page carried out in microblogging, QQ space.

Described the first performance element 603 is all right: recommend described multi-medium data to other electronic equipment being associated with described electronic equipment, for example, when described electronic equipment and described other electronic equipment are emerged good friend and are related to by network focus, described electronic equipment can be sent to described other electronic equipment by this multi-medium data by the mode such as QQ, micro-letter, uses for described other electronic equipment.

As from the foregoing, in the second embodiment of electronic equipment of the present invention, by the similarity of the audio volume control in the audio volume control in more described second audio data and described the first voice data, and/or the similarity of the meaning of one's words information in comparison second audio data and the meaning of one's words information in the first voice data; Be greater than the first sub-threshold value in the first sub-similarity, and/or the second sub-similarity is while being greater than the second sub-threshold value, determines that described second audio data and described the first voice data match; Then multi-medium data is carried out collection, the scheduled operation such as specific function, recommendation is set; Without user's participation, can realize the automatic collection of multi-medium data, embody variation, the hommization of electronic functionalities, the equipment that has promoted user is experienced.

The 3rd embodiment of a kind of electronic equipment provided by the invention, described electronic equipment comprises: audio frequency input block, audio output unit; Wherein, specifically Mike, external earphone etc. of described audio frequency input block; Described audio output unit can be specially receiver, external earphone etc.

Fig. 7 is the composition schematic diagram of the 3rd embodiment of electronic equipment of the present invention; As shown in Figure 7, described electronic equipment comprises: the first detecting unit 701, the first judging unit 702, the first performance element 703, the first recommendation unit 704, the first determining unit 705 and the second determining unit 706; Wherein,

Described the first detecting unit 701, for when the described audio output unit play multimedia data, detects by the second audio data of described audio frequency input block input, and wherein, described multi-medium data at least comprises the first voice data.

Described second audio data can be the music of user's humming; The first detecting unit 701 detects user by the audio frequency of described audio frequency input block input.

Described the first judging unit 702, for judging whether described second audio data mates with described first voice data of described multi-medium data, while being judged as coupling, triggers described the first performance element 703.

Further, described the first judging unit 702 parses the first voice data from described multi-medium data; Second audio data and the first voice data are compared, obtain the first similarity; In the time that described the first similarity is greater than default similarity threshold, determine that described second audio data and the first voice data match.

Concrete, described the first judging unit 702 can be realized comparing between second audio data and the first voice data by following three kinds of modes.

Mode one: described the first judging unit 702 is resolved the audio volume control in described second audio data; The similarity of the audio volume control in the audio volume control in more described second audio data and described the first voice data, generates the first sub-similarity in described the first similarity; In the time that described the first sub-similarity is greater than the first sub-threshold value in described similarity threshold, determine that described second audio data and described the first voice data match.

Mode two: described the first judging unit 702 carries out speech recognition to inputted second audio data, obtains meaning of one's words information; The similarity of the meaning of one's words information comprising in the meaning of one's words information relatively obtaining and the first voice data, generates the second sub-similarity in described the first similarity; In the time that described the second sub-similarity is greater than the second sub-threshold value in described similarity threshold, determine that described second audio data and described the first voice data match.

Mode three: described the first judging unit 702 utilizes described mode one and described mode two simultaneously, in the time that the first sub-similarity is greater than the first sub-threshold value and the second sub-similarity and is greater than the second sub-threshold value, determine that described second audio data and described the first voice data match.

Described the first performance element 703, for carrying out scheduled operation to described multi-medium data.

Here, described the first performance element 703 can mark or is collected described multi-medium data, and for example described in mark, multi-medium data is the favorite music of user, while carrying out music, preferentially plays this music follow-up; Or, collect described multi-medium data to multimedia as in music collection folder, so that the easy-to-look-up described multi-medium data of subsequent user.

Described the first performance element 703 can also determine that described multi-medium data is the output audio of predetermined function, and for example described multi-medium data is set to incoming ring tone, ring of alarm clock etc.

Described the first performance element 703 can also be recommended described multi-medium data.

Wherein, described the first performance element 703 can: recommend described multi-medium data to the network that is associated with described electronic equipment, for example, described in the network of described electronic equipment being registered is considered as, be associated network time, by this network, described multi-medium data is uploaded to microblogging, QQ space by described electronic equipment, and using this multi-medium data as playing music, so that beautifying of the space page carried out in microblogging, QQ space.

Described the first performance element 703 is all right: recommend described multi-medium data to other electronic equipment being associated with described electronic equipment, for example, when described electronic equipment and described other electronic equipment are emerged good friend and are related to by network focus, described electronic equipment can be sent to described other electronic equipment by this multi-medium data by the mode such as QQ, micro-letter, uses for described other electronic equipment.

Described the first recommendation unit 704, for according to a preset rules, recommends the 3rd voice data being associated with described second audio data.

Here, before described the first recommendation unit 704 is recommended described the 3rd voice data, described the first determining unit 705 and/or described the second determining unit 706 can be searched for described the 3rd voice data in the local resource of described electronic equipment, as described in the 3rd voice data as described in search in the electronic equipment song collection of having collected;

Described the first determining unit 705 and/or described the second determining unit 706 also can be searched in the resource of other electronic equipment being associated with described electronic equipment, as with as described in the computer of electronic equipment collaborative work or the resource of personal digital assistant (PDA, Personal Digital Assistant) or in the resource of other electronic equipment by network focus and described electronic equipment composition group, search for the 3rd voice data;

Described the first determining unit 705 and/or described the second determining unit 706 can also be in the resources of the network being associated with described electronic equipment, as described in the 3rd voice data as described in search in the network of electronic equipment registration.

In such scheme, which can be used as the 3rd voice data to the data of described first determining unit 705 definite search, mainly passes through:

Search is greater than the voice data of default Second Threshold with the similarity of described second audio data, and determines that the voice data finding is the 3rd voice data being associated with described second audio data.Under this mode, mainly consider that existing user to sing has situation out of tune a little, search for the audio frequency similar to audio frequency out of tune, and using the audio frequency searching as the 3rd voice data.

In such scheme, which can be used as the 3rd voice data to the data of described second determining unit 706 definite search, mainly passes through:

Obtain the First Characteristic of described second audio data, described First Characteristic is for characterizing the prosodic features of described second audio data; Search meets the voice data of described First Characteristic, and determines that the voice data searching is the 3rd voice data being associated with described second audio data.Under this mode, mainly consider the music for user search and second audio data with identical rhythm attribute, if the rhythm attribute of second audio data is brisk type/sad type, in the local resource of described electronic equipment, search for the song of brisk type/sad type, and using the song of the brisk type/sad type searching as the 3rd voice data.

Wherein, described the first recommendation unit 704 recommends the concrete mode of described the 3rd voice data to recommend the mode of described multi-medium data identical with aforesaid described the first performance element 703, repeats no more here.

As from the foregoing, in the 3rd embodiment of electronic equipment of the present invention, not only can carry out predetermined operation to multi-medium data; Can also recommend to have with second audio data to network or other electronic equipment the 3rd voice data of certain similarity or identical rhythm attribute; Without user's participation, can realize voice data automatic collection, the operation such as specific function, recommendation is set, embodied variation, the hommization of electronic functionalities, promoted user equipment experience.

The 4th embodiment of a kind of electronic equipment provided by the invention, described electronic equipment comprises: audio frequency input block, audio output unit; Wherein, specifically Mike, external earphone etc. of described audio frequency input block; Described audio output unit can be specially receiver, external earphone etc.

Fig. 8 is the composition schematic diagram of the 4th embodiment of electronic equipment of the present invention; As shown in Figure 8, described electronic equipment comprises: the first detecting unit 801, the first judging unit 802, the first performance element 803, the first permute unit 804; Wherein,

Described the first detecting unit 801, for when the described audio output unit play multimedia data, detects by the second audio data of described audio frequency input block input, and wherein, described multi-medium data at least comprises the first voice data.

Described second audio data can be the music of user's humming; The first detecting unit 801 detects user by the audio frequency of described audio frequency input block input.

Described the first judging unit 802, for judging whether described second audio data mates with described first voice data of described multi-medium data, while being judged as coupling, triggers described the first performance element 803.

Further, described the first judging unit 802 parses the first voice data from described multi-medium data; Second audio data and the first voice data are compared, obtain the first similarity; In the time that described the first similarity is greater than default similarity threshold, determine that described second audio data and the first voice data match.

Concrete, described the first judging unit 802 can be realized comparing between second audio data and the first voice data by following three kinds of modes.

Mode one: described the first judging unit 802 is resolved the audio volume control in described second audio data; The similarity of the audio volume control in the audio volume control in more described second audio data and described the first voice data, generates the first sub-similarity in described the first similarity; In the time that described the first sub-similarity is greater than the first sub-threshold value in described similarity threshold, determine that described second audio data and described the first voice data match.

Mode two: described the first judging unit 802 carries out speech recognition to inputted second audio data, obtains meaning of one's words information; The similarity of the meaning of one's words information comprising in the meaning of one's words information relatively obtaining and the first voice data, generates the second sub-similarity in described the first similarity; In the time that described the second sub-similarity is greater than the second sub-threshold value in described similarity threshold, determine that described second audio data and described the first voice data match.

Mode three: described the first judging unit 802 utilizes described mode one and described mode two simultaneously, in the time that the first sub-similarity is greater than the first sub-threshold value and the second sub-similarity and is greater than the second sub-threshold value, determine that described second audio data and described the first voice data match.

Described the first performance element 803, for carrying out scheduled operation to described multi-medium data.

Here, described the first performance element 803 can mark or is collected described multi-medium data, and for example described in mark, multi-medium data is the favorite music of user, while carrying out music, preferentially plays this music follow-up; Or, collect described multi-medium data to multimedia as in music collection folder, so that the easy-to-look-up described multi-medium data of subsequent user.

Described the first performance element 803 can also determine that described multi-medium data is the output audio of predetermined function, and for example described multi-medium data is set to incoming ring tone, ring of alarm clock etc.

Described the first performance element 803 can also be recommended described multi-medium data.

Wherein, described the first performance element 803 can: recommend described multi-medium data to the network that is associated with described electronic equipment, for example, described in the network of described electronic equipment being registered is considered as, be associated network time, by this network, described multi-medium data is uploaded to microblogging, QQ space by described electronic equipment, and using this multi-medium data as playing music, so that beautifying of the space page carried out in microblogging, QQ space.

Described the first performance element 803 is all right: recommend described multi-medium data to other electronic equipment being associated with described electronic equipment, for example, when described electronic equipment and described other electronic equipment are emerged good friend and are related to by network focus, described electronic equipment can be sent to described other electronic equipment by this multi-medium data by the mode such as QQ, micro-letter, uses for described other electronic equipment.

Described the first permute unit 804, is replaced into described second audio data for the first voice data that described multi-medium data and second audio data are matched, and forms the 4th voice data.

Here, consider following this situation, described multi-medium data is the audio frequency of Chinese musical telling unification, the first voice data in described multi-medium data is the lyrics that need to sing, other parts are the lyrics that need to say, so, it is that second audio data is replaced described the first voice data from multi-medium data that described the first permute unit 804 utilizes the part that user hums, and described multi-medium data now i.e. the 4th audio frequency just comprises the lyrics two large divisions that the lyrics that user hums and needs are said.

As from the foregoing, in the 4th embodiment of electronic equipment of the present invention, not only can be judging that second audio data and the first voice data in multi-medium data when mating, carry out predetermined operation to multi-medium data; The first voice data in multi-medium data can also be replaced with to second audio data, form a new voice data, the equipment that has promoted user is experienced.

It should be noted that, the above mentioned electronic equipment of each embodiment of the present invention includes but not limited to the following stated: all kinds computing machine, integral computer, panel computer, mobile phone, the electronic readers etc. such as industrial control computer, personal computer.The object of the preferred electronic equipment of various embodiments of the present invention is mobile phone.

In the several embodiment that provide in the application, should be understood that disclosed equipment and method can realize by another way.Apparatus embodiments described above is only schematic, for example, the division of described unit, be only that a kind of logic function is divided, when actual realization, can there is other dividing mode, as: multiple unit or assembly can be in conjunction with, maybe can be integrated into another system, or some features can ignore, or do not carry out.In addition, the coupling each other of shown or discussed each ingredient or direct-coupling or communication connection can be by some interfaces, and the indirect coupling of equipment or unit or communication connection can be electrical, machinery or other form.

The above-mentioned unit as separating component explanation can or can not be also physically to separate, and the parts that show as unit can be or can not be also physical locations, can be positioned at a place, also can be distributed in multiple network element; Can select according to the actual needs part or all of unit wherein to realize the object of the present embodiment scheme.

In addition, the each functional unit in various embodiments of the present invention can all be integrated in a processing unit, can be also that each unit is distinguished separately as a unit, also can be integrated in a unit two or more unit; Above-mentioned integrated unit both can adopt the form of hardware to realize, and the form that also can adopt hardware to add SFU software functional unit realizes.

One of ordinary skill in the art will appreciate that: all or part of step that realizes said method embodiment can complete by the relevant hardware of programmed instruction, aforesaid program can be stored in a computer read/write memory medium, this program, in the time carrying out, is carried out the step that comprises said method embodiment; And aforesaid storage medium comprises: movable storage device, ROM (read-only memory) (ROM, Read-Only Memory), the various media that can be program code stored such as random access memory (RAM, Random Access Memory), magnetic disc or CD.

The above; be only the specific embodiment of the present invention, but protection scope of the present invention is not limited to this, any be familiar with those skilled in the art the present invention disclose technical scope in; can expect easily changing or replacing, within all should being encompassed in protection scope of the present invention.Therefore, protection scope of the present invention should be as the criterion with the protection domain of described claim.

Claims

1. an information processing method, is applied to one and has in the electronic equipment of audio frequency input block and audio output unit; Described method comprises:

Detect by the second audio data of described audio frequency input block input;

2. method according to claim 1, is characterized in that, whether the first voice data in the described multi-medium data of the second audio data that described judgement is inputted and broadcasting mates, and comprising:

From multi-medium data, parse the first voice data;

Second audio data and the first voice data are compared, obtain the first similarity;

In the time that described the first similarity is greater than default similarity threshold, determine that described second audio data and the first voice data match.

3. method according to claim 2, is characterized in that, described second audio data and the first voice data is compared, and comprising:

Resolve the audio volume control in described second audio data;

The similarity of the audio volume control in the audio volume control in more described second audio data and described the first voice data, generates the first sub-similarity in described the first similarity;

Accordingly, described in the time that described the first similarity is greater than default similarity threshold, determine that described second audio data and the first voice data match, and comprising:

In the time that described the first sub-similarity is greater than the first sub-threshold value in described similarity threshold, determine that described second audio data and described the first voice data match.

4. according to the method in claim 2 or 3, it is characterized in that, described second audio data and the first voice data compared, comprising:

Inputted second audio data is carried out to speech recognition, obtain meaning of one's words information;

The similarity of the meaning of one's words information comprising in the meaning of one's words information relatively obtaining and the first voice data, generates the second sub-similarity in described the first similarity;

In the time that described the second sub-similarity is greater than the second sub-threshold value in described similarity threshold, determine that described second audio data and described the first voice data match.

5. method according to claim 1, is characterized in that, described to described multi-medium data execution scheduled operation, comprising:

Mark or collect described multi-medium data; And/or,

Determine that described multi-medium data is the output audio of predetermined function; And/or,

Recommend described multi-medium data.

6. method according to claim 5, is characterized in that, the described multi-medium data of described recommendation, comprising:

Recommend described multi-medium data to the network being associated with described electronic equipment; And/or,

Recommend described multi-medium data to other electronic equipment being associated with described electronic equipment.

7. according to the method described in claim 1 to 6 any one, it is characterized in that, described method also comprises:

According to a preset rules, recommend the 3rd voice data being associated with described second audio data.

8. method according to claim 7, is characterized in that, described method also comprises:

Search is greater than the voice data of default Second Threshold with the similarity of described second audio data, and determines that the voice data searching is the 3rd voice data being associated with described second audio data.

9. method according to claim 7, is characterized in that, described method also comprises:

Obtain the First Characteristic of described second audio data, described First Characteristic is for characterizing the prosodic features of described second audio data;

Search meets the voice data of described First Characteristic, and determines that the voice data searching is the 3rd voice data being associated with described second audio data.

10. method according to claim 8 or claim 9, is characterized in that, described method also comprises:

In the local resource of described electronic equipment, search for; And/or,

In the resource of other electronic equipment being associated with described electronic equipment, search for; And/or,

In the resource of the network being associated with described electronic equipment, search for.

11. according to the method described in claim 1 to 6 any one, it is characterized in that, described method also comprises:

The first voice data matching with second audio data in described multi-medium data is replaced into described second audio data, forms the 4th voice data.

12. 1 kinds of electronic equipments, described electronic equipment comprises: audio frequency input block, audio output unit; Described electronic equipment also comprises:

13. electronic equipments according to claim 12, is characterized in that, described the first judging unit, for:

From multi-medium data, parse the first voice data;

14. electronic equipments according to claim 13, is characterized in that, described the first judging unit, for:

Resolve the audio volume control in described second audio data;

15. according to the electronic equipment described in claim 13 or 14, it is characterized in that, described the first judging unit, for:

16. electronic equipments according to claim 12, is characterized in that, described the first performance element, for:

Mark or collect described multi-medium data; And/or,

Recommend described multi-medium data.

17. electronic equipments according to claim 16, is characterized in that, described the first performance element, for:

18. according to claim 12 to the electronic equipment described in 17 any one, it is characterized in that, described electronic equipment also comprises:

The first recommendation unit, for according to a preset rules, recommends the 3rd voice data being associated with described second audio data.

19. electronic equipments according to claim 18, is characterized in that, described electronic equipment also comprises:

The first determining unit, for searching for the voice data that is greater than default Second Threshold with the similarity of described second audio data, and determines that the voice data searching is the 3rd voice data being associated with described second audio data.

20. according to the electronic equipment described in claim 18 or 19, it is characterized in that, described electronic equipment also comprises:

The second determining unit, for obtaining the First Characteristic of described second audio data, described First Characteristic is for characterizing the prosodic features of described second audio data;

21. according to the electronic equipment described in claim 19 or 20, it is characterized in that,

In the local resource of described electronic equipment, search for; And/or,

22. according to claim 12 to the method described in 17 any one, it is characterized in that, described electronic equipment also comprises:

The first permute unit, is replaced into described second audio data for the first voice data that described multi-medium data and second audio data are matched, and forms the 4th voice data.