CN105808780B - Song recognition method and apparatus - Google Patents

Song recognition method and apparatus Download PDF

Info

Publication number
CN105808780B
CN105808780B CN201610194530.3A CN201610194530A CN105808780B CN 105808780 B CN105808780 B CN 105808780B CN 201610194530 A CN201610194530 A CN 201610194530A CN 105808780 B CN105808780 B CN 105808780B
Authority
CN
China
Prior art keywords
information
audio
song
target
equipment
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201610194530.3A
Other languages
Chinese (zh)
Other versions
CN105808780A (en
Inventor
王发靖
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangzhou Kugou Computer Technology Co Ltd
Original Assignee
Guangzhou Kugou Computer Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangzhou Kugou Computer Technology Co Ltd filed Critical Guangzhou Kugou Computer Technology Co Ltd
Priority to CN201610194530.3A priority Critical patent/CN105808780B/en
Publication of CN105808780A publication Critical patent/CN105808780A/en
Application granted granted Critical
Publication of CN105808780B publication Critical patent/CN105808780B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/63Querying
    • G06F16/638Presentation of query results
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/683Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0487Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser

Abstract

The invention discloses a kind of song recognition method and apparatus, belong to internet area.The described method includes: when receiving designated character, trigger song recognition instruction, after triggering the song recognition instruction, obtain target audio information, the target audio information is currently playing audio-frequency information, and the equipment of the equipment and the currently playing target audio information for receiving the song recognition instruction is the same equipment, is based on the target audio information, target song is identified by server, the target song is the corresponding song of the target audio information.Equipment in the present invention for receiving equipment and the currently playing target audio information that the song recognition instructs is the same equipment, improves the accuracy rate and efficiency for obtaining the target audio information, and then improves the accuracy rate and efficiency of identification song.

Description

Song recognition method and apparatus
Technical field
The present invention relates to internet area, in particular to a kind of song recognition method and apparatus.
Background technique
With the development of science and technology, mobile phone, the function of apparatus such as computer are more and more, which can play song etc. Audio-frequency information.By taking the device plays song as an example, during the device plays song, user often can be to currently playing song It is bent interested, it is desirable to get the relevant information of the song, such as song title, author, the song style, album name of the song Therefore the information such as title need a kind of song recognition method.
In the related technology, when needing to identify song currently playing in the first equipment, user needs to manually turn on Related application in two equipment records song currently playing in the first equipment by the application, obtains target audio The target audio information is sent to server by information.When server receives the target audio information, according to the sound of storage Frequency information bank identifies the target audio information, and recognition result is sent to the second equipment.
In the implementation of the present invention, the inventor finds that the existing technology has at least the following problems: firstly, user needs The related application in the second equipment is manually turned on, and is recorded by the second equipment song currently playing to the first equipment System, complicated for operation, process is cumbersome.Secondly, user, which opens related application, needs the regular hour, therefore, second is opened in user When related application in equipment, the song played in the first equipment may be missed, to be difficult accurately to work as the first equipment The song of preceding broadcasting is recorded, and the accuracy rate for recording target audio information is reduced.Finally, when the second equipment works as the first equipment When the song of preceding broadcasting is recorded, if there are interference informations such as noises in ambient enviroment, may by interference information also into Row is recorded, and then increases the difficulty of server identification target audio information, reduces recognition efficiency.
Summary of the invention
In order to solve problems in the prior art, the embodiment of the invention provides a kind of song recognition method and apparatus.It is described Technical solution is as follows:
In a first aspect, providing a kind of song recognition method, which comprises
When receiving designated character, triggering song recognition instruction;
After triggering the song recognition instruction, target audio information is obtained, the target audio information is currently to broadcast The audio-frequency information put, and the equipment of the equipment and the currently playing target audio information for receiving the song recognition instruction For the same equipment;
Based on the target audio information, target song is identified by server, the target song is described The corresponding song of target audio information.
Optionally, described when receiving designated character, triggering song recognition instruction, comprising:
When receiving designated character in target text input frame, the song recognition instruction, the target text are triggered Word input frame is any text input box in the currently playing target audio information page.
Optionally, described to be based on after the target audio information identifies target song by server, also wrap It includes:
Displaying target song information, the target song information are based on the target audio information to institute for the server Target song is stated to be identified to obtain.
Optionally, described to be based on after the target audio information identifies target song by server, also wrap It includes:
The specific audio frequency of installation is called to play application;
It is played and is applied by the specific audio frequency, play the target song.
Optionally, described after triggering the song recognition instruction, obtain target audio information, comprising:
If currently playing audio-frequency information be it is multiple, from currently playing multiple audio-frequency informations, obtain present bit In the audio-frequency information that foreground plays;
The audio-frequency information that will acquire is determined as the target audio information.
Optionally, described after triggering the song recognition instruction, obtain target audio information, comprising:
If currently playing audio-frequency information be it is multiple, obtain the program mark where currently playing multiple audio-frequency informations Know;
Show the program identification where the multiple audio-frequency information;
It is when the program identification based on display receives selection instruction, the selected audio-frequency information of the selection instruction is true It is set to the target audio information.
Optionally, described after triggering the song recognition instruction, obtain target audio information, comprising:
Obtain currently playing audio-frequency information;
The audio-frequency information is decomposed into background music information and voiceless sound information, the voiceless sound information is the audio-frequency information In information in addition to background music information;
An information is selected in the background music information and the voiceless sound information, the information of selection is determined as described Target audio information.
Second aspect, provides a kind of song recognition equipment, and the equipment includes:
Receiving module, for when receiving designated character, triggering song recognition to be instructed;
Module is obtained, for obtaining target audio information, the target audio after triggering the song recognition instruction Information is currently playing audio-frequency information, and equipment and the currently playing target sound for receiving the song recognition instruction The equipment of frequency information is the same equipment;
Identification module identifies target song by server, the mesh for being based on the target audio information Mark song is the corresponding song of the target audio information.
Optionally, the receiving module includes:
Trigger unit refers to for when receiving designated character in target text input frame, triggering the song recognition It enables, the target text input frame is any text input box in the currently playing target audio information page.
Optionally, the equipment further include:
Display module, is used for displaying target song information, and the target song information is that the server is based on the mesh Mark audio-frequency information is identified to obtain to the target song.
Optionally, the equipment further include:
Calling module, for calling the specific audio frequency of installation to play application;
Playing module is applied for being played by the specific audio frequency, plays the target song.
Optionally, the acquisition module includes:
First acquisition unit, if for currently playing audio-frequency information be it is multiple, from currently playing multiple audios In information, the audio-frequency information for being currently located at foreground broadcasting is obtained;
First determination unit, the audio-frequency information for will acquire are determined as the target audio information.
Optionally, the acquisition module includes:
Second acquisition unit, if for currently playing audio-frequency information be it is multiple, obtain currently playing multiple sounds Program identification where frequency information;
Display unit, for showing the program identification where the multiple audio-frequency information;
Second determination unit, for when the program identification based on display receives selection instruction, by the selection instruction Selected audio-frequency information is determined as the target audio information.
Optionally, the acquisition module includes:
Third acquiring unit, for obtaining currently playing audio-frequency information;
Decomposition unit, for the audio-frequency information to be decomposed into background music information and voiceless sound information, the voiceless sound information For the information in the audio-frequency information in addition to background music information;
Third determination unit will be selected for selecting an information in the background music information and the voiceless sound information The information selected is determined as the target audio information.
Technical solution provided in an embodiment of the present invention has the benefit that in embodiments of the present invention, when the equipment When receiving designated character, triggering song recognition instruction, i.e. user can be by inputting designated character, touch the equipment quickly Song recognition instruction is sent out, the speed for responding song recognition instruction is improved, after triggering song recognition instruction, this is set It is standby to obtain target audio information, due to the equipment and the currently playing target audio information for receiving song recognition instruction Equipment is the same equipment, therefore, there is no need to obtain the target audio information by other equipment, improves and obtains the target The accuracy rate and efficiency of audio-frequency information, and then improve the accuracy rate and efficiency of identification song.Obtaining the target audio information Later, it is based on the target audio information, target song is identified by server, further improves and identifies target song Bent efficiency.
Detailed description of the invention
To describe the technical solutions in the embodiments of the present invention more clearly, make required in being described below to embodiment Attached drawing is briefly described, it should be apparent that, drawings in the following description are only some embodiments of the invention, for For those of ordinary skill in the art, without creative efforts, it can also be obtained according to these attached drawings other Attached drawing.
Fig. 1 is a kind of song recognition method flow diagram provided in an embodiment of the present invention;
Fig. 2 is another song recognition method flow diagram provided in an embodiment of the present invention;
Fig. 3 is a kind of page schematic diagram for playing audio-frequency information provided in an embodiment of the present invention;
Fig. 4 is another page schematic diagram for playing audio-frequency information provided in an embodiment of the present invention;
Fig. 5 is a kind of song recognition device structure schematic diagram provided in an embodiment of the present invention.
Specific embodiment
To make the object, technical solutions and advantages of the present invention clearer, below in conjunction with attached drawing to embodiment party of the present invention Formula is described in further detail.
Before carrying out detailed explanation to the embodiment of the present invention, first the application scenarios of the embodiment of the present invention are given It introduces.When user plays song using mobile phone, apparatus such as computer, user often can be interested in currently playing song, it is desirable to Get the relevant information of the song.In the related technology, when needing to identify song currently playing in the first equipment, Yong Huxu The related application in the second equipment is manually turned on, song currently playing in the first equipment is recorded by the application System, obtains target audio information, the target audio information is then sent to server, is believed by server according to the audio of storage Breath library identifies the target audio information.But user needs to manually turn on the related application in the second equipment, and passes through The second equipment song currently playing to the first equipment is recorded, and not only complicated for operation, process is cumbersome, it is also difficult to accurately record The currently playing song of the first equipment is made, and during the first equipment of recording currently playing song, it is easy to will do It disturbs information also to be recorded, and then causes to identify that the accuracy rate of target audio information is very low.Therefore, the embodiment of the present invention provides A kind of song recognition method can be improved the accuracy rate of identification target audio information.
Fig. 1 is a kind of song recognition method flow diagram provided in an embodiment of the present invention, referring to Fig. 1, the interaction master of this method Body be equipment and server, this method comprises:
Step 101: when receiving designated character, triggering song recognition instruction.
Step 102: after triggering song recognition instruction, obtaining target audio information, which is to work as The audio-frequency information of preceding broadcasting, and the equipment of the equipment and the currently playing target audio information for receiving song recognition instruction For the same equipment.
Step 102: being based on the target audio information, target song is identified by server, which is The corresponding song of target audio information.
In embodiments of the present invention, when the equipment receives designated character, triggering song recognition instruction, i.e. user can be with By inputting designated character, to make the equipment quickly trigger song recognition instruction, improves and respond song recognition instruction Speed, after triggering song recognition instruction, which obtains target audio information, due to referring to for receiving the song recognition The equipment of the equipment of order and the currently playing target audio information be the same equipment, therefore, there is no need to by other equipment come The target audio information is obtained, improves the accuracy rate and efficiency for obtaining the target audio information, and then improve identification song Accuracy rate and efficiency.After obtaining the target audio information, it is based on the target audio information, target is sung by server Qu Jinhang identification, further improves the efficiency for identifying the target song.
Optionally, when receiving designated character, triggering song recognition instruction, comprising:
When receiving designated character in target text input frame, song recognition instruction is triggered, the target text is defeated Entering frame is any text input box in the currently playing target audio information page.
Optionally, after being identified based on the target audio information to target song by server, further includes:
Displaying target song information, the target song information are that the server is based on the target audio information to target song Qu Jinhang identifies to obtain.
Optionally, after being identified based on the target audio information to target song by server, further includes:
The specific audio frequency of installation is called to play application;
It is played and is applied by the specific audio frequency, play the target song.
Optionally, after triggering song recognition instruction, target audio information is obtained, comprising:
If currently playing audio-frequency information be it is multiple, from currently playing multiple audio-frequency informations, obtain present bit In the audio-frequency information that foreground plays;
The audio-frequency information that will acquire is determined as the target audio information.
Optionally, after triggering song recognition instruction, target audio information is obtained, comprising:
If currently playing audio-frequency information be it is multiple, obtain the program mark where currently playing multiple audio-frequency informations Know;
Show the program identification where multiple audio-frequency information;
When the program identification based on display receives selection instruction, the selected audio-frequency information of the selection instruction is determined For the target audio information.
Optionally, after triggering song recognition instruction, target audio information is obtained, comprising:
Obtain currently playing audio-frequency information;
The audio-frequency information is decomposed into background music information and voiceless sound information, which is in the audio-frequency information except back Information other than scape music information;
An information is selected in the background music information and the voiceless sound information, and the information of selection is determined as the target sound Frequency information.
All the above alternatives, can form alternative embodiment of the invention according to any combination, and the present invention is real It applies example and this is no longer repeated one by one.
Fig. 2 is a kind of song recognition method flow diagram provided in an embodiment of the present invention, referring to fig. 2, the interaction master of this method Body be equipment and server, this method comprises:
Step 201: when the equipment receives designated character, triggering song recognition instruction.
Since when user plays song using mobile phone, apparatus such as computer, user often can be emerging to currently playing song sense Interest, it is desirable to get the relevant information of the song, therefore, which can trigger the song when receiving the instruction character Identification instruction, and then the song currently playing to the equipment identifies.
Wherein, which can be the equipment that mobile phone, computer etc. can play the audio-frequency informations such as song, which can To be to play MP3 (Moving Picture Experts Group Audio Layer III, dynamic image expert's compression standard Audio level 3), the audio file of formats such as WMA (Windows Media Audio) when audio-frequency information, be also possible to play AVI (Audio Video Interleaved, Audio Video Interleaved format), MP4 (Moving Picture Experts Group Audio Layer IV, dynamic image expert's compression standard audio level 4) etc. the video file of formats when audio letter Breath, certainly, in practical applications, which can also be audio-frequency information when the other types of file of the device plays, For example, station broadcast etc., the embodiment of the present invention is not specifically limited in this embodiment.
It should be noted that the designated character can be a character, it is also possible to a character string, for example, when this refers to When to determine character be a character, which can be " s ", " f " etc.;When the designated character is a character string, the character string It can be "@shibie ", " ctrl "+" alt "+" s " etc., the present invention is not especially limit this.
It should also be noted that, the designated character can also be based on specified operation input for the equipment, and do not broadcast currently It puts in the audio-frequency information page and is shown, for example, the specified operation can be first by " enter " when the equipment is computer Then key inputs the designated character, at this point, the equipment can receive the designated character and trigger song recognition instruction, and work as The designated character will not be shown in preceding broadcasting audio-frequency information page.Certainly, in practical applications, which can also be The operation of other way, the embodiment of the present invention are not specifically limited in this embodiment.
In addition, song recognition instruction other than it can trigger through the above way, can also be executed default behaviour by user It triggers, which can be the operation such as clicking operation, slide, and the embodiment of the present invention is not specifically limited in this embodiment.
It should be noted that it is default that the speed that user inputs designated character will be faster than user's execution due under normal conditions Therefore the speed of operation when i.e. triggering song recognition instructs when the equipment receives designated character, can be improved response identification The speed of song instruction.
Further, if user is triggered the song recognition by input designated character and instructed, when the equipment is in target When receiving designated character in text input box, song recognition instruction is triggered, which is currently playing be somebody's turn to do Any text input box in target audio information page.
It wherein, usually can also include one or more text input box since the equipment is when playing audio-frequency information, For example, text input box can be used for searching for audio file when the audio-frequency information of the device plays audio file;When the equipment When the audio-frequency information of playing video file, text input box can be used for inputting barrage.Therefore, user can be somebody's turn to do currently playing In multiple text input boxes in the audio-frequency information page, selection target input frame, and designated character is inputted, and then when the equipment exists When receiving the designated character in target text input frame, triggering song recognition instruction.
For example, as shown in figure 3, further including a use in the page of currently playing film when the device plays film In the text input box of input barrage, user can input " shibie " in the text input box, and then the equipment can be When receiving " shibie " in the text input box, triggering song recognition instruction.
Step 202: after triggering song recognition instruction, which obtains target audio information, and the target is believed Breath is sent to server, which is currently playing audio-frequency information, and for receiving song recognition instruction The equipment of equipment and the currently playing target audio information is the same equipment.
Since the audio-frequency information of different songs is also different, it can be different to identify by different audio-frequency informations Song, so, after triggering song recognition instruction, which can be determined as target sound for currently playing audio-frequency information Frequency information.In addition, due to the limited storage space of equipment local, the audio-frequency information that may store is also limited, therefore, in order to mention The target audio information can be sent to service when obtaining the target audio information by the efficiency of height identification song, the equipment Device, and then the target is identified by the server.
Wherein, when the device plays audio-frequency information, the audio that is typically sent to the audio-frequency information in the equipment Output module, and then the audio-frequency information is played by the audio output module, therefore, when the equipment needs to obtain target audio letter When breath, the audio-frequency information can be determined as the mesh in the audio output module audio-frequency information being sent in the equipment Mark audio-frequency information.Due in embodiments of the present invention, for receiving the equipment and the currently playing target of song recognition instruction The equipment of audio-frequency information can be the same equipment, do not need to play the audio-frequency information in the audio output module and then lead to It crosses other equipment and records the audio-frequency information, therefore, user does not need to obtain the target audio information manually, and obtains the target When audio-frequency information the accuracy rate for obtaining the target audio information will not be improved by the interference of the interference informations such as ambient noise And efficiency, and then improve the accuracy rate and efficiency of identification song.
Further, due to the equipment may currently playing multiple audio-frequency informations, for example, playing film and song simultaneously Song therefore, can be in multiple sound when the currently playing multiple audio-frequency informations of the equipment and when needing to obtain the target audio information An audio-frequency information is selected in frequency information, and then the audio-frequency information of selection is determined as the target audio information, specifically can wrap Include following two ways.
First way, if currently playing audio-frequency information be it is multiple, from currently playing multiple audio-frequency informations, The audio-frequency information for being currently located at foreground broadcasting is obtained, the audio-frequency information that will acquire is determined as the target audio information.
Wherein, when the equipment plays multiple audio-frequency informations simultaneously, being currently located at the audio-frequency information of foreground broadcasting, have very much can It can be that user wishes the target audio information obtained, for example, the audio for being currently located at foreground broadcasting may be that the user currently sees The audio-frequency information in film seen, therefore, the available audio-frequency information for being currently located at foreground broadcasting of the equipment, the sound that will acquire Frequency information is determined as the target audio information.
It should be noted that the audio-frequency information that the foreground plays, which refers to, works as when the equipment can show multiple pages simultaneously When multiple page is arranged with overlapped way, the audio-frequency information currently playing positioned at the page of the top of multiple page;When When the equipment is only able to display a page simultaneously, the audio-frequency information which plays refers to be played in the page that the equipment is currently shown Audio-frequency information.
For example, being shown when the equipment can show multiple pages simultaneously as shown in figure 4, the equipment is laminated simultaneously " " news hookup " ", " " Harry Potter " " and " " the Sound of Music " " three pages, and " " news hookup " " page is located at this three Page top, therefore, the currently playing audio 1 of available " " the news hookup " " page of the equipment, and the sound that will be will acquire Frequently 1 is determined as the target audio information.
The second way, if currently playing audio-frequency information be it is multiple, obtain currently playing multiple audio-frequency informations The program identification at place shows the program identification where multiple audio-frequency information, when the program identification based on display receives choosing When selecting instruction, the selected audio-frequency information of the selection instruction is determined as the target audio information.
Wherein, each in multiple audio-frequency information since currently playing multiple audio-frequency informations play simultaneously A audio-frequency information all may be that user wishes the target audio information obtained, therefore, obtain the target audio information to improve Accuracy rate, the program identification where the available currently playing multiple audio-frequency informations of the equipment shows multiple audio letter Program identification where ceasing, and when receiving selection instruction, the selected audio-frequency information of the selection instruction is determined as the mesh Mark audio-frequency information.
For example, when the equipment obtain the program identification where currently playing multiple audio-frequency informations be " " news hookup " ", When " " Harry Potter " " and " " the Sound of Music " ", show the program identification where multiple audio-frequency information be " " news hookup " ", " " Harry Potter " " and " " the Sound of Music " ", when receiving selection instruction 1, program identification which selects for " " news hookup " " corresponding audio-frequency information " audio 1 " is determined as the target audio and believed by " " news hookup " ", therefore, the equipment Breath.
It should be noted that the various ways such as the equipment can be shown by window, pop-up is shown show multiple audio Program identification where information, the embodiment of the present invention are not specifically limited in this embodiment.
It should also be noted that, the program identification can be the Page Name for playing the audio-frequency information page, it is also possible to broadcast The page properties of playback frequency information page, certainly, in practical applications, which can also be other and can identify respectively The mark of multiple audio-frequency information, the embodiment of the present invention are not specifically limited in this embodiment.
Wherein, which can be the corresponding programm name of currently playing audio-frequency information, such as song title, electricity Shadow title etc.;The page properties can be the Apply Names of currently playing audio-frequency information, for example, music player, video playing Device etc..Certainly, in practical applications, the Page Name and page properties can also have other meanings, and the embodiment of the present invention is to this It is not specifically limited.
For example, as shown in figure 4, the equipment plays three audio-frequency informations simultaneously, wherein the page properties of the page 1 are network Radio station, Page Name are " news hookup ", and the audio-frequency information of broadcasting is audio 1;The page properties of the page 2 are video player, Page Name is " Harry Potter ", and the audio-frequency information of broadcasting is audio 2;The page properties of the page 3 are music player, page name Referred to as " the Sound of Music ", the audio-frequency information of broadcasting are audio 3.When the program identification is the Page Name for playing the audio-frequency information page When, the program identification where which obtains currently playing multiple audio-frequency informations is " " news hookup " ", " " Harry Potter " " " " the Sound of Music " ";When the program identification is to play the page properties of the audio-frequency information page, which obtains currently playing Multiple audio-frequency informations where program identification be " network radio station ", " video player " and " music player ".
In addition, in practical applications, which can also select one in multiple audio-frequency information otherwise Audio-frequency information, and then the audio-frequency information of selection is determined as the target audio information, the embodiment of the present invention does not do specific limit to this It is fixed.
Further, in the second way, which not only can be directly by the selected audio-frequency information of selection instruction It is determined as the target audio information, it is of course also possible to which the selected audio-frequency information of the selection instruction is decomposed, obtains background Music information and information of singing opera arias, and then an information is selected from background music information and voiceless sound information, the information of selection is true It is set to the target audio information, the embodiment of the present invention is not specifically limited in this embodiment.
Further, when the equipment obtains target audio information, which can not only be obtained by the above method It takes, can also be obtained in the following way, specifically: currently playing audio-frequency information is obtained, which is decomposed For background music information and voiceless sound information, which is the information in the audio-frequency information in addition to background music information, In An information is selected in the background music information and the voiceless sound information, and the information of selection is determined as the target audio information.
Wherein, since the audio-frequency information in song generally comprises two parts, a part is the instrument playings such as piano, flute Background music information, another part is the voiceless sound information that singer sings, and the user may be only interested in background music, obtains Take sung by other singers and background music information be the background music song;Or the voiceless sound that only singer is sung Information is interested, obtains the song of singer performance, therefore, in order to further increase the accuracy rate and effect of identification song Rate, the available currently playing audio-frequency information of the equipment, is decomposed into background music information and voiceless sound information for the audio-frequency information, An information is selected in the background music information and the voiceless sound information, and the information of selection is determined as the target audio information, And then only background music information or voiceless sound information are identified.
It should be noted that background music information and voiceless sound information respectively correspond different tracks in song, therefore, When the audio-frequency information is decomposed into background music information and voiceless sound information, background music can be determined by different tracks The audio-frequency information in practical applications, is decomposed into background music information and voiceless sound information may be used also by information and voiceless sound information certainly To refer to the prior art, the embodiment of the present invention no longer repeats one by one.
Step 203: the server is based on the target audio information, identifies to target song, will when identifying successfully Target song information is sent to the equipment, which is the corresponding song of target audio information.
It specifically, can be according to the audio of storage when the server receives the target audio information of equipment transmission Information bank identifies the target audio information, and then identifies to the target song.When identifying successfully, by target Song information is sent to the equipment.
Wherein, it when the server is according to the audio-frequency information library of storage, when being identified to the target audio information, can incite somebody to action The audio-frequency information stored in the target audio information and the audio-frequency information library is matched, when in the audio-frequency information library exist and this When the audio-frequency information of target audio information matches, determination is identified successfully.
It should be noted that the audio-frequency information stored in the target audio information and the audio-frequency information library is carried out matched Method, can refer to the prior art, and the embodiment of the present invention no longer repeats one by one.
It wherein, may include audio-frequency information song information corresponding with the audio-frequency information, the service in the audio-frequency information library Device can be based on the target audio information, before identifying to the target song, audio-frequency information is corresponding with the audio Song information is stored in the audio-frequency information library, as described in Table 1.
Table 1
It should be noted that the embodiment of the present invention only carries out for audio-frequency information and song information shown in the above-mentioned table 1 Illustrate, above-mentioned table 1 does not constitute the embodiment of the present invention and limits.
It should also be noted that, the song information may include song title, singer, album name, title, age, wind The information such as lattice, chained address, certainly in practical applications, the song information can also include other information, the embodiment of the present invention It is not specifically limited in this embodiment.
In addition, when, there is no when the audio-frequency information with the target audio information matches, determining identification in the audio-frequency information library Failure sends recognition failures prompt information to the equipment, and the failure prompt information is for prompting the ownership goal song recognition to lose It loses.
Wherein, since the audio-frequency information that the server can store also is limited, it may in the audio-frequency information library And the target audio information is not present, so, when the target audio information is not present in the audio-frequency information library, determine that identification is lost It loses, and sends recognition failures prompt information to the equipment.
For example, working as the target audio information that the server receives for audio 1, in the audio-frequency information library of server storage There are the audios 1, and therefore, server determination identifies successfully, and the corresponding song information 1 of audio 1 is sent to the equipment. When the target audio information that the server receives be audio 4, the server storage audio-frequency information library in be not present the audio 4, therefore, which determines recognition failures, and sends recognition failures prompt information to the equipment.
Step 204: when the equipment receives the target song information of server transmission, displaying target song information should Target song information is that the server is identified to obtain based on the target audio information to the target song.
When the equipment receives the target song information, the target song information can be shown, and then user can root According to the target song information, information related with the target song is quickly understood.
It should be noted that the various ways such as the equipment can be shown by window, pop-up is shown show the target song Information, the embodiment of the present invention are not specifically limited in this embodiment.
In addition, from the foregoing it will be appreciated that including the information such as song title, chained address in the target song information, therefore, when this When equipment receives the target song information, the specific audio frequency of installation according to the target song information, can be called to play application, It is played and is applied by the specific audio frequency, play the target song.
Wherein, when the equipment receives the target song information, and plays the target song, user knows intuitively root According to the target song of broadcasting, determine whether the song that the equipment is identified is correct song, to further increase identification song Accuracy rate.
It should be noted that can be sung according to the target when playing the application plays target song by the specific audio frequency The chained address for including in bent information obtains the corresponding Internet resources of the target song, and then corresponding according to the target song Internet resources play the target song;Alternatively, playing the song service of application from the specific audio frequency according to the target song information In device, obtains and play the target song.Certainly, in practical applications, which broadcasts by specific audio frequency broadcasting application When putting the target song, the target song can also be played according to other information, the embodiment of the present invention is not specifically limited in this embodiment.
Further, from the foregoing it will be appreciated that when the server determines recognition failures, recognition failures can be sent to the equipment and mentioned Show information, therefore, when the equipment receives the recognition failures prompt information, can also show the recognition failures prompt information.
It should be noted that the various ways such as the equipment can be shown by window, pop-up is shown show the recognition failures When prompt information, the embodiment of the present invention is not specifically limited in this embodiment.
In embodiments of the present invention, user can be defeated in any text input box in the currently playing audio-frequency information page Enter designated character, and then when the equipment receives designated character, can quickly trigger song recognition instruction, improves response and know The speed of other song instruction.In addition, the equipment obtains target audio information, and should after triggering song recognition instruction Target information is sent to server, since the target audio information is currently playing audio-frequency information, receives the song recognition and refers to The equipment of order and the equipment of the currently playing audio-frequency information are the same equipment, and the target audio information that obtains of the equipment be It is acquired when by audio output module that the audio-frequency information is sent in the equipment, rather than by recording the audio output mould The audio-frequency information that block plays acquires, and this improves the accuracys rate and efficiency that obtain the target audio information, and then improves The accuracy rate and efficiency of identification song.Furthermore when server is according to the target audio information, successfully identify the target song it Afterwards, which can be with displaying target song information, or plays target song, it is ensured that user can rapidly get target song Bent information, or intuitively according to the target song of broadcasting, determine whether the song that the equipment is identified is correct song, with Further increase the accuracy rate of identification song.
Fig. 5 is a kind of song recognition equipment provided in an embodiment of the present invention, and referring to Fig. 5, which includes: receiving module 501, module 502 and identification module 503 are obtained.
Receiving module 501, for when receiving designated character, triggering song recognition to be instructed.
Module 502 is obtained, for obtaining target audio information, the target audio after triggering song recognition instruction Information is currently playing audio-frequency information, and equipment and currently playing target audio letter for receiving song recognition instruction The equipment of breath is the same equipment.
Identification module 503 identifies target song by server, the mesh for being based on the target audio information Mark song is the corresponding song of target audio information.
Optionally, which includes:
Trigger unit, for when receiving designated character in target text input frame, triggering song recognition instruction, The target text input frame is any text input box in the currently playing target audio information page.
Optionally, the equipment further include:
Display module, is used for displaying target song information, which is that the server is based on the target audio Information is identified to obtain to the target song.
Optionally, the equipment further include:
Calling module, for calling the specific audio frequency of installation to play application;
Playing module plays the target song for playing application by the specific audio frequency.
Optionally, which includes:
First acquisition unit, if for currently playing audio-frequency information be it is multiple, from currently playing multiple audios In information, the audio-frequency information for being currently located at foreground broadcasting is obtained;
First determination unit, the audio-frequency information for will acquire are determined as the target audio information.
Optionally, which includes:
Second acquisition unit, if for currently playing audio-frequency information be it is multiple, obtain currently playing multiple sounds Program identification where frequency information;
Display unit, for showing the program identification where multiple audio-frequency information;
Second determination unit, for when the program identification based on display receives selection instruction, by the selection instruction institute The audio-frequency information of selection is determined as the target audio information.
Optionally, which includes:
Third acquiring unit, for obtaining currently playing audio-frequency information;
Decomposition unit, for the audio-frequency information to be decomposed into background music information and voiceless sound information, which is should Information in audio-frequency information in addition to background music information;
Third determination unit, for selecting an information in the background music information and the voiceless sound information, by selection Information is determined as the target audio information.
In conclusion in embodiments of the present invention, when the equipment receives designated character, triggering song recognition instruction, I.e. user, to make the equipment quickly trigger song recognition instruction, can improve by inputting designated character and respond the song The speed for identifying instruction, after triggering song recognition instruction, which obtains target audio information, due to for receiving this The equipment of song recognition instruction and the equipment of the currently playing target audio information are the same equipment, therefore, there is no need to pass through Other equipment obtain the target audio information, improve the accuracy rate and efficiency for obtaining the target audio information, and then improve The accuracy rate and efficiency of identification song.After obtaining the target audio information, it is based on the target audio information, passes through service Device identifies target song, further improves the efficiency for identifying the target song.
About the equipment in above-described embodiment, wherein modules execute the concrete mode of operation in related this method Embodiment in be described in detail, no detailed explanation will be given here.
Those of ordinary skill in the art will appreciate that realizing that all or part of the steps of above-described embodiment can pass through hardware It completes, relevant hardware can also be instructed to complete by program, the program can store in a kind of computer-readable In storage medium, storage medium mentioned above can be read-only memory, disk or CD etc..
The foregoing is merely presently preferred embodiments of the present invention, is not intended to limit the invention, it is all in spirit of the invention and Within principle, any modification, equivalent replacement, improvement and so on be should all be included in the protection scope of the present invention.

Claims (10)

1. a kind of song recognition method, which is characterized in that the described method includes:
When receiving designated character, triggering song recognition instruction;
After triggering the song recognition instruction, target audio information is obtained, the target audio information is currently playing Audio-frequency information, and the equipment of the equipment and the currently playing target audio information for receiving the song recognition instruction is same One equipment;
Based on the target audio information, target song is identified by server, the target song is the target The corresponding song of audio-frequency information;
Wherein, described when receiving designated character, triggering song recognition instruction, comprising:
When receiving the designated character in target text input frame, the song recognition instruction, the target text are triggered Word input frame is any text input box in the currently playing target audio information page;
Wherein, described after triggering the song recognition instruction, obtain target audio information, comprising:
Obtain the currently playing audio-frequency information;
The audio-frequency information is decomposed into background music information and voiceless sound information, the voiceless sound information is to remove in the audio-frequency information Information other than the background music information;
An information is selected in the background music information and the voiceless sound information, and the information of selection is determined as the target Audio-frequency information.
2. the method as described in claim 1, which is characterized in that described to be based on the target audio information to mesh by server After mark song is identified, further includes:
Displaying target song information, the target song information are that the server is based on the target audio information to the mesh Mark song is identified to obtain.
3. the method as described in claim 1, which is characterized in that described to be based on the target audio information to mesh by server After mark song is identified, further includes:
The specific audio frequency of installation is called to play application;
It is played and is applied by the specific audio frequency, play the target song.
4. the method as described in claim 1, which is characterized in that it is described after triggering the song recognition instruction, obtain mesh Mark audio-frequency information, comprising:
If the currently playing audio-frequency information be it is multiple, from currently playing multiple audio-frequency informations, obtain present bit In the audio-frequency information that foreground plays;
The audio-frequency information that will acquire is determined as the target audio information.
5. the method as described in claim 1, which is characterized in that it is described after triggering the song recognition instruction, obtain mesh Mark audio-frequency information, comprising:
If the currently playing audio-frequency information be it is multiple, obtain the program mark where currently playing multiple audio-frequency informations Know;
Show the program identification where the multiple audio-frequency information;
When the program identification based on display receives selection instruction, the selected audio-frequency information of the selection instruction is determined as The target audio information.
6. a kind of song recognition equipment, which is characterized in that the equipment includes:
Receiving module, for when receiving designated character, triggering song recognition to be instructed;
Module is obtained, for obtaining target audio information, the target audio information after triggering the song recognition instruction Equipment and the currently playing target audio letter for currently playing audio-frequency information, and for receiving the song recognition instruction The equipment of breath is the same equipment;
Identification module identifies target song by server, the target song for being based on the target audio information Song is the corresponding song of the target audio information;
Wherein, the receiving module includes:
Trigger unit refers to for when receiving the designated character in target text input frame, triggering the song recognition It enables, the target text input frame is any text input box in the currently playing target audio information page;
The acquisition module includes:
Third acquiring unit, for obtaining the currently playing audio-frequency information;
Decomposition unit, for the audio-frequency information to be decomposed into background music information and voiceless sound information, the voiceless sound information is institute State the information in audio-frequency information in addition to the background music information;
Third determination unit, for selecting an information in the background music information and the voiceless sound information, by selection Information is determined as the target audio information.
7. equipment as claimed in claim 6, which is characterized in that the equipment further include:
Display module, is used for displaying target song information, and the target song information is that the server is based on the target sound Frequency information is identified to obtain to the target song.
8. equipment as claimed in claim 6, which is characterized in that the equipment further include:
Calling module, for calling the specific audio frequency of installation to play application;
Playing module is applied for being played by the specific audio frequency, plays the target song.
9. equipment as claimed in claim 6, which is characterized in that the acquisition module includes:
First acquisition unit, if for the currently playing audio-frequency information be it is multiple, from currently playing multiple audios In information, the audio-frequency information for being currently located at foreground broadcasting is obtained;
First determination unit, the audio-frequency information for will acquire are determined as the target audio information.
10. equipment as claimed in claim 6, which is characterized in that the acquisition module includes:
Second acquisition unit, if for the currently playing audio-frequency information be it is multiple, obtain currently playing multiple sounds Program identification where frequency information;
Display unit, for showing the program identification where the multiple audio-frequency information;
Second determination unit will be selected by the selection instruction for when the program identification based on display receives selection instruction The audio-frequency information selected is determined as the target audio information.
CN201610194530.3A 2016-03-31 2016-03-31 Song recognition method and apparatus Active CN105808780B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610194530.3A CN105808780B (en) 2016-03-31 2016-03-31 Song recognition method and apparatus

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610194530.3A CN105808780B (en) 2016-03-31 2016-03-31 Song recognition method and apparatus

Publications (2)

Publication Number Publication Date
CN105808780A CN105808780A (en) 2016-07-27
CN105808780B true CN105808780B (en) 2019-11-22

Family

ID=56460541

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610194530.3A Active CN105808780B (en) 2016-03-31 2016-03-31 Song recognition method and apparatus

Country Status (1)

Country Link
CN (1) CN105808780B (en)

Families Citing this family (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106599274A (en) * 2016-12-23 2017-04-26 珠海市魅族科技有限公司 Played sound source identification apparatus and method
CN107040587A (en) * 2017-03-02 2017-08-11 广州小鹏汽车科技有限公司 A kind of vehicle radio station music content acquisition methods and device
CN108334272B (en) * 2018-01-23 2020-08-21 维沃移动通信有限公司 Control method and mobile terminal
CN108509620A (en) * 2018-04-04 2018-09-07 广州酷狗计算机科技有限公司 Song recognition method and device, storage medium
CN109947979B (en) * 2018-08-22 2021-09-21 Oppo广东移动通信有限公司 Song identification method, device, terminal and storage medium
CN113517010A (en) * 2021-08-03 2021-10-19 广州酷狗计算机科技有限公司 Calling method and device of music playing function, electronic equipment and storage medium
CN114666653A (en) * 2022-03-23 2022-06-24 腾讯音乐娱乐科技(深圳)有限公司 Subtitle display method and device for music segments and readable storage medium

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20110068749A (en) * 2009-12-14 2011-06-22 한국과학기술연구원 Apparatus and method for searching similar music artist and music/music artist recommendation service system
CN103685520A (en) * 2013-12-13 2014-03-26 深圳Tcl新技术有限公司 Method and device for pushing songs on basis of voice recognition
CN104598502A (en) * 2014-04-22 2015-05-06 腾讯科技(北京)有限公司 Method, device and system for obtaining background music information in played video

Also Published As

Publication number Publication date
CN105808780A (en) 2016-07-27

Similar Documents

Publication Publication Date Title
CN105808780B (en) Song recognition method and apparatus
KR101397433B1 (en) Method and apparatus for configuring equalizer of media file player
CN102959544B (en) For the method and system of synchronized multimedia
US20090307594A1 (en) Adaptive User Interface
EP1855216A2 (en) System, device, method, and program for segmenting radio broadcast audio data
CN1636240A (en) System for selling a product utilizing audio content identification
KR20080024137A (en) Playlist structure for large playlists
US20070255747A1 (en) System, method and medium browsing media content using meta data
US20080154962A1 (en) Apparatus and method for automatically composing album and managing cover image of album
CN106155470B (en) A kind of audio file generation method and device
JP2007012013A (en) Video data management device and method, and program
WO2023040520A1 (en) Method and apparatus for performing music matching of video, and computer device and storage medium
EP1403852B1 (en) Voice activated music playback system
JP2000207415A (en) Information providing method, information recording medium, information management method and recording and reproducing device
WO2022160603A1 (en) Song recommendation method and apparatus, electronic device, and storage medium
JP2000298978A (en) Playing music related information display device, display processing method for playing music related information, and recording medium for playing music related information display program
JP2004152174A (en) Content reproducing device, content providing system, content retrieving method, and program
JP2006338315A (en) Data selection system
KR101551968B1 (en) Music source information provide method by media of vehicle
CN103680561A (en) System and method for synchronizing human voice signal and text description data of human voice signal
CN111046218A (en) Audio acquisition method, device and system based on screen locking state
JP7166373B2 (en) METHOD, SYSTEM, AND COMPUTER-READABLE RECORDING MEDIUM FOR MANAGING TEXT TRANSFORMATION RECORD AND MEMO TO VOICE FILE
US11810570B2 (en) Graphical user interface displaying linked schedule items
KR20180034718A (en) Method of providing music based on mindmap and server performing the same
KR101415024B1 (en) Method for Searching a music using a metadata

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
CB02 Change of applicant information

Address after: 510660 Guangzhou City, Guangzhou, Guangdong, Whampoa Avenue, No. 315, self - made 1-17

Applicant after: Guangzhou KuGou Networks Co., Ltd.

Address before: 510000 B1, building, No. 16, rhyme Road, Guangzhou, Guangdong, China 13F

Applicant before: Guangzhou KuGou Networks Co., Ltd.

CB02 Change of applicant information
GR01 Patent grant
GR01 Patent grant