CN109947979A - Song recognition method, apparatus, terminal and storage medium - Google Patents

Song recognition method, apparatus, terminal and storage medium Download PDF

Info

Publication number
CN109947979A
CN109947979A CN201810962656.XA CN201810962656A CN109947979A CN 109947979 A CN109947979 A CN 109947979A CN 201810962656 A CN201810962656 A CN 201810962656A CN 109947979 A CN109947979 A CN 109947979A
Authority
CN
China
Prior art keywords
icon
audio
user interface
song
terminal
Prior art date
Application number
CN201810962656.XA
Other languages
Chinese (zh)
Inventor
宋方
Original Assignee
Oppo广东移动通信有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Oppo广东移动通信有限公司 filed Critical Oppo广东移动通信有限公司
Priority to CN201810962656.XA priority Critical patent/CN109947979A/en
Publication of CN109947979A publication Critical patent/CN109947979A/en

Links

Abstract

This application discloses a kind of song recognition method, apparatus, terminal and storage mediums, belong to field of terminal technology.The described method includes: showing the first user interface of the first application program, playing in the first user interface has audio-video frequency content;Audio identification icon is shown in the first user interface, show that recognition result icon, recognition result icon are used to indicate whether successfully to identify the target song in audio-video frequency content in the first user interface when receiving the first operation signal on audio identification icon.The embodiment of the present application is identified by the target song in the terminal audio-video frequency content currently playing to itself, it avoids the song in the audio-video frequency content that first terminal in the related technology is played and needs just to can be carried out the problem that identification causes the efficiency of song recognition lower by second terminal, improve the accuracy and efficiency of song recognition.

Description

Song recognition method, apparatus, terminal and storage medium

Technical field

The invention relates to field of terminal technology, in particular to a kind of song recognition method, apparatus, terminal and storage Medium.

Background technique

Multimedia application is installed in current most of terminals.Terminal can play sound by multimedia application Video content.

Usually during first terminal playing audio-video content, user may be to currently playing audio-video frequency content In song it is interested, it is desirable to get the relevant information of the song, for example the song title of the song, album name and sing Person's title etc..A kind of song recognition method provided by the relevant technologies includes: that second terminal passes through microphone acquisition first eventually Played audio-video frequency content is held, second terminal identifies collected audio-video frequency content, identifies in audio-video frequency content Song.Wherein, first terminal and second terminal are different terminal.

Summary of the invention

The embodiment of the present application provides a kind of song recognition method, apparatus, terminal and storage medium, can be used for solving phase The song in audio-video frequency content that first terminal is played in the technology of pass, which needs just to can be carried out identification by second terminal, to be caused to sing The lower problem of the efficiency of song identification.Technical solution is as follows:

According to the embodiment of the present application in a first aspect, provide a kind of song recognition method, it is applied in terminal, the side Method includes:

Show the first user interface of the first application program, playing in first user interface has audio-video frequency content;

Show that audio identification icon, the audio identification icon are that triggering regards the sound in first user interface The entrance that target song in frequency content is identified;

When receiving the first operation signal on the audio identification icon, shown in first user interface floating Dynamic window, the floating frame are used to show the introductory information of the target song.

According to the second aspect of the embodiment of the present application, a kind of song recognition device is provided, is applied in terminal, the dress It sets and includes:

First display module is broadcast in first user interface for showing the first user interface of the first application program It is placed with audio-video frequency content;

Second display module, for showing audio identification icon, the audio identification figure in first user interface It is designated as triggering the entrance for identifying the target song in the audio-video frequency content;

Third display module, for when receiving the first operation signal on the audio identification icon, described Show that floating frame, the floating frame are used to show the introductory information of the target song in one user interface.

According to the third aspect of the embodiment of the present application, a kind of terminal is provided, the terminal includes processor and memory, It is stored at least one instruction in the memory, described instruction is loaded by the processor and executed to realize such as the application the One side and its any song recognition method of alternative embodiment.

According to the fourth aspect of the embodiment of the present application, a kind of computer readable storage medium, the storage medium are provided In be stored at least one instruction, described instruction is loaded by processor and is executed to realize such as the application first aspect and its optional Any song recognition method of embodiment.

Technical solution bring beneficial effect provided by the embodiments of the present application includes at least:

By showing the first user interface of the first application program, playing in the first user interface has audio-video frequency content;? Audio identification icon is shown in first user interface, is used when receiving the first operation signal on audio identification icon first Show that recognition result icon, recognition result icon are used to indicate whether successfully to identify the target in audio-video frequency content on the interface of family Song;The target song enabled the terminal in the audio-video frequency content currently playing to itself identifies, avoids related skill The song in audio-video frequency content that first terminal is played in art, which needs just to can be carried out identification by second terminal, causes song to be known The lower problem of other efficiency, improves the accuracy and efficiency of song recognition.

Detailed description of the invention

Fig. 1 is the structural schematic diagram for the terminal that one exemplary embodiment of the application provides;

Fig. 2 is the structural schematic diagram for the terminal that another exemplary embodiment of the application provides;

Fig. 3 A to Fig. 3 F is the appearance for the terminal with different touch display screens that the exemplary embodiment of the application provides Schematic diagram;

Fig. 4 is the flow chart for the song recognition method that one exemplary embodiment of the application provides

Fig. 5 is the interface schematic diagram that the song recognition method that one exemplary embodiment of the application provides is related to;

Fig. 6 is the flow chart for the song recognition method that another exemplary embodiment of the application provides;

Fig. 7 to Figure 11 is interface schematic diagram of the song recognition method of Fig. 6 embodiment offer when implementing;

Figure 12 is the structural schematic diagram for the song recognition device that one exemplary embodiment of the application provides.

Specific embodiment

To keep the purposes, technical schemes and advantages of the application clearer, below in conjunction with attached drawing to the application embodiment party Formula is described in further detail.

In the following description when referring to the accompanying drawings, unless otherwise indicated, the same numbers in different attached drawings indicate same or similar Element.Embodiment described in following exemplary embodiment does not represent all embodiment party consistent with the application Formula.On the contrary, they are only the device and side consistent with some aspects as detailed in the attached claim, the application The example of method.

In the description of the present application, it is to be understood that term " first ", " second " etc. are used for description purposes only, without It can be interpreted as indication or suggestion relative importance.In the description of the present application, it should be noted that unless otherwise specific regulation And restriction, term " connected ", " connection " shall be understood in a broad sense, for example, it may be being fixedly connected, may be a detachable connection, Or it is integrally connected;It can be mechanical connection, be also possible to be electrically connected;It can be directly connected, intermediary can also be passed through It is indirectly connected.For the ordinary skill in the art, the tool of above-mentioned term in this application can be understood with concrete condition Body meaning.In addition, unless otherwise indicated, " multiple " refer to two or more in the description of the present application."and/or" is retouched The incidence relation of affiliated partner is stated, indicates may exist three kinds of relationships, for example, A and/or B, can indicate: individualism A, together When there are A and B, these three situations of individualism B.Character "/" typicallys represent the relationship that forward-backward correlation object is a kind of "or".

Referring to fig. 1 and fig. 2, the structure side of the terminal 100 provided it illustrates one exemplary embodiment of the application Block diagram.The terminal 100 can be mobile phone, tablet computer, laptop and e-book etc..Terminal 100 in the application can be with Including one or more such as lower component: processor 110, memory 120 and touch display screen 130.

Processor 110 may include one or more processing core.Processor 110 utilizes various interfaces and connection Various pieces in entire terminal 100, by running or executing the instruction being stored in memory 120, program, code set or refer to Collection is enabled, and calls the data being stored in memory 120, executes the various functions and processing data of terminal 100.Optionally, Processor 110 can use Digital Signal Processing (Digital Signal Processing, DSP), field programmable gate array (Field-Programmable Gate Array, FPGA), programmable logic array (Programmable Logic Array, PLA) at least one of example, in hardware realize.Processor 110 can integrating central processor (Central Processing Unit, CPU), in image processor (Graphics Processing Unit, GPU) and modem etc. One or more of combinations.Wherein, the main processing operation system of CPU, user interface and application program etc.;GPU is for being responsible for The rendering and drafting of content to be shown needed for touch display screen 130;Modem is for handling wireless communication.It is understood that , above-mentioned modem can not also be integrated into processor 110, be realized separately through chip piece.

Memory 120 may include random access memory (Random Access Memory, RAM), also may include read-only Memory (Read-Only Memory).Optionally, which includes non-transient computer-readable medium (non- transitory computer-readable storage medium).Memory 120 can be used for store instruction, program, generation Code, code set or instruction set.Memory 120 may include storing program area and storage data area, wherein storing program area can store Instruction for realizing operating system, the instruction at least one function (such as touch function, sound-playing function, image Playing function etc.), for realizing instruction of following each embodiments of the method etc.;Storage data area can be stored according to terminal 100 Use created data (such as audio data, phone directory) etc..

By taking operating system is Android (Android) system as an example, program and data such as Fig. 1 institute for being stored in memory 120 Show, Linux inner core 220, system Runtime Library layer 240, application framework layer 260 and application layer 280 are stored in memory 120. Linux inner core 220 provides the driving of bottom for the various hardware of terminal 100, such as shows driving, audio driven, camera Driving, bluetooth driving, Wi-Fi driving, power management etc..System Runtime Library layer 240 is by some libraries C/C++ come for Android System provides main characteristic and supports.If the library SQLite provides the support of database, the library OpenGL/ES provides 3D drawing Support, the library Webkit provides the support etc. of browser kernel.Android fortune is also provided in system Runtime Library layer 240 Library 242 (Android Runtime) when row, it mainly provides some core libraries, and developer can allow for use Java language To write Android application.Application framework layer 260 provides the various API that may be used when building application program, developer The application program of oneself, such as activity management, window management, view management, notice pipe can be constructed by using these API Reason, assures reason, call management, resource management, orientation management at content provider.Operation has at least one application in application layer 280 Program, these application programs can be the included contact person's program of operating system, short message program, timing routine, camera applications etc.; It is also possible to application program that third party developer is developed, such as instant messaging program, photograph pretty program etc..

By taking operating system is IOS system as an example, the program and data stored in memory 120 is as shown in Fig. 2, IOS system It include: kernel operating system layer 320 (Core OS layer), kernel service layer 340 (Core Services layer), media Layer 360 (Media layer), tangible layer 380 (Cocoa Touch Layer).Kernel operating system layer 320 includes operation System kernel, driver and underlying programs frame, these underlying programs frames provide the function closer to hardware, for position It is used in the program frame of kernel service layer 340.Kernel service layer 340 is supplied to system service required for application program And/or program frame, such as basic (Foundation) frame, account frame, advertising framework, data frame memory frame, network company Connect frame, geographical location frame, moving frame etc..Media layer 360 provide for application program in relation to audiovisual in terms of interface, such as The relevant interface of graph image, the relevant interface of Audiotechnica, the relevant interface of video technique, audio video transmission technology it is wireless Play (AirPlay) interface etc..Tangible layer 380 provides the relevant frame in various common interfaces for application development, Tangible layer 380 is responsible for the touch interactive operation of user on the terminal 100.Such as local notice service, long-range Push Service, extensively Accuse frame, game tool frame, message user interface interface (User Interface, UI) frame, user interface UIKit frame Frame, map frame etc..

In frame out shown in Fig. 2, frame related with major applications program includes but is not limited to: kernel service layer The UIKit frame in basic framework and tangible layer 380 in 340.Basic framework provides many basic object class and data It is unrelated to provide most basic system service and UI for all application programs for type.And the class that UIKit frame provides is basic UI class libraries, for creating the user interface based on touch, iOS application program can provide UI based on UIKit frame, so it The architecture of application program is provided, for constructing user interface, drawing, processing and user's alternative events, response gesture etc. Deng.

Touch display screen 130 for receive user using any suitable object such as finger, felt pen on it or near Touch operation, and the user interface of each application program of display.Before touch display screen 130 is generally arranged at terminal 130 Panel.Touch display screen 130 is designed to shield comprehensively, Curved screen or abnormal shape shield.Touch display screen 130 is also designed to To shield and the combination of Curved screen comprehensively, the combination of abnormal shape screen and Curved screen, the present embodiment is not limited this.Wherein:

Screen comprehensively

Comprehensively screen can refer to 130 occupied terminal 100 of touch display screen front panel screen accounting be more than threshold value (such as 80% or 90% or 95%) screen design.Shield a kind of calculation of accounting are as follows: (area/terminal of touch display screen 130 The area of 100 front panel) * 100%;Shield another calculation of accounting are as follows: (actual displayed area in touch display screen 130 The area of the front panel of the area/terminal 100 in domain) * 100%;Shield another calculation of accounting are as follows: (touch display screen 130 Diagonal line/terminal 100 front panel diagonal line) * 100%.In schematical example as shown in Figure 3A, terminal 100 Front panel on nearly all region be touch display screen 130, on the front panel 40 of terminal 100, except produced by center 41 Edge except other regions, all touch display screens 130.Four angles of the touch display screen 130 can be right angle or Person's fillet.

Screen can also be the screen that at least one bezel assembly is integrated in 130 inside of touch display screen or lower layer comprehensively Design.Optionally, which includes: camera, fingerprint sensor, close to optical sensor, Distance-sensing Device etc..In some embodiments, the other component on the front panel of conventional terminal is integrated in whole areas of touch display screen 130 In domain or partial region, such as after the photosensitive element in camera is split as multiple photosensitive pixels, by each photosensitive pixel collection At in the black region in display pixel each in touch display screen 130.Since at least one bezel assembly being integrated in The inside of touch display screen 130, so screen has higher screen accounting comprehensively.

Certainly in other embodiments, the bezel assembly on the front panel of conventional terminal can also be arranged at end The side or the back side at end 100, for example the lower section of touch display screen 130 is arranged in ultrasonic fingerprint sensor, by osteoacusis formula Earpiece be arranged in terminal 130 inside, by camera be arranged to positioned at terminal side and pluggable structure.

In some alternative embodiments, when terminal 100 is using screen comprehensively, the single side of the center of terminal 100, or Edge touching is provided on two sides (such as left and right two sides) or four sides (such as four, upper and lower, left and right side) Sensor 120 is controlled, which grasps for detecting touch operation, clicking operation, pressing of the user on center The operation of at least one of work and slide etc..The edge touch sensing 120 can be touch sensor, heating power sensing Any one in device, pressure sensor etc..User can apply operation on edge touch sensing 120, in terminal 100 Application program controlled.

Curved screen

Curved screen refers to that the section of touch display screen 130 is projected as in curved shape and along being parallel on the direction in section The screen of plane designs, which can be U-shaped.Optionally, Curved screen refers to that at least one side is curved shape Screen design method.Optionally, Curved screen refers to that at least one side of touch display screen 130 is extended over to terminal 100 On center.Since the side of touch display screen 130 is extended over to the center of terminal 100, will not have display function originally yet Can and touch function center be covered as can display area and/or operable area so that Curved screen be provided with it is higher Shield accounting.Optionally, in example as shown in Figure 3B, Curved screen refers to that two sides 42 in left and right are that the screen of curved shape is set Meter;Alternatively, Curved screen refers to that upper and lower two sides are the screen designs of curved shape;Alternatively, Curved screen refers to upper and lower, left and right Four sides are the screen design of curved shape.In an alternate embodiment of the invention, Curved screen uses the touch with certain flexibility Screen material preparation.

Abnormal shape screen

Abnormal shape screen is the display screen that face shaping is irregular shape, and irregular shape is not rectangle or round rectangle.It can Choosing, abnormal shape screen refers to the screen that protrusion, notch and/or borehole are provided in the touch display screen 130 of rectangle or round rectangle Curtain design.Optionally, the protrusion, notch and/or borehole can be located at edge, center Screen of touch display screen 130 or both Have.When protrusion, notch and/or borehole are arranged in one edge, can be set in the middle position or both ends at the edge;When Protrusion, notch and/or borehole are arranged in center Screen, and the upper area in screen, upper left side region, left side area can be set Domain, bottom-left quadrant, lower zone, lower right region, right area, in one or more regions in the region of upper right side.When When being arranged in multiple regions, protrusion, notch and borehole can be with integrated distributions, can also be with dispersed distribution;Can be symmetrical, It can be with mal-distribution.Optionally, the quantity of the protrusion, notch and/or borehole is also unlimited.

Due to abnormal shape screen by the upper frontal region of display screen and/or lower frontal region be covered as can display area and/or operable area, So that display screen occupies more spaces on the front panel of terminal, so special-shaped screen also has bigger screen accounting.Some In embodiment, for accommodating at least one bezel assembly in notch and/or borehole, which includes camera, refers to Line sensor, close at least one of optical sensor, range sensor, earpiece, environmental luminance sensor, physical button.

Illustratively, which can be set on one or more edges, which can be semicircular indentations, right angle Rectangular notch, round rectangle notch or irregular shape notch.In schematical example as shown in Figure 3 C, abnormal shape screen be can be The central location of the top edge of touch display screen 130 is provided with the screen design of semicircular indentations 43, the semicircular indentations 43 The position vacated for accommodate camera, range sensor (also known as proximity sensor), earpiece, in environmental luminance sensor At least one bezel assembly;It is schematical as shown in Figure 3D, abnormal shape screen can be the lower edge in touch display screen 130 Central location be provided with semicircular indentations 44 screen design, the position which is vacated for accommodate physics by At least one of key, fingerprint sensor, microphone component;Schematically in example as shown in FIGURE 3 E, abnormal shape screen be can be The central location of the lower edge of touch display screen 130 is provided with the screen design of half elliptic notch 45, while in terminal 100 Front panel on be also formed with a semiellipse type notch, two half elliptic notches enclose an elliptical region, this is ellipse Border circular areas is for accommodating physical button or fingerprint recognition mould group;Schematically in example as illustrated in Figure 3 F, abnormal shape screen can To be the screen design for being provided at least one aperture 46 in the upper half in touch display screen 130, which is vacated Position for accommodating at least one of camera, range sensor, earpiece, environmental luminance sensor bezel assembly.

In addition to this, it will be understood by those skilled in the art that the structure of terminal 100 shown by above-mentioned attached drawing is not constituted Restriction to terminal 100, terminal may include perhaps combining certain components or difference than illustrating more or fewer components Component layout.For example, further including radio circuit, input unit, sensor, voicefrequency circuit, Wireless Fidelity in terminal 100 The components such as (Wireless Fidelity, WiFi) module, power supply, bluetooth module, details are not described herein.

Referring to FIG. 4, it illustrates the flow charts of the song recognition method shown in one exemplary embodiment of the application.This Embodiment is applied to illustrate in terminal in this way.This method comprises:

Step 401, the first user interface of the first application program is shown, playing in the first user interface has in audio-video Hold.

The screen state of terminal includes vertical screen state and transverse screen state.Optionally, it is shown when terminal is in transverse screen state First user interface of the first application program.

Optionally, when the display screen of terminal is abnormity screen, terminal shows the first application program on main display area First user interface does not show content in auxiliary display area.Alternatively, terminal shows first on the entire display area of display screen First user interface of application program.Wherein, display screen is also referred to as touch display screen.

Wherein, abnormity screen is the screen for being provided with the irregular shape of gap regions.

It optionally, include main display area and auxiliary display area on the display screen of terminal.Main display area and auxiliary viewing area Domain is to belong to the different display areas on same display screen.First display area of main display area is greater than the of auxiliary display area Two display areas.When the display screen of terminal shields for abnormity, main display area is the display area on abnormity screen in rectangle, auxiliary aobvious Show region to be on abnormity screen be in the display area of profiled shape, the intersection of main display area and auxiliary display area is equal to abnormity screen Entire display area.

First application program is the application program positioned at front stage operation.During user uses the first application program, Terminal shows the first user interface of the first application program on the entire display area of display screen, which is the The program interface of displaying target content in one application program.

First application program is the application program for playing audio-video content.First application program is also referred to as multimedia application Program, the first application program can be video application, be also possible to game application.

Optionally, audio-video frequency content includes audio content and/or video content.

Step 402, show that audio identification icon, audio identification icon are triggering in audio-video in the first user interface The entrance that target song in appearance is identified.

Terminal shows audio identification icon in the first user interface of the first application program.

Optionally, audio identification icon is the icon for listening song to know Qu Gongneng, and audio identification icon is for triggering to audio-video Target song in content is identified.

The triggering mode for showing audio identification icon includes but is not limited to following several possible implementations.

In one possible implementation, when receiving four operation signal in the first user interface, first Audio identification icon is shown in user interface.

Optionally, when terminal receives four operation signal in the first user interface, the of the first user interface Overlapping display audio identification icon on one regional area.

4th operation signal, which can be, at least one of to be clicked signal, double-clicks signal, long-pressing signal, slip signals.

Optionally, the 4th operation signal includes: the first sliding on the first partial region for act on the first user interface Signal.Schematically, the sliding initial position of the first slip signals is located in first partial region, the sliding of the first slip signals It is directed toward the middle position of the display screen of terminal in direction.

It schematically, include buoy in the first user interface, buoy be that triggering shows that audio identification icon operates control Part, the 4th operation signal include: the second slip signals acted on buoy.For example, the sliding start bit of the second slip signals Setting on buoy, the glide direction of the second slip signals is directed toward the middle position of display screen.

Optionally, when the display screen of terminal is abnormity screen, first partial region is used for the notch for indicating to be located at abnormity screen The region of region side.Optionally, when the display screen of terminal is abnormity screen, first partial region is used to indicate and abnormity screen Gap regions are less than the region of first distance threshold value.

First distance threshold value is terminal default setting, is also possible to the customized setting of user.The present embodiment to this not It is limited.

Optionally, when the display screen of terminal is that abnormity shields and the display state of terminal is vertical screen state, the second operation letter It number is that there are the corresponding slip signals of at least one sliding trace of the curved side slide downward of depressed section from abnormity screen;When When the display screen of terminal is that abnormity shields and the display state of terminal is transverse screen state, the second operation signal is existed from abnormity screen The corresponding slip signals of at least one sliding trace that the curved side of depressed section is slided to the right.

For example, user passes through one when the display screen of terminal is that abnormity shields and the display state of terminal is transverse screen state Finger carries out to the right slide in the left edge of the abnormity screen of terminal, and corresponding, the first operation signal that terminal receives is The corresponding signal of 1 sliding trace slided to the right from the left edge of abnormity screen.

In alternatively possible implementation, when the duration of the playing audio-video content in the first user interface is more than aobvious When showing duration threshold value, audio identification icon is shown in the first user interface.

Show that duration threshold value is terminal default setting or the customized setting of user.This is not added in the present embodiment To limit.

Schematically, when the first application program be video application when, record video start time, if current time with When the time difference of video start time reaches display duration threshold value, audio identification icon is shown in the first user interface.

It should be noted that this implementation is not limited the triggering mode of display audio identification icon, below only to work as When receiving the second operation signal in the first user interface, shows in the first user interface and carry out for audio identification icon Explanation.

Step 403, it when receiving the first operation signal on audio identification icon, is shown in the first user interface floating Dynamic window, floating frame are used for the introductory information of displaying target song.

When terminal receives the first operation signal on audio identification icon, with floating frame in the first user interface Form displaying target song introductory information.

First operation signal can be at least one of click signal, double-click signal, long-pressing signal, slip signals.Under Face is only illustrated so that the first operation signal is click signal as an example.

Floating frame, also known as top-level windows or picture-in-picture window, can be by the Windows in Android operation system Manager window is realized.The floating frame can not block the main display elements in the first user interface as far as possible.User can be with Information in floating frame is operated.

Optionally, the frame or the dragging signal on corner that terminal receives floating frame, change floating according to the dragging signal The length and width of dynamic window and any one in display scale.

Optionally, the introductory information of target song includes the song title of target song, the corresponding target of target song The title of album, at least one of the Periodical front cover information of target album and creator's information of target song.

It should be noted that the equipment of playing audio-video content and the equipment of the target song in identification audio-video frequency content are Same terminal.Target song i.e. in audio-video frequency content of the terminal for being played to the terminal identifies.

In a schematical example, as shown in figure 5, terminal passes through video application playing audio-video content, i.e., Terminal shows the first user interface 51 of video application, the first user interface on the entire display area of display screen at this time 51 for showing audio-video frequency content (not shown).Terminal shows audio on the first partial region of the first user interface 51 Identify icon 52.When terminal receives the click signal on audio identification icon 52, in currently playing audio-video frequency content Target song identified, when successfully identifying the target song in currently playing audio-video frequency content, in the first user Show that floating frame 53, floating frame 53 are used for the introductory information of displaying target song, the introduction of target song on interface 51 Property information includes the title " album 1 " of the corresponding target album of song title " AA ", target song of target song, target album Periodical front cover information 54 and target song chanteur's title " Xiao Zhou ".

In conclusion the present embodiment passes through the first user interface of the first application program of display, broadcast in the first user interface It is placed with audio-video frequency content;Audio identification icon is shown in the first user interface, when receiving first on audio identification icon Show that recognition result icon, recognition result icon are used to indicate whether successfully to identify when operation signal in the first user interface Target song in audio-video frequency content;The target song enabled the terminal in the audio-video frequency content currently playing to itself carries out Identification, avoiding that the song in the audio-video frequency content that first terminal in the related technology is played needs could be by second terminal The problem that row identification causes the efficiency of song recognition lower, improves the accuracy and efficiency of song recognition.

Referring to FIG. 6, the flow chart of the song recognition method provided it illustrates one exemplary embodiment of the application.This Embodiment is applied to illustrate in terminal in the song recognition method.The song recognition method includes:

Step 601, the first user interface of the first application program is shown, playing in the first user interface has in audio-video Hold.

Optionally, terminal shows the first user interface of the first application program on the entire display area of display screen.

First user interface is that the program interface for having audio-video frequency content is played in the first application program.Optionally, it first answers It is video application with program, the first user interface is the video playing interface of target audio-video frequency content.

Step 602, when receiving four operation signal in the first user interface, in the first game of the first user interface Overlapping display audio identification icon on portion region.

The display mode of audio identification icon includes but is not limited to following several possible implementations.

In one possible implementation, terminal Overlapping display side on the first partial region of the first user interface Column, the sidebar is for showing audio identification icon.

Optionally, sidebar includes audio identification icon and basic icon.

Wherein, basic icon includes the icon of fixed function, the program icon of third application program and LnkTools At least one of the tool icon.

Schematically, fixed function includes at least one of shorthand function, file transmitting function and file memory function. Third application program is the other application program other than the first application program.LnkTools includes mute tool, brightness tune At least one of section tool, screenshotss tool, record screen tool, on-hook tool, parameter configuration tool, backstage cleaning tool.

For example, basic icon includes playing icon, pause icon, sound when the first application program is video application Amount adjusts at least one of icon, brightness regulation icon, screenshotss icon, record screen icon.

For example, basic icon includes hang-up icon, virtual knapsack figure when the first application program is game application At least one of mark, virtual technical ability icon, silent icon, backstage cleaning icon, screenshotss icon, record screen icon.

In a schematical example, as shown in fig. 7, terminal passes through video application playing audio-video content, i.e., Terminal shows the first user interface 51 of video application, the first user interface on the entire display area of display screen at this time 51 for showing audio-video frequency content (not shown).When terminal receives the left edge of display screen in the first user interface 51 When the slip signals slided to the right, terminal Overlapping display sidebar 71 on the first partial region of the first user interface 51 should Sidebar 71 is used to show the audio identification icon 72 for listening song to know Qu Gongneng.

Optionally, terminal shows that three subregions, three subregions include the first subregion, the second subregion and third in sidebar Subregion.First subregion is used to show the icon of fixed function, the second subregion be used to show LnkTools the tool icon and/ Or audio identification icon, third subregion are used to show the program icon of third application program.

In alternatively possible implementation, terminal Overlapping display side on the first partial region of the first user interface Sidebar, the sidebar is for showing first foundation icon.When receiving the second slip signals in sidebar, second is followed to slide Dynamic signal cancels the display to first foundation icon in sidebar, and increases in sidebar to the second basic icon and audio Identify the display of icon.

Basic icon includes first foundation icon and the second basic icon.Since the display area of sidebar is limited, terminal First foundation icon can be first shown in sidebar, when terminal receives the second slip signals in sidebar, follow Two slip signals cancel the display to first foundation icon in sidebar, and increase in sidebar to the second basic icon and The display of audio identification icon.

It is shown that is, terminal can follow the second slip signals opposite side sidebar to carry out sliding.The sliding shows to include: that will remove The first foundation icon of sidebar carries out cancellation and shows, will move into that the second basic icon of sidebar and audio identification icon carry out The first foundation icon that show, will still be in sidebar follows the second slip signals to change display position.

Optionally, terminal display paging in sidebar accords with, and the page break is not yet aobvious for prompting to exist in sidebar packet The icon shown.

It should be noted that audio identification icon can be and be directly displayed in sidebar, it is also possible to follow sliding It is shown in signal immigration sidebar, the present embodiment is not limited this.

Optionally, when terminal shows three subregions in sidebar, the first subregion is used to show the icon of fixed function, Second subregion is used to show the tool icon of LnkTools, when third subregion is used to show the program icon of third application program, Terminal display paging in the second subregion accords with, and the page break is for prompting the presence of still not shown icon in the second subregion.Work as end When termination receives the slip signals on the second subregion, slip signals is followed to cancel the display to the tool icon in the second subregion, And display of the increase to other the tool icons and audio identification icon in the second subregion.

In a schematical example, as shown in figure 8, terminal shows that video is answered on the entire display area of display screen With the first user interface 51 of program, the first user interface 51 is for showing audio-video frequency content (not shown).When terminal exists When receiving the slip signals that the left edge of display screen slides to the right in the first user interface 51, terminal is in the first user interface 51 First partial region on Overlapping display sidebar 81, three subregions are shown in the sidebar 81, the first subregion 82 is for showing The icon (such as icon A and icon B) of fixed function, the second subregion 83 are used to show the tool icon (ratio of LnkTools Such as icon C and icon D), third subregion 84 be used for show third application program program icon (such as when icon E to icon J), Terminal display paging symbol 85 in the second subregion 83, the page break 85 is for prompting the presence of still not shown figure in the second subregion 83 Mark.When terminal receives the slip signals on the second subregion 83, slip signals is followed to cancel in the second subregion 83 to icon C With the display of icon D, and increase display to icon K and audio identification icon 86 in the second subregion 83.

Step 603, when receiving the first operation signal on audio identification icon, is shown in the first user interface One prompt icon, the first prompt icon, which is used to indicate, identifies the target song in audio-video frequency content.

Optionally, when terminal receives the first operation signal on audio identification icon, to the mesh in audio-video frequency content Mark song is identified, the first prompt icon is shown in the first user interface.

Optionally, terminal shows the first prompt icon on the second regional area of the first user interface.

Second regional area can be with first partial region there are the region of intersection, be also possible to and first partial region There is no the regions of intersection.Optionally, when the display screen of terminal is abnormity screen, the second regional area shields for indicating with abnormity Gap regions be greater than second distance threshold value region.

Second distance threshold value is terminal default setting, is also possible to the customized setting of user.The present embodiment to this not It is limited.

In one possible implementation, right when terminal receives the first operation signal on audio identification icon Target song in audio-video frequency content is identified, comprising: when terminal receives the click signal on audio identification icon, is obtained The audio-video frequency content that fetch bit is played in foreground;Terminal obtains object matching model, and object matching model is for regarding to sample sound The model that frequency content is trained.Audio-video frequency content is input to output in object matching model and obtains song identity by terminal, is sung Song mark is used to indicate target song.

When showing multiple pages simultaneously on the display screen of terminal, the audio-video frequency content played positioned at foreground is positioned at multiple The currently playing audio-video frequency content of first user interface of the top of the page;When one page of display screen display of terminal When, the audio-video frequency content played positioned at foreground is the audio-video frequency content played in the first user interface currently shown.In this Shen Please be in embodiment, the equipment of playing audio-video content and the equipment of the target song in identification audio-video frequency content are same terminal, It avoids in the related technology after terminal plays audio-video frequency content again either manually or by other equipment to the mesh in audio-video frequency content The case where mark song is identified, also avoids the case where there are ambient noises in the audio-video frequency content got, improves and obtain The accuracy rate and efficiency for the audio-video frequency content got, and then improve the accuracy rate and efficiency of identification target song.

Optionally, object matching model is preparatory trained model.Terminal obtains the object matching model of itself storage, Or terminal obtains trained Template matching model from server.

Object matching model includes but is not limited to: convolutional neural networks (Convolutional Neural Network, CNN) model, deep neural network (Deep Neural Network, DNN) model, Recognition with Recurrent Neural Network (Recurrent Neural Networks, RNN) model, insertion (embedding) model, gradient promotion decision tree (Gradient Boosting Decision Tree, GBDT) model, at least one of logistic regression (Logistic Regression, LR) model.

Optionally, the process of server training objective Matching Model includes: that server obtains training sample set, training sample Collection includes at least one set of sample data group;Server instructs at least one set of sample data group using error backpropagation algorithm Practice, obtains object matching model.Wherein, every group of sample data group includes: sample audio-video frequency content and the correct song demarcated in advance Song mark.

Optionally, server is trained at least one set of sample data group using error backpropagation algorithm, obtains mesh Matching Model is marked, including but not limited to following steps: for every group of sample data group at least one set of sample data group, Sample audio-video frequency content is inputted into initial parameter model, obtains training result;For every group of sample data group, by training result with Correct song identity is compared, and obtains calculating loss, is calculated loss and is used to indicate between training result and correct song identity Error;According to the corresponding calculating loss of at least one set of sample data group, obtained using error backpropagation algorithm training Object matching model.

Optionally, terminal loses the gradient direction for determining object matching model by back-propagation algorithm according to calculating, from The output layer of object matching model successively updates forward the model parameter in object matching model.

Optionally, when terminal receives the first operation signal on audio identification icon, to the mesh in audio-video frequency content Mark song is identified, shows that recognition result icon, recognition result icon are used to indicate whether success in the first user interface Identify the target song in audio-video frequency content.

Optionally, recognition result icon includes identifying one of successfully icon and recognition failures icon.Wherein, it is identified as Function icon is used to indicate the target song successfully identified in audio-video frequency content.Recognition failures icon is used to indicate unidentified sound out Target song in video content.

Optionally, when terminal receives the first operation signal on audio identification icon, to the mesh in audio-video frequency content Mark song is identified, if terminal recognition successfully thens follow the steps 605, if terminal recognition unsuccessfully thens follow the steps 607.

Step 604, it is shown as the first prompt icon switching to identify successfully icon if identifying successfully, identifies successfully icon It is used to indicate the target song successfully identified in audio-video frequency content.

It should be noted that terminal can show in the first user interface and identify the same of successful icon when identifying successfully When show floating frame, alternatively, shown in the first user interface identify successfully icon after show floating frame, or directly Display floating frame identifies successfully icon without display.

Optionally, the first prompt icon switching is shown as floating frame by terminal if identifying successfully, alternatively, terminal is by the One prompt icon switching is shown as identifying successfully icon.The present embodiment only with if identifying successfully terminal by first prompt icon cut It changes for being shown as identifying successfully icon and is illustrated.

Step 605, cancel display when the display duration for identifying successfully icon reaches the first duration threshold value to identify and successfully scheme Mark.

When the display duration for identifying successfully icon reaches the first duration threshold value, terminal cancels display automatically and identifies and successfully scheme Mark.First duration threshold value is terminal default setting or the customized setting of user.The present embodiment does not limit this It is fixed.For example, the first duration threshold value is 2 seconds.

Step 606, show that floating frame, floating frame are introductory for displaying target song in the first user interface Information.

Optionally, terminal cancel display identify successfully icon while or cancellation show identify successfully icon it Afterwards, in the first user interface in the form of floating frame displaying target song introductory information.

In a schematical example, based on the sidebar 81 that Fig. 8 is provided, as shown in figure 9, when terminal receives side When 81 sound intermediate frequency of column identifies the click signal on icon 86, the first prompt icon 91 and first is shown in the first user interface 51 The corresponding cancel button 92 of icon 91 is prompted, the first prompt icon 91 is used to indicate terminal to the target in audio-video frequency content Song is identified that cancel button 92 is used to indicate the identification cancelled to the target song in audio-video frequency content.If identifying successfully Then terminal by first prompt icon 91 switching be shown as identifying successfully icon 93, the identification success icon 93 be used to indicate terminal at Function identifies the target song in audio-video frequency content.Cancel display when the display duration for identifying successfully icon 93 reaches 2 seconds to know Not Cheng Gong icon, floating frame 94 is shown in the first user interface 51, it is corresponding to show target song in floating frame 94 Information bar 95, the information bar 95 include that the title of the corresponding target album of song title " AA ", target song of target song is " special Volume 1 ", the Periodical front cover information 96 of target album and chanteur's title " Xiao Zhou " of target song.

Optionally, floating frame also shows collection control and/or jumps control.

Schematically, the introductory information of terminal displaying target song in the form of information bar in floating frame, the letter Breath further includes the corresponding collection control of target song and/or jumps control in column.

In one possible implementation, floating frame also shows collection control.When terminal receives target song When the second operation signal on corresponding collection control, target song is added in song collection folder.

Second operation signal can be at least one of click signal, double-click signal, long-pressing signal, slip signals.Under Face is only illustrated so that the second operation signal is click signal as an example.

Optionally, the corresponding collection control of target song is that target song is added in song collection folder for triggering It can operational controls.

Optionally, song collection folder is the target collection in target music application.Wherein, target music application journey Sequence and target collection are terminal default setting or the customized setting of user.The present embodiment is not limited this.

Optionally, when terminal receives the second operation signal on the corresponding collection control of target song, first is shown Show that the collection control of form is switched to the collection control of the second display format, and target song is added in song collection folder. First display format is used to indicate target song and is not added with into song collection folder, and the second display format is used to indicate target song It has been added in song collection folder.

Schematically, the display format for collecting control includes at least one of color, shape and animation effect.Schematically , the collection control of the first display format is the icon of hollow out effect, and the collection control of the second display format is filling effect Icon.

In alternatively possible implementation, floating frame also shows and jumps control.When terminal receives target song When the bent corresponding third operation signal jumped on control, the switching of the first user interface is shown as the second of the second application program User interface, second user interface is for playing target song.

Third operation signal can be at least one of click signal, double-click signal, long-pressing signal, slip signals.Under Face is only illustrated so that third operation signal is click signal as an example.

Optionally, the corresponding control that jumps of target song is that the switching of the first user interface is shown as second for triggering to answer With the second user interface of program can operational controls.

Second application program is other multimedia applications other than the first application program.Second application program For the application program for playing song.Second application program can be video application, be also possible to music application.

Optionally, it when terminal receives the corresponding third operation signal jumped on control of target song, is used first Pop-up window is shown on the interface of family, is shown in pop-up window for confirming the ACK button for carrying out jumping display.When terminal connects When receiving the click signal on ACK button, the first user interface is jumped to second user circle for being shown as the second application program Face.

In alternatively possible implementation, floating frame also shows similar songs recommendation list, and similar songs push away It recommends list to include the corresponding introductory information of multiple similar songs, at least one of collect control and jump control, phase It is to be higher than the song of similar threshold value with target song similarity like song.

Optionally, when audio-video frequency content is input in object matching model by above-mentioned terminal, n song identity is obtained, N song identity is ranked up according to the sequence of matching degree from high to low.Terminal is by first song identity pair after sequence The song answered is determined as target song, and second to the m-th corresponding song of song identity after sequence is determined as phase Like song.

Optionally, similar songs recommendation list includes the corresponding information bar of multiple similar songs, each information bar packet At least one of include the introductory information of the similar songs, collect control and jump control.

Schematically, the introductory information of similar songs includes that the song title of similar songs, similar songs are corresponding specially The title collected, at least one of the Periodical front cover information of the corresponding album of similar songs and creator's information of similar songs.

Optionally, the corresponding collection control of similar songs is that similar songs are added in song collection folder for triggering It can operational controls.When terminal receives the second operation signal on the corresponding collection control of similar songs, similar songs are added It adds in song collection folder.

Optionally, the corresponding control that jumps of similar songs is that the switching of the first user interface is shown as second for triggering to answer With the third user interface of program can operational controls.When terminal receives the corresponding third behaviour jumped on control of similar songs When making signal, the switching of the first user interface is shown as to the third user interface of the second application program, third user interface is used for Play similar songs.

In a schematical example, based on the floating frame 94 that Fig. 9 is provided, as shown in Figure 10, the letter of target song The collection control 97 of hollow out effect is also shown in breath column 95, also shows similar songs recommendation list in the floating frame 94 98, there are two the corresponding information bar of similar songs, the letters of first similar songs for the similar songs recommendation list 98 display Ceasing column includes the song title " BB " of the similar songs, the title " album 2 " of the corresponding album of similar songs, the cover of the album The chanteur's title " Xiao Zhou " and collection control of information and the similar songs, the information bar of second similar songs includes that this is similar Song title " CC ", the title " album 2 " of the corresponding album of the similar songs of song, the Periodical front cover information of the album is similar with this The chanteur's title " Xiao Zhou " and collection control of song.When terminal receives the click on the corresponding collection control 97 of target song When signal, the collection control 97 of the first display format is switched to the collection control 101 of the second display format, and by target song It is added in song collection folder.

Step 607, the first prompt icon switching is shown as recognition failures icon, recognition failures icon if recognition failures The target song being used to indicate in unidentified audio-video frequency content out.

The first prompt icon switching is shown as recognition failures icon by terminal if recognition failures.

Step 608, cancel display recognition failures figure when the display duration of recognition failures icon reaches the second duration threshold value Mark.

When the display duration of recognition failures icon reaches the second duration threshold value, terminal cancels display recognition failures figure automatically Mark.

Second duration threshold value is terminal default setting or the customized setting of user.This is not added in the present embodiment To limit.For example, the second duration threshold value is 2 seconds.

Step 609, the prompt information for being used to indicate failure cause is shown in the first user interface, failure cause includes eventually Hold not connected network or the unidentified target song out of terminal.

Optionally, terminal is while cancelling display recognition failures icon or after cancelling display recognition failures icon Display is used to indicate the prompt information of failure cause.Wherein, failure cause includes that the not connected network of terminal or terminal are unidentified Target song out.

It is corresponding based on Fig. 9 the first prompt icon 91 provided and the first prompt icon 91 in a schematical example Cancel button 92, the first prompt icon 91 is used to indicate terminal and identifies to the target song in audio-video frequency content. As shown in figure 11, terminal prompts first icon 91 to switch and is shown as recognition failures icon 111 if recognition failures, which loses Lose the target song that icon 111 is used to indicate in the unidentified audio-video frequency content out of terminal.When the display of recognition failures icon 111 Cancel display recognition failures icon 111 when length reaches 2 seconds, shows floating frame 112, floating frame in the first user interface 51 It is shown in 112 prompt information " being retried after asking linked network ", it is not connected for terminal which is used to indicate failure cause Network.

The first prompt icon switching is shown as being identified as if identifying successfully in conclusion the embodiment of the present application also passes through Function icon identifies that successfully icon is used to indicate the target song successfully identified in audio-video frequency content, when identifying successful icon Display duration cancels display when reaching the first duration threshold value and identifies successfully icon, shows floating frame in the first user interface; So that first being shown as the first prompt icon switching to identify successfully icon when terminal recognition success, then it will identify that successfully icon is cut It changes and is shown as floating frame, further enrich the prompt effect of terminal.

The embodiment of the present application is also by inciting somebody to action when receiving the second operation signal on the corresponding collection control of target song Target song is added in song collection folder;So that target song can be added by terminal by single stepping, simplifies and use Operating procedure when family switches between different applications improves human-computer interaction efficiency.

Following is the application Installation practice, can be used for executing the application embodiment of the method.It is real for the application device Undisclosed details in example is applied, the application embodiment of the method is please referred to.

Referring to FIG. 5, the structural schematic diagram of the song recognition device provided it illustrates the application one embodiment.The song Bent identification device can be by special hardware circuit, alternatively, the whole or one of software and hardware being implemented in combination with as the terminal in Fig. 1 Part, the song recognition device include: the first display module 1210, the second display module 1220 and third display module 1230.

First display module 1210 is broadcast in the first user interface for showing the first user interface of the first application program It is placed with audio-video frequency content;

Second display module 1220, for showing that audio identification icon, audio identification icon are in the first user interface Trigger the entrance identified to the target song in audio-video frequency content;

Third display module 1230, for being used first when receiving the first operation signal on audio identification icon Show that floating frame, floating frame are used for the introductory information of displaying target song on the interface of family.

Optionally, third display module 1230 is also used to when receiving the first operation signal on audio identification icon, The first prompt of display icon in the first user interface, the first prompt icon are used to indicate to the target in audio-video frequency content Song is identified;

The first prompt icon switching is shown as floating frame if identifying successfully.

Optionally, third display module 1230 is also used to be shown as knowing by the first prompt icon switching if identifying successfully Not Cheng Gong icon, identify that successfully icon is used to indicate the target song successfully identified in audio-video frequency content;

Cancel display when the display duration for identifying successfully icon reaches the first duration threshold value and identifies successfully icon;

Floating frame is shown in the first user interface.

Optionally, floating frame also shows collection control, which further includes the 4th display module, the 4th display module For when receiving the second operation signal on the corresponding collection control of target song, target song to be added to song collection In folder.

Optionally, floating frame also shows and jumps control, which further includes the 5th display module, the 5th display module For when receiving the corresponding third operation signal jumped on control of target song, the switching of the first user interface to be shown as The second user interface of second application program, second user interface is for playing target song.

Optionally, floating frame also shows similar songs recommendation list, and similar songs recommendation list includes multiple similar The corresponding introductory information of song at least one of collects control and jumps control, and similar songs are and target song Similarity is higher than the song of similar threshold value.

Optionally, the second display module 1220 is also used to when receiving four operation signal in the first user interface, The Overlapping display audio identification icon on the first partial region of the first user interface.

Optionally, the second display module 1220 is also used to when receiving four operation signal in the first user interface, The Overlapping display sidebar on the first partial region of the first user interface, sidebar include audio identification icon and foundation drawing Mark;

Wherein, basic icon includes the icon of fixed function, the program icon of third application program and LnkTools At least one of the tool icon.

Optionally, third display module 1230, including recognition unit and display unit.Recognition unit is received for working as When the first operation signal on audio identification icon, the target song in audio-video frequency content is identified;Display unit is used for The first prompt of display icon in the first user interface.

Optionally, recognition unit is also used to when receiving the click signal on audio identification icon, is obtained and is located at foreground The audio-video frequency content of broadcasting;Object matching model is obtained, object matching model is for being trained to sample audio-video frequency content Model;Audio-video frequency content is input to output in object matching model and obtains song identity, song identity is used to indicate target Song.

Optionally, recognition unit, is also used to obtain training sample set, and training sample set includes at least one set of sample data Group, every group of sample data group include: sample audio-video frequency content and the correct song identity demarcated in advance;

For every group of sample data group at least one set of sample data group, sample audio-video frequency content is inputted into initial parameter Model obtains training result;

For every group of sample data group, training result is compared with correct song identity, obtains calculating loss, calculate Loss is used to indicate the error between training result and correct song identity;

According to the corresponding calculating loss of at least one set of sample data group, obtained using error backpropagation algorithm training Object matching model.

Optionally, device further include: the 6th display module.6th display module, for if recognition failures by the One prompt icon switching is shown as recognition failures icon, and recognition failures icon is used to indicate the mesh in unidentified audio-video frequency content out Mark song.

Optionally, the 6th display module is also used to reach the second duration threshold value when the display duration of recognition failures icon When cancel display recognition failures icon;

Display is used to indicate the prompt information of failure cause in the first user interface, and failure cause includes that terminal is not connected Network or the unidentified target song out of terminal.

Correlative detail is in combination with the embodiment of the method with reference to shown in Fig. 4 to Figure 11.Wherein, the first display module 1210, Two display modules 1220 and third display module 1230 are also used to realize any other implicit in above method embodiment or openly To the relevant function of display step.

It should be noted that device provided by the above embodiment, when realizing its function, only with above-mentioned each functional module It divides and carries out for example, can according to need in practical application and be completed by different functional modules above-mentioned function distribution, The internal structure of equipment is divided into different functional modules, to complete all or part of the functions described above.In addition, Apparatus and method embodiment provided by the above embodiment belongs to same design, and specific implementation process is detailed in embodiment of the method, this In repeat no more.

The application also provides a kind of computer-readable medium, is stored thereon with program instruction, and program instruction is held by processor The song recognition method that above-mentioned each embodiment of the method provides is realized when row.

Present invention also provides a kind of computer program products comprising instruction, when run on a computer, so that Computer executes song recognition method described in above-mentioned each embodiment.

Above-mentioned the embodiment of the present application serial number is for illustration only, does not represent the advantages or disadvantages of the embodiments.

Those of ordinary skill in the art will appreciate that completely or partially being walked in the song recognition method of realization above-described embodiment Suddenly may be implemented by hardware, relevant hardware can also be instructed to complete by program, the program can store in In a kind of computer readable storage medium, storage medium mentioned above can be read-only memory, disk or CD etc..More than Described is only that the preferred embodiment of the application within the spirit and principles of this application, is made not to limit the application Any modification, equivalent substitution, improvement and etc., should be included within the scope of protection of this application.

Claims (16)

1. a kind of song recognition method, which is characterized in that be applied in terminal, which comprises
Show the first user interface of the first application program, playing in first user interface has audio-video frequency content;
Show that audio identification icon, the audio identification icon are triggering in the audio-video in first user interface The entrance that target song in appearance is identified;
When receiving the first operation signal on the audio identification icon, floating window is shown in first user interface Mouthful, the floating frame is used to show the introductory information of the target song.
2. the method according to claim 1, wherein described work as first received on the audio identification icon When operation signal, floating frame is shown in first user interface, comprising:
When receiving first operation signal on the audio identification icon, is shown in first user interface One prompt icon, the first prompt icon, which is used to indicate, knows the target song in the audio-video frequency content Not;
The first prompt icon switching is shown as the floating frame if identifying successfully.
3. according to the method described in claim 2, it is characterized in that, described cut the first prompt icon if identifying successfully It changes and is shown as the floating frame, comprising:
It is shown as the first prompt icon switching to identify successfully icon if identifying successfully, it is described to identify that successfully icon is used for Indicate successfully to identify the target song in the audio-video frequency content;
When the display duration for identifying successfully icon reaches the first duration threshold value, cancellation display is described identifies successfully icon;
The floating frame is shown in first user interface.
4. according to the method described in claim 2, it is characterized in that, the floating frame also shows collection control, the side Method further include:
When receiving the second operation signal on the corresponding collection control of the target song, the target song is added It adds in song collection folder.
5. according to the method described in claim 2, it is characterized in that, the floating frame, which is also shown, jumps control, the side Method further include:
When receive the target song it is corresponding it is described jump the third operation signal on control when, by first user circle Face switches the second user interface for being shown as the second application program, and the second user interface is for playing the target song.
6. according to the method described in claim 2, it is characterized in that, the floating frame, which also shows similar songs, recommends column Table, the similar songs recommendation list include the corresponding introductory information of multiple similar songs, collection control and jump control At least one of part, the similar songs are the song for being higher than similar threshold value with the target song similarity.
7. the method according to claim 1, wherein described show audio identification in first user interface Icon, comprising:
When receiving four operation signal in first user interface, in the first partial area of first user interface Audio identification icon described in Overlapping display on domain.
8. the method according to the description of claim 7 is characterized in that described work as the 4th received in first user interface When operation signal, the audio identification icon described in Overlapping display on the first partial region of first user interface, comprising:
When receiving four operation signal in first user interface, described the of first user interface Overlapping display sidebar on one regional area, the sidebar include the audio identification icon and basic icon;
Wherein, the basic icon includes the icon of fixed function, the program icon of third application program and LnkTools At least one of the tool icon.
9. according to the method described in claim 2, it is characterized in that, described described on the audio identification icon when receiving When the first operation signal, the first prompt icon is shown in first user interface, comprising:
When receiving first operation signal on the audio identification icon, to the mesh in the audio-video frequency content Mark song is identified;
The first prompt icon is shown in first user interface.
10. according to the method described in claim 9, it is characterized in that, described work as the institute received on the audio identification icon When stating the first operation signal, the target song in the audio-video frequency content is identified, comprising:
When receiving the click signal on the audio identification icon, obtains and be located at the audio-video frequency content that foreground plays;
Object matching model is obtained, the object matching model is the model for being trained to sample audio-video frequency content;
The audio-video frequency content is input to output in the object matching model and obtains song identity, the song identity is used for Indicate the target song.
11. according to the method described in claim 10, it is characterized in that, the acquisition object matching model, comprising:
Training sample set is obtained, the training sample set includes at least one set of sample data group, sample data group packet described in every group It includes: the sample audio-video frequency content and the correct song identity demarcated in advance;
For sample data group described in every group at least one set of sample data group, the sample audio-video frequency content is inputted Initial parameter model, obtains training result;
For sample data group described in every group, the training result is compared with the correct song identity, is calculated Loss, the error for calculating loss and being used to indicate between the training result and the correct song identity;
According at least one set of corresponding calculating loss of sample data group, using error backpropagation algorithm training Obtain the object matching model.
12. according to the method described in claim 2, it is characterized in that, the method, further includes:
The first prompt icon switching is shown as recognition failures icon if recognition failures, the recognition failures icon is used for Indicate the target song in the unidentified audio-video frequency content out.
13. according to the method for claim 12, which is characterized in that described to prompt icon for described first if recognition failures Switching is shown as after recognition failures icon, further includes:
Cancel when the display duration of the recognition failures icon reaches the second duration threshold value and shows the recognition failures icon;
Display is used to indicate the prompt information of failure cause in first user interface, and the failure cause includes the end Hold not connected network or the unidentified target song out of the terminal.
14. a kind of song recognition device, which is characterized in that be applied in terminal, described device includes:
First display module, for showing the first user interface of the first application program, playing in first user interface has Audio-video frequency content;
Second display module, for showing that audio identification icon, the audio identification icon are in first user interface Trigger the entrance identified to the target song in the audio-video frequency content;
Third display module, for being used described first when receiving the first operation signal on the audio identification icon Show that floating frame, the floating frame are used to show the introductory information of the target song on the interface of family.
15. a kind of terminal, which is characterized in that the terminal includes processor, the memory that is connected with the processor, Yi Jicun The program instruction on the memory is stored up, the processor realizes such as claim 1 to 13 times when executing described program instruction Song recognition method described in one.
16. a kind of computer readable storage medium, which is characterized in that be stored thereon with program instruction, described program instruction is located Manage the song recognition method realized as described in claim 1 to 13 is any when device executes.
CN201810962656.XA 2018-08-22 2018-08-22 Song recognition method, apparatus, terminal and storage medium CN109947979A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810962656.XA CN109947979A (en) 2018-08-22 2018-08-22 Song recognition method, apparatus, terminal and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810962656.XA CN109947979A (en) 2018-08-22 2018-08-22 Song recognition method, apparatus, terminal and storage medium

Publications (1)

Publication Number Publication Date
CN109947979A true CN109947979A (en) 2019-06-28

Family

ID=67005927

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810962656.XA CN109947979A (en) 2018-08-22 2018-08-22 Song recognition method, apparatus, terminal and storage medium

Country Status (1)

Country Link
CN (1) CN109947979A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110324645A (en) * 2019-07-05 2019-10-11 广州酷狗计算机科技有限公司 Song display methods, device, equipment and storage medium

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105808780A (en) * 2016-03-31 2016-07-27 广州酷狗计算机科技有限公司 Song recognition method and device
US20180041623A1 (en) * 2016-08-05 2018-02-08 Alibaba Group Holding Limited Method and device for displaying application information
CN108089786A (en) * 2017-12-14 2018-05-29 广东欧珀移动通信有限公司 Method for displaying user interface, device, equipment and storage medium
CN108334272A (en) * 2018-01-23 2018-07-27 维沃移动通信有限公司 A kind of control method and mobile terminal
CN108415752A (en) * 2018-03-12 2018-08-17 广东欧珀移动通信有限公司 Method for displaying user interface, device, equipment and storage medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105808780A (en) * 2016-03-31 2016-07-27 广州酷狗计算机科技有限公司 Song recognition method and device
US20180041623A1 (en) * 2016-08-05 2018-02-08 Alibaba Group Holding Limited Method and device for displaying application information
CN108089786A (en) * 2017-12-14 2018-05-29 广东欧珀移动通信有限公司 Method for displaying user interface, device, equipment and storage medium
CN108334272A (en) * 2018-01-23 2018-07-27 维沃移动通信有限公司 A kind of control method and mobile terminal
CN108415752A (en) * 2018-03-12 2018-08-17 广东欧珀移动通信有限公司 Method for displaying user interface, device, equipment and storage medium

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110324645A (en) * 2019-07-05 2019-10-11 广州酷狗计算机科技有限公司 Song display methods, device, equipment and storage medium

Similar Documents

Publication Publication Date Title
US9940003B2 (en) Method and device for executing object on display
CN103208283B (en) The method and device of user function is executed using speech recognition
US8464180B1 (en) Organizing graphical representations on computing devices
JP2017537412A (en) System and method for tracking events and providing virtual meeting feedback
CN107870716B (en) Method and device for calling background application program
US20190174191A1 (en) System and Method for Integrating Interactive Call-To-Action, Contextual Applications with Videos
US8966614B2 (en) Systems, methods, and computer program products for providing video-passwords for user authentication
CN107688422B (en) Notification message display method and device
CN101523392B (en) Personalized slide show generation
US8930818B2 (en) Visualization of website analytics
US9161238B2 (en) Mobile device monitoring and testing
JP6496848B2 (en) Method and system for extracting and providing highlight video of video content
EP3454193A1 (en) Control method and apparatus of terminal device, and storage medium
US8977678B2 (en) System and method for conducting surveys on devices without requiring persistent network connectivity
US20150382147A1 (en) Leveraging user signals for improved interactions with digital personal assistant
CN107077292A (en) Clip and paste information providing method and device
US20130130216A1 (en) Custom narration of electronic books
CN107341018B (en) Method and device for continuously displaying view after page switching
US20110161818A1 (en) Method and apparatus for video chapter utilization in video player ui
CN101651779B (en) Information processing apparatus, and method
CN104205854A (en) Method and system for providing a display of social messages on a second screen which is synched to content on a first screen
CN107704177A (en) interface display method, device and terminal
CN104918095A (en) Multimedia stream data preview display method and device
US9824477B1 (en) Photo and video collaboration platform
CN107491683B (en) Application decryption method and device, terminal and computer readable storage medium

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination