CN110300323B - Music recognition method and device - Google Patents

Music recognition method and device Download PDF

Info

Publication number
CN110300323B
CN110300323B CN201810245218.1A CN201810245218A CN110300323B CN 110300323 B CN110300323 B CN 110300323B CN 201810245218 A CN201810245218 A CN 201810245218A CN 110300323 B CN110300323 B CN 110300323B
Authority
CN
China
Prior art keywords
information
music
multimedia
music information
module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810245218.1A
Other languages
Chinese (zh)
Other versions
CN110300323A (en
Inventor
豆成权
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Alibaba China Co Ltd
Original Assignee
Alibaba China Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Alibaba China Co Ltd filed Critical Alibaba China Co Ltd
Priority to CN201810245218.1A priority Critical patent/CN110300323B/en
Publication of CN110300323A publication Critical patent/CN110300323A/en
Application granted granted Critical
Publication of CN110300323B publication Critical patent/CN110300323B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/232Content retrieval operation locally within server, e.g. reading video streams from disk arrays
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/235Processing of additional data, e.g. scrambling of additional data or processing content descriptors
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/239Interfacing the upstream path of the transmission network, e.g. prioritizing client content requests
    • H04N21/2393Interfacing the upstream path of the transmission network, e.g. prioritizing client content requests involving handling client requests
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/435Processing of additional data, e.g. decrypting of additional data, reconstructing software from modules extracted from the transport stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/442Monitoring of processes or resources, e.g. detecting the failure of a recording device, monitoring the downstream bandwidth, the number of times a movie has been viewed, the storage space available from the internal hard disk
    • H04N21/44213Monitoring of end-user related data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/81Monomedia components thereof
    • H04N21/8126Monomedia components thereof involving additional data, e.g. news, sports, stocks, weather forecasts
    • H04N21/8133Monomedia components thereof involving additional data, e.g. news, sports, stocks, weather forecasts specifically related to the content, e.g. biography of the actors in a movie, detailed information about an article seen in a video program

Abstract

The present disclosure relates to a method and apparatus for identifying music, including collecting multimedia information of a multimedia resource currently played when a music identification instruction is received, the multimedia information including one or more of position information, subtitle information, and spectrum information, the position information including episode identification and progress information; acquiring music information corresponding to the multimedia information from a server; and if the music information corresponding to the multimedia information is acquired, displaying the music information. By determining music information according to the multimedia information, the method and the device for identifying music according to the embodiment of the disclosure can display the music information corresponding to the currently played multimedia resource, and facilitate a user to search for music.

Description

Music recognition method and device
Technical Field
The present disclosure relates to the field of multimedia technologies, and in particular, to a method and an apparatus for recognizing music.
Background
Background music is typically placed in the video to render the atmosphere. During the viewing of a video, a user may be interested in the background music and may want to know information about the background music. Without knowing the name of the music, it is cumbersome to find the music.
Disclosure of Invention
In view of this, the present disclosure provides a method and an apparatus for identifying music, which are convenient for a user to search for music.
According to a first aspect of the present disclosure, a method for identifying music is provided, the method being applied to a terminal, and including acquiring multimedia information of a currently played multimedia resource when a music identification instruction is received, the multimedia information including one or more of position information, subtitle information, and spectrum information, the position information including episode identification and progress information; acquiring music information corresponding to the multimedia information from a server; and if the music information corresponding to the multimedia information is acquired, displaying the music information.
In one possible implementation, the method further includes: and if the music information corresponding to the multimedia information is not acquired, prompting that the acquisition fails and continuing to play the multimedia resource.
In one possible implementation, the method further includes: when a music information editing instruction is received, acquiring edited music information and multimedia information of a currently played multimedia resource; establishing a corresponding relation between multimedia information of the currently played multimedia resource and edited music information; and sending the corresponding relation to the server.
In one possible implementation, the method further includes: and pausing the playing of the multimedia resources and playing the music corresponding to the music information.
According to a second aspect of the present disclosure, a method for identifying music is provided, where the method is applied to a server and includes receiving a first request for obtaining music information sent by a terminal, where the first request carries multimedia information; searching music information corresponding to the multimedia information according to the corresponding relation between the music information and the multimedia information, wherein the multimedia information comprises one or more of position information, subtitle information and spectrum information, and the position information comprises episode identification and progress information; and if the music information corresponding to the multimedia information is found, returning the music information to the terminal.
In one possible implementation, the method further includes: receiving a second request for updating the music information sent by the terminal, wherein the second request carries the corresponding relation between the multimedia information and the music information; and updating the corresponding relation between the music information and the multimedia information by adopting the corresponding relation in the second request.
In one possible implementation, the method further includes: and returning the music corresponding to the music information to the terminal.
According to a third aspect of the present disclosure, there is provided an apparatus for identifying music, the apparatus being applied to a terminal, the apparatus including an acquisition module configured to acquire multimedia information of a currently played multimedia resource when a music identification instruction is received, the multimedia information including one or more of location information, subtitle information, and spectrum information, the location information including episode identification and progress information; the first acquisition module is used for acquiring music information corresponding to the multimedia information from a server; and the display module is used for displaying the music information if the music information corresponding to the multimedia information is acquired.
In one possible implementation, the apparatus further includes: and the first playing module is used for prompting that the acquisition fails and continuing to play the multimedia resource if the music information corresponding to the multimedia information is not acquired.
In one possible implementation, the apparatus further includes: the second acquisition module is used for acquiring the edited music information and the multimedia information of the currently played multimedia resource when receiving the music information editing instruction; the establishing module is used for establishing the corresponding relation between the multimedia information of the currently played multimedia resource and the edited music information; and the sending module is used for sending the corresponding relation to the server.
In one possible implementation, the apparatus further includes: and the second playing module is used for pausing the playing of the multimedia resource and playing the music corresponding to the music information.
According to a fourth aspect of the present disclosure, a device for identifying music is provided, where the device is applied to a server and includes a first receiving module, configured to receive a first request for obtaining music information sent by a terminal, where the first request carries multimedia information; the searching module is used for searching music information corresponding to the multimedia information according to the corresponding relation between the music information and the multimedia information, wherein the multimedia information comprises one or more of position information, subtitle information and spectrum information, and the position information comprises episode identification and progress information; and the first returning module is used for returning the music information to the terminal if the music information corresponding to the multimedia information is found.
In one possible implementation, the apparatus further includes: the second receiving module is used for receiving a second request for updating the music information sent by the terminal, wherein the second request carries the corresponding relation between the multimedia information and the music information;
and the updating module is used for updating the corresponding relation between the music information and the multimedia information by adopting the corresponding relation in the second request.
In one possible implementation, the apparatus further includes: and the second returning module is used for returning the music corresponding to the music information to the terminal.
According to a fifth aspect of the present disclosure, there is provided an apparatus for recognizing music, comprising: a processor; a memory for storing processor-executable instructions; wherein the processor is configured to perform the method of the first aspect.
According to a sixth aspect of the present disclosure, there is provided a non-transitory computer readable storage medium having stored thereon computer program instructions, wherein the computer program instructions, when executed by a processor, implement the method of the first aspect described above.
According to a seventh aspect of the present disclosure, there is provided an apparatus for recognizing music, comprising: a processor; a memory for storing processor-executable instructions; wherein the processor is configured to perform the method of the second aspect described above.
According to an eighth aspect of the present disclosure, there is provided a non-transitory computer readable storage medium having stored thereon computer program instructions, wherein the computer program instructions, when executed by a processor, implement the method of the second aspect described above.
The method and the device for identifying the music according to the embodiments of the disclosure can display the music information corresponding to the current multimedia resource, and facilitate a user to search the music.
Other features and aspects of the present disclosure will become apparent from the following detailed description of exemplary embodiments, which proceeds with reference to the accompanying drawings.
Drawings
The accompanying drawings, which are incorporated in and constitute a part of this specification, illustrate exemplary embodiments, features, and aspects of the disclosure and, together with the description, serve to explain the principles of the disclosure.
Fig. 1 illustrates a flow chart of a method of identifying music according to an embodiment of the present disclosure.
Fig. 2 illustrates a flow chart of a method of identifying music according to an embodiment of the present disclosure.
Fig. 3 illustrates a flow chart of a method of identifying music according to an embodiment of the present disclosure.
Fig. 4 illustrates a flow chart of a method of identifying music according to an embodiment of the present disclosure.
Fig. 5 illustrates a block diagram of an apparatus for recognizing music according to an embodiment of the present disclosure.
Fig. 6 illustrates a block diagram of an apparatus for recognizing music according to an embodiment of the present disclosure.
Fig. 7 illustrates a block diagram of an apparatus for recognizing music according to an embodiment of the present disclosure.
Fig. 8 illustrates a block diagram of an apparatus for recognizing music according to an embodiment of the present disclosure.
Fig. 9 illustrates a block diagram of an apparatus for recognizing music according to an embodiment of the present disclosure.
Fig. 10 illustrates a block diagram of an apparatus for recognizing music according to an embodiment of the present disclosure.
Detailed Description
Various exemplary embodiments, features and aspects of the present disclosure will be described in detail below with reference to the accompanying drawings. In the drawings, like reference numbers can indicate functionally identical or similar elements. While the various aspects of the embodiments are presented in drawings, the drawings are not necessarily drawn to scale unless specifically indicated.
The word "exemplary" is used exclusively herein to mean "serving as an example, embodiment, or illustration. Any embodiment described herein as "exemplary" is not necessarily to be construed as preferred or advantageous over other embodiments.
Furthermore, in the following detailed description, numerous specific details are set forth in order to provide a better understanding of the present disclosure. It will be understood by those skilled in the art that the present disclosure may be practiced without some of these specific details. In some instances, methods, means, elements and circuits that are well known to those skilled in the art have not been described in detail so as not to obscure the present disclosure.
Fig. 1 illustrates a flow chart of a method of identifying music according to an embodiment of the present disclosure. The method can be applied to terminals such as mobile phones, tablet computers, computers and the like. As shown in fig. 1, the method may include:
step S11, when receiving the music identification instruction, collecting multimedia information of the multimedia resource played currently, wherein the multimedia information includes one or more of position information, caption information and spectrum information, and the position information includes episode identification and progress information.
The music recognition instruction may be an instruction generated by a user by triggering a music recognition function. In one example, the music recognition function may be triggered by a button, which may be a virtual button trigger or a physical button trigger. The virtual button can be arranged on a playing page of the multimedia resource, and the virtual button can be hidden when the multimedia resource is played in a full screen mode. The entity button may be a key of the terminal, such as a combination of a volume key and a screen locking key of the mobile phone. In one example, the music recognition function may also be triggered by speech, which is triggered when the speech includes a preset word (e.g., "perform music recognition"). The music recognition function may also be triggered by other means, as this disclosure is not limited thereto.
The multimedia resource may be a video resource such as a movie, a tv show, and a variety, which is not limited by the present disclosure. When receiving the music identification instruction, the terminal may acquire multimedia information of a currently played multimedia resource, where the multimedia information includes one or more of position information, subtitle information, and spectrum information.
The location information may include episode identification and progress information. Wherein the episode identification may be used to identify a currently playing episode, such as movie 1, or a second episode of a tv show 1. The progress information may be used to indicate a point in time, for example, 25 minutes and 46 seconds, to which the current play is made. The location information may be used to indicate the currently playing episode and the point in time of the playing, e.g. the 25 th minute 46 seconds of movie 1.
The subtitle information may be used to indicate a subtitle corresponding to a currently played episode at a currently played time point. For example, the subtitle for movie 1 at 25 minutes and 46 seconds.
The spectrum information may be a spectrum of audio corresponding to a currently played episode at a currently played time point. For example, movie 1, 25 th minute 46 second corresponds to the spectrum of audio. In one possible implementation manner, the frequency spectrum of the audio corresponding to the currently played episode at the currently played time point may be a frequency spectrum of the audio in the currently played episode for a period of time with the currently played time point as a center point/a start point/an end point.
Step S12, music information corresponding to the multimedia information is acquired from the server.
The music information may be related information of music, such as a title, an album to which the music belongs, a singer, a play link address, and lyrics.
In a possible implementation manner, the terminal may send a request carrying multimedia information to the server to obtain music information corresponding to the multimedia information from the server. After receiving the request, the server can search the music information corresponding to the multimedia information carried in the request according to the corresponding relation between the multimedia information and the music information.
In one possible implementation manner, the server stores music information, time intervals corresponding to the music information, caption information corresponding to each time interval, and frequency spectrum information corresponding to the music information. The time interval corresponding to the music information comprises an episode identification and a start-stop time, the episode identification of the time interval is used for identifying the episode of the music corresponding to the music information, and the start-stop time of the time interval represents the time period of the music corresponding to the music information in the episode. The caption information corresponding to the time interval may indicate captions played during the occurrence of music corresponding to the music information in the episode. The spectrum information corresponding to the music information may indicate a spectrum of audio of the music corresponding to the music information. Table 1 shows one example of music information, time intervals corresponding to the music information, caption information corresponding to each time interval, and spectrum information corresponding to the music information stored in the server.
TABLE 1
Figure BDA0001606334220000071
In one example, the multimedia information includes position information, the server may obtain an episode identification and progress information from the position information, the server obtains start and stop times in a time interval in which the episode identification is located, and for each start and stop time, it is determined whether a time point corresponding to the progress information is within the start and stop time. And after determining the starting and ending time of the time point corresponding to the progress information, the server returns the music information corresponding to the time interval corresponding to the starting and ending time to the terminal.
In one example, the multimedia information includes subtitle information that the server may look up in locally stored subtitle information. If the server finds the subtitle information in the multimedia information locally, the server can determine that the music information corresponding to the found subtitle information is returned to the terminal. The server may search the subtitle information of the multimedia information in the locally stored subtitle information by using a text comparison method in the related art, which is not limited in this disclosure.
In one example, the multimedia information includes spectrum information, and the server may match the spectrum information with locally stored spectrum information and return music information corresponding to the matched spectrum information to the terminal. The server may match the spectrum information in the multimedia information with the locally stored spectrum information by using a spectrum information comparison method in the related art, which is not limited in this disclosure.
In one possible implementation, the server may set location information, caption information, and spectrum information priorities. And if music information corresponding to the position information exists, the music information is returned to the terminal, and the process is ended. If the music information corresponding to the position information does not exist, the server can determine the music information corresponding to the caption information, and if the music information corresponding to the caption information exists, the server returns the music information to the terminal and ends the process. If the music information corresponding to the caption information does not exist, the server can determine the music information corresponding to the frequency spectrum information, and if the music information corresponding to the frequency spectrum information exists, the server returns the music information to the terminal and ends the process. If music information corresponding to the frequency spectrum information does not exist, the server can send a notification of acquisition failure to the terminal. Because the workload of comparing time is small, the workload of comparing frequency spectrum is large, and the workload of comparing characters is between the two, the priority is set to be position information, caption information and frequency spectrum information from high to low, and the workload can be reduced. It should be noted that the above setting manner of the priority level is only an example, and the present disclosure does not limit the priority levels of the position information, the subtitle information, and the spectrum information.
And step S13, if the music information corresponding to the multimedia information is acquired, displaying the music information.
In a possible implementation manner, the terminal may display the music information in a small window manner on the multimedia resource playing picture, or display the music information in a semitransparent character manner, which is not limited in this disclosure.
The music identification method comprises the steps of collecting multimedia information of a currently played multimedia resource when a music identification instruction is received, obtaining music information corresponding to the multimedia information from a server, and displaying the music information if the music information corresponding to the multimedia information is obtained.
In a possible implementation manner, after the terminal acquires the music information corresponding to the multimedia information, the terminal may pause playing the multimedia resource and play the music corresponding to the music information.
In a possible implementation manner, if music information corresponding to the multimedia information is not acquired, the terminal may prompt that acquisition fails and continue to play the multimedia resource.
Fig. 2 illustrates a flow chart of a method of identifying music according to an embodiment of the present disclosure. As shown in fig. 2, the method for identifying music further includes:
step S14, when receiving the music information editing instruction, acquiring the edited music information and the multimedia information of the currently played multimedia resource.
The music information editing instruction may be an instruction generated by the user by triggering the music information editing function, and the triggering of the music information editing function may refer to the triggering of the music identification function in step S11, which is not described herein again.
In one possible implementation manner, upon receiving the music information editing instruction, the terminal may provide a music information editing control, such as a music information editing box. If the corresponding music information exists when the music information editing instruction is received, the terminal can display the music information in an editable state, and a user can update the existing music information. If the corresponding music information does not exist when the music information editing instruction is received, the terminal can display a blank frame, and a user can add new music information.
Step S15, establishing a corresponding relationship between the multimedia information of the currently played multimedia resource and the edited music information.
The terminal can acquire the position information, the caption information and the frequency spectrum information of the multimedia resource which is played currently, establish the corresponding relation between the position information and the music information, establish the corresponding relation between the caption information and the music information, establish the corresponding relation between the frequency spectrum information and the music information, and associate the position information, the caption information and the frequency spectrum information through the music information.
And step S16, sending the corresponding relation to the server.
Therefore, when the user watches the multimedia resource, if the music information is not acquired, the user can add the music information by himself. After the user sees the music information displayed by the terminal, if the user finds that the music information is wrong or wants to add the content of the music information, the user can edit the music information.
Fig. 3 illustrates a flow chart of a method of identifying music according to an embodiment of the present disclosure. The method may be applied to a server. As shown in fig. 3, the method of identifying music may include:
step S21, receiving a first request for obtaining music information sent by a terminal, where the first request carries multimedia information, the multimedia information includes one or more of location information, subtitle information, and spectrum information, and the location information includes episode identification and progress information.
Wherein the first request may be used to represent a request for obtaining music information. The multimedia information of step S21 can refer to the multimedia information of step S11, and will not be described herein.
And step S22, searching music information corresponding to the multimedia information according to the corresponding relation between the music information and the multimedia information.
Step S22 can refer to step S12, which is not described here.
And step S23, if the music information corresponding to the multimedia information is found, returning the music information to the terminal.
Therefore, the method and the device for identifying music according to the embodiments of the disclosure can return music information corresponding to the current multimedia resource to the terminal, thereby facilitating the user to search for music.
In a possible implementation manner, the server may further return music corresponding to the music information to the terminal.
In a possible implementation manner, if music information corresponding to the multimedia information is not found, the server may send a prompt of acquisition failure to the terminal.
Fig. 4 illustrates a flow chart of a method of identifying music according to an embodiment of the present disclosure. As shown in fig. 4, the method of recognizing music may further include:
and S24, receiving a second request for updating the music information sent by the terminal, wherein the second request carries the corresponding relation between the multimedia information and the music information.
The second request may be for indicating a request for updating music information. The step S15 can be referred to as the corresponding relationship between the multimedia information and the music information in the step S24, and the description thereof is omitted here.
And S25, updating the corresponding relation between the music information and the multimedia information by adopting the corresponding relation in the second request.
In a possible implementation manner, if the music information is newly added music information, the server may store a correspondence between the music information and the multimedia information.
In a possible implementation manner, if the music information is updated on the basis of existing music information, the server may search for local music information corresponding to the multimedia information, and update the music information using the received music information.
Therefore, when the user watches the multimedia resource, if the music information is not acquired, the user can add the music information by himself. After the user sees the music information displayed by the terminal, if the user finds that the music information is wrong or wants to add the content of the music information, the user can edit the music information.
Fig. 5 illustrates a block diagram of an apparatus for recognizing music according to an embodiment of the present disclosure. The apparatus for recognizing music may be applied to a terminal. As shown in fig. 5, the apparatus 50 for recognizing music may include:
the collection module 51 is configured to collect multimedia information of a currently played multimedia resource when a music recognition instruction is received, where the multimedia information includes one or more of position information, subtitle information, and spectrum information, and the position information includes episode identification and progress information;
a first obtaining module 52, configured to obtain music information corresponding to the multimedia information from a server;
and a displaying module 53, configured to display the music information if the music information corresponding to the multimedia information is obtained.
Fig. 6 illustrates a block diagram of an apparatus for recognizing music according to an embodiment of the present disclosure. As shown in fig. 6, in a possible implementation, the apparatus 50 for identifying music may further include:
the first playing module 54 is configured to prompt that the obtaining fails and continue to play the multimedia resource if the music information corresponding to the multimedia information is not obtained.
In one possible implementation, the apparatus 50 may further include:
a second obtaining module 55, configured to obtain edited music information and multimedia information of a currently played multimedia resource when receiving a music information editing instruction;
an establishing module 56, configured to establish a corresponding relationship between multimedia information of a currently played multimedia resource and edited music information;
a sending module 57, configured to send the correspondence to the server.
In one possible implementation, the apparatus 50 may further include:
the second playing module 58 is configured to pause playing the multimedia resource and play the music corresponding to the music information.
Fig. 7 illustrates a block diagram of an apparatus for recognizing music according to an embodiment of the present disclosure. The apparatus for recognizing music may be applied to a server. As shown in fig. 7, the apparatus 60 for recognizing music may include:
a first receiving module 61, configured to receive a first request for obtaining music information sent by a terminal, where the first request carries multimedia information;
the searching module 62 is configured to search music information corresponding to the multimedia information according to a corresponding relationship between the music information and the multimedia information, where the multimedia information includes one or more of location information, subtitle information, and spectrum information, and the location information includes episode identification and progress information;
a first returning module 63, configured to return the music information to the terminal if the music information corresponding to the multimedia information is found.
Fig. 8 illustrates a block diagram of an apparatus for recognizing music according to an embodiment of the present disclosure. As shown in fig. 8, in a possible implementation, the apparatus 60 for identifying music may further include:
a second receiving module 64, configured to receive a second request for updating music information sent by the terminal, where the second request carries a corresponding relationship between multimedia information and music information;
and an updating module 65, configured to update the correspondence between the music information and the multimedia information by using the correspondence in the second request.
In one possible implementation, the apparatus 60 may further include:
and a second returning module 66, configured to return the music corresponding to the music information to the terminal.
The music identification device comprises a server, a music identification device and a music display device, wherein the music identification device comprises a music display device, a music display device and a music display device, the music display device comprises a music display device, a music display device and a music display device, the music display device comprises a music display device and a music display device, the music display device comprises a music display device, the music display device and a music display device.
Fig. 9 is a block diagram illustrating an apparatus 800 for identifying music according to an example embodiment. For example, the apparatus 800 may be a mobile phone, a computer, a digital broadcast terminal, a messaging device, a game console, a tablet device, a medical device, an exercise device, a personal digital assistant, and the like.
Referring to fig. 9, the apparatus 800 may include one or more of the following components: processing component 802, memory 804, power component 806, multimedia component 808, audio component 810, input/output (I/O) interface 812, sensor component 814, and communication component 816.
The processing component 802 generally controls overall operation of the device 800, such as operations associated with display, telephone calls, data communications, camera operations, and recording operations. The processing components 802 may include one or more processors 820 to execute instructions to perform all or a portion of the steps of the methods described above. Further, the processing component 802 can include one or more modules that facilitate interaction between the processing component 802 and other components. For example, the processing component 802 can include a multimedia module to facilitate interaction between the multimedia component 808 and the processing component 802.
The memory 804 is configured to store various types of data to support operations at the apparatus 800. Examples of such data include instructions for any application or method operating on device 800, contact data, phonebook data, messages, pictures, videos, and so forth. The memory 804 may be implemented by any type or combination of volatile or non-volatile memory devices such as Static Random Access Memory (SRAM), electrically erasable programmable read-only memory (EEPROM), erasable programmable read-only memory (EPROM), programmable read-only memory (PROM), read-only memory (ROM), magnetic memory, flash memory, magnetic or optical disks.
Power components 806 provide power to the various components of device 800. The power components 806 may include a power management system, one or more power supplies, and other components associated with generating, managing, and distributing power for the apparatus 800.
The multimedia component 808 includes a screen that provides an output interface between the device 800 and a user. In some embodiments, the screen may include a Liquid Crystal Display (LCD) and a Touch Panel (TP). If the screen includes a touch panel, the screen may be implemented as a touch screen to receive an input signal from a user. The touch panel includes one or more touch sensors to sense touch, slide, and gestures on the touch panel. The touch sensor may not only sense the boundary of a touch or slide action, but also detect the duration and pressure associated with the touch or slide operation. In some embodiments, the multimedia component 808 includes a front facing camera and/or a rear facing camera. The front camera and/or the rear camera may receive external multimedia data when the device 800 is in an operating mode, such as a shooting mode or a video mode. Each front camera and rear camera may be a fixed optical lens system or have a focal length and optical zoom capability.
The audio component 810 is configured to output and/or input audio signals. For example, the audio component 810 includes a Microphone (MIC) configured to receive external audio signals when the apparatus 800 is in an operational mode, such as a call mode, a recording mode, and a voice recognition mode. The received audio signals may further be stored in the memory 804 or transmitted via the communication component 816. In some embodiments, audio component 810 also includes a speaker for outputting audio signals.
The I/O interface 812 provides an interface between the processing component 802 and peripheral interface modules, which may be keyboards, click wheels, buttons, etc. These buttons may include, but are not limited to: a home button, a volume button, a start button, and a lock button.
The sensor assembly 814 includes one or more sensors for providing various aspects of state assessment for the device 800. For example, the sensor assembly 814 may detect the open/closed status of the device 800, the relative positioning of components, such as a display and keypad of the device 800, the sensor assembly 814 may also detect a change in the position of the device 800 or a component of the device 800, the presence or absence of user contact with the device 800, the orientation or acceleration/deceleration of the device 800, and a change in the temperature of the device 800. Sensor assembly 814 may include a proximity sensor configured to detect the presence of a nearby object without any physical contact. The sensor assembly 814 may also include a light sensor, such as a CMOS or CCD image sensor, for use in imaging applications. In some embodiments, the sensor assembly 814 may also include an acceleration sensor, a gyroscope sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.
The communication component 816 is configured to facilitate communications between the apparatus 800 and other devices in a wired or wireless manner. The device 800 may access a wireless network based on a communication standard, such as WiFi, 2G or 3G, or a combination thereof. In an exemplary embodiment, the communication component 816 receives a broadcast signal or broadcast related information from an external broadcast management system via a broadcast channel. In an exemplary embodiment, the communication component 816 further includes a Near Field Communication (NFC) module to facilitate short-range communications. For example, the NFC module may be implemented based on Radio Frequency Identification (RFID) technology, infrared data association (IrDA) technology, Ultra Wideband (UWB) technology, Bluetooth (BT) technology, and other technologies.
In an exemplary embodiment, the apparatus 800 may be implemented by one or more Application Specific Integrated Circuits (ASICs), Digital Signal Processors (DSPs), Digital Signal Processing Devices (DSPDs), Programmable Logic Devices (PLDs), Field Programmable Gate Arrays (FPGAs), controllers, micro-controllers, microprocessors or other electronic components for performing the above-described methods.
In an exemplary embodiment, a non-transitory computer readable storage medium, such as the memory 804, is also provided that includes computer program instructions executable by the processor 820 of the apparatus 800 to perform the method performed by the terminal.
Fig. 10 is a block diagram illustrating an apparatus 1900 for recognizing music according to an example embodiment. For example, the apparatus 1900 may be provided as a server. Referring to FIG. 10, the device 1900 includes a processing component 1922 further including one or more processors and memory resources, represented by memory 1932, for storing instructions, e.g., applications, executable by the processing component 1922. The application programs stored in memory 1932 may include one or more modules that each correspond to a set of instructions. Further, the processing component 1922 is configured to execute instructions to perform the above-described method.
The device 1900 may also include a power component 1926 configured to perform power management of the device 1900, a wired or wireless network interface 1950 configured to connect the device 1900 to a network, and an input/output (I/O) interface 1958. The device 1900 may operate based on an operating system stored in memory 1932, such as Windows Server, Mac OS XTM, UnixTM, LinuxTM, FreeBSDTM, or the like.
In an exemplary embodiment, a non-transitory computer readable storage medium, such as the memory 1932, is also provided that includes computer program instructions executable by the processing component 1922 of the apparatus 1900 to perform the server-implemented method described above.
The present disclosure may be systems, methods, and/or computer program products. The computer program product may include a computer-readable storage medium having computer-readable program instructions embodied thereon for causing a processor to implement various aspects of the present disclosure.
The computer readable storage medium may be a tangible device that can hold and store the instructions for use by the instruction execution device. The computer readable storage medium may be, for example, but not limited to, an electronic memory device, a magnetic memory device, an optical memory device, an electromagnetic memory device, a semiconductor memory device, or any suitable combination of the foregoing. More specific examples (a non-exhaustive list) of the computer readable storage medium would include the following: a portable computer diskette, a hard disk, a Random Access Memory (RAM), a read-only memory (ROM), an erasable programmable read-only memory (EPROM or flash memory), a Static Random Access Memory (SRAM), a portable compact disc read-only memory (CD-ROM), a Digital Versatile Disc (DVD), a memory stick, a floppy disk, a mechanical coding device, such as punch cards or in-groove projection structures having instructions stored thereon, and any suitable combination of the foregoing. Computer-readable storage media as used herein is not to be construed as transitory signals per se, such as radio waves or other freely propagating electromagnetic waves, electromagnetic waves propagating through a waveguide or other transmission medium (e.g., optical pulses through a fiber optic cable), or electrical signals transmitted through electrical wires.
The computer-readable program instructions described herein may be downloaded from a computer-readable storage medium to a respective computing/processing device, or to an external computer or external storage device via a network, such as the internet, a local area network, a wide area network, and/or a wireless network. The network may include copper transmission cables, fiber optic transmission, wireless transmission, routers, firewalls, switches, gateway computers and/or edge servers. The network adapter card or network interface in each computing/processing device receives computer-readable program instructions from the network and forwards the computer-readable program instructions for storage in a computer-readable storage medium in the respective computing/processing device.
The computer program instructions for carrying out operations of the present disclosure may be assembler instructions, Instruction Set Architecture (ISA) instructions, machine-related instructions, microcode, firmware instructions, state setting data, or source or object code written in any combination of one or more programming languages, including an object oriented programming language such as Smalltalk, C + + or the like and conventional procedural programming languages, such as the "C" programming language or similar programming languages. The computer-readable program instructions may execute entirely on the user's computer, partly on the user's computer, as a stand-alone software package, partly on the user's computer and partly on a remote computer or entirely on the remote computer or server. In the case of a remote computer, the remote computer may be connected to the user's computer through any type of network, including a Local Area Network (LAN) or a Wide Area Network (WAN), or the connection may be made to an external computer (for example, through the Internet using an Internet service provider). In some embodiments, the electronic circuitry that can execute the computer-readable program instructions implements aspects of the present disclosure by utilizing the state information of the computer-readable program instructions to personalize the electronic circuitry, such as a programmable logic circuit, a Field Programmable Gate Array (FPGA), or a Programmable Logic Array (PLA).
Various aspects of the present disclosure are described herein with reference to flowchart illustrations and/or block diagrams of methods, apparatus (systems) and computer program products according to embodiments of the disclosure. It will be understood that each block of the flowchart illustrations and/or block diagrams, and combinations of blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer-readable program instructions.
These computer-readable program instructions may be provided to a processor of a general purpose computer, special purpose computer, or other programmable data processing apparatus to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing apparatus, create means for implementing the functions/acts specified in the flowchart and/or block diagram block or blocks. These computer-readable program instructions may also be stored in a computer-readable storage medium that can direct a computer, programmable data processing apparatus, and/or other devices to function in a particular manner, such that the computer-readable medium storing the instructions comprises an article of manufacture including instructions which implement the function/act specified in the flowchart and/or block diagram block or blocks.
The computer readable program instructions may also be loaded onto a computer, other programmable data processing apparatus, or other devices to cause a series of operational steps to be performed on the computer, other programmable apparatus or other devices to produce a computer implemented process such that the instructions which execute on the computer, other programmable apparatus or other devices implement the functions/acts specified in the flowchart and/or block diagram block or blocks.
The flowchart and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of systems, methods and computer program products according to various embodiments of the present disclosure. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of instructions, which comprises one or more executable instructions for implementing the specified logical function(s). In some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
Having described embodiments of the present disclosure, the foregoing description is intended to be exemplary, not exhaustive, and not limited to the disclosed embodiments. Many modifications and variations will be apparent to those of ordinary skill in the art without departing from the scope and spirit of the described embodiments. The terms used herein were chosen in order to best explain the principles of the embodiments, the practical application, or technical improvements to the techniques in the marketplace, or to enable others of ordinary skill in the art to understand the embodiments disclosed herein.

Claims (18)

1. A method for identifying music, the method being applied to a terminal, the method comprising:
when a music identification instruction is received, acquiring multimedia information of a multimedia resource which is played currently, wherein the multimedia information comprises position information, subtitle information and spectrum information, and the position information comprises an episode identification and progress information;
acquiring music information corresponding to the multimedia information from a server according to the preset priority of the multimedia information;
and if the music information corresponding to the multimedia information is acquired, displaying the music information.
2. The method of claim 1, further comprising:
and if the music information corresponding to the multimedia information is not acquired, prompting that the acquisition fails and continuing to play the multimedia resource.
3. The method of claim 1, further comprising:
when a music information editing instruction is received, acquiring edited music information and multimedia information of a currently played multimedia resource;
establishing a corresponding relation between multimedia information of the currently played multimedia resource and edited music information;
and sending the corresponding relation to the server.
4. The method of claim 1, further comprising:
and pausing the playing of the multimedia resources and playing the music corresponding to the music information.
5. A method for identifying music, the method being applied to a server, the method comprising:
receiving a first request for acquiring music information sent by a terminal, wherein the first request carries multimedia information, the multimedia information comprises position information, subtitle information and spectrum information, and the position information comprises an episode identification and progress information;
searching music information corresponding to the multimedia information according to the preset priority of the multimedia information and the corresponding relation between the music information and the multimedia information;
and if the music information corresponding to the multimedia information is found, returning the music information to the terminal.
6. The method of claim 5, further comprising:
receiving a second request for updating the music information sent by the terminal, wherein the second request carries the corresponding relation between the multimedia information and the music information;
and updating the corresponding relation between the music information and the multimedia information by adopting the corresponding relation in the second request.
7. The method of claim 5, further comprising:
and returning the music corresponding to the music information to the terminal.
8. An apparatus for recognizing music, the apparatus being applied to a terminal, the apparatus comprising:
the music identification module is used for receiving a music identification instruction and acquiring multimedia information of a multimedia resource played currently, wherein the multimedia information comprises position information, subtitle information and spectrum information, and the position information comprises an episode identification and progress information;
the first acquisition module is used for acquiring music information corresponding to the multimedia information from a server according to the preset priority of the multimedia information;
and the display module is used for displaying the music information if the music information corresponding to the multimedia information is acquired.
9. The apparatus of claim 8, further comprising:
and the first playing module is used for prompting that the acquisition fails and continuing to play the multimedia resource if the music information corresponding to the multimedia information is not acquired.
10. The apparatus of claim 8, further comprising:
the second acquisition module is used for acquiring the edited music information and the multimedia information of the currently played multimedia resource when receiving the music information editing instruction;
the establishing module is used for establishing the corresponding relation between the multimedia information of the currently played multimedia resource and the edited music information;
and the sending module is used for sending the corresponding relation to the server.
11. The apparatus of claim 8, further comprising:
and the second playing module is used for pausing the playing of the multimedia resource and playing the music corresponding to the music information.
12. An apparatus for recognizing music, the apparatus being applied to a server, the apparatus comprising:
the first receiving module is used for receiving a first request for acquiring music information sent by a terminal, wherein the first request carries multimedia information;
the searching module is used for searching music information corresponding to the multimedia information according to the preset priority of the multimedia information and the corresponding relation between the music information and the multimedia information, wherein the multimedia information comprises position information, subtitle information and spectrum information, and the position information comprises episode identification and progress information;
and the first returning module is used for returning the music information to the terminal if the music information corresponding to the multimedia information is found.
13. The apparatus of claim 12, further comprising:
the second receiving module is used for receiving a second request for updating the music information sent by the terminal, wherein the second request carries the corresponding relation between the multimedia information and the music information;
and the updating module is used for updating the corresponding relation between the music information and the multimedia information by adopting the corresponding relation in the second request.
14. The apparatus of claim 12, further comprising:
and the second returning module is used for returning the music corresponding to the music information to the terminal.
15. An apparatus for recognizing music, comprising:
a processor;
a memory for storing instructions executable by the processor;
wherein the processor is configured to perform the method of any one of claims 1 to 4.
16. A non-transitory computer readable storage medium having computer program instructions stored thereon, wherein the computer program instructions, when executed by a processor, implement the method of any of claims 1 to 4.
17. An apparatus for recognizing music, comprising:
a processor;
a memory for storing instructions executable by the processor;
wherein the processor is configured to perform the method of any one of claims 5 to 7.
18. A non-transitory computer readable storage medium having computer program instructions stored thereon, wherein the computer program instructions, when executed by a processor, implement the method of any of claims 5 to 7.
CN201810245218.1A 2018-03-23 2018-03-23 Music recognition method and device Active CN110300323B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810245218.1A CN110300323B (en) 2018-03-23 2018-03-23 Music recognition method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810245218.1A CN110300323B (en) 2018-03-23 2018-03-23 Music recognition method and device

Publications (2)

Publication Number Publication Date
CN110300323A CN110300323A (en) 2019-10-01
CN110300323B true CN110300323B (en) 2021-08-03

Family

ID=68026010

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810245218.1A Active CN110300323B (en) 2018-03-23 2018-03-23 Music recognition method and device

Country Status (1)

Country Link
CN (1) CN110300323B (en)

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2009117974A (en) * 2007-11-02 2009-05-28 Fujifilm Corp Interest information creation method, apparatus, and system
CN105657535B (en) * 2015-12-29 2018-10-30 北京搜狗科技发展有限公司 A kind of audio identification methods and device

Also Published As

Publication number Publication date
CN110300323A (en) 2019-10-01

Similar Documents

Publication Publication Date Title
CN108668176B (en) Bullet screen display method and device
EP3174053A1 (en) Method, apparatus and system for playing multimedia data, computer program and recording medium
CN108259991B (en) Video processing method and device
CN107948708B (en) Bullet screen display method and device
CN110519655B (en) Video editing method, device and storage medium
CN108495168B (en) Bullet screen information display method and device
CN110996121A (en) Information processing method and device, electronic equipment and storage medium
CN107122430B (en) Search result display method and device
CN109947981B (en) Video sharing method and device
CN108600818B (en) Method and device for displaying multimedia resources
EP3147802B1 (en) Method and apparatus for processing information
CN109063101B (en) Video cover generation method and device
CN110234030B (en) Bullet screen information display method and device
CN108174269B (en) Visual audio playing method and device
CN108833952B (en) Video advertisement putting method and device
CN112291631A (en) Information acquisition method, device, terminal and storage medium
CN108521579B (en) Bullet screen information display method and device
CN109992754B (en) Document processing method and device
CN109756783B (en) Poster generation method and device
CN110955800A (en) Video retrieval method and device
CN110213062B (en) Method and device for processing message
CN110121115B (en) Method and device for determining wonderful video clip
WO2020020222A1 (en) Play framework, display method, apparatus and storage medium for media content
CN110460873B (en) Method and device for generating video on demand
CN110858921A (en) Program video processing method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
TA01 Transfer of patent application right
TA01 Transfer of patent application right

Effective date of registration: 20200430

Address after: 310052 room 508, floor 5, building 4, No. 699, Wangshang Road, Changhe street, Binjiang District, Hangzhou City, Zhejiang Province

Applicant after: Alibaba (China) Co.,Ltd.

Address before: 100080 Beijing Haidian District city Haidian street A Sinosteel International Plaza No. 8 block 5 layer A, C

Applicant before: Youku network technology (Beijing) Co., Ltd

GR01 Patent grant
GR01 Patent grant