WO2016165325A1 - 音频信息识别方法及装置 - Google Patents
音频信息识别方法及装置 Download PDFInfo
- Publication number
- WO2016165325A1 WO2016165325A1 PCT/CN2015/095034 CN2015095034W WO2016165325A1 WO 2016165325 A1 WO2016165325 A1 WO 2016165325A1 CN 2015095034 W CN2015095034 W CN 2015095034W WO 2016165325 A1 WO2016165325 A1 WO 2016165325A1
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- audio
- information
- keyword
- audio information
- link
- Prior art date
Links
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F16/00—Information retrieval; Database structures therefor; File system structures therefor
- G06F16/60—Information retrieval; Database structures therefor; File system structures therefor of audio data
- G06F16/68—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
- G06F16/683—Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F3/00—Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
- G06F3/16—Sound input; Sound output
- G06F3/165—Management of the audio stream, e.g. setting of volume, audio stream path
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06F—ELECTRIC DIGITAL DATA PROCESSING
- G06F40/00—Handling natural language data
- G06F40/10—Text processing
- G06F40/12—Use of codes for handling textual entities
- G06F40/134—Hyperlinking
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04H—BROADCAST COMMUNICATION
- H04H60/00—Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
- H04H60/29—Arrangements for monitoring broadcast services or broadcast-related services
- H04H60/33—Arrangements for monitoring the users' behaviour or opinions
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04H—BROADCAST COMMUNICATION
- H04H60/00—Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
- H04H60/35—Arrangements for identifying or recognising characteristics with a direct linkage to broadcast information or to broadcast space-time, e.g. for identifying broadcast stations or for identifying users
- H04H60/37—Arrangements for identifying or recognising characteristics with a direct linkage to broadcast information or to broadcast space-time, e.g. for identifying broadcast stations or for identifying users for identifying segments of broadcast information, e.g. scenes or extracting programme ID
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04H—BROADCAST COMMUNICATION
- H04H60/00—Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
- H04H60/61—Arrangements for services using the result of monitoring, identification or recognition covered by groups H04H60/29-H04H60/54
- H04H60/65—Arrangements for services using the result of monitoring, identification or recognition covered by groups H04H60/29-H04H60/54 for using the result on users' side
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04H—BROADCAST COMMUNICATION
- H04H60/00—Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
- H04H60/68—Systems specially adapted for using specific information, e.g. geographical or meteorological information
- H04H60/73—Systems specially adapted for using specific information, e.g. geographical or meteorological information using meta-information
- H04H60/74—Systems specially adapted for using specific information, e.g. geographical or meteorological information using meta-information using programme related information, e.g. title, composer or interpreter
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L65/00—Network arrangements, protocols or services for supporting real-time applications in data packet communication
- H04L65/40—Support for services or applications
- H04L65/402—Support for services or applications wherein the services involve a main real-time session and one or more additional parallel non-real time sessions, e.g. downloading a file in a parallel FTP session, initiating an email or combinational services
- H04L65/4025—Support for services or applications wherein the services involve a main real-time session and one or more additional parallel non-real time sessions, e.g. downloading a file in a parallel FTP session, initiating an email or combinational services where none of the additional parallel sessions is real time or time sensitive, e.g. downloading a file in a parallel FTP session, initiating an email or combinational services
Definitions
- the present disclosure relates to the field of audio recognition technologies, and in particular, to an audio information identification method and apparatus.
- some applications can recognize the song name, singer and lyrics of the listened song, and display the recognized song name, singer and lyrics to the user.
- the present disclosure provides an audio information identification method and apparatus.
- the technical solution is as follows:
- an audio information identifying method comprising:
- an audio information identifying apparatus comprising:
- An identification module configured to identify the audio being played to obtain audio information of the audio
- a first display module configured to display, on the information display interface, a jump link set for a keyword in the audio information identified by the identification module;
- a second display module configured to be triggered when the jump link displayed by the first display module is triggered At the time, pre-stored information corresponding to the keyword is displayed.
- an audio information identifying apparatus comprising:
- a memory for storing the processor executable instructions
- processor is configured to:
- FIG. 1 is a flowchart of an audio information identification method according to an exemplary embodiment
- FIG. 2A is a flowchart of an audio information identification method according to another exemplary embodiment
- 2B is a flowchart of a method for acquiring audio information, according to an exemplary embodiment
- FIG. 2C is a schematic diagram showing displaying audio information and a jump link according to an exemplary embodiment
- 2D is a schematic diagram showing a display jump page according to an exemplary embodiment
- FIG. 3A is a flowchart illustrating a method of playing or downloading audio that is listened to, according to an exemplary embodiment
- FIG. 3B is a diagram showing a play link and a download link for displaying audio according to an exemplary embodiment. schematic diagram
- FIG. 4A is a flowchart of a method for searching for keywords in audio information, according to an exemplary embodiment
- FIG. 4B is a schematic diagram showing a search result corresponding to a keyword according to an exemplary embodiment
- FIG. 5 is a block diagram of an audio information identifying apparatus according to an exemplary embodiment
- FIG. 6 is a block diagram of an audio information identifying apparatus according to another exemplary embodiment
- FIG. 7 is a block diagram of an apparatus for identifying audio information, according to an exemplary embodiment.
- FIG. 1 is a flowchart of an audio information identification method according to an exemplary embodiment.
- the audio information identification method may be applied to an electronic device, where the electronic device may be a smart phone, a tablet, or Smart TVs, e-book readers, multimedia players, laptop portable computers and desktop computers, to name a few.
- the audio information identifying method may include the following steps.
- step 101 the audio being played is identified to obtain audio information of the audio.
- step 102 a jump link set for the keyword in the audio information is displayed on the information display interface.
- step 103 when the jump link is triggered, the pre-stored information corresponding to the keyword is displayed.
- the audio information identifying method obtains the audio information of the audio by identifying the audio being played, and displays the jump link set by the keyword in the audio information, and When the jump link is triggered, the pre-stored information corresponding to the keyword is displayed; since more information corresponding to the audio can be displayed by providing the jump link, the information that can only be displayed in a single interface and the information that can be displayed is solved. Relatively few problems; achieving the effect of increasing the richness of audio information.
- FIG. 2A is a flowchart of a method for identifying an audio information according to another exemplary embodiment.
- the method for identifying an audio information may be applied to an electronic device, which may be a smart phone or a tablet. , smart TV, e-book reader, multimedia player, laptop portable computer and desktop computer, and so on.
- the audio information identifying method may include the following steps. Before recognizing the audio being played, the electronic device needs to acquire the audio being played. In order to meet different requirements, the manner in which the electronic device acquires the audio being played also needs to be adjusted accordingly. Please refer to step 201 and step 202 below.
- step 201 the audio being played is acquired every predetermined time interval.
- the audio being played here may be the audio played by the electronic device after listening to the radio broadcast, or the audio being played by other devices around the electronic device, and the electronic device may acquire the other device that is playing the audio. Audio.
- the audio can be music audio, language program audio, or book audio.
- the electronic device can acquire the audio being played every predetermined time interval, and the predetermined time interval can be set by the user, for example, the predetermined time interval can be set to 3 minutes, 4 minutes, or 5 minutes, and the like.
- the electronic device may acquire the audio being played after detecting that the current audio tempo changes more than a predetermined threshold.
- a predetermined threshold For example, in the case of music audio, there is usually an interval after the completion of a song and before the next song is played, and the rhythm of the audio is quite different from the rhythm when playing the song. Therefore, when the electronic device detects that the rhythm of the current audio exceeds a predetermined threshold change, it indicates that the played song has been switched, and the audio acquired by the electronic device at this time is the audio of the switched song.
- step 202 an identification command for identifying the audio being played is received, and the audio being played is acquired.
- the electronic device can acquire the audio being played after receiving the user-triggered recognition command for recognizing the audio being played.
- the user when the user is listening to the radio broadcast using the electronic device, it is found that the currently played audio is very nice, and it is desirable to obtain related information of the audio. At this time, the user can trigger generation on the electronic device to generate the pair. An identification instruction that the audio recognizes, and the electronic device receives the After the instruction is recognized, the audio being played is obtained.
- the user when other devices are playing audio, and the user wants to obtain information about the audio that the other device is playing, the user can turn on the held electronic device and trigger a generated pair on the electronic device.
- the recognition command for the audio being played is recognized, and the electronic device acquires the audio being played after receiving the recognition command.
- the identification control on the electronic device may be triggered to generate the identification instruction, or the specified hardware (such as the volume) triggered by the electronic device may be triggered. Key) to generate an identification command.
- step 203 the audio being played is identified to obtain audio information of the audio.
- FIG. 2B is a flowchart of a method for acquiring audio information according to an exemplary embodiment.
- step 203A the audio is identified to obtain an audio feature of the audio, the audio feature being associated with the first or both of the textual and identity information of the audio.
- the electronic device recognizes the acquired audio being played to obtain an audio feature of the audio.
- the audio feature is related to textual information, intonation features, or tonal features that appear in the audio. If the audio is recognized by voiceprint recognition technology, the audio feature is also related to the identity information of the audio. For example, when the acquired audio is music audio, the text information obtained by the recognition is the lyrics corresponding to the acquired audio, and the identity information obtained by the voiceprint recognition is the singer corresponding to the audio; when the acquired audio is a language class When the program audio is obtained, the text information obtained by the recognition is the program content corresponding to the acquired audio, and the identity information obtained by the voiceprint recognition is the performing artist corresponding to the audio.
- step 203B the audio feature is sent to the server, and the audio feature is used to trigger the server to find the audio information that matches the audio feature, and feed back the found audio information.
- the electronic device sends the obtained audio feature to the server, and the server can find the audio information that matches the audio feature according to the pre-stored database, and the server feeds back the audio information to the electronic device after finding the audio information that matches the audio feature.
- the audio information may include owner information of the audio corresponding to the audio feature, an audio name corresponding to the audio, and the like.
- the audio information may include a song name, an album name, a singer, a lyric, etc.; when the audio being played is a language program In audio, the audio information may include a program name, a performer, and the like; when the audio being played is a book audio, the audio information may include a book author, a book name, a chapter directory, and the like.
- step 203C the audio information fed back by the server is received.
- step 204 a jump link set for the keyword in the audio information is displayed on the information display interface.
- the electronic device may set a jump link for the keywords in the audio information, so that the user can obtain more information by jumping.
- the keyword here may be a keyword capable of indicating the main feature of the audio, for example, when the audio being played is music audio, the keyword may be a song name, a singer and an album name, etc.; when the audio being played is a language class In the program audio, the keywords may be the program name and the performer, etc.; when the audio being played is the book audio, the keywords may be the book author and the book name.
- FIG. 2C is a schematic diagram of displaying audio information and a jump link according to an exemplary embodiment.
- 2C takes audio as music audio as an example, and the audio information received by the electronic device is “song name: “song A”, “singer: singer A”, “album: “album A”), and lyrics corresponding to song A. .
- the electronic device has a jump link for "Song A”, “Singer A”, and “Album A”, respectively, and "Song Name: "Song A”", “Singer: Singer A”, “Album: “Album A” and the lyrics corresponding to song A are displayed on the information display interface.
- step 205 when the jump link is triggered, the pre-stored information corresponding to the keyword is displayed.
- the electronic device displays the pre-stored information corresponding to the keyword, and the pre-stored information is usually the detailed information of the pre-stored keyword.
- the electronic device jumps to display the details page of the artist; when the audio being played is the language program audio, and the program name jumps
- the transfer link is triggered, the electronic device jumps to display a detailed introduction page of the program; when the audio being played is a book audio, and the jump link of the book author is triggered, the electronic device jumps to display the page of the book author column .
- FIG. 2D is a schematic diagram of displaying a jump page according to an exemplary embodiment. 2D still uses audio as music audio as an example.
- the electronic device jumps to display the details of the singer A.
- the audio information and the jump link may be For row corresponding storage, please refer to step 206 and step 207 below.
- step 206 after the jump link is displayed, the audio information and the jump link are automatically saved to the pre-stored list.
- the electronic device displays the jump link and the audio information set in the audio information on the information display interface
- the audio information and the jump link can be automatically saved to the pre-stored list. Users can find saved audio information by viewing the pre-stored list.
- step 207 a save instruction for instructing to save the audio information and the jump link is received, and the audio information and the jump link are saved in the pre-stored list.
- the user may be inquired whether to save the audio information and the jump link, when receiving the instruction for saving the audio information and jumping
- the transfer instruction of the link is saved, and the audio information and the jump link are saved to the pre-stored list.
- the electronic device may display a save control for saving the audio information and the jump link on the information display interface, and when the electronic device detects that the save control is triggered, save the corresponding audio information and the jump link to the pre-stored List.
- the user when the user's in-vehicle system is listening to the radio station and a song is being played, the user can identify the audio being played by using an in-vehicle system or a handheld smart phone, such as an in-vehicle system or a smart phone.
- the audio information of the audio, and the jump link set by the keyword in the audio information may be displayed on the information display interface, and the user may display the pre-stored information corresponding to the keyword after triggering the jump link.
- the jump link and the audio information can be automatically changed.
- the user can also trigger a save control for saving audio information and a jump link.
- the device such as the in-vehicle system or the handheld smart phone receives the save command generated by the user-triggered save control, the audio information and the jump link are saved. Go to the pre-existing list for easy viewing by the user.
- the audio information identifying method obtains the audio information of the audio by identifying the audio being played, and displays the jump link set by the keyword in the audio information, and When the jump link is triggered, the pre-stored information corresponding to the keyword is displayed; since more information corresponding to the audio can be displayed by providing the jump link, the solution can only be solved in a single interface.
- the information of the audio is displayed, and the information that can be displayed is relatively small; the effect of improving the richness of the audio information is achieved.
- FIG. 3A is a flowchart of a method for playing or downloading the listened audio, according to an exemplary embodiment.
- step 301 a play link and a download link of the complete audio corresponding to the audio are displayed on the information display interface.
- the electronic device can acquire a play link and a download link of the complete audio corresponding to the audio according to the acquired audio information, and display the play link and the download link in the information display interface.
- a play link and a download link of the song corresponding to the song name may be displayed; when the obtained audio information has a program name, the language category corresponding to the program name may be displayed.
- the play link and the download link of the program audio; when there is a book name in the obtained audio information, the play link and the download link of the book audio corresponding to the book name may be displayed.
- FIG. 3B is a schematic diagram showing a play link and a download link for displaying audio according to an exemplary embodiment.
- the electronic device displays a play link 311 for playing song A and a download link 322 for downloading song A on the information display interface.
- the electronic device may further display a download link for downloading the book, and a link for reading the book online; when the obtained audio information has a program name, And when there is a program video corresponding to the program name, the electronic device may further display a download link for downloading the program video, and a play link for playing the program video.
- step 302 the full audio is played when the play link is triggered.
- step 303 when the download link is triggered, the complete audio is downloaded.
- the electronic device plays the complete audio corresponding to the acquired audio when detecting that the play link is triggered;
- the electronic device downloads the complete audio corresponding to the acquired audio when it detects that the download link is triggered.
- the embodiment of the present disclosure displays a complete audio play link and a download link corresponding to the audio on the information display interface, and plays the complete audio when the play link is triggered, and downloads when the download link is triggered.
- the complete audio since the play link and the download link can be provided on the information display interface, when the user wants to enjoy or collect the listened audio again, the user needs to open the corresponding program to search and then play or download the audio, and the operation steps are complicated. Problem; achieved the effect of simplifying the operation steps and improving the operation efficiency.
- FIG. 4A it is a flowchart of a method for searching for keywords in audio information according to an exemplary embodiment.
- step 401 a search control corresponding to the keyword in the audio information is displayed in the information display interface.
- the electronic device can display the search control corresponding to the keyword in the audio information in the information display interface.
- step 402 when a search control of a keyword is triggered, a search interface of the keyword is displayed, and a search result corresponding to the keyword is displayed in the search interface.
- the electronic device detects that the search control of a keyword in the information display interface is triggered, the search interface of the keyword is displayed, and the search result corresponding to the keyword is displayed in the search interface.
- FIG. 4B is a schematic diagram showing a search result corresponding to a keyword according to an exemplary embodiment.
- 4B is an example in which the acquired audio is music audio.
- the electronic device detects that the search control 411 of the singer A is triggered, the electronic device displays a search interface corresponding to the singer A, and displays search results corresponding to the singer A in the search interface. .
- the embodiment of the present disclosure displays a search interface of the keyword when the search control of a keyword is triggered, and the search result corresponding to the keyword is displayed in the search interface;
- the search control for searching for keywords is displayed on the display, so that the problem that it is necessary to open other applications for searching and having many operation steps is solved; the effect of improving the operation efficiency is achieved.
- steps in FIG. 2A and FIG. 3A may be combined into one embodiment.
- the steps in FIG. 2A and FIG. 4A may be combined into one embodiment, and the steps in FIG. 2A, FIG. 3A and FIG. Combined into one embodiment.
- FIG. 5 is a block diagram of an audio information identifying apparatus according to an exemplary embodiment.
- the audio information identifying apparatus may be applied to an electronic device, which may be a smart phone, a tablet, or an intelligent device. TV, e-book reader, multimedia player, laptop portable computer and desktop computer, etc.
- the audio information identifying device may include, but is not limited to, an identification module 501, a first display module 502, and a second display module 503.
- the identification module 501 is configured to identify the audio being played to obtain audio information of the audio.
- the first display module 502 is configured to display, on the information display interface, a jump link set by the keyword in the audio information recognized by the recognition module 501.
- the second display module 503 is configured to display pre-stored information corresponding to the keyword when the jump link displayed by the first display module 502 is triggered.
- the audio information identifying apparatus obtains the audio information of the audio by identifying the audio being played, and displays the jump link set by the keyword in the audio information, and When the jump link is triggered, the pre-stored information corresponding to the keyword is displayed; since more information corresponding to the audio can be displayed by providing the jump link, the information that can only be displayed in a single interface and the information that can be displayed is solved. Relatively few problems; achieving the effect of increasing the richness of audio information.
- FIG. 6 is a block diagram of an audio information identifying apparatus, as shown in FIG. 6, the audio information identifying apparatus may be applied to an electronic device, and the electronic device may be a smart phone, a tablet, Smart TVs, e-book readers, multimedia players, laptop portable computers and desktop computers, to name a few.
- the audio information identifying apparatus may include, but is not limited to, an identification module 601, a first display module 602, and a second display module 603.
- the identification module 601 is configured to identify the audio being played to obtain audio of the audio information.
- the first display module 602 is configured to display, on the information display interface, a jump link set by the keyword in the audio information identified by the identification module 601.
- the second display module 603 is configured to display pre-stored information corresponding to the keyword when the jump link displayed by the first display module 602 is triggered.
- the identification module 601 can include: an identification sub-module 601a, a transmission sub-module 601b, and a receiving sub-module 601c.
- the identification sub-module 601a is configured to identify the audio to obtain an audio feature of the audio, the audio feature being related to the first or both of the textual information and the identity information of the audio.
- the sending sub-module 601b is configured to send the audio feature identified by the identifying sub-module 601a to the server, the audio feature is used to trigger the server to find the audio information that matches the audio feature, and feed back the found audio information.
- the receiving submodule 601c is configured to receive audio information fed back by the server.
- the audio information identifying apparatus may further include: a first obtaining module 604 or a second acquiring module 605.
- the first obtaining module 604 is configured to acquire the audio being played every predetermined time interval.
- the second obtaining module 605 is configured to receive an identification instruction for identifying the audio being played, and acquire the audio being played.
- the audio information identifying apparatus may further include: a third display module 606, a playing module 607, and a downloading module 608.
- the third display module 606 is configured to display a play link and a download link of the complete audio corresponding to the audio on the information display interface.
- the play module 607 is configured to play the full audio when the play link displayed by the third display module 606 is triggered.
- the download module 608 is configured to download the full audio when the download link displayed by the third display module 606 is triggered.
- the audio information identifying apparatus may further include: a fourth display module 609 and a fifth display module 610.
- the fourth display module 609 is configured to display a search control corresponding to the keyword in the audio information in the information display interface.
- the fifth display module 610 is configured to display a search interface of the keyword when the search control of a keyword displayed by the fourth display module 609 is triggered, and the search result corresponding to the keyword is displayed in the search interface. .
- the audio information identifying apparatus may further include: a first saving module 611 or a second saving module 612.
- the first saving module 611 is configured to automatically save the audio information and the jump link to the pre-stored list after displaying the jump link.
- the second saving module 612 is configured to receive a save instruction for instructing to save the audio information and the jump link, and save the audio information and the jump link to the pre-stored list.
- the audio information identifying apparatus obtains the audio information of the audio by identifying the audio being played, and displays the jump link set by the keyword in the audio information, and When the jump link is triggered, the pre-stored information corresponding to the keyword is displayed; since more information corresponding to the audio can be displayed by providing the jump link, the information that can only be displayed in a single interface and the information that can be displayed is solved. Relatively few problems; achieving the effect of increasing the richness of audio information.
- the information display interface provides a play link and a download link, so that when the user wants to enjoy or collect the listened audio again, the user needs to open the corresponding program to search and then play or download the audio, and the operation steps are complicated; the simplified operation is achieved. Steps to improve the efficiency of the operation.
- search control when a search control of a keyword is triggered, a search interface of the keyword is displayed, and a search result corresponding to the keyword is displayed in the search interface; since the search keyword can be displayed on the information display interface
- the search control solves the problem that it is necessary to open other applications for searching and has many operation steps; the effect of improving the operation efficiency is achieved.
- An exemplary embodiment of the present disclosure provides an audio information identifying apparatus capable of implementing the audio information identifying method provided by the present disclosure, the audio information identifying apparatus comprising: a processor, a memory for storing processor executable instructions;
- processor is configured to:
- the jump link When the jump link is triggered, the pre-stored information corresponding to the keyword is displayed.
- FIG. 7 is a block diagram of an apparatus for identifying audio information, according to an exemplary embodiment.
- device 700 can be a mobile phone, a computer, a digital broadcast terminal, a messaging device, a gaming console, a tablet device, a fitness device, a personal digital assistant, and the like.
- apparatus 700 can include one or more of the following components: processing component 702, memory 704, power component 706, multimedia component 708, audio component 710, input/output (I/O) interface 712, sensor component 714, and Communication component 716.
- Processing component 702 typically controls the overall operation of device 700, such as operations associated with display, telephone calls, data communications, camera operations, and recording operations.
- Processing component 702 can include one or more processors 718 to execute instructions to perform all or part of the steps of the methods described above.
- processing component 702 can include one or more modules to facilitate interaction between component 702 and other components.
- processing component 702 can include a multimedia module to facilitate interaction between multimedia component 708 and processing component 702.
- Memory 704 is configured to store various types of data to support operation at device 700. Examples of such data include instructions for any application or method operating on device 700, contact data, phone book data, messages, pictures, videos, and the like. Memory 704 can be implemented by any type of volatile or non-volatile storage device, or a combination thereof, such as static random access memory (SRAM), electrically erasable programmable read only memory (EEPROM), erasable Programmable Read Only Memory (EPROM), Programmable Read Only Memory (PROM), Read Only Memory (ROM), Magnetic Memory, Flash Memory, Disk or Optical Disk.
- SRAM static random access memory
- EEPROM electrically erasable programmable read only memory
- EPROM erasable Programmable Read Only Memory
- PROM Programmable Read Only Memory
- ROM Read Only Memory
- Magnetic Memory Flash Memory
- Disk Disk or Optical Disk.
- Power component 706 provides power to various components of device 700.
- Power component 706 can include electricity The source management system, one or more power sources, and other components associated with generating, managing, and distributing power for the device 700.
- the multimedia component 708 includes a screen between the device 700 and the user that provides an output interface.
- the screen can include a liquid crystal display (LCD) and a touch panel (TP). If the screen includes a touch panel, the screen can be implemented as a touch screen to receive input signals from the user.
- the touch panel includes one or more touch sensors to sense touches, slides, and gestures on the touch panel. The touch sensor can sense not only the boundaries of the touch or sliding action, but also the duration and pressure associated with the touch or slide operation.
- the multimedia component 708 includes a front camera and/or a rear camera. When the device 700 is in an operation mode, such as a shooting mode or a video mode, the front camera and/or the rear camera can receive external multimedia data. Each front and rear camera can be a fixed optical lens system or have focal length and optical zoom capabilities.
- the audio component 710 is configured to output and/or input an audio signal.
- audio component 710 includes a microphone (MIC) that is configured to receive an external audio signal when device 700 is in an operational mode, such as a call mode, a recording mode, and a voice recognition mode.
- the received audio signal may be further stored in memory 704 or transmitted via communication component 716.
- audio component 710 also includes a speaker for outputting an audio signal.
- the I/O interface 712 provides an interface between the processing component 702 and the peripheral interface module, which may be a keyboard, a click wheel, a button, or the like. These buttons may include, but are not limited to, a home button, a volume button, a start button, and a lock button.
- Sensor assembly 714 includes one or more sensors for providing device 700 with various aspects of status assessment.
- sensor component 714 can detect an open/closed state of device 700, relative positioning of components, such as a display and a keypad of device 700, and sensor component 714 can also detect a change in position of device 700 or a component of device 700, user The presence or absence of contact with device 700, device 700 orientation or acceleration/deceleration and temperature variation of device 700.
- Sensor assembly 714 can include a proximity sensor configured to detect the presence of nearby objects without any physical contact.
- Sensor component 714 can also include a light sensor, such as a CMOS or CCD image sensor, for use in imaging applications.
- the sensor component 714 can also include an acceleration sensor, a gyro sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.
- Communication component 716 is configured to facilitate wired or wireless communication between device 700 and other devices.
- the device 700 can access a wireless network based on a communication standard, such as Wi-Fi, 2G or 3G, or The combination.
- communication component 716 receives broadcast signals or broadcast associated information from an external broadcast management system via a broadcast channel.
- communication component 716 also includes a near field communication (NFC) module to facilitate short range communication.
- NFC near field communication
- the NFC module can be implemented based on radio frequency identification (RFID) technology, infrared data association (IrDA) technology, ultra-wideband (UWB) technology, Bluetooth (BT) technology, and other technologies.
- RFID radio frequency identification
- IrDA infrared data association
- UWB ultra-wideband
- Bluetooth Bluetooth
- apparatus 700 may be implemented by one or more application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable A gate array (FPGA), controller, microcontroller, microprocessor or other electronic component implementation for performing the above described audio information identification method.
- ASICs application specific integrated circuits
- DSPs digital signal processors
- DSPDs digital signal processing devices
- PLDs programmable logic devices
- FPGA field programmable A gate array
- controller microcontroller, microprocessor or other electronic component implementation for performing the above described audio information identification method.
- non-transitory computer readable storage medium comprising instructions, such as a memory 704 comprising instructions executable by processor 718 of apparatus 700 to perform the above described audio information identification method.
- the non-transitory computer readable storage medium can be a ROM, a random access memory (RAM), a CD-ROM, a magnetic tape, a floppy disk, and an optical data storage device.
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Signal Processing (AREA)
- General Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- Library & Information Science (AREA)
- Multimedia (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Databases & Information Systems (AREA)
- Data Mining & Analysis (AREA)
- Social Psychology (AREA)
- Artificial Intelligence (AREA)
- Computational Linguistics (AREA)
- Human Computer Interaction (AREA)
- Computer Networks & Wireless Communication (AREA)
- User Interface Of Digital Computer (AREA)
- Information Transfer Between Computers (AREA)
- Stereophonic System (AREA)
- Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
- Reverberation, Karaoke And Other Acoustics (AREA)
Abstract
Description
Claims (13)
- 一种音频信息识别方法,其特征在于,所述方法包括:对正在播放的音频进行识别,得到所述音频的音频信息;在信息展示界面上显示为所述音频信息中的关键词设置的跳转链接;当所述跳转链接被触发时,显示与所述关键词对应的预存信息。
- 根据权利要求1所述的方法,其特征在于,所述对正在播放的音频进行识别,得到所述音频的音频信息,包括:对所述音频进行识别,得到所述音频的音频特征,所述音频特征与所述音频的文本信息和身份信息中前一种或全部两种相关;将所述音频特征发送给服务器,所述音频特征用于触发所述服务器查找到与所述音频特征匹配的音频信息,并反馈查找到的所述音频信息;接收所述服务器反馈的所述音频信息。
- 根据权利要求1所述的方法,其特征在于,所述方法还包括:每隔预定时间间隔获取所述正在播放的音频;或,接收对正在播放的音频进行识别的识别指令,获取所述正在播放的音频。
- 根据权利要求1所述的方法,其特征在于,所述方法还包括:在所述信息展示界面上显示与所述音频对应的完整音频的播放链接和下载链接;当所述播放链接被触发时,播放所述完整音频;当所述下载链接被触发时,下载所述完整音频。
- 根据权利要求1所述的方法,其特征在于,所述方法还包括:在所述信息展示界面内显示与所述音频信息中的关键词对应的搜索控件;当一个关键词的搜索控件被触发时,显示所述关键词的搜索界面,所述搜索界面内显示有与所述关键词对应的搜索结果。
- 根据权利要求1至5任一所述的方法,其特征在于,所述方法还包括:在显示所述跳转链接后,自动将所述音频信息和所述跳转链接保存至预存列表中;或,接收用于指示保存所述音频信息和所述跳转链接的保存指令,将所述音频信息和所述跳转链接保存至预存列表中。
- 一种音频信息识别装置,其特征在于,所述装置包括:识别模块,被配置为对正在播放的音频进行识别,得到所述音频的音频信息;第一显示模块,被配置为在信息展示界面上显示为所述识别模块识别得到的所述音频信息中的关键词设置的跳转链接;第二显示模块,被配置为当所述第一显示模块显示的所述跳转链接被触发时,显示与所述关键词对应的预存信息。
- 根据权利要求7所述的装置,其特征在于,所述识别模块,包括:识别子模块,被配置为对所述音频进行识别,得到所述音频的音频特征,所述音频特征与所述音频的文本信息和身份信息中前一种或全部两种相关;发送子模块,被配置为将所述识别子模块识别得到的所述音频特征发送给服务器,所述音频特征用于触发所述服务器查找到与所述音频特征匹配的音频信息,并反馈查找到的所述音频信息;接收子模块,被配置为接收所述服务器反馈的所述音频信息。
- 根据权利要求7所述的装置,其特征在于,所述装置还包括:第一获取模块,被配置为每隔预定时间间隔获取所述正在播放的音频;或,第二获取模块,被配置为接收对正在播放的音频进行识别的识别指令,获取所述正在播放的音频。
- 根据权利要求7所述的装置,其特征在于,所述装置还包括:第三显示模块,被配置为在所述信息展示界面上显示与所述音频对应的完整音频的播放链接和下载链接;播放模块,被配置为当所述第三显示模块显示的所述播放链接被触发时,播放所述完整音频;下载模块,被配置为当所述第三显示模块显示的所述下载链接被触发时,下载所述完整音频。
- 根据权利要求7所述的装置,其特征在于,所述装置还包括:第四显示模块,被配置为在所述信息展示界面内显示与所述音频信息中的关键词对应的搜索控件;第五显示模块,被配置为当所述第四显示模块显示的一个关键词的搜索控件被触发时,显示所述关键词的搜索界面,所述搜索界面内显示有与所述关键词对应的搜索结果。
- 根据权利要求7至11任一所述的装置,其特征在于,所述装置还包括:第一保存模块,被配置为在显示所述跳转链接后,自动将所述音频信息和所述跳转链接保存至预存列表中;或,第二保存模块,被配置为接收用于指示保存所述音频信息和所述跳转链接的保存指令,将所述音频信息和所述跳转链接保存至预存列表中。
- 一种音频信息识别装置,其特征在于,所述装置包括:处理器;用于存储所述处理器可执行指令的存储器;其中,所述处理器被配置为:对正在播放的音频进行识别,得到所述音频的音频信息;在信息展示界面上显示为所述音频信息中的关键词设置的跳转链接;当所述跳转链接被触发时,显示与所述关键词对应的预存信息。
Priority Applications (4)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2017512096A JP6236189B2 (ja) | 2015-04-15 | 2015-11-19 | オーディオ情報識別方法及び装置 |
MX2016002658A MX359479B (es) | 2015-04-15 | 2015-11-19 | Método y aparato para identificar información de audio. |
KR1020167001534A KR20160132808A (ko) | 2015-04-15 | 2015-11-19 | 오디오정보식별방법 및 장치 |
RU2016108039A RU2634696C2 (ru) | 2015-04-15 | 2015-11-19 | Способ и устройство для идентификации аудиоинформации |
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201510178987.0A CN104820678B (zh) | 2015-04-15 | 2015-04-15 | 音频信息识别方法及装置 |
CN201510178987.0 | 2015-04-15 |
Publications (1)
Publication Number | Publication Date |
---|---|
WO2016165325A1 true WO2016165325A1 (zh) | 2016-10-20 |
Family
ID=53730975
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
PCT/CN2015/095034 WO2016165325A1 (zh) | 2015-04-15 | 2015-11-19 | 音频信息识别方法及装置 |
Country Status (8)
Country | Link |
---|---|
US (1) | US20160306880A1 (zh) |
EP (1) | EP3082280B1 (zh) |
JP (1) | JP6236189B2 (zh) |
KR (1) | KR20160132808A (zh) |
CN (1) | CN104820678B (zh) |
MX (1) | MX359479B (zh) |
RU (1) | RU2634696C2 (zh) |
WO (1) | WO2016165325A1 (zh) |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110489573A (zh) * | 2019-07-30 | 2019-11-22 | 维沃移动通信有限公司 | 界面显示方法及电子设备 |
Families Citing this family (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104820678B (zh) * | 2015-04-15 | 2018-10-19 | 小米科技有限责任公司 | 音频信息识别方法及装置 |
CN105005631A (zh) * | 2015-08-24 | 2015-10-28 | 安徽味唯网络科技有限公司 | 一种高精度搜索的方法 |
CN105357588A (zh) * | 2015-11-03 | 2016-02-24 | 腾讯科技(深圳)有限公司 | 数据显示方法及终端 |
CN114464186A (zh) * | 2016-07-28 | 2022-05-10 | 北京小米移动软件有限公司 | 关键词确定方法及装置 |
CN106341728A (zh) * | 2016-10-21 | 2017-01-18 | 北京巡声巡影科技服务有限公司 | 一种视频中的产品信息展示方法、装置和系统 |
CN106851362A (zh) * | 2016-12-15 | 2017-06-13 | 咪咕音乐有限公司 | 一种多媒体内容的播放方法及装置 |
CN106599274A (zh) * | 2016-12-23 | 2017-04-26 | 珠海市魅族科技有限公司 | 识别播放音源的装置及方法 |
CN106897435A (zh) * | 2017-02-28 | 2017-06-27 | 深圳天珑无线科技有限公司 | 终端控制方法及装置 |
CN107040587A (zh) * | 2017-03-02 | 2017-08-11 | 广州小鹏汽车科技有限公司 | 一种车载电台音乐内容获取方法及装置 |
CN107959751A (zh) * | 2017-11-14 | 2018-04-24 | 优酷网络技术(北京)有限公司 | 音频播放方法及装置 |
US20190206102A1 (en) * | 2017-12-29 | 2019-07-04 | Facebook, Inc. | Systems and methods for enhancing content |
CN111723235B (zh) * | 2019-03-19 | 2023-09-26 | 百度在线网络技术(北京)有限公司 | 音乐内容识别方法、装置及设备 |
CN112148754A (zh) * | 2020-09-01 | 2020-12-29 | 腾讯音乐娱乐科技(深圳)有限公司 | 一种歌曲识别方法和装置 |
US20230142904A1 (en) * | 2021-11-09 | 2023-05-11 | Honda Motor Co., Ltd. | Creation of notes for items of interest mentioned in audio content |
EP4213145A1 (en) * | 2022-01-14 | 2023-07-19 | Vestel Elektronik Sanayi ve Ticaret A.S. | Device and method for triggering a music identification application |
Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20040133558A1 (en) * | 2003-01-06 | 2004-07-08 | Masterwriter, Inc. | Information management system plus |
US7752546B2 (en) * | 2001-06-29 | 2010-07-06 | Thomson Licensing | Method and system for providing an acoustic interface |
CN102868822A (zh) * | 2012-09-24 | 2013-01-09 | 广东欧珀移动通信有限公司 | 一种移动终端实施的歌词显示方法 |
CN103096249A (zh) * | 2011-10-28 | 2013-05-08 | M&Service株式会社 | 内容同时播放终端、其系统以及同时播放方法 |
CN103442083A (zh) * | 2013-09-10 | 2013-12-11 | 百度在线网络技术(北京)有限公司 | 音频文件传输关联内容的方法、系统、客户端和服务器 |
CN104820678A (zh) * | 2015-04-15 | 2015-08-05 | 小米科技有限责任公司 | 音频信息识别方法及装置 |
Family Cites Families (18)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3919479A (en) * | 1972-09-21 | 1975-11-11 | First National Bank Of Boston | Broadcast signal identification system |
US7171018B2 (en) * | 1995-07-27 | 2007-01-30 | Digimarc Corporation | Portable devices and methods employing digital watermarking |
US6317784B1 (en) * | 1998-09-29 | 2001-11-13 | Radiowave.Com, Inc. | Presenting supplemental information for material currently and previously broadcast by a radio station |
US7028082B1 (en) * | 2001-03-08 | 2006-04-11 | Music Choice | Personalized audio system and method |
US6964023B2 (en) * | 2001-02-05 | 2005-11-08 | International Business Machines Corporation | System and method for multi-modal focus detection, referential ambiguity resolution and mood classification using multi-modal input |
US20060184960A1 (en) * | 2005-02-14 | 2006-08-17 | Universal Music Group, Inc. | Method and system for enabling commerce from broadcast content |
CN1983253A (zh) * | 2005-12-15 | 2007-06-20 | 北京中科信利技术有限公司 | 一种提供音乐搜索服务的方法、设备和系统 |
US7787697B2 (en) * | 2006-06-09 | 2010-08-31 | Sony Ericsson Mobile Communications Ab | Identification of an object in media and of related media objects |
ES2433966T3 (es) * | 2006-10-03 | 2013-12-13 | Shazam Entertainment, Ltd. | Método para caudal alto de identificación de contenido de radiodifusión distribuido |
WO2009042697A2 (en) * | 2007-09-24 | 2009-04-02 | Skyclix, Inc. | Phone-based broadcast audio identification |
US20100057781A1 (en) * | 2008-08-27 | 2010-03-04 | Alpine Electronics, Inc. | Media identification system and method |
CN101635002A (zh) * | 2009-08-21 | 2010-01-27 | 深圳市五巨科技有限公司 | 一种移动终端音乐搜索的方法和装置 |
US9264785B2 (en) * | 2010-04-01 | 2016-02-16 | Sony Computer Entertainment Inc. | Media fingerprinting for content determination and retrieval |
US8694533B2 (en) * | 2010-05-19 | 2014-04-08 | Google Inc. | Presenting mobile content based on programming context |
US8158870B2 (en) * | 2010-06-29 | 2012-04-17 | Google Inc. | Intervalgram representation of audio for melody recognition |
KR20120069908A (ko) * | 2010-12-21 | 2012-06-29 | 삼성전자주식회사 | 휴대단말기의 정보제공 장치 및 방법 |
US20150286873A1 (en) * | 2014-04-03 | 2015-10-08 | Bruce L. Davis | Smartphone-based methods and systems |
CN103685520A (zh) * | 2013-12-13 | 2014-03-26 | 深圳Tcl新技术有限公司 | 基于语音识别的歌曲推送的方法和装置 |
-
2015
- 2015-04-15 CN CN201510178987.0A patent/CN104820678B/zh active Active
- 2015-11-19 JP JP2017512096A patent/JP6236189B2/ja active Active
- 2015-11-19 MX MX2016002658A patent/MX359479B/es active IP Right Grant
- 2015-11-19 WO PCT/CN2015/095034 patent/WO2016165325A1/zh active Application Filing
- 2015-11-19 KR KR1020167001534A patent/KR20160132808A/ko not_active Application Discontinuation
- 2015-11-19 RU RU2016108039A patent/RU2634696C2/ru active
-
2016
- 2016-03-24 US US15/080,329 patent/US20160306880A1/en not_active Abandoned
- 2016-04-08 EP EP16164556.9A patent/EP3082280B1/en active Active
Patent Citations (6)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7752546B2 (en) * | 2001-06-29 | 2010-07-06 | Thomson Licensing | Method and system for providing an acoustic interface |
US20040133558A1 (en) * | 2003-01-06 | 2004-07-08 | Masterwriter, Inc. | Information management system plus |
CN103096249A (zh) * | 2011-10-28 | 2013-05-08 | M&Service株式会社 | 内容同时播放终端、其系统以及同时播放方法 |
CN102868822A (zh) * | 2012-09-24 | 2013-01-09 | 广东欧珀移动通信有限公司 | 一种移动终端实施的歌词显示方法 |
CN103442083A (zh) * | 2013-09-10 | 2013-12-11 | 百度在线网络技术(北京)有限公司 | 音频文件传输关联内容的方法、系统、客户端和服务器 |
CN104820678A (zh) * | 2015-04-15 | 2015-08-05 | 小米科技有限责任公司 | 音频信息识别方法及装置 |
Cited By (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110489573A (zh) * | 2019-07-30 | 2019-11-22 | 维沃移动通信有限公司 | 界面显示方法及电子设备 |
Also Published As
Publication number | Publication date |
---|---|
MX2016002658A (es) | 2017-04-27 |
US20160306880A1 (en) | 2016-10-20 |
CN104820678A (zh) | 2015-08-05 |
KR20160132808A (ko) | 2016-11-21 |
CN104820678B (zh) | 2018-10-19 |
MX359479B (es) | 2018-09-28 |
EP3082280A1 (en) | 2016-10-19 |
EP3082280B1 (en) | 2018-07-25 |
JP2017517828A (ja) | 2017-06-29 |
RU2634696C2 (ru) | 2017-11-03 |
RU2016108039A (ru) | 2017-09-07 |
JP6236189B2 (ja) | 2017-11-22 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
WO2016165325A1 (zh) | 音频信息识别方法及装置 | |
US11206448B2 (en) | Method and apparatus for selecting background music for video shooting, terminal device and medium | |
TWI667917B (zh) | Multimedia search result display method and device | |
CN104166689B (zh) | 电子书籍的呈现方法及装置 | |
US20220391060A1 (en) | Methods for displaying and providing multimedia resources | |
WO2015196709A1 (zh) | 信息获取方法及装置 | |
CN109657236B (zh) | 引导信息获取方法、装置、电子装置及存储介质 | |
CN105095427A (zh) | 搜索推荐方法和装置 | |
CN108334623B (zh) | 歌曲的显示方法、装置和系统 | |
CN105068976A (zh) | 票务信息展示方法及装置 | |
CN107229403B (zh) | 一种信息内容选择方法及装置 | |
CN105139848B (zh) | 数据转换方法和装置 | |
CN107342082A (zh) | 基于语音交互的音频处理方法、装置及音频播放设备 | |
WO2018188414A1 (zh) | 搜索结果显示方法及装置 | |
CN111061906A (zh) | 音乐信息处理方法、装置、电子设备及计算机可读存储介质 | |
CN112068711A (zh) | 一种输入法的信息推荐方法、装置和电子设备 | |
CN109862421A (zh) | 一种视频信息识别方法、装置、电子设备及存储介质 | |
US20220208156A1 (en) | Method for generating song melody and electronic device | |
CN106020766A (zh) | 音乐播放的方法及装置 | |
WO2016197549A1 (zh) | 一种进行搜索的方法和装置 | |
CN111540361B (zh) | 一种语音处理方法、装置和介质 | |
WO2020224570A1 (zh) | 交互方法及装置、音箱、电子设备和存储介质 | |
CN112988956B (zh) | 自动生成对话的方法及装置、信息推荐效果检测方法及装置 | |
WO2019196527A1 (zh) | 一种数据处理方法、装置和电子设备 | |
CN106060253B (zh) | 信息呈现的方法及装置 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
ENP | Entry into the national phase |
Ref document number: 20167001534 Country of ref document: KR Kind code of ref document: A Ref document number: 2017512096 Country of ref document: JP Kind code of ref document: A |
|
WWE | Wipo information: entry into national phase |
Ref document number: MX/A/2016/002658 Country of ref document: MX |
|
ENP | Entry into the national phase |
Ref document number: 2016108039 Country of ref document: RU Kind code of ref document: A |
|
REG | Reference to national code |
Ref country code: BR Ref legal event code: B01A Ref document number: 112016004835 Country of ref document: BR |
|
121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 15889021 Country of ref document: EP Kind code of ref document: A1 |
|
ENP | Entry into the national phase |
Ref document number: 112016004835 Country of ref document: BR Kind code of ref document: A2 Effective date: 20160304 |
|
NENP | Non-entry into the national phase |
Ref country code: DE |
|
122 | Ep: pct application non-entry in european phase |
Ref document number: 15889021 Country of ref document: EP Kind code of ref document: A1 |