WO2016165325A1 - 音频信息识别方法及装置 - Google Patents

音频信息识别方法及装置 Download PDF

Info

Publication number
WO2016165325A1
WO2016165325A1 PCT/CN2015/095034 CN2015095034W WO2016165325A1 WO 2016165325 A1 WO2016165325 A1 WO 2016165325A1 CN 2015095034 W CN2015095034 W CN 2015095034W WO 2016165325 A1 WO2016165325 A1 WO 2016165325A1
Authority
WO
WIPO (PCT)
Prior art keywords
audio
information
keyword
audio information
link
Prior art date
Application number
PCT/CN2015/095034
Other languages
English (en)
French (fr)
Inventor
吕露
李棽
郭涛
Original Assignee
小米科技有限责任公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 小米科技有限责任公司 filed Critical 小米科技有限责任公司
Priority to JP2017512096A priority Critical patent/JP6236189B2/ja
Priority to MX2016002658A priority patent/MX359479B/es
Priority to KR1020167001534A priority patent/KR20160132808A/ko
Priority to RU2016108039A priority patent/RU2634696C2/ru
Publication of WO2016165325A1 publication Critical patent/WO2016165325A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/683Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using metadata automatically derived from the content
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/16Sound input; Sound output
    • G06F3/165Management of the audio stream, e.g. setting of volume, audio stream path
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F40/00Handling natural language data
    • G06F40/10Text processing
    • G06F40/12Use of codes for handling textual entities
    • G06F40/134Hyperlinking
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04HBROADCAST COMMUNICATION
    • H04H60/00Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
    • H04H60/29Arrangements for monitoring broadcast services or broadcast-related services
    • H04H60/33Arrangements for monitoring the users' behaviour or opinions
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04HBROADCAST COMMUNICATION
    • H04H60/00Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
    • H04H60/35Arrangements for identifying or recognising characteristics with a direct linkage to broadcast information or to broadcast space-time, e.g. for identifying broadcast stations or for identifying users
    • H04H60/37Arrangements for identifying or recognising characteristics with a direct linkage to broadcast information or to broadcast space-time, e.g. for identifying broadcast stations or for identifying users for identifying segments of broadcast information, e.g. scenes or extracting programme ID
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04HBROADCAST COMMUNICATION
    • H04H60/00Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
    • H04H60/61Arrangements for services using the result of monitoring, identification or recognition covered by groups H04H60/29-H04H60/54
    • H04H60/65Arrangements for services using the result of monitoring, identification or recognition covered by groups H04H60/29-H04H60/54 for using the result on users' side
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04HBROADCAST COMMUNICATION
    • H04H60/00Arrangements for broadcast applications with a direct linking to broadcast information or broadcast space-time; Broadcast-related systems
    • H04H60/68Systems specially adapted for using specific information, e.g. geographical or meteorological information
    • H04H60/73Systems specially adapted for using specific information, e.g. geographical or meteorological information using meta-information
    • H04H60/74Systems specially adapted for using specific information, e.g. geographical or meteorological information using meta-information using programme related information, e.g. title, composer or interpreter
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/40Support for services or applications
    • H04L65/402Support for services or applications wherein the services involve a main real-time session and one or more additional parallel non-real time sessions, e.g. downloading a file in a parallel FTP session, initiating an email or combinational services
    • H04L65/4025Support for services or applications wherein the services involve a main real-time session and one or more additional parallel non-real time sessions, e.g. downloading a file in a parallel FTP session, initiating an email or combinational services where none of the additional parallel sessions is real time or time sensitive, e.g. downloading a file in a parallel FTP session, initiating an email or combinational services

Definitions

  • the present disclosure relates to the field of audio recognition technologies, and in particular, to an audio information identification method and apparatus.
  • some applications can recognize the song name, singer and lyrics of the listened song, and display the recognized song name, singer and lyrics to the user.
  • the present disclosure provides an audio information identification method and apparatus.
  • the technical solution is as follows:
  • an audio information identifying method comprising:
  • an audio information identifying apparatus comprising:
  • An identification module configured to identify the audio being played to obtain audio information of the audio
  • a first display module configured to display, on the information display interface, a jump link set for a keyword in the audio information identified by the identification module;
  • a second display module configured to be triggered when the jump link displayed by the first display module is triggered At the time, pre-stored information corresponding to the keyword is displayed.
  • an audio information identifying apparatus comprising:
  • a memory for storing the processor executable instructions
  • processor is configured to:
  • FIG. 1 is a flowchart of an audio information identification method according to an exemplary embodiment
  • FIG. 2A is a flowchart of an audio information identification method according to another exemplary embodiment
  • 2B is a flowchart of a method for acquiring audio information, according to an exemplary embodiment
  • FIG. 2C is a schematic diagram showing displaying audio information and a jump link according to an exemplary embodiment
  • 2D is a schematic diagram showing a display jump page according to an exemplary embodiment
  • FIG. 3A is a flowchart illustrating a method of playing or downloading audio that is listened to, according to an exemplary embodiment
  • FIG. 3B is a diagram showing a play link and a download link for displaying audio according to an exemplary embodiment. schematic diagram
  • FIG. 4A is a flowchart of a method for searching for keywords in audio information, according to an exemplary embodiment
  • FIG. 4B is a schematic diagram showing a search result corresponding to a keyword according to an exemplary embodiment
  • FIG. 5 is a block diagram of an audio information identifying apparatus according to an exemplary embodiment
  • FIG. 6 is a block diagram of an audio information identifying apparatus according to another exemplary embodiment
  • FIG. 7 is a block diagram of an apparatus for identifying audio information, according to an exemplary embodiment.
  • FIG. 1 is a flowchart of an audio information identification method according to an exemplary embodiment.
  • the audio information identification method may be applied to an electronic device, where the electronic device may be a smart phone, a tablet, or Smart TVs, e-book readers, multimedia players, laptop portable computers and desktop computers, to name a few.
  • the audio information identifying method may include the following steps.
  • step 101 the audio being played is identified to obtain audio information of the audio.
  • step 102 a jump link set for the keyword in the audio information is displayed on the information display interface.
  • step 103 when the jump link is triggered, the pre-stored information corresponding to the keyword is displayed.
  • the audio information identifying method obtains the audio information of the audio by identifying the audio being played, and displays the jump link set by the keyword in the audio information, and When the jump link is triggered, the pre-stored information corresponding to the keyword is displayed; since more information corresponding to the audio can be displayed by providing the jump link, the information that can only be displayed in a single interface and the information that can be displayed is solved. Relatively few problems; achieving the effect of increasing the richness of audio information.
  • FIG. 2A is a flowchart of a method for identifying an audio information according to another exemplary embodiment.
  • the method for identifying an audio information may be applied to an electronic device, which may be a smart phone or a tablet. , smart TV, e-book reader, multimedia player, laptop portable computer and desktop computer, and so on.
  • the audio information identifying method may include the following steps. Before recognizing the audio being played, the electronic device needs to acquire the audio being played. In order to meet different requirements, the manner in which the electronic device acquires the audio being played also needs to be adjusted accordingly. Please refer to step 201 and step 202 below.
  • step 201 the audio being played is acquired every predetermined time interval.
  • the audio being played here may be the audio played by the electronic device after listening to the radio broadcast, or the audio being played by other devices around the electronic device, and the electronic device may acquire the other device that is playing the audio. Audio.
  • the audio can be music audio, language program audio, or book audio.
  • the electronic device can acquire the audio being played every predetermined time interval, and the predetermined time interval can be set by the user, for example, the predetermined time interval can be set to 3 minutes, 4 minutes, or 5 minutes, and the like.
  • the electronic device may acquire the audio being played after detecting that the current audio tempo changes more than a predetermined threshold.
  • a predetermined threshold For example, in the case of music audio, there is usually an interval after the completion of a song and before the next song is played, and the rhythm of the audio is quite different from the rhythm when playing the song. Therefore, when the electronic device detects that the rhythm of the current audio exceeds a predetermined threshold change, it indicates that the played song has been switched, and the audio acquired by the electronic device at this time is the audio of the switched song.
  • step 202 an identification command for identifying the audio being played is received, and the audio being played is acquired.
  • the electronic device can acquire the audio being played after receiving the user-triggered recognition command for recognizing the audio being played.
  • the user when the user is listening to the radio broadcast using the electronic device, it is found that the currently played audio is very nice, and it is desirable to obtain related information of the audio. At this time, the user can trigger generation on the electronic device to generate the pair. An identification instruction that the audio recognizes, and the electronic device receives the After the instruction is recognized, the audio being played is obtained.
  • the user when other devices are playing audio, and the user wants to obtain information about the audio that the other device is playing, the user can turn on the held electronic device and trigger a generated pair on the electronic device.
  • the recognition command for the audio being played is recognized, and the electronic device acquires the audio being played after receiving the recognition command.
  • the identification control on the electronic device may be triggered to generate the identification instruction, or the specified hardware (such as the volume) triggered by the electronic device may be triggered. Key) to generate an identification command.
  • step 203 the audio being played is identified to obtain audio information of the audio.
  • FIG. 2B is a flowchart of a method for acquiring audio information according to an exemplary embodiment.
  • step 203A the audio is identified to obtain an audio feature of the audio, the audio feature being associated with the first or both of the textual and identity information of the audio.
  • the electronic device recognizes the acquired audio being played to obtain an audio feature of the audio.
  • the audio feature is related to textual information, intonation features, or tonal features that appear in the audio. If the audio is recognized by voiceprint recognition technology, the audio feature is also related to the identity information of the audio. For example, when the acquired audio is music audio, the text information obtained by the recognition is the lyrics corresponding to the acquired audio, and the identity information obtained by the voiceprint recognition is the singer corresponding to the audio; when the acquired audio is a language class When the program audio is obtained, the text information obtained by the recognition is the program content corresponding to the acquired audio, and the identity information obtained by the voiceprint recognition is the performing artist corresponding to the audio.
  • step 203B the audio feature is sent to the server, and the audio feature is used to trigger the server to find the audio information that matches the audio feature, and feed back the found audio information.
  • the electronic device sends the obtained audio feature to the server, and the server can find the audio information that matches the audio feature according to the pre-stored database, and the server feeds back the audio information to the electronic device after finding the audio information that matches the audio feature.
  • the audio information may include owner information of the audio corresponding to the audio feature, an audio name corresponding to the audio, and the like.
  • the audio information may include a song name, an album name, a singer, a lyric, etc.; when the audio being played is a language program In audio, the audio information may include a program name, a performer, and the like; when the audio being played is a book audio, the audio information may include a book author, a book name, a chapter directory, and the like.
  • step 203C the audio information fed back by the server is received.
  • step 204 a jump link set for the keyword in the audio information is displayed on the information display interface.
  • the electronic device may set a jump link for the keywords in the audio information, so that the user can obtain more information by jumping.
  • the keyword here may be a keyword capable of indicating the main feature of the audio, for example, when the audio being played is music audio, the keyword may be a song name, a singer and an album name, etc.; when the audio being played is a language class In the program audio, the keywords may be the program name and the performer, etc.; when the audio being played is the book audio, the keywords may be the book author and the book name.
  • FIG. 2C is a schematic diagram of displaying audio information and a jump link according to an exemplary embodiment.
  • 2C takes audio as music audio as an example, and the audio information received by the electronic device is “song name: “song A”, “singer: singer A”, “album: “album A”), and lyrics corresponding to song A. .
  • the electronic device has a jump link for "Song A”, “Singer A”, and “Album A”, respectively, and "Song Name: "Song A”", “Singer: Singer A”, “Album: “Album A” and the lyrics corresponding to song A are displayed on the information display interface.
  • step 205 when the jump link is triggered, the pre-stored information corresponding to the keyword is displayed.
  • the electronic device displays the pre-stored information corresponding to the keyword, and the pre-stored information is usually the detailed information of the pre-stored keyword.
  • the electronic device jumps to display the details page of the artist; when the audio being played is the language program audio, and the program name jumps
  • the transfer link is triggered, the electronic device jumps to display a detailed introduction page of the program; when the audio being played is a book audio, and the jump link of the book author is triggered, the electronic device jumps to display the page of the book author column .
  • FIG. 2D is a schematic diagram of displaying a jump page according to an exemplary embodiment. 2D still uses audio as music audio as an example.
  • the electronic device jumps to display the details of the singer A.
  • the audio information and the jump link may be For row corresponding storage, please refer to step 206 and step 207 below.
  • step 206 after the jump link is displayed, the audio information and the jump link are automatically saved to the pre-stored list.
  • the electronic device displays the jump link and the audio information set in the audio information on the information display interface
  • the audio information and the jump link can be automatically saved to the pre-stored list. Users can find saved audio information by viewing the pre-stored list.
  • step 207 a save instruction for instructing to save the audio information and the jump link is received, and the audio information and the jump link are saved in the pre-stored list.
  • the user may be inquired whether to save the audio information and the jump link, when receiving the instruction for saving the audio information and jumping
  • the transfer instruction of the link is saved, and the audio information and the jump link are saved to the pre-stored list.
  • the electronic device may display a save control for saving the audio information and the jump link on the information display interface, and when the electronic device detects that the save control is triggered, save the corresponding audio information and the jump link to the pre-stored List.
  • the user when the user's in-vehicle system is listening to the radio station and a song is being played, the user can identify the audio being played by using an in-vehicle system or a handheld smart phone, such as an in-vehicle system or a smart phone.
  • the audio information of the audio, and the jump link set by the keyword in the audio information may be displayed on the information display interface, and the user may display the pre-stored information corresponding to the keyword after triggering the jump link.
  • the jump link and the audio information can be automatically changed.
  • the user can also trigger a save control for saving audio information and a jump link.
  • the device such as the in-vehicle system or the handheld smart phone receives the save command generated by the user-triggered save control, the audio information and the jump link are saved. Go to the pre-existing list for easy viewing by the user.
  • the audio information identifying method obtains the audio information of the audio by identifying the audio being played, and displays the jump link set by the keyword in the audio information, and When the jump link is triggered, the pre-stored information corresponding to the keyword is displayed; since more information corresponding to the audio can be displayed by providing the jump link, the solution can only be solved in a single interface.
  • the information of the audio is displayed, and the information that can be displayed is relatively small; the effect of improving the richness of the audio information is achieved.
  • FIG. 3A is a flowchart of a method for playing or downloading the listened audio, according to an exemplary embodiment.
  • step 301 a play link and a download link of the complete audio corresponding to the audio are displayed on the information display interface.
  • the electronic device can acquire a play link and a download link of the complete audio corresponding to the audio according to the acquired audio information, and display the play link and the download link in the information display interface.
  • a play link and a download link of the song corresponding to the song name may be displayed; when the obtained audio information has a program name, the language category corresponding to the program name may be displayed.
  • the play link and the download link of the program audio; when there is a book name in the obtained audio information, the play link and the download link of the book audio corresponding to the book name may be displayed.
  • FIG. 3B is a schematic diagram showing a play link and a download link for displaying audio according to an exemplary embodiment.
  • the electronic device displays a play link 311 for playing song A and a download link 322 for downloading song A on the information display interface.
  • the electronic device may further display a download link for downloading the book, and a link for reading the book online; when the obtained audio information has a program name, And when there is a program video corresponding to the program name, the electronic device may further display a download link for downloading the program video, and a play link for playing the program video.
  • step 302 the full audio is played when the play link is triggered.
  • step 303 when the download link is triggered, the complete audio is downloaded.
  • the electronic device plays the complete audio corresponding to the acquired audio when detecting that the play link is triggered;
  • the electronic device downloads the complete audio corresponding to the acquired audio when it detects that the download link is triggered.
  • the embodiment of the present disclosure displays a complete audio play link and a download link corresponding to the audio on the information display interface, and plays the complete audio when the play link is triggered, and downloads when the download link is triggered.
  • the complete audio since the play link and the download link can be provided on the information display interface, when the user wants to enjoy or collect the listened audio again, the user needs to open the corresponding program to search and then play or download the audio, and the operation steps are complicated. Problem; achieved the effect of simplifying the operation steps and improving the operation efficiency.
  • FIG. 4A it is a flowchart of a method for searching for keywords in audio information according to an exemplary embodiment.
  • step 401 a search control corresponding to the keyword in the audio information is displayed in the information display interface.
  • the electronic device can display the search control corresponding to the keyword in the audio information in the information display interface.
  • step 402 when a search control of a keyword is triggered, a search interface of the keyword is displayed, and a search result corresponding to the keyword is displayed in the search interface.
  • the electronic device detects that the search control of a keyword in the information display interface is triggered, the search interface of the keyword is displayed, and the search result corresponding to the keyword is displayed in the search interface.
  • FIG. 4B is a schematic diagram showing a search result corresponding to a keyword according to an exemplary embodiment.
  • 4B is an example in which the acquired audio is music audio.
  • the electronic device detects that the search control 411 of the singer A is triggered, the electronic device displays a search interface corresponding to the singer A, and displays search results corresponding to the singer A in the search interface. .
  • the embodiment of the present disclosure displays a search interface of the keyword when the search control of a keyword is triggered, and the search result corresponding to the keyword is displayed in the search interface;
  • the search control for searching for keywords is displayed on the display, so that the problem that it is necessary to open other applications for searching and having many operation steps is solved; the effect of improving the operation efficiency is achieved.
  • steps in FIG. 2A and FIG. 3A may be combined into one embodiment.
  • the steps in FIG. 2A and FIG. 4A may be combined into one embodiment, and the steps in FIG. 2A, FIG. 3A and FIG. Combined into one embodiment.
  • FIG. 5 is a block diagram of an audio information identifying apparatus according to an exemplary embodiment.
  • the audio information identifying apparatus may be applied to an electronic device, which may be a smart phone, a tablet, or an intelligent device. TV, e-book reader, multimedia player, laptop portable computer and desktop computer, etc.
  • the audio information identifying device may include, but is not limited to, an identification module 501, a first display module 502, and a second display module 503.
  • the identification module 501 is configured to identify the audio being played to obtain audio information of the audio.
  • the first display module 502 is configured to display, on the information display interface, a jump link set by the keyword in the audio information recognized by the recognition module 501.
  • the second display module 503 is configured to display pre-stored information corresponding to the keyword when the jump link displayed by the first display module 502 is triggered.
  • the audio information identifying apparatus obtains the audio information of the audio by identifying the audio being played, and displays the jump link set by the keyword in the audio information, and When the jump link is triggered, the pre-stored information corresponding to the keyword is displayed; since more information corresponding to the audio can be displayed by providing the jump link, the information that can only be displayed in a single interface and the information that can be displayed is solved. Relatively few problems; achieving the effect of increasing the richness of audio information.
  • FIG. 6 is a block diagram of an audio information identifying apparatus, as shown in FIG. 6, the audio information identifying apparatus may be applied to an electronic device, and the electronic device may be a smart phone, a tablet, Smart TVs, e-book readers, multimedia players, laptop portable computers and desktop computers, to name a few.
  • the audio information identifying apparatus may include, but is not limited to, an identification module 601, a first display module 602, and a second display module 603.
  • the identification module 601 is configured to identify the audio being played to obtain audio of the audio information.
  • the first display module 602 is configured to display, on the information display interface, a jump link set by the keyword in the audio information identified by the identification module 601.
  • the second display module 603 is configured to display pre-stored information corresponding to the keyword when the jump link displayed by the first display module 602 is triggered.
  • the identification module 601 can include: an identification sub-module 601a, a transmission sub-module 601b, and a receiving sub-module 601c.
  • the identification sub-module 601a is configured to identify the audio to obtain an audio feature of the audio, the audio feature being related to the first or both of the textual information and the identity information of the audio.
  • the sending sub-module 601b is configured to send the audio feature identified by the identifying sub-module 601a to the server, the audio feature is used to trigger the server to find the audio information that matches the audio feature, and feed back the found audio information.
  • the receiving submodule 601c is configured to receive audio information fed back by the server.
  • the audio information identifying apparatus may further include: a first obtaining module 604 or a second acquiring module 605.
  • the first obtaining module 604 is configured to acquire the audio being played every predetermined time interval.
  • the second obtaining module 605 is configured to receive an identification instruction for identifying the audio being played, and acquire the audio being played.
  • the audio information identifying apparatus may further include: a third display module 606, a playing module 607, and a downloading module 608.
  • the third display module 606 is configured to display a play link and a download link of the complete audio corresponding to the audio on the information display interface.
  • the play module 607 is configured to play the full audio when the play link displayed by the third display module 606 is triggered.
  • the download module 608 is configured to download the full audio when the download link displayed by the third display module 606 is triggered.
  • the audio information identifying apparatus may further include: a fourth display module 609 and a fifth display module 610.
  • the fourth display module 609 is configured to display a search control corresponding to the keyword in the audio information in the information display interface.
  • the fifth display module 610 is configured to display a search interface of the keyword when the search control of a keyword displayed by the fourth display module 609 is triggered, and the search result corresponding to the keyword is displayed in the search interface. .
  • the audio information identifying apparatus may further include: a first saving module 611 or a second saving module 612.
  • the first saving module 611 is configured to automatically save the audio information and the jump link to the pre-stored list after displaying the jump link.
  • the second saving module 612 is configured to receive a save instruction for instructing to save the audio information and the jump link, and save the audio information and the jump link to the pre-stored list.
  • the audio information identifying apparatus obtains the audio information of the audio by identifying the audio being played, and displays the jump link set by the keyword in the audio information, and When the jump link is triggered, the pre-stored information corresponding to the keyword is displayed; since more information corresponding to the audio can be displayed by providing the jump link, the information that can only be displayed in a single interface and the information that can be displayed is solved. Relatively few problems; achieving the effect of increasing the richness of audio information.
  • the information display interface provides a play link and a download link, so that when the user wants to enjoy or collect the listened audio again, the user needs to open the corresponding program to search and then play or download the audio, and the operation steps are complicated; the simplified operation is achieved. Steps to improve the efficiency of the operation.
  • search control when a search control of a keyword is triggered, a search interface of the keyword is displayed, and a search result corresponding to the keyword is displayed in the search interface; since the search keyword can be displayed on the information display interface
  • the search control solves the problem that it is necessary to open other applications for searching and has many operation steps; the effect of improving the operation efficiency is achieved.
  • An exemplary embodiment of the present disclosure provides an audio information identifying apparatus capable of implementing the audio information identifying method provided by the present disclosure, the audio information identifying apparatus comprising: a processor, a memory for storing processor executable instructions;
  • processor is configured to:
  • the jump link When the jump link is triggered, the pre-stored information corresponding to the keyword is displayed.
  • FIG. 7 is a block diagram of an apparatus for identifying audio information, according to an exemplary embodiment.
  • device 700 can be a mobile phone, a computer, a digital broadcast terminal, a messaging device, a gaming console, a tablet device, a fitness device, a personal digital assistant, and the like.
  • apparatus 700 can include one or more of the following components: processing component 702, memory 704, power component 706, multimedia component 708, audio component 710, input/output (I/O) interface 712, sensor component 714, and Communication component 716.
  • Processing component 702 typically controls the overall operation of device 700, such as operations associated with display, telephone calls, data communications, camera operations, and recording operations.
  • Processing component 702 can include one or more processors 718 to execute instructions to perform all or part of the steps of the methods described above.
  • processing component 702 can include one or more modules to facilitate interaction between component 702 and other components.
  • processing component 702 can include a multimedia module to facilitate interaction between multimedia component 708 and processing component 702.
  • Memory 704 is configured to store various types of data to support operation at device 700. Examples of such data include instructions for any application or method operating on device 700, contact data, phone book data, messages, pictures, videos, and the like. Memory 704 can be implemented by any type of volatile or non-volatile storage device, or a combination thereof, such as static random access memory (SRAM), electrically erasable programmable read only memory (EEPROM), erasable Programmable Read Only Memory (EPROM), Programmable Read Only Memory (PROM), Read Only Memory (ROM), Magnetic Memory, Flash Memory, Disk or Optical Disk.
  • SRAM static random access memory
  • EEPROM electrically erasable programmable read only memory
  • EPROM erasable Programmable Read Only Memory
  • PROM Programmable Read Only Memory
  • ROM Read Only Memory
  • Magnetic Memory Flash Memory
  • Disk Disk or Optical Disk.
  • Power component 706 provides power to various components of device 700.
  • Power component 706 can include electricity The source management system, one or more power sources, and other components associated with generating, managing, and distributing power for the device 700.
  • the multimedia component 708 includes a screen between the device 700 and the user that provides an output interface.
  • the screen can include a liquid crystal display (LCD) and a touch panel (TP). If the screen includes a touch panel, the screen can be implemented as a touch screen to receive input signals from the user.
  • the touch panel includes one or more touch sensors to sense touches, slides, and gestures on the touch panel. The touch sensor can sense not only the boundaries of the touch or sliding action, but also the duration and pressure associated with the touch or slide operation.
  • the multimedia component 708 includes a front camera and/or a rear camera. When the device 700 is in an operation mode, such as a shooting mode or a video mode, the front camera and/or the rear camera can receive external multimedia data. Each front and rear camera can be a fixed optical lens system or have focal length and optical zoom capabilities.
  • the audio component 710 is configured to output and/or input an audio signal.
  • audio component 710 includes a microphone (MIC) that is configured to receive an external audio signal when device 700 is in an operational mode, such as a call mode, a recording mode, and a voice recognition mode.
  • the received audio signal may be further stored in memory 704 or transmitted via communication component 716.
  • audio component 710 also includes a speaker for outputting an audio signal.
  • the I/O interface 712 provides an interface between the processing component 702 and the peripheral interface module, which may be a keyboard, a click wheel, a button, or the like. These buttons may include, but are not limited to, a home button, a volume button, a start button, and a lock button.
  • Sensor assembly 714 includes one or more sensors for providing device 700 with various aspects of status assessment.
  • sensor component 714 can detect an open/closed state of device 700, relative positioning of components, such as a display and a keypad of device 700, and sensor component 714 can also detect a change in position of device 700 or a component of device 700, user The presence or absence of contact with device 700, device 700 orientation or acceleration/deceleration and temperature variation of device 700.
  • Sensor assembly 714 can include a proximity sensor configured to detect the presence of nearby objects without any physical contact.
  • Sensor component 714 can also include a light sensor, such as a CMOS or CCD image sensor, for use in imaging applications.
  • the sensor component 714 can also include an acceleration sensor, a gyro sensor, a magnetic sensor, a pressure sensor, or a temperature sensor.
  • Communication component 716 is configured to facilitate wired or wireless communication between device 700 and other devices.
  • the device 700 can access a wireless network based on a communication standard, such as Wi-Fi, 2G or 3G, or The combination.
  • communication component 716 receives broadcast signals or broadcast associated information from an external broadcast management system via a broadcast channel.
  • communication component 716 also includes a near field communication (NFC) module to facilitate short range communication.
  • NFC near field communication
  • the NFC module can be implemented based on radio frequency identification (RFID) technology, infrared data association (IrDA) technology, ultra-wideband (UWB) technology, Bluetooth (BT) technology, and other technologies.
  • RFID radio frequency identification
  • IrDA infrared data association
  • UWB ultra-wideband
  • Bluetooth Bluetooth
  • apparatus 700 may be implemented by one or more application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable A gate array (FPGA), controller, microcontroller, microprocessor or other electronic component implementation for performing the above described audio information identification method.
  • ASICs application specific integrated circuits
  • DSPs digital signal processors
  • DSPDs digital signal processing devices
  • PLDs programmable logic devices
  • FPGA field programmable A gate array
  • controller microcontroller, microprocessor or other electronic component implementation for performing the above described audio information identification method.
  • non-transitory computer readable storage medium comprising instructions, such as a memory 704 comprising instructions executable by processor 718 of apparatus 700 to perform the above described audio information identification method.
  • the non-transitory computer readable storage medium can be a ROM, a random access memory (RAM), a CD-ROM, a magnetic tape, a floppy disk, and an optical data storage device.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Signal Processing (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Library & Information Science (AREA)
  • Multimedia (AREA)
  • General Health & Medical Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Databases & Information Systems (AREA)
  • Data Mining & Analysis (AREA)
  • Social Psychology (AREA)
  • Artificial Intelligence (AREA)
  • Computational Linguistics (AREA)
  • Human Computer Interaction (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • User Interface Of Digital Computer (AREA)
  • Information Transfer Between Computers (AREA)
  • Stereophonic System (AREA)
  • Information Retrieval, Db Structures And Fs Structures Therefor (AREA)
  • Reverberation, Karaoke And Other Acoustics (AREA)

Abstract

一种音频信息识别方法及装置,属于音频识别技术领域。所述音频信息识别方法包括:对正在播放的音频进行识别,得到所述音频的音频信息(101);在信息展示界面上显示为所述音频信息中的关键词设置的跳转链接(102);当所述跳转链接被触发时,显示与所述关键词对应的预存信息(103)。通过对正在播放的音频进行识别,得到该音频的音频信息,显示为该音频信息中的关键词设置的跳转链接,并在跳转链接被触发时显示与关键词对应的预存信息;由于能够通过提供跳转链接来显示更多与音频对应的信息,因此解决了只能在单一界面内显示音频的信息,能够展示的信息相对较少的问题;达到了提高音频信息的丰富性的效果。

Description

音频信息识别方法及装置
本申请基于申请号为201510178987.0、申请日为2015年4月15日的中国专利申请提出,并要求该中国专利申请的优先权,该中国专利申请的全部内容在此引入本申请作为参考。
技术领域
本公开涉及音频识别技术领域,特别涉及一种音频信息识别方法及装置。
背景技术
用户在收听电台广播时,经常无法得知正在收听到的音频的相关信息。
为了能够让用户了解收听到的音频的相关信息,部分应用程序可以识别出收听到的歌曲的歌曲名称、演唱者以及歌词,并将识别出的歌曲名称、演唱者以及歌词展示给用户。
发明内容
本公开提供一种音频信息识别方法及装置。所述技术方案如下:
根据本公开实施例的第一方面,提供一种音频信息识别方法,所述方法包括:
对正在播放的音频进行识别,得到所述音频的音频信息;
在信息展示界面上显示为所述音频信息中的关键词设置的跳转链接;
当所述跳转链接被触发时,显示与所述关键词对应的预存信息。
根据本公开实施例的第二方面,提供一种音频信息识别装置,所述装置包括:
识别模块,被配置为对正在播放的音频进行识别,得到所述音频的音频信息;
第一显示模块,被配置为在信息展示界面上显示为所述识别模块识别得到的所述音频信息中的关键词设置的跳转链接;
第二显示模块,被配置为当所述第一显示模块显示的所述跳转链接被触发 时,显示与所述关键词对应的预存信息。
根据本公开实施例的第三方面,提供一种音频信息识别装置,所述装置包括:
处理器;
用于存储所述处理器可执行指令的存储器;
其中,所述处理器被配置为:
对正在播放的音频进行识别,得到所述音频的音频信息;
在信息展示界面上显示为所述音频信息中的关键词设置的跳转链接;
当所述跳转链接被触发时,显示与所述关键词对应的预存信息。
本公开的实施例提供的技术方案可以包括以下有益效果:
通过对正在播放的音频进行识别,得到该音频的音频信息,显示为该音频信息中的关键词设置的跳转链接,并在跳转链接被触发时显示与关键词对应的预存信息;由于能够通过提供跳转链接来显示更多与音频对应的信息,因此解决了只能在单一界面内显示音频的信息,能够展示的信息相对较少的问题;达到了提高音频信息的丰富性的效果。
应当理解的是,以上的一般描述和后文的细节描述仅是示例性的,并不能限制本公开。
附图说明
此处的附图被并入说明书中并构成本说明书的一部分,示出了符合本公开的实施例,并于说明书一起用于解释本公开的原理。
图1是根据一示例性实施例示出的一种音频信息识别方法的流程图;
图2A是根据另一示例性实施例示出的一种音频信息识别方法的流程图;
图2B是根据一示例性实施例示出的一种获取音频信息方法的流程图;
图2C是根据一示例性实施例示出的一种显示音频信息以及跳转链接的示意图;
图2D是根据一示例性实施例示出的一种显示跳转页面的示意图;
图3A是根据一示例性实施例示出的一种播放或下载收听到的音频的方法的流程图;
图3B是根据一示例性实施例示出的一种显示音频的播放链接和下载链接的 示意图;
图4A是根据一示例性实施例示出的一种音频信息中关键词的搜索方法的流程图;
图4B是根据一示例性实施例示出的一种显示与关键词对应的搜索结果的示意图;
图5是根据一示例性实施例示出的一种音频信息识别装置的框图;
图6是根据另一示例性实施例示出的一种音频信息识别装置的框图;
图7是根据一示例性实施例示出的一种用于识别音频信息的装置的框图。
具体实施方式
这里将详细地对示例性实施例进行说明,其示例表示在附图中。下面的描述涉及附图时,除非另有表示,不同附图中的相同数字表示相同或相似的要素。以下示例性实施例中所描述的实施方式并不代表与本公开相一致的所有实施方式。相反,它们仅是与如所附权利要求书中所详述的、本公开的一些方面相一致的装置和方法的例子。
图1是根据一示例性实施例示出的一种音频信息识别方法的流程图,如图1所示,该音频信息识别方法可应用于电子设备中,该电子设备可以是智能手机、平板电脑、智能电视、电子书阅读器、多媒体播放器、膝上型便携计算机和台式计算机等等。该音频信息识别方法可以包括以下步骤。
在步骤101中,对正在播放的音频进行识别,得到该音频的音频信息。
在步骤102中,在信息展示界面上显示为音频信息中的关键词设置的跳转链接。
在步骤103中,当跳转链接被触发时,显示与关键词对应的预存信息。
综上所述,本公开实施例中提供的音频信息识别方法,通过对正在播放的音频进行识别,得到该音频的音频信息,显示为该音频信息中的关键词设置的跳转链接,并在跳转链接被触发时显示与关键词对应的预存信息;由于能够通过提供跳转链接来显示更多与音频对应的信息,因此解决了只能在单一界面内显示音频的信息,能够展示的信息相对较少的问题;达到了提高音频信息的丰富性的效果。
图2A是根据另一示例性实施例示出的一种音频信息识别方法的流程图,如图2A所示,该音频信息识别方法可应用于电子设备中,该电子设备可以是智能手机、平板电脑、智能电视、电子书阅读器、多媒体播放器、膝上型便携计算机和台式计算机等等。该音频信息识别方法可以包括以下步骤。在对正在播放的音频进行识别之前,电子设备需要获取正在播放的音频。而为了满足不同的需求,电子设备获取正在播放音频的方式也需要进行相应调整,请参见下述步骤201和步骤202。
在步骤201中,每隔预定时间间隔获取正在播放的音频。
这里的正在播放的音频可以是由电子设备通过收听电台广播后播放的音频,也可以是该电子设备周围的其他设备正在播放的音频,此时电子设备可以获取到正在播放音频的其他设备播放的音频。该音频可以是音乐音频,也可以是语言类节目音频,还可以是书籍音频。
电子设备可以每隔预定时间间隔获取正在播放的音频,该预定时间间隔可以由用户进行设置,比如,该预定时间间隔可以被设置为3分钟、4分钟或5分钟等。
可选的,为了降低电子设备的功耗,电子设备可以在检测到当前音频的节奏发生超过预定阀值的变化后,获取正在播放的音频。比如,以音乐音频为例,在一首歌曲播放完成之后和在对下一首歌曲进行播放之前通常会有一段间隔,此时音频的节奏与在播放歌曲时的节奏有很大的区别。因此,当电子设备检测到当前音频的节奏发生超过预定阀值的变化时,说明播放的歌曲发生了切换,电子设备此时获取的音频即为切换后的歌曲的音频。
在步骤202中,接收对正在播放的音频进行识别的识别指令,获取正在播放的音频。
为了更加贴合用户的需求,且降低因频繁识别音频而对电子设备的功耗,电子设备可以在接收到用户触发的对正在播放的音频进行识别的识别指令后获取正在播放的音频。
在一种实现场景中,当用户正在使用电子设备收听电台广播,发现当前播放的音频非常好听,且希望获取该音频的相关信息,此时,用户可以在该电子设备上触发生成对正在播放的音频进行识别的识别指令,电子设备在接收到该 识别指令后,获取正在播放的音频。
在另一种实现场景中,当其他设备正在播放音频,且用户希望获取该其他设备正在播放的音频的相关信息时,用户可以开启所持有的电子设备,并在该电子设备上触发生成对正在播放的音频进行识别的识别指令,电子设备在接收到该识别指令后,获取正在播放的音频。
可选的,用户在该电子设备上触发生成对正在播放的音频进行识别的识别指令时,可以触发电子设备上的识别控件以产生识别指令,也可以通过触发电子设备设置的指定硬件(比如音量键)以产生识别指令。
在步骤203中,对正在播放的音频进行识别,得到该音频的音频信息。
电子设备在对正在播放的音频进行识别时,可以先识别得到该音频的音频特征,并将该音频特征发送给服务器,由服务器进行匹配得到音频信息,具体描述请参见下述步骤203A至步骤203C,另请参见图2B,其是根据一示例性实施例示出的一种获取音频信息方法的流程图。
在步骤203A中,对音频进行识别,得到该音频的音频特征,该音频特征与该音频的文本信息和身份信息中前一种或全部两种相关。
电子设备对获取的正在播放的音频进行识别,得到该音频的音频特征。该音频特征与音频中出现的文本信息、语调特征或音调特征等相关。若通过声纹识别技术对音频进行识别时,该音频特征还与该音频的身份信息相关。比如,当获取的音频为音乐音频时,通过识别得到的文本信息为获取的音频所对应的歌词,且通过声纹识别得到的身份信息为该音频所对应的歌手;当获取的音频为语言类节目音频时,通过识别得到的文本信息为获取的音频所对应的节目内容,且通过声纹识别得到的身份信息为该音频所对应的表演艺人。
在步骤203B中,将音频特征发送给服务器,该音频特征用于触发服务器查找到与该音频特征匹配的音频信息,并反馈查找到的该音频信息。
电子设备将得到的音频特征发送给服务器,服务器可以根据预存的数据库查找到与该音频特征匹配的音频信息,服务器在查找到与音频特征匹配的音频信息后,将该音频信息反馈给电子设备。
该音频信息可以包括与该音频特征对应的音频的所有人信息和该音频所对应的音频名称等。举例来讲,当正在播放的音频为音乐音频时,音频信息则可以包括歌曲名称、专辑名称、歌手和歌词等;当正在播放的音频为语言类节目 音频时,音频信息则可以包括节目名称和表演艺人等;当正在播放的音频为书籍音频时,音频信息则可以包括书籍作者、书籍名称和章节目录等。
在步骤203C中,接收服务器反馈的音频信息。
在步骤204中,在信息展示界面上显示为音频信息中的关键词设置的跳转链接。
电子设备在接收到服务器反馈的音频信息后,可以为音频信息中的关键词设置的跳转链接,以方便用户通过跳转获取更多的信息。
这里的关键词可以是能够指示该音频的主要特征的关键词,比如,当正在播放的音频为音乐音频时,关键词可以是歌曲名称、歌手和专辑名称等;当正在播放的音频为语言类节目音频时,关键词可以是节目名称和表演艺人等;当正在播放的音频为书籍音频时,关键词可以是书籍作者和书籍名称等。
举例来讲,请参见图2C,其是根据一示例性实施例示出的一种显示音频信息以及跳转链接的示意图。图2C以音频为音乐音频为例,电子设备接收到的音频信息为“歌曲名:《歌曲A》”、“歌手:歌手A”、“专辑:《专辑A》”以及与歌曲A对应的歌词。电子设备为“《歌曲A》”、“歌手A”、“《专辑A》”分别配置了跳转链接,并将“歌曲名:《歌曲A》”、“歌手:歌手A”、“专辑:《专辑A》”以及与歌曲A对应的歌词显示在了信息展示界面上。
在步骤205中,当跳转链接被触发时,显示与关键词对应的预存信息。
当信息展示界面上的跳转链接被触发时,电子设备则显示与该关键词对应的预存信息,预存信息通常是预存的关键词的详细信息。比如,当正在播放的音频为音乐音频,且歌手名称的跳转链接被触发时,电子设备跳转显示该歌手的详细资料页面;当正在播放的音频为语言类节目音频,且节目名称的跳转链接被触发时,电子设备跳转显示该节目的详细介绍的页面;当正在播放的音频为书籍音频,且书籍作者的跳转链接被触发时,电子设备跳转显示该书籍作者专栏的页面。
举例来讲,请参见图2D,其是根据一示例性实施例示出的一种显示跳转页面的示意图。图2D仍旧以音频为音乐音频为例,当信息展示界面上的“歌手A”被触发时,电子设备跳转显示了该歌手A的详细资料。
为了方便用户查阅通过识别得到的音频信息,电子设备在信息展示界面上显示为音频信息中的关键词设置的跳转链接后,可以将音频信息和跳转链接进 行对应存储,请参见下述步骤206和步骤207。
在步骤206中,在显示跳转链接后,自动将音频信息和跳转链接保存至预存列表中。
电子设备在信息展示界面上显示为音频信息中的关键词设置的跳转链接以及音频信息后,可以自动将音频信息和跳转链接保存至预存列表中。用户可以通过查看预存列表来寻找保存的音频信息。
在步骤207中,接收用于指示保存音频信息和跳转链接的保存指令,将该音频信息和该跳转链接保存至预存列表中。
电子设备在信息展示界面上显示为音频信息中的关键词设置的跳转链接以及音频信息后,可以向用户询问是否保存该音频信息和跳转链接,当接收到用于指示保存音频信息和跳转链接的保存指令,将该音频信息和该跳转链接保存至预存列表中。
可选的,电子设备可以在信息展示界面上显示用于保存音频信息和跳转链接的保存控件,当电子设备检测到该保存控件被触发时,将对应的音频信息和跳转链接保存至预存列表中。
在一种实现场景中,当用户的车载系统正在收听广播电台且正在播放一首歌曲时,用户可以利用车载系统或手持的智能手机等设备识别正在播放的音频,车载系统或智能手机等设备得到该音频的音频信息,并可以在信息展示界面上显示该音频信息中关键词设置的跳转链接,用户在触发这些跳转链接后,可以显示该关键词对应的预存信息。如果是利用车载系统显示了这些跳转链接,为了避免用户因过多关注车载系统上显示的跳转链接或跳转链接所对应的预存信息而影响用户开车,可以自动将跳转链接和音频信息存储至预存列表,以便于用户在方便的时候浏览预存列表中的跳转链接和音频信息。很显然,用户也可以触发用于保存音频信息和跳转链接的保存控件,车载系统或手持的智能手机等设备接收用户触发的保存控件后产生的保存指令后,将音频信息和跳转链接保存至预存列表中,以便于用户方便时查看。
综上所述,本公开实施例中提供的音频信息识别方法,通过对正在播放的音频进行识别,得到该音频的音频信息,显示为该音频信息中的关键词设置的跳转链接,并在跳转链接被触发时显示与关键词对应的预存信息;由于能够通过提供跳转链接来显示更多与音频对应的信息,因此解决了只能在单一界面内 显示音频的信息,能够展示的信息相对较少的问题;达到了提高音频信息的丰富性的效果。
另外,通过将音频信息和跳转链接保存至预存列表中;由于能够通过该预存列表查找到识别过的音频的音频信息,因此解决了用户无法查看最近识别的音频的音频信息的问题;达到了提高查找音频信息的便捷性的效果。
为了方便用户再次欣赏或收藏收听到的音频,电子设备在显示为音频信息中的关键词设置的跳转链接时,还可以显示该音频所对应的完整音频的播放链接和下载链接。请参见图3A,其是根据一示例性实施例示出的一种播放或下载收听到的音频的方法的流程图。
在步骤301中,在信息展示界面上显示与音频对应的完整音频的播放链接和下载链接。
电子设备可以根据获取的音频信息获取与音频对应的完整音频的播放链接和下载链接,并将该播放链接和下载链接显示在信息展示界面中。
比如,当获取的音频信息中有歌曲名称时,可以显示该歌曲名称所对应的歌曲的播放链接和下载链接;当获取的音频信息中有节目名称时,可以显示该节目名称所对应的语言类节目音频的播放链接和下载链接;当获取的音频信息中有书籍名称时,可以显示该书籍名称所对应的书籍音频的播放链接和下载链接。
举例来讲,请参见图3B所示,其是根据一示例性实施例示出的一种显示音频的播放链接和下载链接的示意图。图3B以音频为音乐音频为例,电子设备在信息展示界面上显示了用于播放歌曲A的播放链接311和用于下载歌曲A的下载链接322。
需要说明的是,当获取的音频信息中有书籍名称时,电子设备还可以显示用于下载该书籍的下载链接,以及用于在线阅读该书籍的链接;当获取的音频信息中有节目名称,且存在与该节目名称对应的节目视频时,电子设备还可以显示用于下载该节目视频的下载链接,以及用于播放该节目视频的播放链接。
在步骤302中,当播放链接被触发时,播放该完整音频。
在步骤303中,当下载链接被触发时,下载该完整音频。
电子设备在检测到播放链接被触发时,播放与获取的音频对应的完整音频; 电子设备在检测到下载链接被触发时,下载与获取的音频对应的完整音频。
综上所述,本公开实施例通过在信息展示界面上显示与音频对应的完整音频的播放链接和下载链接,并在播放链接被触发时,播放该完整音频,在下载链接被触发时,下载该完整音频;由于能够在信息展示界面上提供播放链接和下载链接,因此解决了用户想要再次欣赏或收藏收听到的音频时,需要打开相应程序进行搜索后才能播放或下载音频,操作步骤复杂的问题;达到了简化操作步骤,提高操作效率的效果。
为了方便用户进一步了解音频信息中的关键词,电子设备在显示为音频信息中的关键词设置的跳转链接时,还可以显示与音频信息中各个关键词对应的搜索控件。请参见图4A,其是根据一示例性实施例示出的一种音频信息中关键词的搜索方法的流程图。
在步骤401中,在信息展示界面内显示与音频信息中的关键词对应的搜索控件。
为了显示更多与关键词对应的信息,使得用户能够进一步了解与关键词相关的信息,电子设备可以在信息展示界面内显示与音频信息中的关键词对应的搜索控件。
在步骤402中,当一个关键词的搜索控件被触发时,显示该关键词的搜索界面,该搜索界面内显示有与该关键词对应的搜索结果。
电子设备在检测到信息展示界面内某个关键词的搜索控件被触发时,显示该关键词的搜索界面,并在该搜索界面内显示与该关键词对应的搜索结果。
举例来讲,请参见图4B,其是根据一示例性实施例示出的一种显示与关键词对应的搜索结果的示意图。图4B以获取的音频为音乐音频为例,电子设备在检测到歌手A的搜索控件411被触发时,显示与歌手A对应的搜索界面,并在该搜索界面内显示与歌手A对应的搜索结果。
综上所述,本公开实施例通过当一个关键词的搜索控件被触发时,显示该关键词的搜索界面,该搜索界面内显示有与该关键词对应的搜索结果;由于能够在信息展示界面上显示用于搜索关键词的搜索控件,因此解决了需要打开其它应用程序进行搜索,操作步骤较多的问题;达到了提高操作效率的效果。
需要说明的是,上述图2A和图3A中的步骤可以合并为一个实施例,上述图2A和图4A中的步骤可以合并为一个实施例,上述图2A、图3A和图4A中的步骤可以合并为一个实施例。
下述为本公开装置实施例,可以用于执行本公开方法实施例。对于本公开装置实施例中未披露的细节,请参照本公开方法实施例。
图5是根据一示例性实施例示出的一种音频信息识别装置的框图,如图5所示,该音频信息识别装置可应用于电子设备中,该电子设备可以是智能手机、平板电脑、智能电视、电子书阅读器、多媒体播放器、膝上型便携计算机和台式计算机等等。该音频信息识别装置可以包括但不限于:识别模块501、第一显示模块502和第二显示模块503。
该识别模块501,被配置为对正在播放的音频进行识别,得到该音频的音频信息。
该第一显示模块502,被配置为在信息展示界面上显示为识别模块501识别得到的音频信息中的关键词设置的跳转链接。
该第二显示模块503,被配置为当第一显示模块502显示的跳转链接被触发时,显示与关键词对应的预存信息。
综上所述,本公开实施例中提供的音频信息识别装置,通过对正在播放的音频进行识别,得到该音频的音频信息,显示为该音频信息中的关键词设置的跳转链接,并在跳转链接被触发时显示与关键词对应的预存信息;由于能够通过提供跳转链接来显示更多与音频对应的信息,因此解决了只能在单一界面内显示音频的信息,能够展示的信息相对较少的问题;达到了提高音频信息的丰富性的效果。
图6是根据另一示例性实施例示出的一种音频信息识别装置的框图,如图6所示,该音频信息识别装置可应用于电子设备中,该电子设备可以是智能手机、平板电脑、智能电视、电子书阅读器、多媒体播放器、膝上型便携计算机和台式计算机等等。该音频信息识别装置可以包括但不限于:识别模块601、第一显示模块602和第二显示模块603。
该识别模块601,被配置为对正在播放的音频进行识别,得到该音频的音频 信息。
该第一显示模块602,被配置为在信息展示界面上显示为识别模块601识别得到的音频信息中的关键词设置的跳转链接。
该第二显示模块603,被配置为当第一显示模块602显示的跳转链接被触发时,显示与关键词对应的预存信息。
在一种可能的实施例中,该识别模块601可以包括:识别子模块601a、发送子模块601b和接收子模块601c。
该识别子模块601a,被配置为对音频进行识别,得到该音频的音频特征,该音频特征与该音频的文本信息和身份信息中前一种或全部两种相关。
该发送子模块601b,被配置为将识别子模块601a识别得到的音频特征发送给服务器,该音频特征用于触发服务器查找到与该音频特征匹配的音频信息,并反馈查找到的该音频信息。
该接收子模块601c,被配置为接收服务器反馈的音频信息。
在一种可能的实施例中,该音频信息识别装置还可以包括:第一获取模块604或第二获取模块605。
该第一获取模块604,被配置为每隔预定时间间隔获取正在播放的音频。
该第二获取模块605,被配置为接收对正在播放的音频进行识别的识别指令,获取正在播放的音频。
在一种可能的实施例中,该音频信息识别装置还可以包括:第三显示模块606、播放模块607和下载模块608。
该第三显示模块606,被配置为在信息展示界面上显示与音频对应的完整音频的播放链接和下载链接。
该播放模块607,被配置为当第三显示模块606显示的播放链接被触发时,播放完整音频。
该下载模块608,被配置为当第三显示模块606显示的下载链接被触发时,下载完整音频。
在一种可能的实施例中,该音频信息识别装置还可以包括:第四显示模块609和第五显示模块610。
该第四显示模块609,被配置为在信息展示界面内显示与音频信息中的关键词对应的搜索控件。
该第五显示模块610,被配置为当第四显示模块609显示的一个关键词的搜索控件被触发时,显示该关键词的搜索界面,该搜索界面内显示有与该关键词对应的搜索结果。
在一种可能的实施例中,该音频信息识别装置还可以包括:第一保存模块611或第二保存模块612。
该第一保存模块611,被配置为在显示跳转链接后,自动将音频信息和跳转链接保存至预存列表中。
该第二保存模块612,被配置为接收用于指示保存音频信息和跳转链接的保存指令,将音频信息和跳转链接保存至预存列表中。
综上所述,本公开实施例中提供的音频信息识别装置,通过对正在播放的音频进行识别,得到该音频的音频信息,显示为该音频信息中的关键词设置的跳转链接,并在跳转链接被触发时显示与关键词对应的预存信息;由于能够通过提供跳转链接来显示更多与音频对应的信息,因此解决了只能在单一界面内显示音频的信息,能够展示的信息相对较少的问题;达到了提高音频信息的丰富性的效果。
另外,通过将音频信息和跳转链接保存至预存列表中;由于能够通过该预存列表查找到识别过的音频的音频信息,因此解决了用户无法查看最近识别的音频的音频信息的问题;达到了提高查找音频信息的便捷性的效果。
另外,通过在信息展示界面上显示与音频对应的完整音频的播放链接和下载链接,并在播放链接被触发时,播放该完整音频,在下载链接被触发时,下载该完整音频;由于能够在信息展示界面上提供播放链接和下载链接,因此解决了用户想要再次欣赏或收藏收听到的音频时,需要打开相应程序进行搜索后才能播放或下载音频,操作步骤复杂的问题;达到了简化操作步骤,提高操作效率的效果。
另外,通过当一个关键词的搜索控件被触发时,显示该关键词的搜索界面,该搜索界面内显示有与该关键词对应的搜索结果;由于能够在信息展示界面上显示用于搜索关键词的搜索控件,因此解决了需要打开其它应用程序进行搜索,操作步骤较多的问题;达到了提高操作效率的效果。
关于上述实施例中的装置,其中各个模块执行操作的具体方式已经在有关 该方法的实施例中进行了详细描述,此处将不做详细阐述说明。
本公开一示例性实施例提供了一种音频信息识别装置,能够实现本公开提供的音频信息识别方法,该音频信息识别装置包括:处理器、用于存储处理器可执行指令的存储器;
其中,处理器被配置为:
对正在播放的音频进行识别,得到该音频的音频信息;
在信息展示界面上显示为音频信息中的关键词设置的跳转链接;
当跳转链接被触发时,显示与关键词对应的预存信息。
图7是根据一示例性实施例示出的一种用于识别音频信息的装置的框图。例如,装置700可以是移动电话,计算机,数字广播终端,消息收发设备,游戏控制台,平板设备,健身设备,个人数字助理等。
参照图7,装置700可以包括以下一个或多个组件:处理组件702,存储器704,电源组件706,多媒体组件708,音频组件710,输入/输出(I/O)接口712,传感器组件714,以及通信组件716。
处理组件702通常控制装置700的整体操作,诸如与显示,电话呼叫,数据通信,相机操作和记录操作相关联的操作。处理组件702可以包括一个或多个处理器718来执行指令,以完成上述的方法的全部或部分步骤。此外,处理组件702可以包括一个或多个模块,便于处理组件702和其他组件之间的交互。例如,处理组件702可以包括多媒体模块,以方便多媒体组件708和处理组件702之间的交互。
存储器704被配置为存储各种类型的数据以支持在装置700的操作。这些数据的示例包括用于在装置700上操作的任何应用程序或方法的指令,联系人数据,电话簿数据,消息,图片,视频等。存储器704可以由任何类型的易失性或非易失性存储设备或者它们的组合实现,如静态随机存取存储器(SRAM),电可擦除可编程只读存储器(EEPROM),可擦除可编程只读存储器(EPROM),可编程只读存储器(PROM),只读存储器(ROM),磁存储器,快闪存储器,磁盘或光盘。
电源组件706为装置700的各种组件提供电力。电源组件706可以包括电 源管理系统,一个或多个电源,及其他与为装置700生成、管理和分配电力相关联的组件。
多媒体组件708包括在装置700和用户之间的提供一个输出接口的屏幕。在一些实施例中,屏幕可以包括液晶显示器(LCD)和触摸面板(TP)。如果屏幕包括触摸面板,屏幕可以被实现为触摸屏,以接收来自用户的输入信号。触摸面板包括一个或多个触摸传感器以感测触摸、滑动和触摸面板上的手势。触摸传感器可以不仅感测触摸或滑动动作的边界,而且还检测与触摸或滑动操作相关的持续时间和压力。在一些实施例中,多媒体组件708包括一个前置摄像头和/或后置摄像头。当装置700处于操作模式,如拍摄模式或视频模式时,前置摄像头和/或后置摄像头可以接收外部的多媒体数据。每个前置摄像头和后置摄像头可以是一个固定的光学透镜系统或具有焦距和光学变焦能力。
音频组件710被配置为输出和/或输入音频信号。例如,音频组件710包括一个麦克风(MIC),当装置700处于操作模式,如呼叫模式、记录模式和语音识别模式时,麦克风被配置为接收外部音频信号。所接收的音频信号可以被进一步存储在存储器704或经由通信组件716发送。在一些实施例中,音频组件710还包括一个扬声器,用于输出音频信号。
I/O接口712为处理组件702和外围接口模块之间提供接口,上述外围接口模块可以是键盘,点击轮,按钮等。这些按钮可包括但不限于:主页按钮、音量按钮、启动按钮和锁定按钮。
传感器组件714包括一个或多个传感器,用于为装置700提供各个方面的状态评估。例如,传感器组件714可以检测到装置700的打开/关闭状态,组件的相对定位,例如组件为装置700的显示器和小键盘,传感器组件714还可以检测装置700或装置700一个组件的位置改变,用户与装置700接触的存在或不存在,装置700方位或加速/减速和装置700的温度变化。传感器组件714可以包括接近传感器,被配置用来在没有任何的物理接触时检测附近物体的存在。传感器组件714还可以包括光传感器,如CMOS或CCD图像传感器,用于在成像应用中使用。在一些实施例中,该传感器组件714还可以包括加速度传感器,陀螺仪传感器,磁传感器,压力传感器或温度传感器。
通信组件716被配置为便于装置700和其他设备之间有线或无线方式的通信。装置700可以接入基于通信标准的无线网络,如Wi-Fi,2G或3G,或它们 的组合。在一个示例性实施例中,通信组件716经由广播信道接收来自外部广播管理系统的广播信号或广播相关信息。在一个示例性实施例中,通信组件716还包括近场通信(NFC)模块,以促进短程通信。例如,在NFC模块可基于射频识别(RFID)技术,红外数据协会(IrDA)技术,超宽带(UWB)技术,蓝牙(BT)技术和其他技术来实现。
在示例性实施例中,装置700可以被一个或多个应用专用集成电路(ASIC)、数字信号处理器(DSP)、数字信号处理设备(DSPD)、可编程逻辑器件(PLD)、现场可编程门阵列(FPGA)、控制器、微控制器、微处理器或其他电子元件实现,用于执行上述音频信息识别方法。
在示例性实施例中,还提供了一种包括指令的非临时性计算机可读存储介质,例如包括指令的存储器704,上述指令可由装置700的处理器718执行以完成上述音频信息识别方法。例如,非临时性计算机可读存储介质可以是ROM、随机存取存储器(RAM)、CD-ROM、磁带、软盘和光数据存储设备等。
本领域技术人员在考虑说明书及实践这里公开的发明后,将容易想到本公开的其它实施方案。本申请旨在涵盖本公开的任何变型、用途或者适应性变化,这些变型、用途或者适应性变化遵循本公开的一般性原理并包括本公开未公开的本技术领域中的公知常识或惯用技术手段。说明书和实施例仅被视为示例性的,本公开的真正范围和精神由下面的权利要求指出。
应当理解的是,本公开并不局限于上面已经描述并在附图中示出的精确结构,并且可以在不脱离其范围进行各种修改和改变。本公开的范围仅由所附的权利要求来限制。

Claims (13)

  1. 一种音频信息识别方法,其特征在于,所述方法包括:
    对正在播放的音频进行识别,得到所述音频的音频信息;
    在信息展示界面上显示为所述音频信息中的关键词设置的跳转链接;
    当所述跳转链接被触发时,显示与所述关键词对应的预存信息。
  2. 根据权利要求1所述的方法,其特征在于,所述对正在播放的音频进行识别,得到所述音频的音频信息,包括:
    对所述音频进行识别,得到所述音频的音频特征,所述音频特征与所述音频的文本信息和身份信息中前一种或全部两种相关;
    将所述音频特征发送给服务器,所述音频特征用于触发所述服务器查找到与所述音频特征匹配的音频信息,并反馈查找到的所述音频信息;
    接收所述服务器反馈的所述音频信息。
  3. 根据权利要求1所述的方法,其特征在于,所述方法还包括:
    每隔预定时间间隔获取所述正在播放的音频;
    或,
    接收对正在播放的音频进行识别的识别指令,获取所述正在播放的音频。
  4. 根据权利要求1所述的方法,其特征在于,所述方法还包括:
    在所述信息展示界面上显示与所述音频对应的完整音频的播放链接和下载链接;
    当所述播放链接被触发时,播放所述完整音频;
    当所述下载链接被触发时,下载所述完整音频。
  5. 根据权利要求1所述的方法,其特征在于,所述方法还包括:
    在所述信息展示界面内显示与所述音频信息中的关键词对应的搜索控件;
    当一个关键词的搜索控件被触发时,显示所述关键词的搜索界面,所述搜索界面内显示有与所述关键词对应的搜索结果。
  6. 根据权利要求1至5任一所述的方法,其特征在于,所述方法还包括:
    在显示所述跳转链接后,自动将所述音频信息和所述跳转链接保存至预存列表中;
    或,
    接收用于指示保存所述音频信息和所述跳转链接的保存指令,将所述音频信息和所述跳转链接保存至预存列表中。
  7. 一种音频信息识别装置,其特征在于,所述装置包括:
    识别模块,被配置为对正在播放的音频进行识别,得到所述音频的音频信息;
    第一显示模块,被配置为在信息展示界面上显示为所述识别模块识别得到的所述音频信息中的关键词设置的跳转链接;
    第二显示模块,被配置为当所述第一显示模块显示的所述跳转链接被触发时,显示与所述关键词对应的预存信息。
  8. 根据权利要求7所述的装置,其特征在于,所述识别模块,包括:
    识别子模块,被配置为对所述音频进行识别,得到所述音频的音频特征,所述音频特征与所述音频的文本信息和身份信息中前一种或全部两种相关;
    发送子模块,被配置为将所述识别子模块识别得到的所述音频特征发送给服务器,所述音频特征用于触发所述服务器查找到与所述音频特征匹配的音频信息,并反馈查找到的所述音频信息;
    接收子模块,被配置为接收所述服务器反馈的所述音频信息。
  9. 根据权利要求7所述的装置,其特征在于,所述装置还包括:
    第一获取模块,被配置为每隔预定时间间隔获取所述正在播放的音频;
    或,
    第二获取模块,被配置为接收对正在播放的音频进行识别的识别指令,获取所述正在播放的音频。
  10. 根据权利要求7所述的装置,其特征在于,所述装置还包括:
    第三显示模块,被配置为在所述信息展示界面上显示与所述音频对应的完整音频的播放链接和下载链接;
    播放模块,被配置为当所述第三显示模块显示的所述播放链接被触发时,播放所述完整音频;
    下载模块,被配置为当所述第三显示模块显示的所述下载链接被触发时,下载所述完整音频。
  11. 根据权利要求7所述的装置,其特征在于,所述装置还包括:
    第四显示模块,被配置为在所述信息展示界面内显示与所述音频信息中的关键词对应的搜索控件;
    第五显示模块,被配置为当所述第四显示模块显示的一个关键词的搜索控件被触发时,显示所述关键词的搜索界面,所述搜索界面内显示有与所述关键词对应的搜索结果。
  12. 根据权利要求7至11任一所述的装置,其特征在于,所述装置还包括:
    第一保存模块,被配置为在显示所述跳转链接后,自动将所述音频信息和所述跳转链接保存至预存列表中;
    或,
    第二保存模块,被配置为接收用于指示保存所述音频信息和所述跳转链接的保存指令,将所述音频信息和所述跳转链接保存至预存列表中。
  13. 一种音频信息识别装置,其特征在于,所述装置包括:
    处理器;
    用于存储所述处理器可执行指令的存储器;
    其中,所述处理器被配置为:
    对正在播放的音频进行识别,得到所述音频的音频信息;
    在信息展示界面上显示为所述音频信息中的关键词设置的跳转链接;
    当所述跳转链接被触发时,显示与所述关键词对应的预存信息。
PCT/CN2015/095034 2015-04-15 2015-11-19 音频信息识别方法及装置 WO2016165325A1 (zh)

Priority Applications (4)

Application Number Priority Date Filing Date Title
JP2017512096A JP6236189B2 (ja) 2015-04-15 2015-11-19 オーディオ情報識別方法及び装置
MX2016002658A MX359479B (es) 2015-04-15 2015-11-19 Método y aparato para identificar información de audio.
KR1020167001534A KR20160132808A (ko) 2015-04-15 2015-11-19 오디오정보식별방법 및 장치
RU2016108039A RU2634696C2 (ru) 2015-04-15 2015-11-19 Способ и устройство для идентификации аудиоинформации

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201510178987.0A CN104820678B (zh) 2015-04-15 2015-04-15 音频信息识别方法及装置
CN201510178987.0 2015-04-15

Publications (1)

Publication Number Publication Date
WO2016165325A1 true WO2016165325A1 (zh) 2016-10-20

Family

ID=53730975

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2015/095034 WO2016165325A1 (zh) 2015-04-15 2015-11-19 音频信息识别方法及装置

Country Status (8)

Country Link
US (1) US20160306880A1 (zh)
EP (1) EP3082280B1 (zh)
JP (1) JP6236189B2 (zh)
KR (1) KR20160132808A (zh)
CN (1) CN104820678B (zh)
MX (1) MX359479B (zh)
RU (1) RU2634696C2 (zh)
WO (1) WO2016165325A1 (zh)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110489573A (zh) * 2019-07-30 2019-11-22 维沃移动通信有限公司 界面显示方法及电子设备

Families Citing this family (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104820678B (zh) * 2015-04-15 2018-10-19 小米科技有限责任公司 音频信息识别方法及装置
CN105005631A (zh) * 2015-08-24 2015-10-28 安徽味唯网络科技有限公司 一种高精度搜索的方法
CN105357588A (zh) * 2015-11-03 2016-02-24 腾讯科技(深圳)有限公司 数据显示方法及终端
CN114464186A (zh) * 2016-07-28 2022-05-10 北京小米移动软件有限公司 关键词确定方法及装置
CN106341728A (zh) * 2016-10-21 2017-01-18 北京巡声巡影科技服务有限公司 一种视频中的产品信息展示方法、装置和系统
CN106851362A (zh) * 2016-12-15 2017-06-13 咪咕音乐有限公司 一种多媒体内容的播放方法及装置
CN106599274A (zh) * 2016-12-23 2017-04-26 珠海市魅族科技有限公司 识别播放音源的装置及方法
CN106897435A (zh) * 2017-02-28 2017-06-27 深圳天珑无线科技有限公司 终端控制方法及装置
CN107040587A (zh) * 2017-03-02 2017-08-11 广州小鹏汽车科技有限公司 一种车载电台音乐内容获取方法及装置
CN107959751A (zh) * 2017-11-14 2018-04-24 优酷网络技术(北京)有限公司 音频播放方法及装置
US20190206102A1 (en) * 2017-12-29 2019-07-04 Facebook, Inc. Systems and methods for enhancing content
CN111723235B (zh) * 2019-03-19 2023-09-26 百度在线网络技术(北京)有限公司 音乐内容识别方法、装置及设备
CN112148754A (zh) * 2020-09-01 2020-12-29 腾讯音乐娱乐科技(深圳)有限公司 一种歌曲识别方法和装置
US20230142904A1 (en) * 2021-11-09 2023-05-11 Honda Motor Co., Ltd. Creation of notes for items of interest mentioned in audio content
EP4213145A1 (en) * 2022-01-14 2023-07-19 Vestel Elektronik Sanayi ve Ticaret A.S. Device and method for triggering a music identification application

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20040133558A1 (en) * 2003-01-06 2004-07-08 Masterwriter, Inc. Information management system plus
US7752546B2 (en) * 2001-06-29 2010-07-06 Thomson Licensing Method and system for providing an acoustic interface
CN102868822A (zh) * 2012-09-24 2013-01-09 广东欧珀移动通信有限公司 一种移动终端实施的歌词显示方法
CN103096249A (zh) * 2011-10-28 2013-05-08 M&Service株式会社 内容同时播放终端、其系统以及同时播放方法
CN103442083A (zh) * 2013-09-10 2013-12-11 百度在线网络技术(北京)有限公司 音频文件传输关联内容的方法、系统、客户端和服务器
CN104820678A (zh) * 2015-04-15 2015-08-05 小米科技有限责任公司 音频信息识别方法及装置

Family Cites Families (18)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US3919479A (en) * 1972-09-21 1975-11-11 First National Bank Of Boston Broadcast signal identification system
US7171018B2 (en) * 1995-07-27 2007-01-30 Digimarc Corporation Portable devices and methods employing digital watermarking
US6317784B1 (en) * 1998-09-29 2001-11-13 Radiowave.Com, Inc. Presenting supplemental information for material currently and previously broadcast by a radio station
US7028082B1 (en) * 2001-03-08 2006-04-11 Music Choice Personalized audio system and method
US6964023B2 (en) * 2001-02-05 2005-11-08 International Business Machines Corporation System and method for multi-modal focus detection, referential ambiguity resolution and mood classification using multi-modal input
US20060184960A1 (en) * 2005-02-14 2006-08-17 Universal Music Group, Inc. Method and system for enabling commerce from broadcast content
CN1983253A (zh) * 2005-12-15 2007-06-20 北京中科信利技术有限公司 一种提供音乐搜索服务的方法、设备和系统
US7787697B2 (en) * 2006-06-09 2010-08-31 Sony Ericsson Mobile Communications Ab Identification of an object in media and of related media objects
ES2433966T3 (es) * 2006-10-03 2013-12-13 Shazam Entertainment, Ltd. Método para caudal alto de identificación de contenido de radiodifusión distribuido
WO2009042697A2 (en) * 2007-09-24 2009-04-02 Skyclix, Inc. Phone-based broadcast audio identification
US20100057781A1 (en) * 2008-08-27 2010-03-04 Alpine Electronics, Inc. Media identification system and method
CN101635002A (zh) * 2009-08-21 2010-01-27 深圳市五巨科技有限公司 一种移动终端音乐搜索的方法和装置
US9264785B2 (en) * 2010-04-01 2016-02-16 Sony Computer Entertainment Inc. Media fingerprinting for content determination and retrieval
US8694533B2 (en) * 2010-05-19 2014-04-08 Google Inc. Presenting mobile content based on programming context
US8158870B2 (en) * 2010-06-29 2012-04-17 Google Inc. Intervalgram representation of audio for melody recognition
KR20120069908A (ko) * 2010-12-21 2012-06-29 삼성전자주식회사 휴대단말기의 정보제공 장치 및 방법
US20150286873A1 (en) * 2014-04-03 2015-10-08 Bruce L. Davis Smartphone-based methods and systems
CN103685520A (zh) * 2013-12-13 2014-03-26 深圳Tcl新技术有限公司 基于语音识别的歌曲推送的方法和装置

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7752546B2 (en) * 2001-06-29 2010-07-06 Thomson Licensing Method and system for providing an acoustic interface
US20040133558A1 (en) * 2003-01-06 2004-07-08 Masterwriter, Inc. Information management system plus
CN103096249A (zh) * 2011-10-28 2013-05-08 M&Service株式会社 内容同时播放终端、其系统以及同时播放方法
CN102868822A (zh) * 2012-09-24 2013-01-09 广东欧珀移动通信有限公司 一种移动终端实施的歌词显示方法
CN103442083A (zh) * 2013-09-10 2013-12-11 百度在线网络技术(北京)有限公司 音频文件传输关联内容的方法、系统、客户端和服务器
CN104820678A (zh) * 2015-04-15 2015-08-05 小米科技有限责任公司 音频信息识别方法及装置

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110489573A (zh) * 2019-07-30 2019-11-22 维沃移动通信有限公司 界面显示方法及电子设备

Also Published As

Publication number Publication date
MX2016002658A (es) 2017-04-27
US20160306880A1 (en) 2016-10-20
CN104820678A (zh) 2015-08-05
KR20160132808A (ko) 2016-11-21
CN104820678B (zh) 2018-10-19
MX359479B (es) 2018-09-28
EP3082280A1 (en) 2016-10-19
EP3082280B1 (en) 2018-07-25
JP2017517828A (ja) 2017-06-29
RU2634696C2 (ru) 2017-11-03
RU2016108039A (ru) 2017-09-07
JP6236189B2 (ja) 2017-11-22

Similar Documents

Publication Publication Date Title
WO2016165325A1 (zh) 音频信息识别方法及装置
US11206448B2 (en) Method and apparatus for selecting background music for video shooting, terminal device and medium
TWI667917B (zh) Multimedia search result display method and device
CN104166689B (zh) 电子书籍的呈现方法及装置
US20220391060A1 (en) Methods for displaying and providing multimedia resources
WO2015196709A1 (zh) 信息获取方法及装置
CN109657236B (zh) 引导信息获取方法、装置、电子装置及存储介质
CN105095427A (zh) 搜索推荐方法和装置
CN108334623B (zh) 歌曲的显示方法、装置和系统
CN105068976A (zh) 票务信息展示方法及装置
CN107229403B (zh) 一种信息内容选择方法及装置
CN105139848B (zh) 数据转换方法和装置
CN107342082A (zh) 基于语音交互的音频处理方法、装置及音频播放设备
WO2018188414A1 (zh) 搜索结果显示方法及装置
CN111061906A (zh) 音乐信息处理方法、装置、电子设备及计算机可读存储介质
CN112068711A (zh) 一种输入法的信息推荐方法、装置和电子设备
CN109862421A (zh) 一种视频信息识别方法、装置、电子设备及存储介质
US20220208156A1 (en) Method for generating song melody and electronic device
CN106020766A (zh) 音乐播放的方法及装置
WO2016197549A1 (zh) 一种进行搜索的方法和装置
CN111540361B (zh) 一种语音处理方法、装置和介质
WO2020224570A1 (zh) 交互方法及装置、音箱、电子设备和存储介质
CN112988956B (zh) 自动生成对话的方法及装置、信息推荐效果检测方法及装置
WO2019196527A1 (zh) 一种数据处理方法、装置和电子设备
CN106060253B (zh) 信息呈现的方法及装置

Legal Events

Date Code Title Description
ENP Entry into the national phase

Ref document number: 20167001534

Country of ref document: KR

Kind code of ref document: A

Ref document number: 2017512096

Country of ref document: JP

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: MX/A/2016/002658

Country of ref document: MX

ENP Entry into the national phase

Ref document number: 2016108039

Country of ref document: RU

Kind code of ref document: A

REG Reference to national code

Ref country code: BR

Ref legal event code: B01A

Ref document number: 112016004835

Country of ref document: BR

121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15889021

Country of ref document: EP

Kind code of ref document: A1

ENP Entry into the national phase

Ref document number: 112016004835

Country of ref document: BR

Kind code of ref document: A2

Effective date: 20160304

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 15889021

Country of ref document: EP

Kind code of ref document: A1