WO2017219925A1 - 一种信息发送方法、装置及计算机存储介质 - Google Patents

一种信息发送方法、装置及计算机存储介质 Download PDF

Info

Publication number
WO2017219925A1
WO2017219925A1 PCT/CN2017/088744 CN2017088744W WO2017219925A1 WO 2017219925 A1 WO2017219925 A1 WO 2017219925A1 CN 2017088744 W CN2017088744 W CN 2017088744W WO 2017219925 A1 WO2017219925 A1 WO 2017219925A1
Authority
WO
WIPO (PCT)
Prior art keywords
information
feature
user
recognized
identified
Prior art date
Application number
PCT/CN2017/088744
Other languages
English (en)
French (fr)
Inventor
马晓龙
Original Assignee
深圳市中兴微电子技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳市中兴微电子技术有限公司 filed Critical 深圳市中兴微电子技术有限公司
Publication of WO2017219925A1 publication Critical patent/WO2017219925A1/zh

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/30Authentication, i.e. establishing the identity or authorisation of security principals
    • G06F21/31User authentication
    • G06F21/32User authentication using biometric data, e.g. fingerprints, iris scans or voiceprints
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M1/00Substation equipment, e.g. for use by subscribers
    • H04M1/72Mobile telephones; Cordless telephones, i.e. devices for establishing wireless links to base stations without route selection
    • H04M1/724User interfaces specially adapted for cordless or mobile telephones
    • H04M1/72448User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions
    • H04M1/72454User interfaces specially adapted for cordless or mobile telephones with means for adapting the functionality of the device according to specific conditions according to context-related or environment-related conditions

Definitions

  • the present invention relates to information management technologies in the field of mobile communications, and in particular, to a method, an apparatus, and a computer storage medium for transmitting information.
  • Speech recognition is a research object of speech signal, and it is an important research direction of artificial intelligence. The ultimate goal is to realize natural language communication between human and machine. In general, there are three methods of speech recognition: methods based on vocal tract model and speech knowledge, methods of template matching, and methods using artificial neural networks.
  • the generation of human language is a complex physiological and physical process between the human language center and the vocal organs.
  • the vocal organs used by each person in speech are very different in size and shape, so the voiceprints of any two people have difference.
  • Each person's acoustic characteristics are both relatively stable and variability. This variation may come from physiology, pathology, psychology, simulation or camouflage, and may also be related to environmental disturbances.
  • Voiceprint refers to the spectrum of sound waves carrying speech information displayed by electroacoustic instruments.
  • Voiceprint recognition is a kind of biometric technology, also known as speaker recognition, which extracts voiceprint information from the speech signal emitted by the speaker. Speaker recognition includes two categories: Speaker Identification and Speaker Verification. The speaker recognizes which of a number of people a certain voice is said to be a "multiple choice” question; and the speaker confirms that it is used to confirm whether a certain voice is specified by a certain person. The problem of "one-on-one discrimination”. Voiceprint recognition mainly includes: speech signal processing, voiceprint feature extraction, voiceprint modeling, voiceprint comparison, and decision making.
  • the mobile terminal can identify the voice information generated by the first user during the call by using the existing voice recognition method, and then directly send the notification information of the voice information to the second user. However, in some special scenarios, the mobile terminal cannot directly send the notification information of the first user to the second user. In the existing information sending method, the notification information of the first user is not sent indirectly to the notification information. The second user's method.
  • the embodiments of the present invention provide a method, a device, and a computer storage medium, which can enable the mobile terminal to send the notification information of the first user to the second user, thereby ensuring the second user.
  • the notification information sent by the first user can be received in time.
  • an embodiment of the present invention provides a method for sending information, where the method includes:
  • the method before the acquiring the to-be-recognized voice feature corresponding to the to-be-identified voice information of the first user, the method further includes:
  • the speech feature to be recognized is acquired.
  • the determining whether the to-be-identified voice feature meets a preset trigger condition comprises:
  • the acquiring the notification information corresponding to the to-be-identified voice feature includes:
  • the static information includes: text information of the first user;
  • the dynamic information includes: picture information and location information of the first user.
  • the embodiment of the present invention further provides an information sending apparatus, where the apparatus includes: an acquiring unit, configured to acquire a to-be-recognized voice feature corresponding to the to-be-identified voice information of the first user, where the to-be-recognized voice is to be recognized The feature is sent to the judging unit;
  • the determining unit is configured to determine whether the to-be-recognized voice feature meets a preset trigger condition, and when the to-be-recognized voice feature satisfies a preset trigger condition, send an acquisition instruction to the acquiring unit;
  • the acquiring unit is further configured to: after receiving the obtaining instruction, acquire notification information corresponding to the to-be-recognized voice feature, and send the notification information to the sending unit;
  • the sending unit is configured to send the notification information to a second user.
  • the apparatus further includes: an authentication unit;
  • the acquiring unit is further configured to acquire a voiceprint feature to be identified corresponding to the to-be-identified voice information, and send the to-be-identified voiceprint feature to the authentication unit;
  • the authentication unit is configured to perform identity authentication on the first user according to the voiceprint feature to be identified; when the authentication is passed, send an instruction to pass the authentication to the acquiring unit;
  • the acquiring unit is further configured to acquire the to-be-recognized voice feature after receiving the instruction that the authentication passes.
  • the determining unit is configured to determine whether the to-be-recognized speech feature matches a pre-saved speech feature; when the matching is successful, determining that the to-be-recognized speech feature satisfies the trigger condition; When it fails, it is determined that the to-be-identified speech feature does not satisfy the trigger condition.
  • the acquiring unit is configured to acquire static information corresponding to the to-be-recognized speech feature, or acquire dynamic information corresponding to the to-be-recognized speech feature.
  • the static information includes: text information of the first user;
  • the dynamic information includes: picture information and location information of the first user.
  • an embodiment of the present invention further provides a computer storage medium, where the computer storage medium stores computer executable instructions, and the computer executable instructions are used to execute the information sending method according to the embodiment of the present invention.
  • the to-be-recognized speech feature corresponding to the to-be-identified speech information of the first user is first acquired, and when the to-be-recognized speech feature satisfies the preset triggering condition, the acquiring and the to-be-recognized are obtained.
  • the notification information corresponding to the voice feature is finally sent to the second user. That is, when the mobile terminal cannot directly send the notification information of the first user to the second user, the notification information corresponding to the to-be-recognized voice feature may be sent to the voice feature to be recognized as a trigger condition.
  • Second user Not like prior art The mobile terminal can only directly send the notification information of the first user to the second user.
  • the information sending method and device provided by the embodiment of the present invention can enable the mobile terminal to send the notification information of the first user to the second user, thereby ensuring that the second user can receive the information in time.
  • FIG. 1 is a schematic structural diagram of hardware of a mobile terminal that implements various embodiments of the present invention
  • FIG. 2 is a schematic flowchart of an implementation process of an information sending method according to an embodiment of the present invention
  • FIG. 3 is a schematic flowchart of a method for implementing identity authentication for a first user according to an embodiment of the present invention
  • FIG. 4 is a schematic flowchart of an implementation method for determining whether a trigger condition is met according to an embodiment of the present invention
  • FIG. 5 is a schematic structural diagram of a structure of an information sending apparatus according to an embodiment of the present invention.
  • the mobile terminal can be implemented in various forms.
  • the terminal described in the present invention may include, for example, a mobile phone, a smart phone, a notebook computer, a digital broadcast receiver, a personal digital assistant (PDA, Personal Digital Assistant), a tablet (PAD, Portable Android Device), a portable multimedia player. (PMP, Portable Media Player), mobile terminals such as navigation devices, and fixed terminals such as digital television (TV, Television), desktop computers, and the like end.
  • PDA Personal Digital Assistant
  • PAD Portable Android Device
  • PMP Portable Media Player
  • mobile terminals such as navigation devices
  • fixed terminals such as digital television (TV, Television), desktop computers, and the like end.
  • TV Television
  • TV Television
  • desktop computers and the like end.
  • the terminal is a mobile terminal.
  • those skilled in the art will appreciate that configurations in accordance with embodiments of the present invention can be applied to fixed type terminals in addition to components that are specifically for mobile purposes.
  • FIG. 1 is a schematic diagram of a hardware structure of a mobile terminal for implementing various embodiments of the present invention.
  • the mobile terminal 100 may include: a communication unit 110 and a storage unit. 120, identification unit 130, output unit 140, input unit 150, power supply unit 160, interface unit 170, controller 180, and the like.
  • Communication unit 110 typically includes one or more modules that allow mobile terminal 100 to communicate wirelessly with a wireless communication system or a wireless communication network.
  • the communication unit 110 may include at least one of the mobile communication module 111, the Internet communication module 112, and the location information module 113.
  • the mobile communication module 111 transmits a radio signal to and/or receives a radio signal from at least one of a base station, an external terminal, and a server.
  • the radio signal may include: a voice call signal, a video call signal, or various types of data transmitted and/or received according to text and/or multimedia messages.
  • the internet communication module 112 supports wireless internet access of the mobile terminal 100.
  • the internet communication module 112 can be internally or externally coupled to the terminal.
  • the wireless Internet access technology involved in the Internet communication module 112 may include: Wireless Local Area Networks (WLAN), Wireless Fidelity (Wi-Fi), Wireless Broadband Access Service (Wibro), Worldwide Interoperability for Microwave Access (WiMAX), High Speed Downlink Packet Access (HSDPA), and the like.
  • the location information module 113 is a module configured to check or acquire location information of the mobile terminal 100.
  • a typical example of the location information module 113 is the Global Positioning System (GPS). GPS can calculate distance information and accuracy from three or more satellites The time information is confirmed so that the location information of the mobile terminal 100 can be accurately calculated.
  • GPS Global Positioning System
  • the location information module 113 may include, but is not limited to, GPS, Assisted GPS (A-GPS, Assisted GPS), Big Dipper positioning system, Global Navigation Satellite System (GLONASS, GLOBAL NAVIGATION SATELLITE SYSTEM), and Galileo satellite navigation. System (Galileo, Galileo satellite navigation system).
  • the storage unit 120 may store a software program or the like that performs processing and control operations performed by the controller 180, or may temporarily store data (for example, a phone book, a message, a still image, a video, and the like) that has been output or is to be output. Moreover, the storage unit 120 can store data regarding various manners of vibration and audio signals that are output when a touch is applied to the touch screen.
  • the storage unit 120 may include at least one type of storage medium, and the storage medium may include: a flash memory, a hard disk, a multimedia card, a card type memory (for example, SD or DX memory, etc.), a random access memory (RAM, Random Access Memory). ), Static Random Access Memory (SRAM), Read-Only Memory (ROM), Electrically Erasable Programmable Read-Only Memory (EEPROM), Programmable Read only memory (PROM, Programmable Read-Only Memory), magnetic memory, magnetic disk and optical disk.
  • the mobile terminal 100 can cooperate with a network storage device that performs a storage function of the storage unit 120 through a network connection.
  • the identification unit 130 includes: a voice recognition module 131 and a voiceprint recognition module 132;
  • the voice recognition module 131 is configured to acquire voice information during a call of the user, and identify the voice information of the user based on the vocal tract model, template matching, and artificial neural network technology, and convert the input into computer readable input information.
  • the voiceprint recognition module 132 is configured to convert the voice feature of the user into a format recognizable by the device by voiceprint recognition technology, for example, a message digest algorithm (MD5) identifier, and the like.
  • voiceprint recognition technology for example, a message digest algorithm (MD5) identifier, and the like.
  • MD5 message digest algorithm
  • the sign is converted into a format recognizable by the device for matching with the pre-saved voiceprint feature to confirm whether the user is a genuine legitimate user.
  • the output unit 140 can provide output signals (eg, acoustic, optical, electrical, and vibration signals, etc.) by way of visual, audio, somatosensory, and the like.
  • the output unit 140 may include a display module 141 and an audio output module 142.
  • the display module 141 can display information processed in the mobile terminal 100. For example, when the mobile terminal 100 is in the standby mode, the display module 141 may display a user interface (UI, User Interface) and a graphical user interface (GUI) related to the standby mode of the mobile terminal 100. When the mobile terminal 100 is in the play mode, the display module 141 can present the video stream and the operated UI and GUI to the user. When the mobile terminal 100 is used in the call mode, the display module 141 can provide the user with the dial pad and the UI of the call information. GUI for user reference operations, etc. When the display module 141 and the touch panel are superposed on each other in the form of a layer to form a touch screen, the display module 141 can function as an input device and an output device.
  • UI User Interface
  • GUI graphical user interface
  • the display module 141 may include: a Liquid Crystal Display (LCD), a Thin Film Transistor-Liquid Crystal Display (TFT-LCD), an Organic Light Emitting Diode (OLED) display, and a flexible display. At least one of them.
  • the mobile terminal 100 may include two or more display modules 141.
  • the audio output module 142 may convert the audio data received by the communication unit 110 or stored in the storage unit 120 when the mobile terminal 100 is in a call signal receiving mode, a call mode, a recording mode, a voice recognition mode, and a broadcast receiving mode.
  • the audio signal is output as sound, and the audio output module 142 can provide audio output (eg, call signal reception sound and message reception sound, etc.) related to a specific function performed by the mobile terminal 100, and the audio output module 142 can include: a speaker And buzzer, etc.
  • Input unit 150 typically includes one or more modules that allow mobile terminal 100 to receive various forms of signal inputs including audio, video, and touch operations.
  • Input unit 150 includes but not Limited to:
  • the microphone 151 can receive the sound recorded when the mobile terminal 100 operates in a specific mode (for example, a call mode, a recording mode, and a voice recognition mode, etc.), and can process the received sound into audio data, and the processed audio data can be processed.
  • the mobile communication module 111 transmits to the communication base station.
  • the microphone 151 can implement various types of noise canceling (or suppression) algorithms to cancel (or suppress) noise or interference generated in the process of receiving and transmitting audio signals.
  • the user input module 152 can generate various operations for controlling the mobile terminal 100 according to commands input by the user.
  • the user input module 152 allows the user to input various types of information, which may include: a keyboard, a mouse, a trackball, a joystick, a touchpad, and the like.
  • a touch screen can be formed.
  • the camera 153 can process the image data of the still picture or video obtained by the image capturing device in the video capturing mode or the image capturing mode, and the processed image frame can be displayed on the display module 141, and the image frame processed by the camera 153 It may be stored in the storage unit 120 (or other storage medium) or transmitted via the communication unit 110.
  • the sensor module 154 is responsible for detecting the surrounding environment of the mobile terminal 100 and its own conditions, including but not limited to: an accelerometer, a magnetic sensor, a direction sensor, a gyroscope, a light sensing sensor, a pressure sensor, a temperature sensor, a proximity sensor, a gravity sensor, Linear acceleration sensor and rotation vector sensor.
  • the module can detect information such as acceleration, rotation, light intensity, and magnetic field and generate commands or signals.
  • the power supply unit 160 receives external power or internal power under the control of the controller 180 and provides appropriate power required to operate the various components and components.
  • the interface unit 170 provides a method in which the external device and the mobile terminal 100 are connected.
  • the external device may include: a wired (or wireless) headphone port, an audio input/output (I/O, Input/Output) port, a memory card port, a charger interface, a video I/O port, a wearable device, and a SIM card. Wait for mobile communication card interfaces, etc.
  • the interface unit 170 can be used to receive from an external device The input (eg, data stream, current) is input and the received data is transmitted to the mobile terminal 100.
  • the controller 180 generally controls overall operations of the mobile terminal 100, for example, the controller 180 performs control and processing related to voice calls, data communications, video calls, etc., and the controller 180 can perform pattern recognition processing and will execute on the touch screen
  • the handwriting input or picture drawing input is recognized as a character or an image.
  • the various embodiments described herein can be implemented in a computer readable medium using, for example, computer software, hardware, or any combination thereof.
  • the embodiments described herein may use an Application Specific Integrated Circuit (ASIC), a Digital Signal Processor (DSP), a Digital Signal Processor (DSPD), Programmable Logic Device (PLD), Field Programmable Gate Array (FPGA), processor, controller, microcontroller, microprocessor, electronics designed to perform the functions described herein
  • ASIC Application Specific Integrated Circuit
  • DSP Digital Signal Processor
  • DSPD Digital Signal Processor
  • PLD Programmable Logic Device
  • FPGA Field Programmable Gate Array
  • the mobile terminal 100 has been described in terms of its function.
  • FIG. 2 is a schematic flowchart of an implementation manner of an information sending method according to an embodiment of the present invention. As shown in FIG. 2, the information sending method may include the following steps:
  • Step 201 Acquire a to-be-recognized voice feature corresponding to the to-be-identified voice information of the first user.
  • the mobile terminal when the first user inputs the to-be-identified voice information in the mobile terminal, the mobile terminal may first collect the to-be-identified voice information entered by the first user, and then acquire the to-be-identified voice information with the first user. Corresponding speech features to be recognized.
  • the mobile terminal can acquire the to-be-recognized voice feature corresponding to the to-be-identified voice information of the first user by using the existing voice recognition technology.
  • the mobile terminal may perform identity authentication on the first user before acquiring the to-be-identified voice feature corresponding to the to-be-identified voice information of the first user.
  • the mobile terminal may first acquire the to-be-recognized voiceprint feature corresponding to the to-be-identified voice information, and then according to the voiceprint to be recognized. The feature authenticates the first user.
  • FIG. 3 is a schematic flowchart of a method for implementing identity authentication for a first user according to an embodiment of the present invention. As shown in FIG. 3, the method for performing identity authentication on a first user by a mobile terminal may include the following steps:
  • step 201a it is determined whether the voiceprint feature to be recognized matches the pre-saved voiceprint feature; when the matching is successful, step 201b is performed; when the matching fails, step 201c is performed.
  • the mobile terminal can determine whether the voiceprint feature to be recognized matches the pre-saved voiceprint feature.
  • the mobile terminal can use the existing voiceprint recognition method to match the voiceprint feature to be recognized with the pre-saved voiceprint feature.
  • the mobile terminal can use the template matching method in the existing voiceprint recognition. , nearest neighbor method, neural network method, Hidden Markov Model (HMM) method, vector quantization (VQ, Vector Quantization) clustering method and polynomial resolver method, the voiceprint features to be recognized and pre-saved The voiceprint features are matched.
  • step 201b is performed; when the matching fails, step 201c is performed.
  • Step 201b Determine that the first user passes the identity authentication, and ends the current judgment process.
  • the mobile terminal may determine that the first user is authenticated by the identity, that is, the mobile terminal may acquire the to-be-identified voice feature corresponding to the to-be-identified voice information of the first user, and end the current judgment process. .
  • Step 201c Determine that the first user does not pass the identity authentication.
  • the mobile terminal may determine that the first user does not pass the identity. Authentication, that is, the mobile terminal does not acquire the to-be-recognized voice feature corresponding to the first user's to-be-identified voice information.
  • the mobile terminal can perform identity authentication on the first user according to the voiceprint feature of the first user, and the mobile terminal acquires the voice information corresponding to the voice information to be recognized when the authentication is passed.
  • the voice feature to be recognized when the authentication fails, the mobile terminal does not acquire the to-be-identified voice feature corresponding to the voice information to be recognized.
  • Step 202 Determine whether the to-be-recognized speech feature satisfies a preset trigger condition; when the condition is met, step 203 is performed; when the trigger condition is not met, step 205 is performed.
  • the mobile terminal can determine whether the to-be-recognized speech feature satisfies the preset trigger condition; when the to-be-recognized speech feature satisfies the preset trigger condition, step 203 is performed; when the to-be-recognized speech feature does not satisfy the preset trigger condition At step 205, step 205 is performed.
  • FIG. 4 is a schematic flowchart of a method for determining whether a trigger condition is met according to an embodiment of the present invention. As shown in FIG. 4, the method for determining whether a voice feature to be recognized meets a trigger condition may include the following steps:
  • Step 202a Determine whether the to-be-identified speech feature matches the pre-saved speech feature; when the matching is successful, perform step 202b; when the matching fails, perform step 202c.
  • the mobile terminal can determine whether the to-be-recognized speech feature matches the pre-saved speech feature.
  • the mobile terminal may use existing voice recognition methods to match the to-be-recognized voice feature with the pre-saved voice feature.
  • Step 202b Determine that the to-be-recognized speech feature satisfies the trigger condition, and end the current judgment process.
  • the mobile terminal may determine that the to-be-recognized speech feature satisfies the trigger condition. That is to say, the voice feature to be recognized may trigger the mobile terminal to acquire the notification information corresponding to the voice feature to be recognized, and end the current judgment process.
  • Step 202c Determine that the to-be-recognized speech feature does not satisfy the trigger condition.
  • the mobile terminal may determine that the to-be-recognized speech feature does not satisfy the trigger condition. That is to say, the voice feature to be recognized does not trigger the mobile terminal to acquire the notification information corresponding to the voice feature to be recognized.
  • the mobile terminal can determine whether the to-be-recognized speech feature satisfies a preset trigger condition by using the above-mentioned steps 202a-202c; when the to-be-recognized speech feature satisfies the trigger condition, the mobile terminal acquires the to-be-recognized speech feature. Corresponding notification information; when the to-be-identified speech feature does not satisfy the trigger condition, the mobile terminal does not acquire the notification information corresponding to the to-be-recognized speech feature.
  • Step 203 Acquire notification information corresponding to the voice feature to be recognized.
  • the mobile terminal may acquire notification information corresponding to the to-be-recognized voice feature.
  • the mobile terminal may acquire static information corresponding to the voice feature to be recognized; or the mobile terminal may also acquire dynamic information corresponding to the voice feature to be recognized.
  • the static information may include: text information of the first user; the dynamic information may include: picture information and location information of the first user.
  • the mobile terminal when the mobile terminal acquires the text information of the first user corresponding to the to-be-identified voice feature, the mobile terminal may acquire the text information of the first user saved in advance; when the mobile terminal acquires the voice feature corresponding to the to-be-recognized voice feature When the picture information of the first user is used, the mobile terminal may acquire the picture information of the first user by using a preset camera module; when the mobile terminal acquires the location information of the first user corresponding to the voice feature to be recognized, the mobile terminal may preset The positioning module obtains the location information of the first user.
  • Step 204 Send the notification information to the second user.
  • the mobile terminal may send the notification information to the second user.
  • the first user may set a second user that receives the notification information in the mobile terminal in advance. Therefore, after the mobile terminal obtains the notification information, the notification information can be sent to the second user.
  • Step 205 ending the flow of information transmission.
  • the mobile terminal may end the flow of information transmission.
  • the mobile terminal When the mobile terminal is unable to directly send the notification information of the first user to the second user, the mobile terminal can set the to-be-recognized speech feature by setting the to-be-recognized speech feature as a trigger condition.
  • the corresponding notification information is sent to the second user.
  • the mobile terminal can only directly transmit the notification information of the first user to the second user.
  • the information sending method provided by the embodiment of the present invention can enable the mobile terminal to send the notification information of the first user to the second user, thereby ensuring that the second user can receive the first time in time.
  • the notification information sent by the user and it is simple and convenient to implement, easy to popularize, and has a wider application range.
  • FIG. 5 is a schematic structural diagram of a structure of an information sending apparatus according to an embodiment of the present invention.
  • the information sending apparatus includes: an obtaining unit 501, a determining unit 502, and a sending unit 503;
  • the acquiring unit 501 is configured to acquire a to-be-recognized voice feature corresponding to the first user's to-be-identified voice information, and send the to-be-identified voice feature to the determining unit 502;
  • the determining unit 502 is configured to determine whether the to-be-recognized voice feature meets a preset triggering condition, and when the to-be-recognized voice feature satisfies a preset triggering condition, send an acquiring instruction to the acquiring unit 501;
  • the acquiring unit 501 is further configured to: after receiving the acquisition instruction, acquire notification information corresponding to the to-be-recognized voice feature, and send the notification information to the sending unit 503;
  • the sending unit 503 is configured to send the notification information to the second user.
  • the device further includes: an authentication unit 504;
  • the acquiring unit 501 is further configured to acquire a voiceprint feature to be recognized corresponding to the to-be-identified voice information, and send the to-be-identified voiceprint feature to the authentication unit 504;
  • the authentication unit 504 is configured to perform identity authentication on the first user according to the voiceprint feature to be identified; when the authentication is passed, send an instruction to pass the authentication to the acquiring unit 501;
  • the acquiring unit 501 is further configured to acquire the to-be-recognized voice feature after receiving the instruction that the authentication is passed.
  • the determining unit 502 is configured to determine whether the to-be-recognized speech feature matches a pre-saved speech feature; when the matching is successful, determine that the to-be-recognized speech feature satisfies the trigger condition; When the matching fails, it is determined that the to-be-identified speech feature does not satisfy the trigger condition.
  • the acquiring unit 501 is configured to acquire static information corresponding to the to-be-identified speech feature, or acquire dynamic information corresponding to the to-be-recognized speech feature.
  • the static information includes: text information of the first user;
  • the dynamic information includes: picture information and location information of the first user.
  • the obtaining unit 501, the determining unit 202, the sending unit 503, and the authenticating unit 504 may each be a Central Processing Unit (CPU), a Microprocessor Unit (MPU), and a DSP located in the mobile terminal. Or FPGA implementation.
  • CPU Central Processing Unit
  • MPU Microprocessor Unit
  • DSP Digital Signal processor
  • the information sending apparatus of the embodiment of the present invention when the mobile terminal is unable to directly send the notification information of the first user to the second user, may set the voice feature to be recognized by setting the to-be-identified voice feature as a trigger condition. The corresponding notification information is sent to the second user. Rather than in the prior art, the mobile terminal can only directly transmit the notification information of the first user to the second user. Obviously, compared with the prior art, the information sending apparatus provided by the embodiment of the present invention can enable the mobile terminal to send the notification information of the first user to the second user, thereby ensuring The second user can receive the notification information sent by the first user in time; and is simple and convenient to implement, convenient to popularize, and has a wider application range.
  • the embodiment of the invention further describes a computer storage medium, wherein the computer storage medium stores computer executable instructions, and the computer executable instructions are used to execute the information sending method described in the foregoing embodiments. That is to say, after the computer executable instructions are executed by the processor, the information transmitting method provided by any one of the foregoing technical solutions can be implemented.
  • an embodiment of the present invention further describes a computer storage medium, where the computer storage medium stores one or more programs, and the one or more programs may be executed by one or more processors, Implement the following steps:
  • the one or more programs may be executed by the one or more processors, The following steps are implemented: the method further includes:
  • the speech feature to be recognized is acquired.
  • the one or more programs may be executed by the one or more processors to implement the following steps:
  • embodiments of the present invention can be provided as a method, system, or computer program product. Accordingly, the present invention can take the form of a hardware embodiment, a software embodiment, or a combination of software and hardware. Moreover, the invention can take the form of a computer program product embodied on one or more computer-usable storage media (including but not limited to disk storage and optical storage, etc.) including computer usable program code.
  • the computer program instructions can also be stored in a computer readable memory that can direct a computer or other programmable data processing device to operate in a particular manner, such that the instructions stored in the computer readable memory produce an article of manufacture comprising the instruction device.
  • the apparatus implements the functions specified in one or more blocks of a flow or a flow and/or block diagram of the flowchart.
  • These computer program instructions can also be loaded onto a computer or other programmable data processing device such that a series of operational steps are performed on a computer or other programmable device to produce computer-implemented processing for execution on a computer or other programmable device.
  • the instructions provide steps for implementing the functions specified in one or more of the flow or in a block or blocks of a flow diagram.
  • the technical solution of the embodiment of the present invention first acquires a to-be-recognized speech feature corresponding to the to-be-identified speech information of the first user, and acquires the notification information corresponding to the to-be-recognized speech feature when the to-be-recognized speech feature satisfies a preset trigger condition. And finally, the notification information is sent to the second user; in this way, the mobile terminal can send the notification information of the first user to the second user, so that the second user can receive the notification information sent by the first user in time; Moreover, it is simple and convenient to implement, easy to popularize, and has a wider application range.

Abstract

一种信息发送方法、装置及计算机存储介质,其中,该方法包括:获取与第一用户的待识别语音信息对应的待识别语音特征(201);判断所述待识别语音特征是否满足预先设置的触发条件(202);当所述待识别语音特征满足预先设置的触发条件时,获取与所述待识别语音特征对应的通知信息(203);将所述通知信息发送给第二用户(204)。

Description

一种信息发送方法、装置及计算机存储介质
相关申请的交叉引用
本申请基于申请号为201610450969.8、申请日为2016年06月21日的中国专利申请提出,并要求该中国专利申请的优先权,该中国专利申请的全部内容在此引入本申请作为参考。
技术领域
本发明涉及移动通信领域信息管理技术,尤其涉及一种信息发送方法、装置及计算机存储介质。
背景技术
随着科学技术的不断发展,人工智能作为计算机科学的分支,致力于研究一种新的能以人类智能相似的方式做出反映的智能机器,该领域的研究包括:机器人、语音识别、声纹识别(VPR,Voiceprint Recognition)、图像识别和自然语言处理等。
语音识别是以语音信号为研究对象,是人工智能的一个重要的研究方向,其最终目标是实现人与机器之间进行自然语言通信。一般来说,语音识别的方法有三种:基于声道模型和语音知识的方法、模板匹配的方法以及利用人工神经网络的方法。
人类语言的产生是人体语言中枢与发音器官之间一个复杂的生理物理过程,每个人在讲话时使用的发声器官在尺寸和形态方面存在很大的差异,所以任何两个人的声纹图谱都有差异。每个人的声学特征既有相对稳定性,又有变异性。这种变异可能来自生理、病理、心理、模拟或者伪装,也可能与环境干扰有关。尽管如此,由于每个人的发音器官都不尽相同,因此, 在一般情况下,仍然能够区别不同的人发出的声音或者判断某一个声音是否来自于同一人。声纹(Voiceprint)是指用电声学仪器显示的携带言语信息的声波频谱。声纹识别是生物识别技术的一种,也称为说话人识别,是从说话人发出的语音信号中提取声纹信息。说话人识别包括两类:说话人辨认(Speaker Identification)和说话人确认(Speaker Verification)。说话人辨认用以判断某段语音是若干人中的哪一个所说的,是“多选一”的问题;而说话人确认用于确认某段语音是否是指定的某个人所说的,是“一对一判别”的问题。声纹识别主要包括:语音信号处理、声纹特征提取、声纹建模、声纹比对、判别决策等。
在实现本发明的过程中,发明人发现现有技术中至少存在如下问题:
移动终端可以采用现有的语音识别的方法对第一用户在通话过程中产生的语音信息进行识别,然后将语音信息进行语音识别后的通知信息直接地发送给第二用户。但是,在某些特殊的场景下,移动终端无法将第一用户的通知信息直接地发送给第二用户,在现有的信息发送方法中,还没有将第一用户的通知信息间接地发送给第二用户的方法。
发明内容
为解决现有存在的技术问题,本发明实施例提供一种信息发送方法、装置及计算机存储介质,能够实现移动终端可以将第一用户的通知信息发送给第二用户,从而可以保证第二用户能够及时地接收到第一用户发送的通知信息。
为达到上述目的,本发明实施例的技术方案是这样实现的:
第一方面,本发明实施例提供了一种信息发送方法,所述方法包括:
获取与第一用户的待识别语音信息对应的待识别语音特征;
判断所述待识别语音特征是否满足预先设置的触发条件;
当所述待识别语音特征满足预先设置的触发条件时,获取与所述待识 别语音特征对应的通知信息;
将所述通知信息发送给第二用户。
在一实施例中,在所述获取与第一用户的待识别语音信息对应的待识别语音特征之前,所述方法还包括:
获取与所述待识别语音信息对应的待识别声纹特征;
根据所述待识别声纹特征对所述第一用户进行身份认证;
当认证通过时,获取所述待识别语音特征。
在一实施例中,所述判断所述待识别语音特征是否满足预先设置的触发条件包括:
判断所述待识别语音特征与预先保存的语音特征是否匹配;
当匹配成功时,判定所述待识别语音特征满足所述触发条件;
当匹配失败时,判定所述待识别语音特征不满足所述触发条件。
在一实施例中,所述获取与所述待识别语音特征对应的通知信息包括:
获取与所述待识别语音特征对应的静态信息;
或者,获取与所述待识别语音特征对应的动态信息。
在一实施例中,所述静态信息包括:所述第一用户的文本信息;所述动态信息包括:所述第一用户的图片信息和位置信息。
第二方面,本发明实施例还提供了一种信息发送装置,所述装置包括:获取单元,配置为获取与第一用户的待识别语音信息对应的待识别语音特征,将所述待识别语音特征发送给判断单元;
所述判断单元,配置为判断所述待识别语音特征是否满足预先设置的触发条件,当所述待识别语音特征满足预先设置的触发条件时,向所述获取单元发送一个获取指令;
所述获取单元,还配置为在接收到所述获取指令之后,获取与所述待识别语音特征对应的通知信息,将所述通知信息发送给发送单元;
所述发送单元,配置为将所述通知信息发送给第二用户。
在一实施例中,所述装置还包括:认证单元;
所述获取单元,还配置为获取与所述待识别语音信息对应的待识别声纹特征,将所述待识别声纹特征发送给所述认证单元;
所述认证单元,配置为根据所述待识别声纹特征对所述第一用户进行身份认证;当认证通过时,向所述获取单元发送一个认证通过的指令;
所述获取单元,还配置为在接收到所述认证通过的指令之后,获取所述待识别语音特征。
在一实施例中,所述判断单元,具体配置为判断所述待识别语音特征与预先保存的语音特征是否匹配;当匹配成功时,判定所述待识别语音特征满足所述触发条件;当匹配失败时,判定所述待识别语音特征不满足所述触发条件。
在一实施例中,所述获取单元,具体配置为获取与所述待识别语音特征对应的静态信息;或者,获取与所述待识别语音特征对应的动态信息。
在一实施例中,所述静态信息包括:所述第一用户的文本信息;所述动态信息包括:所述第一用户的图片信息和位置信息。
第三方面,本发明实施例还提供了一种计算机存储介质,所述计算机存储介质中存储有计算机可执行指令,所述计算机可执行指令用于执行本发明实施例所述的信息发送方法。
由此可见,在本发明实施例的技术方案中,先获取与第一用户的待识别语音信息对应的待识别语音特征,当待识别语音特征满足预先设置的触发条件时,再获取与待识别语音特征对应的通知信息,最后将通知信息发送给第二用户。也就是说,当移动终端无法将第一用户的通知信息直接地发送给第二用户时,可以通过将待识别语音特征设置为触发条件的方式,将与待识别语音特征对应的通知信息发送给第二用户。而不是像现有技术 中,移动终端只能将第一用户的通知信息直接地发送给第二用户。显然,和现有技术相比,本发明实施例提出的信息发送方法及装置,能够实现移动终端可以将第一用户的通知信息发送给第二用户,从而可以保证第二用户能够及时地接收到第一用户发送的通知信息;并且,实现起来简单方便,便于普及,适用范围更广。
附图说明
图1为实现本发明各个实施例的移动终端的硬件结构示意图;
图2为本发明实施例中信息发送方法的实现流程示意图;
图3为本发明实施例中对第一用户进行身份认证的实现方法流程示意图;
图4为本发明实施例中判断是否满足触发条件的实现方法流程示意图;
图5为本发明实施例中信息发送装置的组成结构示意图。
具体实施方式
应当理解,此处所描述的具体实施例仅仅用以解释本发明的技术方案,并不用于限定本发明的保护范围。
现在将参考附图描述实现本发明各个实施例的移动终端。在后续的描述中,使用用于表示元件的诸如“模块”、“部件”或“单元”的后缀仅为了有利于本发明的说明,其本身并没有特定的意义。因此,“模块”与“部件”可以混合地使用。
移动终端可以以各种形式来实施。例如,本发明中描述的终端可以包括诸如移动电话、智能电话、笔记本电脑、数字广播接收器、个人数字助理(PDA,Personal Digital Assistant)、平板电脑(PAD,Portable Android Device)、便携式多媒体播放器(PMP,Portable Media Player)、导航装置等的移动终端以及诸如数字电视(TV,Television)、台式计算机等的固定终 端。下面,假设终端是移动终端。然而,本领域技术人员将理解的是,除了特别用于移动目的的元件之外,根据本发明的实施方式的构造也能够应用于固定类型的终端。
下面结合附图对技术方案的实施作进一步的详细描述:图1为实现本发明各个实施例的移动终端的硬件结构示意图,如图1所示,移动终端100可以包括:通信单元110、存储单元120、识别单元130、输出单元140、输入单元150、电源单元160、接口单元170和控制器180等。
通信单元110通常包括一个或者多个模块,其允许移动终端100可以与无线通信系统或者无线通信网络之间进行无线通信。例如,通信单元110可以包括:移动通信模块111、互联网通信模块112和位置信息模块113中的至少一个。
移动通信模块111将无线电信号发送到基站、外部终端以及服务器中的至少一个和/或从其接收无线电信号。其中,无线电信号可以包括:语音通话信号、视频通话信号、或者根据文本和/或多媒体消息发送和/或接收的各种类型的数据。
互联网通信模块112支持移动终端100的无线互联网接入。互联网通信模块112可以内部或外部地耦接到终端。互联网通信模块112所涉及的无线互联网接入技术可以包括:无线局域网(WLAN,Wireless Local Area Networks)、无线相容性认证(Wi-Fi,Wireless Fidelity)、无线宽带(Wibro,Wireless Broadbandaccess service)、全球微波互联接入(WiMAX,Worldwide Interoperability for Microwave Access)、高速下行链路分组接入(HSDPA,High Speed Downlink Packet Access)等。
位置信息模块113是配置为检查或者获取移动终端100的位置信息的模块。位置信息模块113的一个典型示例是全球定位系统(GPS,Global Positioning System)。GPS可以计算来自三个或者更多卫星的距离信息和准 确的时间信息,从而可以精确地计算移动终端100的位置信息。值得注意的是,该位置信息模块113可以包括但不限于:GPS、辅助GPS(A-GPS,Assisted GPS)、北斗星定位系统、全球卫星导航系统(GLONASS,GLOBAL NAVIGATION SATELLITE SYSTEM)、以及伽利略卫星导航系统(Galileo,Galileo satellite navigation system)。
存储单元120可以存储由控制器180执行的处理和控制操作的软件程序等,或者可以暂时存储已经输出或将要输出的数据(例如,电话簿、消息、静态图像和视频等)。而且,存储单元120可以存储关于当触摸施加到触摸屏时输出的各种方式的振动和音频信号的数据。
存储单元120可以包括至少一种类型的存储介质,所述存储介质可以包括:闪存、硬盘、多媒体卡、卡型存储器(例如,SD或DX存储器等)、随机访同存储器(RAM,Random Access Memory)、静态随机访问存储器(SRAM,Static Random Access Memory)、只读存储器(ROM,Read-Only Memory)、电可擦除可编程只读存储器(EEPROM,Electrically Erasable Programmable Read-Only Memory)、可编程只读存储器(PROM,Programmable Read-Only Memory)、磁性存储器、磁盘和光盘等。而且,移动终端100可以与通过网络连接执行存储单元120的存储功能的网络存储装置协作。
识别单元130包括:语音识别模块131和声纹识别模块132;其中,
语音识别模块131:配置为在用户的通话过程中获取语音信息,基于声道模型、模板匹配以及人工神经网络的技术对用户的语音信息进行识别,并转换为计算机可读的输入信息。
声纹识别模块132:配置为通过声纹识别技术将用户的声音特征转换成装置能够识别的格式,例如:消息摘要算法第五版(MD5,Message Digest Algorithm MD5)识别码等。当用户在通话的时候,可以获取用户的声音特 征并转换成与装置能够识别的格式,用于与预先保存的声纹特征进行匹配,以确认该用户是不是真正的合法使用者。
输出单元140可以通过视觉、音频、体感等方式提供输出信号(例如,声、光、电和震动信号等)。输出单元140可以包括:显示模块141和音频输出模块142。
显示模块141可以显示移动终端100中处理的信息。例如,当移动终端100处于待机模式时,显示模块141可以显示移动终端100与待机模式相关的用户界面(UI,User Interface)和图形用户界面(GUI,Graphical User Interface)。当移动终端100处于播放模式时,显示模块141可以向用户展示视频流以及操作的UI和GUI,当移动终端100用于通话模式时,显示模块141可以向用户提供拨号键盘以及通话信息的UI和GUI,以供用户参考操作等。当显示模块141和触摸板以层的形式彼此叠加以形成触摸屏时,显示模块141可以用作输入装置和输出装置。显示模块141可以包括:液晶显示器(LCD,Liquid Crystal Display)、薄膜晶体管LCD(TFT-LCD,Thin Film Transistor-Liquid Crystal Display)、有机发光二极管(OLED,Organic Light Emitting Diode)显示器和柔性显示器等中的至少一种。根据特定的实施方式,移动终端100可以包括两个或更多显示模块141。
音频输出模块142可以在移动终端100处于呼叫信号接收模式、通话模式、记录模式、语音识别模式和广播接收模式等模式下时,将通信单元110接收的或者在存储单元120中存储的音频数据转换为音频信号并且输出为声音,而且,音频输出模块142可以提供与移动终端100执行的特定功能相关的音频输出(例如,呼叫信号接收声音和消息接收声音等),音频输出模块142可以包括:扬声器和蜂鸣器等。
输入单元150通常包括一个或多个模块,其允许移动终端100接收包括音频、视频和触摸操作等各种形式的信号输入。输入单元150包括但不 限于:
麦克风151,可以接收移动终端100运行在特定模式(例如,通话模式、记录模式和语音识别模式等)时录入的声音,并且能够将接收到的声音处理为音频数据,处理后的音频数据可以经移动通信模块111发送到通信基站。麦克风151可以实施各种类型的噪声消除(或者抑制)算法以消除(或者抑制)在接收和发送音频信号的过程中产生的噪声或者干扰。
用户输入模块152,可以根据用户输入的命令生成控制移动终端100的各种操作。用户输入模块152允许用户输入各种类型的信息,可以包括:键盘、鼠标、轨迹球、摇杆以及触摸板等。特别地,当触摸板以层的形式叠加在显示模块141上时,可以形成触摸屏。
摄像头153,可以对图像捕获装置在视频捕获模式或者图像捕获模式中获得的静态图片或者视频的图像数据进行处理,处理后的图像帧可以显示在显示模块141上,经摄像头153处理后的图像帧可以存储在存储单元120(或其它存储介质)中或者经由通信单元110进行发送。
传感器模块154,负责检测移动终端100的周围环境以及自身情况,包括但不限于:加速计、磁力感应器、方向传感器、陀螺仪、光线感应传感器、压力传感器、温度传感器、接近传感器、重力传感器、线性加速度传感器和旋转矢量传感器。该模块可以检测如加速度、旋转、光线强弱以及磁场等信息并且生成指令或者信号。
电源单元160,在控制器180的控制下接收外部电力或内部电力并且提供操作各元件和组件所需的适当的电力。
接口单元170,提供外部设备和移动终端100进行连接的方法。例如,外部装置可以包括:有线(或无线)耳机端口、音频输入/输出(I/O,Input/Output)端口、存储卡端口、充电器接口、视频I/O端口、可穿戴设备以及SIM卡等移动通信卡接口等。接口单元170可以用来接收来自外部装置的 输入(例如数据流、电流)并且将接收到的数据传输到移动终端100内。
控制器180,通常控制移动终端100的总体操作,例如,控制器180执行与语音通话、数据通信、视频通话等相关的控制和处理,控制器180可以执行模式识别处理,以及将在触摸屏上执行的手写输入或者图片绘制输入识别为字符或图像。
这里描述的各种实施方式可以以使用例如计算机软件、硬件或其任何组合的计算机可读介质来实施。对于硬件实施,这里描述的实施方式可以通过使用特定用途集成电路(ASIC,Application Specific Integrated Circuit)、数字信号处理器(DSP,Digital Signal Processor)、数字信号处理装置(DSPD,Digital Signal Processor Device)、可编程逻辑装置(PLD,Programmable Logic Device)、现场可编程门阵列(FPGA,Field Programmable Gate Array)、处理器、控制器、微控制器、微处理器、被设计为执行这里描述的功能的电子单元中的至少一种来实施,在一些情况下,这样的实施方式可以在控制器180中实施。对于软件实施,诸如过程或功能的实施方式可以与允许执行至少一种功能或操作的单独的软件模块来实施。软件代码可以由以任何适当的编程语言编写的软件应用程序(或程序)来实施,软件代码可以存储在存储单元120中并且由控制器180执行。
至此,已经按照其功能描述了移动终端100。
基于上述移动终端硬件结构,提出本发明信息发送方法各个实施例。
在本发明的各种实施例中,图2为本发明实施例中信息发送方法的实现流程示意图,如图2所示,信息发送方法可以包括以下步骤:
步骤201、获取与第一用户的待识别语音信息对应的待识别语音特征。
在本发明的具体实施例中,当第一用户在移动终端中录入待识别语音信息时,移动终端可以先采集第一用户录入的待识别语音信息,然后获取与第一用户的待识别语音信息对应的待识别语音特征。作为一种实施方式, 移动终端可以采用现有语音识别的技术,获取与第一用户的待识别语音信息对应的待识别语音特征。
较佳地,在本发明的具体实施例中,移动终端在获取与第一用户的待识别语音信息对应的待识别语音特征之前,移动终端还可以对第一用户进行身份认证。作为一种实施方式,移动终端在获取与第一用户的待识别语音信息对应的待识别语音特征之前,可以先获取与待识别语音信息对应的待识别声纹特征,然后可以根据待识别声纹特征对第一用户进行身份认证。图3为本发明实施例中对第一用户进行身份认证的实现方法流程示意图,如图3所示,移动终端对第一用户进行身份认证的方法可以包括以下步骤:
步骤201a、判断待识别声纹特征与预先保存的声纹特征是否匹配;当匹配成功时,执行步骤201b;当匹配失败时,执行步骤201c。
在本步骤中,移动终端可以判断待识别声纹特征与预先保存的声纹特征是否匹配。作为一种实施方式,移动终端可以采用现有的声纹识别的方法,将待识别声纹特征与预先保存的声纹特征进行匹配,例如,移动终端可以采用现有声纹识别中的模板匹配方法、最近邻方法、神经网络方法、隐式马尔可夫模型(HMM,Hidden Markov Model)方法、向量量化(VQ,Vector Quantization)聚类方法以及多项式分解器方法,将待识别声纹特征与预先保存的声纹特征进行匹配。当匹配成功时,执行步骤201b;当匹配失败时,执行步骤201c。
步骤201b、判定第一用户通过身份认证,结束本次判断流程。
在本步骤中,当匹配成功时,移动终端可以判定第一用户通过身份认证,也就是说,移动终端可以获取与第一用户的待识别语音信息对应的待识别语音特征,结束本次判断流程。
步骤201c、判定第一用户不通过身份认证。
在本步骤中,当匹配失败时,移动终端可以判定第一用户不通过身份 认证,也就是说,移动终端不会获取与第一用户的待识别语音信息对应的待识别语音特征。
根据上述的描述可知,通过上述的步骤201a~201c,移动终端可以根据第一用户的声纹特征对第一用户进行身份认证;当认证通过时,移动终端才会获取与待识别语音信息对应的待识别语音特征;当认证不通过时,移动终端则不会获取与待识别语音信息对应的待识别语音特征。
步骤202、判断待识别语音特征是否满足预先设置的触发条件;当满足条件时,执行步骤203;当不满足触发条件时,执行步骤205。
在本步骤中,移动终端可以判断待识别语音特征是否满足预先设置的触发条件;当待识别语音特征满足预先设置的触发条件时,执行步骤203;当待识别语音特征不满足预先设置的触发条件时,执行步骤205。
图4为本发明实施例中判断是否满足触发条件的实现方法流程示意图,如图4所示,判断待识别语音特征是否满足触发条件的方法可以包括以下步骤:
步骤202a、判断待识别语音特征与预先保存的语音特征是否匹配;当匹配成功时,执行步骤202b;当匹配失败时,执行步骤202c。
在本步骤中,移动终端可以判断待识别语音特征与预先保存的语音特征是否匹配。作为一种实施方式,移动终端可以采用现有的语音识别的方法,将待识别语音特征与预先保存的语音特征进行匹配。当匹配成功时,执行步骤202b;当匹配失败时,执行步骤202c。
步骤202b、判定待识别语音特征满足触发条件,结束本次判断流程。
在本步骤中,当匹配成功时,移动终端可以判定待识别语音特征满足触发条件。也就是说,待识别语音特征可以触发移动终端获取与待识别语音特征对应的通知信息,结束本次判断流程。
步骤202c、判定待识别语音特征不满足触发条件。
在本步骤中,当匹配失败时,移动终端可以判定待识别语音特征不满足触发条件。也就是说,待识别语音特征不会触发移动终端获取与待识别语音特征对应的通知信息。
根据上述的描述可知,通过上述的步骤202a~202c,移动终端可以判断待识别语音特征是否满足预先设置的触发条件;当待识别语音特征满足触发条件时,移动终端才会获取与待识别语音特征对应的通知信息;当待识别语音特征不满足触发条件时,移动终端则不会获取与待识别语音特征对应的通知信息。
步骤203、获取与待识别语音特征对应的通知信息。
在本发明的具体实施例中,当待识别语音特征满足预先设置的触发条件时,移动终端可以获取与待识别语音特征对应的通知信息。作为一种实施方式,在本发明的具体实施例中,移动终端可以获取与待识别语音特征对应的静态信息;或者,移动终端也可以获取与待识别语音特征对应的动态信息。其中,所述静态信息可以包括:第一用户的文本信息;所述动态信息可以包括:第一用户的图片信息和位置信息。
作为一种实施方式,当移动终端获取与待识别语音特性对应的第一用户的文本信息时,移动终端可以获取预先保存的第一用户的文本信息;当移动终端获取与待识别语音特征对应的第一用户的图片信息时,移动终端可以通过预先设置的照相模块获取第一用户的图片信息;当移动终端获取与待识别语音特征对应的第一用户的位置信息时,移动终端可以通过预先设置的定位模块获取第一用户的位置信息。
步骤204、将通知信息发送给第二用户。
在本发明的具体实施例中,当移动终端在获取到与待识别语音特征对应的通知信息之后,移动终端可以将通知信息发送给第二用户。作为一种实施方式,第一用户可以预先在移动终端中设置接收通知信息的第二用户, 因此,移动终端在获取到通知信息之后,就可以将通知信息发送给第二用户。
步骤205、结束信息发送的流程。
在本发明的具体实施例中,当待识别语音特征不满足预先设置的触发条件时,移动终端可以结束信息发送的流程。
本发明实施例提出的信息发送方法,当移动终端无法将第一用户的通知信息直接地发送给第二用户时,可以通过将待识别语音特征设置为触发条件的方式,将与待识别语音特征对应的通知信息发送给第二用户。而不是像现有技术中,移动终端只能将第一用户的通知信息直接地发送给第二用户。显然,和现有技术相比,本发明实施例提出的信息发送方法,能够实现移动终端可以将第一用户的通知信息发送给第二用户,从而可以保证第二用户能够及时地接收到第一用户发送的通知信息;并且,实现起来简单方便,便于普及,适用范围更广。
图5为本发明实施例中信息发送装置的组成结构示意图,如图5所示,该信息发送装置包括:获取单元501、判断单元502和发送单元503;其中,
所述获取单元501,配置为获取与第一用户的待识别语音信息对应的待识别语音特征,将所述待识别语音特征发送给所述判断单元502;
所述判断单元502,配置为判断所述待识别语音特征是否满足预先设置的触发条件,当所述待识别语音特征满足预先设置的触发条件时,向所述获取单元501发送一个获取指令;
所述获取单元501,还配置为在接收到所述获取指令之后,获取与所述待识别语音特征对应的通知信息,将所述通知信息发送给所述发送单元503;
所述发送单元503,配置为将所述通知信息发送给第二用户。
作为一种实施方式,所述装置还包括:认证单元504;
所述获取单元501,还配置为获取与所述待识别语音信息对应的待识别声纹特征,将所述待识别声纹特征发送给所述认证单元504;
所述认证单元504,配置为根据所述待识别声纹特征对所述第一用户进行身份认证;当认证通过时,向所述获取单元501发送一个认证通过的指令;
所述获取单元501,还配置为在接收到所述认证通过的指令之后,获取所述待识别语音特征。
作为一种实施方式,所述判断单元502,具体配置为判断所述待识别语音特征与预先保存的语音特征是否匹配;当匹配成功时,判定所述待识别语音特征满足所述触发条件;当匹配失败时,判定所述待识别语音特征不满足所述触发条件。
作为一种实施方式,所述获取单元501,具体配置为获取与所述待识别语音特征对应的静态信息;或者,获取与所述待识别语音特征对应的动态信息。
作为一种实施方式,所述静态信息包括:所述第一用户的文本信息;所述动态信息包括:所述第一用户的图片信息和位置信息。
在实际应用中,所述获取单元501、判断单元202、发送单元503和认证单元504均可由位于移动终端的中央处理器(CPU,Central Processing Unit)、微处理器(MPU,Microprocessor Unit)、DSP、或FPGA等实现。
本发明实施例提出的信息发送装置,当移动终端无法将第一用户的通知信息直接地发送给第二用户时,可以通过将待识别语音特征设置为触发条件的方式,将与待识别语音特征对应的通知信息发送给第二用户。而不是像现有技术中,移动终端只能将第一用户的通知信息直接地发送给第二用户。显然,和现有技术相比,本发明实施例提出的信息发送装置,能够实现移动终端可以将第一用户的通知信息发送给第二用户,从而可以保证 第二用户能够及时地接收到第一用户发送的通知信息;并且,实现起来简单方便,便于普及,适用范围更广。
本发明实施例还记载了一种计算机存储介质,所述计算机存储介质中存储有计算机可执行指令,所述计算机可执行指令用于执行前述各个实施例所述的信息发送方法。也就是说,所述计算机可执行指令被处理器执行之后,能够实现前述任意一个技术方案提供的信息发送方法。
作为一种实施方式,本发明实施例还记载了一种计算机存储介质,所述计算机存储介质存储有一个或者多个程序,所述一个或者多个程序可被一个或者多个处理器执行,以实现以下步骤:
获取与第一用户的待识别语音信息对应的待识别语音特征;
判断所述待识别语音特征是否满足预先设置的触发条件;
当所述待识别语音特征满足预先设置的触发条件时,获取与所述待识别语音特征对应的通知信息;
将所述通知信息发送给第二用户。
作为一种实施方式,执行所述获取与第一用户的待识别语音信息对应的待识别语音特征的步骤之前,所述一个或者多个程序还可被所述一个或者多个处理器执行,以实现以下步骤:所述方法还包括:
获取与所述待识别语音信息对应的待识别声纹特征;
根据所述待识别声纹特征对所述第一用户进行身份认证;
当认证通过时,获取所述待识别语音特征。
作为一种实施方式,执行判断所述待识别语音特征是否满足预先设置的触发条件的步骤时,所述一个或者多个程序还可被所述一个或者多个处理器执行,以实现以下步骤:
判断所述待识别语音特征与预先保存的语音特征是否匹配;
当匹配成功时,判定所述待识别语音特征满足所述触发条件;
当匹配失败时,判定所述待识别语音特征不满足所述触发条件。
本领域技术人员应当理解,本实施例的计算机存储介质中各程序的功能,可参照前述各实施例所述的信息发送方法的相关描述而理解。
本领域内的技术人员应明白,本发明的实施例可提供为方法、系统、或计算机程序产品。因此,本发明可采用硬件实施例、软件实施例、或结合软件和硬件方面的实施例的形式。而且,本发明可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器和光学存储器等)上实施的计算机程序产品的形式。
本发明是参照根据本发明实施例的方法、设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、专用计算机、嵌入式处理机或其他可编程数据处理设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。
这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理设备以特定方式工作的计算机可读存储器中,使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品,该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。
这些计算机程序指令也可装载到计算机或其他可编程数据处理设备上,使得在计算机或其他可编程设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。
以上所述,仅为本发明的较佳实施例而已,并非用于限定本发明的保护范围。凡在本发明的精神和范围之内所作的任何修改、等同替换和改进等,均包含在本发明的保护范围之内。
工业实用性
本发明实施例的技术方案,先获取与第一用户的待识别语音信息对应的待识别语音特征,当待识别语音特征满足预先设置的触发条件时,再获取与待识别语音特征对应的通知信息,最后将通知信息发送给第二用户;如此,能够实现移动终端可以将第一用户的通知信息发送给第二用户,从而可以保证第二用户能够及时地接收到第一用户发送的通知信息;并且,实现起来简单方便,便于普及,适用范围更广。

Claims (11)

  1. 一种信息发送方法,所述方法包括:
    获取与第一用户的待识别语音信息对应的待识别语音特征;
    判断所述待识别语音特征是否满足预先设置的触发条件;
    当所述待识别语音特征满足预先设置的触发条件时,获取与所述待识别语音特征对应的通知信息;
    将所述通知信息发送给第二用户。
  2. 根据权利要求1所述的方法,其中,在所述获取与第一用户的待识别语音信息对应的待识别语音特征之前,所述方法还包括:
    获取与所述待识别语音信息对应的待识别声纹特征;
    根据所述待识别声纹特征对所述第一用户进行身份认证;
    当认证通过时,获取所述待识别语音特征。
  3. 根据权利要求1所述的方法,其中,所述判断所述待识别语音特征是否满足预先设置的触发条件包括:
    判断所述待识别语音特征与预先保存的语音特征是否匹配;
    当匹配成功时,判定所述待识别语音特征满足所述触发条件;
    当匹配失败时,判定所述待识别语音特征不满足所述触发条件。
  4. 根据权利要求1所述的方法,其中,所述获取与所述待识别语音特征对应的通知信息包括:
    获取与所述待识别语音特征对应的静态信息;
    或者,获取与所述待识别语音特征对应的动态信息。
  5. 根据权利要求4所述的方法,其中,所述静态信息包括:所述第一用户的文本信息;所述动态信息包括:所述第一用户的图片信息和位置信息。
  6. 一种信息发送装置,所述装置包括:
    获取单元,配置为获取与第一用户的待识别语音信息对应的待识别语音特征,将所述待识别语音特征发送给判断单元;
    所述判断单元,配置为判断所述待识别语音特征是否满足预先设置的触发条件,当所述待识别语音特征满足预先设置的触发条件时,向所述获取单元发送一个获取指令;
    所述获取单元,还配置为在接收到所述获取指令之后,获取与所述待识别语音特征对应的通知信息,将所述通知信息发送给发送单元;
    所述发送单元,配置为将所述通知信息发送给第二用户。
  7. 根据权利要求6所述的装置,其中,所述装置还包括:认证单元;
    所述获取单元,还配置为获取与所述待识别语音信息对应的待识别声纹特征,将所述待识别声纹特征发送给所述认证单元;
    所述认证单元,配置为根据所述待识别声纹特征对所述第一用户进行身份认证;当认证通过时,向所述获取单元发送一个认证通过的指令;
    所述获取单元,还配置为在接收到所述认证通过的指令之后,获取所述待识别语音特征。
  8. 根据权利要求6所述的装置,其中,所述判断单元,具体配置为判断所述待识别语音特征与预先保存的语音特征是否匹配;当匹配成功时,判定所述待识别语音特征满足所述触发条件;当匹配失败时,判定所述待识别语音特征不满足所述触发条件。
  9. 根据权利要求6所述的装置,其中,所述获取单元,具体配置为获取与所述待识别语音特征对应的静态信息;或者,获取与所述待识别语音特征对应的动态信息。
  10. 根据权利要求9所述的装置,所述静态信息包括:所述第一用户的文本信息;所述动态信息包括:所述第一用户的图片信息和位置信息。
  11. 一种计算机存储介质,所述计算机存储介质中存储有计算机可执 行指令,所述计算机可执行指令用于执行权利要求1至5任一项所述的信息发送方法。
PCT/CN2017/088744 2016-06-21 2017-06-16 一种信息发送方法、装置及计算机存储介质 WO2017219925A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201610450969.8A CN107526951A (zh) 2016-06-21 2016-06-21 一种信息发送方法及装置
CN201610450969.8 2016-06-21

Publications (1)

Publication Number Publication Date
WO2017219925A1 true WO2017219925A1 (zh) 2017-12-28

Family

ID=60734997

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2017/088744 WO2017219925A1 (zh) 2016-06-21 2017-06-16 一种信息发送方法、装置及计算机存储介质

Country Status (2)

Country Link
CN (1) CN107526951A (zh)
WO (1) WO2017219925A1 (zh)

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1249480A (zh) * 1998-09-29 2000-04-05 松下电器产业株式会社 采用多个文法网络的语音识别系统
CN101656069A (zh) * 2009-09-17 2010-02-24 陈拙夫 一种中文语音信息通讯系统及其通讯方法
CN104820921A (zh) * 2015-03-24 2015-08-05 百度在线网络技术(北京)有限公司 一种用于在用户设备中进行交易的方法和装置

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102457845B (zh) * 2010-10-14 2016-04-13 阿里巴巴集团控股有限公司 无线业务身份认证方法、设备及系统
CN102510426A (zh) * 2011-11-29 2012-06-20 安徽科大讯飞信息科技股份有限公司 个人助理应用访问方法及系统
CN104078045B (zh) * 2013-03-26 2017-05-24 联想(北京)有限公司 一种识别的方法及电子设备
CN104681023A (zh) * 2015-02-15 2015-06-03 联想(北京)有限公司 一种信息处理方法及电子设备

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1249480A (zh) * 1998-09-29 2000-04-05 松下电器产业株式会社 采用多个文法网络的语音识别系统
CN101656069A (zh) * 2009-09-17 2010-02-24 陈拙夫 一种中文语音信息通讯系统及其通讯方法
CN104820921A (zh) * 2015-03-24 2015-08-05 百度在线网络技术(北京)有限公司 一种用于在用户设备中进行交易的方法和装置

Also Published As

Publication number Publication date
CN107526951A (zh) 2017-12-29

Similar Documents

Publication Publication Date Title
US10778667B2 (en) Methods and apparatus to enhance security of authentication
CN111699528B (zh) 电子装置及执行电子装置的功能的方法
RU2763392C1 (ru) Способ голосового управления, носимое устройство и терминал
US10789343B2 (en) Identity authentication method and apparatus
US11341957B2 (en) Method for detecting keyword in speech signal, terminal, and storage medium
EP3477519B1 (en) Identity authentication method, terminal device, and computer-readable storage medium
KR102405793B1 (ko) 음성 신호 인식 방법 및 이를 제공하는 전자 장치
CN108702354B (zh) 基于传感器信号的活跃度确定
CN108496220B (zh) 电子设备及其语音识别方法
KR20160011709A (ko) 지불 확인을 위한 방법, 장치 및 시스템
KR20160124833A (ko) 모바일 디바이스들을 위한 신뢰 브로커 인증 방법
US11537360B2 (en) System for processing user utterance and control method of same
KR102390713B1 (ko) 전자 장치 및 전자 장치의 통화 서비스 제공 방법
US11282528B2 (en) Digital assistant activation based on wake word association
EP3444811B1 (en) Speech recognition method and device
CN108922531B (zh) 槽位识别方法、装置、电子设备及存储介质
US11031010B2 (en) Speech recognition system providing seclusion for private speech transcription and private data retrieval
CN111652624A (zh) 购票处理方法、检票处理方法、装置、设备及存储介质
CN109547622B (zh) 一种验证方法及终端设备
WO2017219925A1 (zh) 一种信息发送方法、装置及计算机存储介质
CN114493787A (zh) 房屋管理方法、装置及计算机可读存储介质
KR102466519B1 (ko) 복수의 기능들을 지원하는 atm 기기 및 그 동작 방법
KR102653450B1 (ko) 전자 장치의 입력 음성에 대한 응답 방법 및 그 전자 장치
WO2022233239A1 (zh) 一种升级方法、装置及电子设备

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 17814663

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 17814663

Country of ref document: EP

Kind code of ref document: A1