WO2023071730A1 - Voiceprint registration method and electronic devices - Google Patents

Voiceprint registration method and electronic devices Download PDF

Info

Publication number
WO2023071730A1
WO2023071730A1 PCT/CN2022/123912 CN2022123912W WO2023071730A1 WO 2023071730 A1 WO2023071730 A1 WO 2023071730A1 CN 2022123912 W CN2022123912 W CN 2022123912W WO 2023071730 A1 WO2023071730 A1 WO 2023071730A1
Authority
WO
WIPO (PCT)
Prior art keywords
electronic device
voice signal
parameter information
voiceprint
voiceprint model
Prior art date
Application number
PCT/CN2022/123912
Other languages
French (fr)
Chinese (zh)
Inventor
房英康
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Publication of WO2023071730A1 publication Critical patent/WO2023071730A1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/02Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/04Training, enrolment or model building
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/06Decision making techniques; Pattern matching strategies

Definitions

  • the present application relates to the technical field of voiceprint registration, in particular to a voiceprint registration method and electronic equipment.
  • voiceprint registration is required before the user performs voice interaction with the electronic device. That is to say, the electronic device can collect the voice signal of the user, extract the voiceprint according to the collected voice signal, and perform registration. Subsequently, when the user performs voice interaction with the electronic device, the electronic device can authenticate the user according to the voiceprint. If the authentication is successful, the user can perform voice interaction with the electronic device. If the authentication fails, the user cannot perform voice interaction with the electronic device.
  • the accuracy of voiceprint authentication is increasingly required.
  • the hardware of different electronic devices and the environments in which different electronic devices are located may also be quite different, so the voiceprints extracted from the voice signals of the same user collected by different electronic devices also have certain differences. Therefore, in an environment where multiple electronic devices work together, if the voice signal collected by one electronic device needs to be authenticated by other electronic devices, the accuracy of voiceprint authentication will be low.
  • Embodiments of the present application provide a voiceprint registration method and electronic equipment, which can improve the accuracy of voiceprint authentication.
  • a voiceprint registration method which is applied to a first electronic device, and the method includes: acquiring a first voice signal and first parameter information used to instruct a second electronic device to collect parameters of the voice signal; according to the The first parameter information adjusts the first voice signal to obtain a second voice signal; generates a first voiceprint model based on the second voice signal, sends the first voiceprint model to the second electronic device, or The fingerprint model authenticates the voice signal collected by the second electronic device.
  • the first electronic device can acquire the first voice signal and the first parameter information corresponding to the second electronic device, adjust the first voice signal according to the first parameter information, and obtain a voice signal suitable for the second electronic device.
  • the second voice signal (the second voice signal can be equivalent to the voice signal collected by the second electronic device, that is to say, the first electronic device can simulate the voice signal collected by the second electronic device according to the first voice signal and the first parameter information ), and generate the first voiceprint model according to the second speech signal.
  • the first electronic device is the second voice signal simulated according to the parameters of the voice signal collected by the second electronic device. Therefore, the similarity between the second voice signal and the voice signal actually collected by the second electronic device is very high, so according to the first
  • the first voiceprint model generated from the second voice signal performs voiceprint authentication on the voice signal collected by the second electronic device, which can improve the accuracy of voiceprint authentication.
  • the first parameter information includes at least one of the following: the microphone type of the second electronic device, the sampling rate of the second electronic device, the encoding method of the second electronic device, or the second Information about the environment in which the electronic device is located. Based on the above method, the first speech signal can be adjusted according to the above at least one parameter to obtain the second speech signal, which improves the flexibility and diversity of adjusting the first speech signal.
  • adjusting the first speech signal according to the first parameter information to obtain the second speech signal includes: using a first algorithm to make the parameters of the first speech signal approach the value indicated by the first parameter information parameters to obtain the second speech signal. Based on the above method, the parameters of the first voice signal can be made to approach the parameters indicated by the first parameter information, so that the second voice signal has a higher similarity with the voice signal actually collected by the second electronic device.
  • acquiring the first voice signal includes: receiving the first voice signal from a third electronic device; or, collecting the first voice signal. Based on the above method, the first electronic device may obtain the first voice signal from the third electronic device, or collect the first voice signal by itself.
  • authenticating the voice signal collected by the second electronic device according to the first voiceprint model includes: receiving the voice signal collected by the second electronic device from the second electronic device; The voice signal collected by the second electronic device is input into the first voiceprint model for voiceprint authentication. Based on the above method, the first voiceprint model corresponding to the second electronic device can be used to perform voiceprint authentication on the voice signal collected by the second electronic device, which improves the accuracy of voiceprint authentication.
  • the method further includes: acquiring second parameter information, where the second parameter information is used to instruct the first electronic device to collect parameters of voice signals; adjusting the first voice signal according to the second parameter information signal to obtain a third voice signal; a second voiceprint model is generated according to the third voice signal, and the voice signal collected by the first electronic device is authenticated according to the second voiceprint model.
  • the first electronic device can obtain the second parameter information corresponding to the first electronic device, adjust the first voice signal according to the second parameter information, and obtain a third voice signal suitable for the first electronic device (the third voice signal It can be equivalent to the voice signal collected by the first electronic device, that is, the first electronic device can simulate the voice signal collected by the first electronic device according to the first voice signal and the second parameter information), and generate the second voice signal according to the third voice signal Two voiceprint models. In this way, it is possible to collect a voice signal once, simulate the voice signal collected by the first electronic device according to the voice signal, and perform voiceprint registration according to the simulated voice signal (ie, the third voice signal).
  • the first electronic device is the third voice signal simulated according to the parameters of the voice signal collected by the first electronic device. Therefore, the similarity between the third voice signal and the voice signal actually collected by the first electronic device is very high, so according to the first electronic device
  • the second voiceprint model generated from the three voice signals performs voiceprint authentication on the voice signals collected by the first electronic device, which can improve the accuracy of voiceprint authentication.
  • adjusting the first voice signal through the second parameter information can enrich the voice signal used for voiceprint registration and further improve the accuracy of voiceprint authentication.
  • the second parameter information includes at least one of the following: the microphone type of the first electronic device, the sampling rate of the first electronic device, the encoding method of the first electronic device, or the first Information about the environment in which the electronic device is located.
  • the first voice signal can be adjusted according to the above at least one parameter to obtain the third voice signal, which improves the flexibility and diversity of adjusting the first voice signal.
  • adjusting the first speech signal according to the second parameter information to obtain the third speech signal includes: using a second algorithm to make the parameters of the first speech signal approach the value indicated by the second parameter information parameters to obtain the third speech signal, and the second algorithm is the same as or different from the first algorithm. Based on the above method, the parameters of the first voice signal can be made to approach the parameters indicated by the second parameter information, so that the third voice signal has a higher similarity with the voice signal actually collected by the first electronic device.
  • authenticating the voice signal collected by the first electronic device according to the second voiceprint model includes: collecting the voice signal; inputting the voice signal collected by the first electronic device into the second voice
  • the fingerprint model is used for voiceprint authentication.
  • the second voiceprint model corresponding to the first electronic device can be used to perform voiceprint authentication on the voice signal collected by the first electronic device, which improves the accuracy of voiceprint authentication.
  • the method further includes: acquiring third parameter information, where the third parameter information is used to instruct the fourth electronic device to collect parameters of the voice signal; adjusting the first voice signal according to the third parameter information , to obtain a fourth voice signal; generate a third voiceprint model according to the fourth voice signal, send the third voiceprint model to the fourth electronic device, or collect the voice of the fourth electronic device according to the third voiceprint model
  • the signal is authenticated.
  • the first electronic device can obtain the third parameter information corresponding to the fourth electronic device, adjust the first voice signal according to the third parameter information, and obtain the fourth voice signal suitable for the fourth electronic device (the fourth voice signal It can be equivalent to the voice signal collected by the fourth electronic device, that is, the first electronic device can simulate the voice signal collected by the fourth electronic device according to the first voice signal and the third parameter information), and generate the first voice signal according to the fourth voice signal Three voiceprint models. In this way, it is possible to collect a voice signal once, simulate the voice signal collected by the fourth electronic device according to the voice signal, and perform voiceprint registration according to the simulated voice signal (that is, the fourth voice signal).
  • the first electronic device is the fourth voice signal simulated according to the parameters of the voice signal collected by the fourth electronic device. Therefore, the similarity between the fourth voice signal and the voice signal actually collected by the fourth electronic device is very high, so according to the first
  • the third voiceprint model generated from the four voice signals performs voiceprint authentication on the voice signals collected by the fourth electronic device, which can improve the accuracy of voiceprint authentication.
  • the third parameter information includes at least one of the following: the microphone type of the fourth electronic device, the sampling rate of the fourth electronic device, the encoding method of the fourth electronic device, or the fourth Information about the environment in which the electronic device is located.
  • the first speech signal can be adjusted according to the above at least one parameter to obtain the fourth speech signal, which improves the flexibility and diversity of adjusting the first speech signal.
  • adjusting the first speech signal according to the third parameter information to obtain the fourth speech signal includes: using a third algorithm to make the parameters of the first speech signal approach the value indicated by the third parameter information parameters to obtain the fourth speech signal, the third algorithm is the same or different from the first algorithm, and the third algorithm is the same or different from the second algorithm. Based on the above method, the parameters of the first voice signal can be made to approach the parameters indicated by the third parameter information, so that the fourth voice signal has a higher similarity with the voice signal actually collected by the fourth electronic device.
  • authenticating the voice signal collected by the fourth electronic device according to the third voiceprint model includes: receiving the voice signal collected by the fourth electronic device from the fourth electronic device; The voice signal collected by the fourth electronic device is input into the third voiceprint model for voiceprint authentication. Based on the above method, the third voiceprint model corresponding to the fourth electronic device can be used to perform voiceprint authentication on the voice signal collected by the fourth electronic device, which improves the accuracy of voiceprint authentication.
  • the embodiment of the present application provides an electronic device, the electronic device includes: an acquisition module, a processing module, and a sending module; The first parameter information of the parameter; the processing module is used to adjust the first voice signal according to the first parameter information to obtain a second voice signal; the processing module is also used to generate a first voiceprint model according to the second voice signal; A sending module, configured to send the first voiceprint model to the second electronic device.
  • the electronic device includes: an acquisition module and a processing module; the acquisition module is used to acquire the first voice signal and the first parameter information used to instruct the second electronic device to collect parameters of the voice signal; the processing module is used to obtain the first voice signal according to the first parameter information A parameter information adjusts the first voice signal to obtain a second voice signal; the processing module is also used to generate a first voiceprint model according to the second voice signal; the processing module is also used to generate the first voiceprint model according to the first voiceprint model.
  • the voice signal collected by the second electronic device is used for authentication.
  • the first parameter information includes at least one of the following: the microphone type of the second electronic device, the sampling rate of the second electronic device, the encoding method of the second electronic device, or the second Information about the environment in which the electronic device is located.
  • the processing module is specifically configured to use a first algorithm to make a parameter of the first speech signal approach a parameter indicated by the first parameter information to obtain the second speech signal.
  • the obtaining module is specifically configured to receive the first voice signal from the third electronic device; or, the obtaining module is specifically configured to collect the first voice signal.
  • the processing module is specifically configured to receive the voice signal collected by the second electronic device from the second electronic device; the processing module is also specifically configured to input the voice signal collected by the second electronic device into The first voiceprint model performs voiceprint authentication.
  • the acquiring module is further configured to acquire second parameter information, where the second parameter information is used to instruct the first electronic device to collect parameters of voice signals; the processing module is also configured to The parameter information adjusts the first voice signal to obtain a third voice signal; the processing module is also used to generate a second voiceprint model according to the third voice signal, and the processing module is also used to generate the second voiceprint model according to the second voiceprint model.
  • the voice signal collected by an electronic device is used for authentication.
  • the second parameter information includes at least one of the following: the microphone type of the first electronic device, the sampling rate of the first electronic device, the encoding method of the first electronic device, or the first Information about the environment in which the electronic device is located.
  • the processing module is specifically configured to use the second algorithm to make the parameters of the first speech signal approach the parameters indicated by the second parameter information to obtain the third speech signal, and the second algorithm and the second An algorithm is the same or different.
  • the processing module is specifically configured to collect voice signals; the processing module is further specifically configured to input the voice signals collected by the first electronic device into the second voiceprint model for voiceprint authentication.
  • the acquiring module is further configured to acquire third parameter information, where the third parameter information is used to instruct the fourth electronic device to collect parameters of voice signals; the processing module is also configured to information to adjust the first voice signal to obtain a fourth voice signal; the processing module is also used to generate a third voiceprint model according to the fourth voice signal, and the sending module is also used to send the third voice signal to the fourth electronic device
  • the fingerprint model, or the processing module is further configured to authenticate the voice signal collected by the fourth electronic device according to the third voiceprint model.
  • the third parameter information includes at least one of the following: the microphone type of the fourth electronic device, the sampling rate of the fourth electronic device, the encoding method of the fourth electronic device, or the fourth Information about the environment in which the electronic device is located.
  • the processing module is further configured to use a third algorithm to make the parameters of the first speech signal approach the parameters indicated by the third parameter information to obtain the fourth speech signal, and the third algorithm and the first
  • the first algorithm is the same or different
  • the third algorithm is the same or different from the second algorithm.
  • the processing module is further configured to receive the voice signal collected by the fourth electronic device from the fourth electronic device; the processing module is also configured to input the voice signal collected by the fourth electronic device into the fourth electronic device.
  • Three voiceprint models are used for voiceprint authentication.
  • an electronic device including: a processor; the processor is configured to be coupled with a memory, and after reading an instruction in the memory, execute the method according to any one of the above aspects according to the instruction.
  • the electronic device may be the first electronic device in the above first aspect.
  • the electronic device further includes a memory, where the memory is configured to store necessary program instructions and data.
  • the electronic device is a chip or a chip system.
  • the electronic device when it is a system-on-a-chip, it may consist of chips, or may include chips and other discrete devices.
  • an electronic device including: a processor and an interface circuit; the interface circuit is used to receive computer programs or instructions and transmit them to the processor; the processor is used to execute the computer programs or instructions, so that the electronic The device executes the method described in the first aspect above.
  • the electronic device is a chip or a chip system.
  • the electronic device when it is a system-on-a-chip, it may consist of chips, or may include chips and other discrete devices.
  • a computer-readable storage medium is provided, and instructions are stored in the computer-readable storage medium, and when the computer-readable storage medium is run on a computer, the computer can execute the method described in the above-mentioned first aspect.
  • a sixth aspect provides a computer program product containing instructions, which when run on a computer enables the computer to execute the method described in the first aspect above.
  • FIG. 1 is a schematic diagram of the architecture of the voiceprint registration system provided by the embodiment of the present application.
  • FIG. 2 is a schematic structural diagram of a mobile phone provided by an embodiment of the present application.
  • FIG. 3 is a first schematic flow diagram of the voiceprint registration method provided by the embodiment of the present application.
  • FIG. 4 is a schematic flow diagram II of the voiceprint registration method provided by the embodiment of the present application.
  • FIG. 5 is a schematic flow diagram III of the voiceprint registration method provided by the embodiment of the present application.
  • FIG. 6 is a schematic diagram of the structure and composition of an electronic device provided by an embodiment of the present application.
  • references to "one embodiment” or “some embodiments” or the like in this specification means that a particular feature, structure, or characteristic described in connection with the embodiment is included in one or more embodiments of the present application.
  • appearances of the phrases “in one embodiment,” “in some embodiments,” “in other embodiments,” “in other embodiments,” etc. in various places in this specification are not necessarily All refer to the same embodiment, but mean “one or more but not all embodiments” unless specifically stated otherwise.
  • the terms “including”, “comprising”, “having” and variations thereof mean “including but not limited to”, unless specifically stated otherwise.
  • the term “connected” includes both direct and indirect connections, unless otherwise stated.
  • words such as “first” and “second” may be used to distinguish technical features with the same or similar functions.
  • the words “first” and “second” do not limit the number and execution order, and the words “first” and “second” do not necessarily mean that they must be different.
  • words such as “exemplary” or “for example” are used to represent examples, illustrations or illustrations, and any embodiment or design described as “exemplary” or “for example” should not be interpreted It is more preferred or more advantageous than other embodiments or design solutions.
  • the use of words such as “exemplary” or “for example” is intended to present related concepts in a specific manner for easy understanding.
  • the technical features of this technical feature are distinguished by “first”, “second”, “third”, etc., the “first”, “second” “, “Third” describes the technical features in no order or order of magnitude.
  • the embodiment of this application provides the following three methods:
  • a voiceprint registration algorithm can be preset in the electronic device.
  • the voiceprint registration algorithm is trained according to different environments, different sound receiving hardware, and different speakers' voice signals.
  • After the electronic device obtains the voice signal for registration it can use the voiceprint registration algorithm to perform voiceprint registration on the voice signal for registration to establish a voiceprint model.
  • the electronic device can authenticate the user according to the voiceprint model. Because the voiceprint registration algorithm is trained according to the voice signals of different environments, different sound receiving hardware, and different speakers, the algorithm can extract more comprehensive and in-depth voiceprint information, which is more robust and can improve voiceprint registration. Accuracy of the model for voiceprint authentication.
  • Method 2 voiceprint registration can be performed on each of the multiple electronic devices. Subsequently, the user can perform authentication on each electronic device. Because the device used by the user for registration and authentication is the same device, the accuracy of voiceprint authentication can be improved.
  • Method 3 The first electronic device can obtain the first voice signal and the first parameter information, adjust the first voice signal according to the first parameter information to obtain the second voice signal, generate the first voiceprint model according to the second voice signal, and send the first voiceprint model to the second voice signal.
  • the second electronic device sends the first voiceprint model or authenticates the voice signal collected by the second electronic device according to the first voiceprint model.
  • the first parameter information may be used to instruct the second electronic device to collect parameters of the voice signal.
  • method 3 does not need to collect speech signals of different environments, different sound receiving hardware, and different speakers, the training cost is lower, and the complexity of the model is also lower.
  • the voiceprint registration algorithm is preset in the electronic device. When the environment, hardware and other conditions of the electronic device change, it is difficult to update and the user experience is poor.
  • the first parameter information can be updated at any time, and the first voice signal can be adjusted according to the updated first parameter information, which is more flexible and provides better user experience.
  • method 3 does not need to perform voiceprint registration on each of the multiple electronic devices, and the user experience is better.
  • method 3 can also assist them in authenticating users to improve the security of voice interaction.
  • the voiceprint registration system may at least include: an electronic device 101 and an electronic device 102 .
  • the voiceprint registration system may further include an electronic device 103 and/or an electronic device 104 .
  • the wireless communication protocol adopted can be a wireless fidelity (Wi-Fi) protocol, various cellular networks (such as the fourth generation (4th generation, 4G ) communication network or the fifth generation (5th generation, 5G) communication network) protocol, etc., there is no specific limitation here.
  • the electronic device in FIG. 1 may form a HyperTerminal.
  • identity authentication between the electronic devices in FIG. 1 can be performed based on any authentication mechanism (such as the HiChian mechanism), and electronic devices that pass the authentication can form a hyper terminal.
  • the HyperTerminal may include multiple electronic devices, the multiple electronic devices are in a network connection state, and the multiple electronic devices are mutually trusted devices.
  • the electronic equipment in Fig. 1 for example electronic equipment 101, electronic equipment 102, electronic equipment 103 or electronic equipment 104, can be mobile phone, panel computer, handheld computer, personal computer (personal computer, PC), cell phone , personal digital assistant (personal digital assistant, PDA), wearable devices (such as smart watches, smart bracelets, etc.), game consoles, or augmented reality (augmented reality, AR) / virtual reality (virtual reality, VR) equipment, etc.
  • Electronic equipment can be mobile phone, panel computer, handheld computer, personal computer (personal computer, PC), cell phone , personal digital assistant (personal digital assistant, PDA), wearable devices (such as smart watches, smart bracelets, etc.), game consoles, or augmented reality (augmented reality, AR) / virtual reality (virtual reality, VR) equipment, etc.
  • the embodiment of the present application does not specifically limit the specific device form of the electronic device in FIG. 1 .
  • the electronic device in FIG. 1 may also be a smart home device (such as a TV, a smart speaker), a vehicle-mounted computer
  • the device forms of the electronic devices in FIG. 1 may be the same.
  • both the electronic device 101 and the electronic device 102 are mobile phones.
  • the device form of the electronic device in FIG. 1 may also be different.
  • the electronic device 101 is a mobile phone, and the electronic device 102 is a tablet computer.
  • the electronic device 101 is a smart watch, and the electronic device 102 is a PC.
  • the electronic device in FIG. 1 may be a touch screen device or a non-touch screen device. Touch screen devices can control electronic devices by clicking and sliding on the screen with fingers, stylus, etc. Non-touch screen devices can be connected to input devices such as a mouse, keyboard, and touch panel, and electronic devices can be controlled through the input devices.
  • the electronic devices in FIG. 1 are all electronic devices capable of running an operating system and installing applications.
  • the operating system of the electronic device in FIG. 1 may be a Hongmeng system, an Android system, an ios system, a windows system, a mac system, a Linux system, etc., which are not specifically limited in this embodiment of the present application.
  • the operating systems of the electronic devices in FIG. 1 may be the same or different.
  • the electronic devices in FIG. 1 may respectively include a memory and a processor. Wherein, the memory can be used to store the operating system, and the processor can be used to run the operating system stored in the memory.
  • the memory may also be referred to as a memory, and is used to store data calculated by an operating system and a processor, and the memory may also be used to run application programs installed on electronic devices.
  • the memory may be the internal memory 121 in FIG. 2 .
  • a distributed system may be deployed on the electronic device shown in FIG. 1 .
  • the electronic devices deployed with the distributed system can execute the voiceprint registration method provided by the embodiment of the present application, so that one electronic device can collect the parameters of the voice signal according to another electronic device, adjust the voice signal, and generate a voice signal according to the adjusted voice signal.
  • voiceprint authentication is performed on the voice signal collected by another electronic device, which can improve the accuracy of voiceprint authentication, or send the generated voiceprint model to another electronic device, so that another electronic device
  • An electronic device can perform voiceprint authentication corresponding to the voice signal it collects according to the generated voiceprint model.
  • the voiceprint registration system shown in FIG. 1 is for example only, and is not intended to limit the technical solutions of the embodiments of the present application. Those skilled in the art should understand that in the actual implementation process, the voiceprint registration system may also include other devices, and the number of electronic devices may also be determined according to specific needs, without limitation.
  • the electronic device is taken as an example of a mobile phone.
  • FIG. 2 is a schematic structural diagram of a mobile phone provided by an embodiment of the present application.
  • the methods in the following embodiments can be implemented in a mobile phone with the following hardware structure.
  • the mobile phone can include a processor 110, an external memory interface 120, an internal memory 121, a universal serial bus (universal serial bus, USB) interface 130, an antenna 1, an antenna 2, a mobile communication module 150, and a wireless communication module 160, audio module 170, speaker 170A, receiver 170B, microphone 170C, earphone interface 170D, sensor module 180, etc.
  • a processor 110 an external memory interface 120, an internal memory 121, a universal serial bus (universal serial bus, USB) interface 130, an antenna 1, an antenna 2, a mobile communication module 150, and a wireless communication module 160, audio module 170, speaker 170A, receiver 170B, microphone 170C, earphone interface 170D, sensor module 180, etc.
  • a universal serial bus universal serial bus, USB
  • the structure shown in the embodiment of the present application does not constitute a specific limitation on the mobile phone.
  • the mobile phone may include more or fewer components than shown in the figure, or combine certain components, or separate certain components, or arrange different components.
  • the illustrated components can be realized in hardware, software or a combination of software and hardware.
  • the processor 110 may include one or more processing units, for example: the processor 110 may include an application processor (application processor, AP), a modem processor, a graphics processing unit (graphics processing unit, GPU), an image signal processor (image signal processor, ISP), controller, memory, video codec, digital signal processor (digital signal processor, DSP), baseband processor, and/or neural network processor (neural-network processing unit, NPU) wait. Wherein, different processing units may be independent devices, or may be integrated in one or more processors.
  • application processor application processor, AP
  • modem processor graphics processing unit
  • GPU graphics processing unit
  • image signal processor image signal processor
  • ISP image signal processor
  • controller memory
  • video codec digital signal processor
  • DSP digital signal processor
  • baseband processor baseband processor
  • neural network processor neural-network processing unit, NPU
  • a memory may also be provided in the processor 110 for storing instructions and data.
  • the memory in processor 110 is a cache memory. This memory may hold instructions or data that processor 110 has just used or recycled. If the processor 110 needs to use the instruction or data again, it can be called directly from the memory. Repeated access is avoided, and the waiting time of the processor 110 is reduced, thereby improving the efficiency of the system.
  • the wireless communication function of the mobile phone can be realized by the antenna 1, the antenna 2, the mobile communication module 150, the wireless communication module 160, the modem processor and the baseband processor.
  • Antenna 1 and Antenna 2 are used to transmit and receive electromagnetic wave signals.
  • Each antenna in a mobile phone can be used to cover single or multiple communication bands. Different antennas can also be multiplexed to improve the utilization of the antennas.
  • Antenna 1 can be multiplexed as a diversity antenna of a wireless local area network.
  • the antenna may be used in conjunction with a tuning switch.
  • the mobile communication module 150 can provide wireless communication solutions including 2G/3G/4G/5G applied to mobile phones.
  • the mobile communication module 150 may include at least one filter, switch, power amplifier, low noise amplifier (low noise amplifier, LNA) and the like.
  • the mobile communication module 150 can receive electromagnetic waves through the antenna 1, filter and amplify the received electromagnetic waves, and send them to the modem processor for demodulation.
  • the mobile communication module 150 can also amplify the signals modulated by the modem processor, and convert them into electromagnetic waves through the antenna 1 for radiation.
  • at least part of the functional modules of the mobile communication module 150 may be set in the processor 110 .
  • at least part of the functional modules of the mobile communication module 150 and at least part of the modules of the processor 110 may be set in the same device.
  • the wireless communication module 160 can provide applications on mobile phones including wireless local area networks (wireless local area networks, WLAN) (such as wireless fidelity (wireless fidelity, Wi-Fi) network), bluetooth (bluetooth, BT), global navigation satellite system ( Global navigation satellite system (GNSS), frequency modulation (frequency modulation, FM), near field communication (near field communication, NFC), infrared technology (infrared, IR) and other wireless communication solutions.
  • the wireless communication module 160 may be one or more devices integrating at least one communication processing module.
  • the wireless communication module 160 receives electromagnetic waves via the antenna 2 , frequency-modulates and filters the electromagnetic wave signals, and sends the processed signals to the processor 110 .
  • the wireless communication module 160 can also receive the signal to be sent from the processor 110 , frequency-modulate it, amplify it, and convert it into electromagnetic waves through the antenna 2 for radiation.
  • the antenna 1 of the mobile phone is coupled to the mobile communication module 150, and the antenna 2 is coupled to the wireless communication module 160, so that the mobile phone can communicate with the network and other devices through wireless communication technology.
  • the wireless communication technology may include global system for mobile communications (GSM), general packet radio service (general packet radio service, GPRS), code division multiple access (code division multiple access, CDMA), broadband Code division multiple access (wideband code division multiple access, WCDMA), time division code division multiple access (time-division code division multiple access, TD-SCDMA), long term evolution (long term evolution, LTE), BT, GNSS, WLAN, NFC , FM, and/or IR techniques, etc.
  • GSM global system for mobile communications
  • GPRS general packet radio service
  • code division multiple access code division multiple access
  • CDMA broadband Code division multiple access
  • WCDMA wideband code division multiple access
  • time division code division multiple access time-division code division multiple access
  • LTE long term evolution
  • BT GNSS
  • WLAN NFC
  • FM
  • the GNSS may include a global positioning system (global positioning system, GPS), a global navigation satellite system (global navigation satellite system, GLONASS), a Beidou navigation satellite system (beidou navigation satellite system, BDS), a quasi-zenith satellite system (quasi -zenith satellite system (QZSS) and/or satellite based augmentation systems (SBAS).
  • GPS global positioning system
  • GLONASS global navigation satellite system
  • Beidou navigation satellite system beidou navigation satellite system
  • BDS Beidou navigation satellite system
  • QZSS quasi-zenith satellite system
  • SBAS satellite based augmentation systems
  • the mobile phone realizes the display function through the GPU, the display screen 194, and the application processor.
  • the GPU is a microprocessor for image processing, and is connected to the display screen 194 and the application processor. GPUs are used to perform mathematical and geometric calculations for graphics rendering.
  • Processor 110 may include one or more GPUs that execute program instructions to generate or change display information.
  • the display screen 194 is used to display images, videos and the like.
  • the display screen 194 includes a display panel.
  • the display panel can be a liquid crystal display (LCD), an organic light-emitting diode (OLED), an active matrix organic light emitting diode or an active matrix organic light emitting diode (active-matrix organic light emitting diode, AMOLED), flexible light-emitting diode (flex light-emitting diode, FLED), Miniled, MicroLed, Micro-oLed, quantum dot light emitting diodes (quantum dot light emitting diodes, QLED), etc.
  • the mobile phone may include 1 or N display screens 194, where N is a positive integer greater than 1.
  • the mobile phone can realize shooting function through ISP, camera 193 , video codec, GPU, display screen 194 and application processor.
  • the ISP is used for processing the data fed back by the camera 193 .
  • the light is transmitted to the photosensitive element of the camera through the lens, and the light signal is converted into an electrical signal, and the photosensitive element of the camera transmits the electrical signal to the ISP for processing, and converts it into an image visible to the naked eye.
  • ISP can also perform algorithm optimization on image noise, brightness, and skin color.
  • ISP can also optimize the exposure, color temperature and other parameters of the shooting scene.
  • the ISP may be located in the camera 193 .
  • Camera 193 is used to capture still images or video.
  • the object generates an optical image through the lens and projects it to the photosensitive element.
  • the photosensitive element may be a charge coupled device (CCD) or a complementary metal-oxide-semiconductor (CMOS) phototransistor.
  • CMOS complementary metal-oxide-semiconductor
  • the photosensitive element converts the light signal into an electrical signal, and then transmits the electrical signal to the ISP to convert it into a digital image signal.
  • the ISP outputs the digital image signal to the DSP for processing.
  • DSP converts digital image signals into standard RGB, YUV and other image signals.
  • the mobile phone may include 1 or N cameras 193, where N is a positive integer greater than 1.
  • Digital signal processors are used to process digital signals. In addition to digital image signals, they can also process other digital signals. For example, when the mobile phone selects the frequency point, the digital signal processor is used to perform Fourier transform on the frequency point energy.
  • Video codecs are used to compress or decompress digital video.
  • a mobile phone can support one or more video codecs.
  • the mobile phone can play or record videos in multiple encoding formats, such as: moving picture experts group (moving picture experts group, MPEG) 1, MPEG2, MPEG3, MPEG4, etc.
  • the external memory interface 120 can be used to connect an external memory card, such as a Micro SD card, to expand the storage capacity of the mobile phone.
  • the external memory card communicates with the processor 110 through the external memory interface 120 to implement a data storage function. Such as saving music, video and other files in the external memory card.
  • the internal memory 121 may be used to store computer-executable program codes including instructions.
  • the processor 110 executes various functional applications and data processing of the mobile phone by executing instructions stored in the internal memory 121 .
  • the internal memory 121 may include an area for storing programs and an area for storing data.
  • the stored program area can store an operating system, at least one application program required by a function (such as a sound playing function, an image playing function, etc.) and the like.
  • the storage data area can store data (such as audio data, phone book, etc.) created during the use of the mobile phone.
  • the internal memory 121 may include a high-speed random access memory, and may also include a non-volatile memory, such as at least one magnetic disk storage device, flash memory device, universal flash storage (universal flash storage, UFS) and the like.
  • the mobile phone can realize the audio function through the audio module 170, the speaker 170A, the receiver 170B, the microphone 170C, the earphone interface 170D, and the application processor. Such as music playback, recording, etc.
  • the audio module 170 is used to convert digital audio information into analog audio signal output, and is also used to convert analog audio input into digital audio signal.
  • the audio module 170 may also be used to encode and decode audio signals.
  • the audio module 170 may be set in the processor 110 , or some functional modules of the audio module 170 may be set in the processor 110 .
  • Speaker 170A also referred to as a "horn" is used to convert audio electrical signals into sound signals.
  • the cell phone can listen to music through speaker 170A, or listen to hands-free calls.
  • Receiver 170B also called “earpiece” is used to convert audio electrical signals into sound signals.
  • the receiver 170B can be placed close to the human ear to listen to the voice.
  • the microphone 170C also called “microphone” or “microphone” is used to convert sound signals into electrical signals.
  • the user can put his mouth close to the microphone 170C to make a sound, and input the sound signal to the microphone 170C.
  • the mobile phone may be provided with at least one microphone 170C.
  • the mobile phone can be provided with two microphones 170C, which can also implement a noise reduction function in addition to collecting sound signals.
  • the mobile phone can also be equipped with three, four or more microphones 170C to realize the collection of sound signals, noise reduction, identification of sound sources, and realization of directional recording functions, etc.
  • the earphone interface 170D is used for connecting wired earphones.
  • the earphone interface 170D can be a USB interface 130, or a 3.5mm open mobile terminal platform (OMTP) standard interface, or a cellular telecommunications industry association of the USA (CTIA) standard interface.
  • OMTP open mobile terminal platform
  • CTIA cellular telecommunications industry association of the USA
  • the sensor module 180 may include one or more of the following sensors: pressure sensor, gyroscope sensor, air pressure sensor, magnetic sensor, acceleration sensor, distance sensor, proximity light sensor, fingerprint sensor, temperature sensor, touch sensor, ambient light sensor, bone conduction sensor, etc.
  • the mobile phone may also include a charging management module, a power management module, a battery, buttons, an indicator, and one or more SIM card interfaces, etc., which are not limited in this embodiment of the present application.
  • the first electronic device may perform some or all of the steps in the embodiment of the present application, and these steps are only examples, and the embodiment of the present application may also perform other steps or variations of various steps.
  • each step may be performed in a different order presented in the embodiment of the present application, and it may not be necessary to perform all the steps in the embodiment of the present application.
  • the voiceprint registration method includes S301-S304a or S301-S304b.
  • the first electronic device acquires a first voice signal and first parameter information.
  • the first electronic device may be any electronic device in FIG. 1 .
  • the first electronic device is the electronic device 101 or the electronic device 102 in FIG. 1 .
  • the first voice signal may or may not be collected by the first electronic device.
  • the first electronic device collects the first voice signal through a voice collection module of the first electronic device.
  • the voice collection module may be a chip, a circuit or a chip system in the first electronic device, which is used to collect voice signals, such as recording and storing the words spoken by the user to obtain voice signals.
  • the first electronic device receives the first voice signal from the third electronic device.
  • the third electronic device may be an electronic device other than the first electronic device. Taking the voiceprint registration system shown in FIG. 1 as an example, if the first electronic device is the electronic device 101 in FIG. 1 , then the third electronic device is at least one of the electronic devices 102-104 in FIG. 1 .
  • the third electronic device collects the first voice signal through the voice collection module of the third electronic device, and sends the first voice signal to the first electronic device.
  • the voice collection module of the third electronic device may be a chip, a circuit or a chip system in the third electronic device, and is used for collecting voice signals.
  • the first parameter information may be used to instruct the second electronic device to collect parameters of the voice signal.
  • the second electronic device and the third electronic device may be the same or different.
  • the first parameter information includes at least one of the following: the microphone type of the second electronic device, the sampling rate of the second electronic device, the encoding mode of the second electronic device, or the environment information of the second electronic device .
  • the microphone type of the second electronic device includes a dynamic microphone or a condenser microphone.
  • the sampling rate of the second electronic device can be understood as the sampling rate of the voice signal by the second electronic device, such as 8000 Hz or 16000 Hz.
  • the encoding method of the second electronic device may be understood as the encoding method of the second electronic device for the voice signal, such as linear pulse coding, nonlinear pulse coding, or adaptive linear coding.
  • the environment where the second electronic device is located may be an environment where the second electronic device is often located, or an environment where the second electronic device has been located within a period of time (eg, within one month).
  • the environment where the second electronic device is located may be one or more of a living room, a bedroom, a study room, a kitchen, a residential area, a street, a shopping mall, or a car.
  • the environment information of the second electronic device may include n bits, where the n bits are used to indicate the environment of the second electronic device, and n is a positive integer. Taking n as 2 as an example, if the value of the environment information is "00", the first parameter information indicates that the environment where the second electronic device is located is the living room; if the value of the environment information is "01", the first parameter information The information indicates that the environment where the second electronic device is located is a bedroom. If the value of the environment information is "10", the first parameter information indicates that the environment where the second electronic device is located is a residential area. If the value of the environment information is "11” ”, the first parameter information indicates that the environment where the second electronic device is located is in a car.
  • first parameter information is only exemplary.
  • first parameter information may also include other parameters, which are not specifically limited in this embodiment of the present application.
  • the first electronic device may acquire the first voice signal and the first parameter information at the same time, or may acquire the first voice signal and the first parameter information separately.
  • the second electronic device may send the first parameter information to the first electronic device while sending the first voice signal to the first electronic device, or That is to say, the first electronic device can acquire the first voice signal and the first parameter information at the same time.
  • the first electronic device after the first electronic device acquires the first voice signal, it may acquire the first parameter information. For example, after acquiring the first voice signal, the first electronic device sends instruction information for acquiring the first parameter to the second electronic device, and the second electronic device sends the first parameter information to the first electronic device after receiving the instruction information. For another example, after the first electronic device establishes a connection with the second electronic device, the second electronic device sends the first parameter information to the first electronic device, and after receiving the first parameter information, the first electronic device stores the first parameter information in the local. Subsequently, after acquiring the first voice signal, the first electronic device acquires the first parameter information locally.
  • the first electronic device adjusts the first voice signal according to the first parameter information to obtain a second voice signal.
  • the first electronic device acquires parameters of the voice signal collected by the electronic device that collects the first voice signal.
  • the first electronic device can acquire the parameters of the first voice signal, that is, the type of microphone used to collect the first voice signal, the sampling rate of the first voice signal collected, the encoding method of the first voice signal, or the electronic device used to collect the first voice signal One or more of the environment information.
  • the sampling rate for collecting the first voice signal the encoding method of the first voice signal, and the environment information of the electronic device for collecting the first voice signal
  • the description of the microphone type of the electronic device, the sampling rate of the second electronic device, the coding mode of the second electronic device, and the environment information of the second electronic device will not be repeated here.
  • the first electronic device uses the first algorithm to make the parameters of the first voice signal approach the parameters indicated by the first parameter information to obtain the second voice signal.
  • the first electronic device may use the first algorithm to simulate the impact of the microphone corresponding to the microphone type on the voice signal, and perform a process on the first voice signal. Adjust to obtain the second voice signal.
  • the first electronic device may adjust the sampling rate of the first voice signal to the sampling rate of the second electronic device through an audio processing algorithm to obtain the second voice signal.
  • the first electronic device may re-encode and decode the encoding format of the first voice signal according to the encoding method of the second electronic device to obtain the second voice signal.
  • the first electronic device may superimpose the environmental noise signal and/or the spatial reverberation signal according to the environment information of the second electronic device, Obtain the second voice signal.
  • the environmental noise signal and the spatial reverberation signal may be preconfigured in the first electronic device.
  • the first electronic device may use an algorithm to simulate the impact of the microphone corresponding to the microphone type on the voice signal to The first voice signal is adjusted, and the sampling rate of the first voice signal is adjusted to the sampling rate of the second electronic device through an audio processing algorithm to obtain a second voice signal.
  • the first electronic device can use an algorithm to simulate the corresponding The impact of the microphone on the voice signal is used to adjust the first voice signal, and the sampling rate of the first voice signal is adjusted to the sampling rate of the second electronic device through an audio processing algorithm, and then according to the location of the second electronic device
  • the environment information is obtained by superimposing the environment noise signal and/or the space reverberation signal to obtain the second voice signal.
  • the environmental noise signal and the spatial reverberation signal may be preconfigured in the first electronic device.
  • the electronic device that adjusts the first voice signal may also be an electronic device other than the first electronic device.
  • the first electronic device may send the first voice signal and the first parameter information to the fifth electronic device.
  • the fifth electronic device may adjust the first voice signal according to the first parameter information to obtain a second voice signal, and send the second voice signal to the first electronic device.
  • the fifth electronic device is different from the first electronic device.
  • the first electronic device generates a first voiceprint model according to the second voice signal.
  • the first electronic device extracts features from the second voice signal, and generates the first voiceprint model according to the extracted features. It can be understood that the voiceprint registration is completed after the first voiceprint model is generated. Subsequently, the user may be authenticated through the first voiceprint model. For example, a voice signal may be input into the first voiceprint model, and the first voiceprint model may output whether the voice signal and the first voice signal are from the same user.
  • the electronic device that generates the first voiceprint model may also be an electronic device other than the first electronic device.
  • the first electronic device may send the second voice signal to the sixth electronic device.
  • the sixth electronic device may generate the first voiceprint model according to the second voice signal, and send the first voiceprint model to the first electronic device.
  • the sixth electronic device and the fifth electronic device may be the same or different.
  • the fifth electronic device may not send the second voice signal to the first electronic device, but may send the second voice signal to the sixth electronic device, so that the sixth electronic device may The electronic device generates a first voiceprint model according to the second voice signal, and sends the first voiceprint model to the first electronic device.
  • S304a The first electronic device authenticates the voice signal collected by the second electronic device according to the first voiceprint model.
  • the first electronic device may receive the voice signal collected by the second electronic device from the second electronic device, and input the voice signal collected by the second electronic device into the first voiceprint model for voiceprint authentication.
  • the second electronic device collects the voice signal 1 through the voice sampling module of the second electronic device, and sends the voice signal 1 to the first electronic device.
  • the first electronic device After receiving the voice signal 1, the first electronic device inputs the voice signal 1 into the first voiceprint model for voiceprint authentication. If the output of the first voiceprint model is 0, it means that the voice signal 1 and the first voice signal are not from the same user, and the authentication fails. If the output of the first voiceprint model is 1, it means that the voice signal 1 and the first voice signal are not from the same user. The signal is from the same user and the authentication is successful.
  • the first voiceprint model is generated according to the second voice signal (that is, the voice signal collected by the second electronic device simulated by the first electronic device based on the first voice signal and the first parameter information)
  • the first voiceprint model is used to The model authenticates the voice signal collected by the second electronic device, which can improve the accuracy of voiceprint authentication.
  • S304b The first electronic device sends the first voiceprint model to the second electronic device.
  • the second electronic device receives the first voiceprint model from the first electronic device.
  • the first electronic device may directly send the first voiceprint model to the second electronic device, or may send the first voiceprint model to the second electronic device via one or more electronic devices.
  • the second electronic device may authenticate the voice signal collected by the second electronic device according to the first voiceprint model. For example, the second electronic device inputs the voice signal collected by itself into the first voiceprint model for voiceprint authentication.
  • the first electronic device can also send the first voiceprint model to electronic devices other than the second electronic device, so that electronic devices other than the second electronic device can also send the second voiceprint model to the second electronic device according to the first voiceprint model.
  • Voice signals collected by electronic equipment are used for authentication.
  • the first electronic device can acquire the first voice signal and the first parameter information corresponding to the second electronic device, adjust the first voice signal according to the first parameter information, and obtain the first voice signal suitable for the second electronic device.
  • Two voice signals (the second voice signal can be equivalent to the voice signal collected by the second electronic device, that is to say, the first electronic device can simulate the voice signal collected by the second electronic device according to the first voice signal and the first parameter information) , and generate the first voiceprint model according to the second voice signal. In this way, it is possible to collect a voice signal once, simulate the voice signal collected by the second electronic device according to the voice signal, and perform voiceprint registration according to the simulated voice signal (that is, the second voice signal).
  • the first electronic device is the second voice signal simulated according to the parameters of the voice signal collected by the second electronic device. Therefore, the similarity between the second voice signal and the voice signal actually collected by the second electronic device is very high, so according to the first
  • the first voiceprint model generated from the second voice signal performs voiceprint authentication on the voice signal collected by the second electronic device, which can improve the accuracy of voiceprint authentication.
  • the first electronic device simulates the voice signal collected by the second electronic device, and registers the voiceprint according to the voice signal.
  • the first electronic device may also simulate a voice signal collected by at least one other electronic device according to the first voice signal, and perform voiceprint registration according to the simulated voice signal.
  • the first electronic device may also simulate the voice signal collected by the first electronic device according to the first voice signal, and perform voiceprint registration according to the simulated voice signal collected by the first electronic device.
  • FIG. 4 reference may be made to the description in the method shown in FIG. 4 below.
  • the first electronic device may also simulate the voice signal collected by the fourth electronic device according to the first voice signal, and perform voiceprint registration according to the simulated voice signal collected by the fourth electronic device. Specifically, reference may be made to the description in the method shown in FIG. 5 below.
  • the method shown in FIG. 3 further includes S305-S308.
  • S305 The first electronic device acquires second parameter information.
  • the second parameter information may be used to instruct the first electronic device to collect parameters of the voice signal.
  • the second parameter information includes at least one of the following: a microphone type of the first electronic device, a sampling rate of the first electronic device, a coding mode of the first electronic device, or environment information of the first electronic device.
  • the first electronic device acquires the second parameter information locally.
  • the first electronic device adjusts the first voice signal according to the second parameter information to obtain a third voice signal.
  • S307 The first electronic device generates a second voiceprint model according to the third voice signal.
  • the first electronic device authenticates the voice signal collected by the first electronic device according to the second voiceprint model.
  • the first electronic device collects voice signals through the voice collection module of the first electronic device, and inputs the collected voice signals into the second voiceprint model for voiceprint authentication.
  • the voice collection module of the first electronic device collects voice signals through the voice collection module of the first electronic device, and inputs the collected voice signals into the second voiceprint model for voiceprint authentication.
  • the first electronic device may first generate the first voiceprint model, such as: acquire the first parameter information, adjust the first voice signal according to the first parameter information, and obtain the second voice signal, Generate the first voiceprint model according to the second voice signal, and then generate the second voiceprint model, such as: obtain the second parameter information, adjust the first voice signal according to the second parameter information, and obtain the third voice signal, according to the third voice signal Generate a second voiceprint model.
  • the first electronic device may also first generate the second voiceprint model, and then generate the first voiceprint model, and may also execute the above two processes at the same time, without limitation.
  • the first electronic device can also send the first voiceprint model to electronic devices other than the first electronic device, so that electronic devices other than the first electronic device can also use the second voiceprint model for the first voiceprint model.
  • Voice signals collected by electronic equipment are used for authentication.
  • the method shown in FIG. 3 further includes S309-S312a or S309-S312b.
  • S309 The first electronic device acquires third parameter information.
  • the third parameter information is used to instruct the fourth electronic device to collect parameters of the voice signal.
  • the third parameter information includes at least one of the following: a microphone type of the fourth electronic device, a sampling rate of the fourth electronic device, a coding method of the fourth electronic device, or environment information of the fourth electronic device.
  • the fourth electronic device is different from the first electronic device and the second electronic device.
  • the first electronic device is the electronic device 101 in FIG. 1 and the second electronic device is the electronic device 102 in FIG. 1
  • the fourth electronic device is the electronic device 103 or the electronic device 104 in FIG. 1 .
  • the first electronic device adjusts the first voice signal according to the third parameter information to obtain a fourth voice signal
  • S311 The first electronic device generates a third voiceprint model according to the fourth voice signal.
  • S312a The first electronic device authenticates the voice signal collected by the fourth electronic device according to the third voiceprint model.
  • S312a may also be replaced with S312b.
  • S312b The first electronic device sends the third voiceprint model to the fourth electronic device.
  • the fourth electronic device receives the third voiceprint model from the first electronic device.
  • the first electronic device can also send the third voiceprint model to electronic devices other than the fourth electronic device, so that the electronic devices other than the fourth electronic device can also send the third voiceprint model to the fourth electronic device according to the third voiceprint model.
  • Voice signals collected by electronic equipment are used for authentication.
  • the first electronic device after the first electronic device acquires the first voice signal, it can first generate the first voiceprint model, for example, it can first acquire the first parameter information, adjust the first voice signal according to the first parameter information, and obtain the second voice signal, generate the first voiceprint model according to the second voice signal, and then generate the third voiceprint model, such as: obtain the third parameter information, adjust the first voice signal according to the third parameter information, and obtain the fourth voice signal, according to the fourth
  • the speech signal generates a third voiceprint model.
  • the first electronic device may also first generate the third voiceprint model, and then generate the first voiceprint model, and may also execute the above two processes at the same time, without limitation.
  • S309-S312b can also be performed in the method shown in FIG. 4, for example, after the first electronic device acquires the first voice signal, or after S303, or after S308, or with S305- S308 is executed at the same time without limitation.
  • the methods and/or steps implemented by the first electronic device may also be implemented by components (such as chips or circuits) that can be used in the first electronic device.
  • the above-mentioned electronic device includes corresponding hardware structures and/or software modules for performing each function.
  • the embodiments of the present application can be implemented in the form of hardware or a combination of hardware and computer software in combination with the example units and algorithm steps described in the embodiments disclosed herein. Whether a certain function is executed by hardware or computer software drives hardware depends on the specific application and design constraints of the technical solution. Those skilled in the art may use different methods to implement the described functions for each specific application, but such implementation should not be regarded as exceeding the scope of the embodiments of the present application.
  • the embodiments of the present application may divide the above-mentioned electronic device into functional modules according to the above-mentioned method examples.
  • each functional module may be divided corresponding to each function, or two or more functions may be integrated into one processing module.
  • the above-mentioned integrated modules can be implemented in the form of hardware or in the form of software function modules. It should be noted that the division of modules in the embodiment of the present application is schematic, and is only a logical function division, and there may be other division methods in actual implementation.
  • the embodiment of the present application discloses an electronic device 600 , which may be the first electronic device in the foregoing embodiments.
  • the electronic device 600 may specifically include: an input device 601 (such as a mouse, a keyboard or a touch screen, etc.); one or more processors 602; a memory 603; one or more application programs (not shown); Program 604 , the above-mentioned devices may be connected through one or more communication buses 605 .
  • the electronic device further includes a voice collection device (such as a recording device) for collecting voice signals.
  • the above-mentioned one or more computer programs 604 are stored in the above-mentioned memory 603 and configured to be executed by the one or more processors 602, the one or more computer programs 604 include instructions, and the instructions can be used to execute the above-mentioned Relevant steps in the examples.
  • the electronic device 600 may be the electronic device 101, the electronic device 102, the electronic device 103, or the electronic device 104 in FIG. 1 .
  • the embodiment of the present application also provides a chip system, including: at least one processor and an interface, the at least one processor is coupled with the memory through the interface, when the at least one processor executes the computer program or instruction in the memory, the above-mentioned The method in any method embodiment is performed.
  • the chip system further includes a memory.
  • the system-on-a-chip may consist of a chip, or may include a chip and other discrete devices, which is not specifically limited in this embodiment of the present application.
  • the embodiment of the present application also provides a computer-readable storage medium, where computer program code is stored, and when the processor executes the computer program code, the electronic device executes the method in the foregoing embodiments.
  • the embodiment of the present application also provides a computer program product, which causes the computer to execute the method in the foregoing embodiments when the computer program product is run on the computer.
  • the electronic device 600, the computer-readable storage medium or the computer program product provided in the embodiment of the present application are all used to execute the corresponding method provided above, therefore, the beneficial effects that it can achieve can refer to the above-mentioned The beneficial effects of the corresponding method will not be repeated here.
  • the disclosed devices and methods may be implemented in other ways.
  • the device embodiments described above are only illustrative.
  • the division of the modules or units is only a logical function division. In actual implementation, there may be other division methods.
  • multiple units or components can be Incorporation or may be integrated into another device, or some features may be omitted, or not implemented.
  • the mutual coupling or direct coupling or communication connection shown or discussed may be through some interfaces, and the indirect coupling or communication connection of devices or units may be in electrical, mechanical or other forms.
  • each functional unit in each embodiment of the present application may be integrated into one processing unit, each unit may exist separately physically, or two or more units may be integrated into one unit.
  • the above-mentioned integrated units can be implemented in the form of hardware or in the form of software functional units.
  • the integrated unit is realized in the form of a software function unit and sold or used as an independent product, it can be stored in a readable storage medium.
  • the technical solution of the embodiment of the present application is essentially or the part that contributes to the prior art, or all or part of the technical solution can be embodied in the form of a software product, and the software product is stored in a storage medium Among them, several instructions are included to make a device (which may be a single-chip microcomputer, a chip, etc.) or a processor (processor) execute all or part of the steps of the methods described in the various embodiments of the present application.
  • the aforementioned storage media include: various media that can store program codes such as U disk, mobile hard disk, ROM, magnetic disk or optical disk.

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Business, Economics & Management (AREA)
  • Game Theory and Decision Science (AREA)
  • Telephone Function (AREA)

Abstract

A voiceprint registration method and electronic devices (101, 102, 103, 104), related to the field of voiceprint registration technology, and capable of improving the accuracy of voiceprint authentication. The method comprises: acquiring a first voice signal and acquiring first parameter information used for instructing a second electronic device to acquire a parameter of the voice signal (S301); adjusting the first voice signal in accordance with the first parameter information, to obtain a second voice signal (S302); generating a first voiceprint model in accordance with the second voice signal (S303); and using the first voiceprint model to authenticate the voice signal acquired by the second electronic device (S304a), or sending the first voiceprint model to the second electronic device (S304b).

Description

声纹注册方法及电子设备Voiceprint registration method and electronic device
“本申请要求于2021年10月28日提交国家知识产权局、申请号为202111266367.4、发明名称为“声纹注册方法及电子设备”的专利申请的优先权,其全部内容通过引用结合在本申请中”。"This application claims the priority of a patent application filed with the State Intellectual Property Office on October 28, 2021, with application number 202111266367.4, and the title of the invention is "Voiceprint Registration Method and Electronic Equipment", the entire contents of which are incorporated in this application by reference." middle".
技术领域technical field
本申请涉及声纹注册技术领域,尤其涉及声纹注册方法及电子设备。The present application relates to the technical field of voiceprint registration, in particular to a voiceprint registration method and electronic equipment.
背景技术Background technique
近年来,电子设备的发展极为迅速,多数电子设备都具备了语音交互功能。通过语音交互功能,用户可以与电子设备对话,或者可以使得电子设备执行用户的命令,十分便捷。因此,语音交互功能逐渐成为了电子设备不可或缺的功能。In recent years, electronic devices have developed extremely rapidly, and most electronic devices have voice interaction functions. Through the voice interaction function, the user can have a conversation with the electronic device, or can make the electronic device execute the user's command, which is very convenient. Therefore, the voice interaction function has gradually become an indispensable function of electronic devices.
通常,在用户与电子设备进行语音交互之前,需要进行声纹注册。也就是说,电子设备可以采集用户的语音信号,根据采集的语音信号提取声纹,并进行注册。后续,在用户与电子设备进行语音交互时,电子设备可以根据声纹对用户进行认证。若认证成功,则该用户可以与电子设备进行语音交互。若认证失败,则该用户不能与电子设备进行语音交互。Usually, voiceprint registration is required before the user performs voice interaction with the electronic device. That is to say, the electronic device can collect the voice signal of the user, extract the voiceprint according to the collected voice signal, and perform registration. Subsequently, when the user performs voice interaction with the electronic device, the electronic device can authenticate the user according to the voiceprint. If the authentication is successful, the user can perform voice interaction with the electronic device. If the authentication fails, the user cannot perform voice interaction with the electronic device.
目前,为了提高语音交互的安全性,对声纹认证的准确性要求越来越高。然而,不同电子设备的硬件存在差异,并且不同电子设备所处的环境也可能相差较大,所以会导致不同电子设备采集的同一用户的语音信号提取出来的声纹也有一定差异。因此,在多个电子设备协同工作的环境中,若一台电子设备采集的语音信号需要其他电子设备来进行认证的情况下,会导致声纹认证的准确性较低。At present, in order to improve the security of voice interaction, the accuracy of voiceprint authentication is increasingly required. However, there are differences in the hardware of different electronic devices, and the environments in which different electronic devices are located may also be quite different, so the voiceprints extracted from the voice signals of the same user collected by different electronic devices also have certain differences. Therefore, in an environment where multiple electronic devices work together, if the voice signal collected by one electronic device needs to be authenticated by other electronic devices, the accuracy of voiceprint authentication will be low.
发明内容Contents of the invention
本申请实施例提供声纹注册方法及电子设备,可以提高声纹认证的准确性。Embodiments of the present application provide a voiceprint registration method and electronic equipment, which can improve the accuracy of voiceprint authentication.
为达到上述目的,本申请的实施例采用如下技术方案:In order to achieve the above object, the embodiments of the present application adopt the following technical solutions:
第一方面,提供了一种声纹注册方法,应用于第一电子设备,该方法包括:获取第一语音信号和用于指示第二电子设备采集语音信号的参数的第一参数信息;根据该第一参数信息调整该第一语音信号,得到第二语音信号;根据该第二语音信号生成第一声纹模型,向该第二电子设备发送该第一声纹模型,或者根据该第一声纹模型对该第二电子设备采集的语音信号进行认证。In the first aspect, a voiceprint registration method is provided, which is applied to a first electronic device, and the method includes: acquiring a first voice signal and first parameter information used to instruct a second electronic device to collect parameters of the voice signal; according to the The first parameter information adjusts the first voice signal to obtain a second voice signal; generates a first voiceprint model based on the second voice signal, sends the first voiceprint model to the second electronic device, or The fingerprint model authenticates the voice signal collected by the second electronic device.
基于上述第一方面提供的方法,第一电子设备可以获取第一语音信号和第二电子设备对应的第一参数信息,根据第一参数信息调整第一语音信号,得到适用于第二电子设备的第二语音信号(该第二语音信号可以相当于第二电子设备采集的语音信号,也就是说,第一电子设备可以根据第一语音信号和第一参数信息模拟第二电子设备采集的语音信号),并根据第二语音信号生成第一声纹模型。如此,可以实现采集一次语音信号,根据该语音信号模拟出第二电子设备采集的语音信号,根据模拟出的语音信号(即第二语音信号)进行声纹注册。其中,第一电子设备是根据第二电子设备采 集语音信号的参数模拟出的第二语音信号,因此,第二语音信号与第二电子设备真实采集的语音信号的相似度非常高,所以根据第二语音信号生成的第一声纹模型对第二电子设备采集的语音信号进行声纹认证,可以提高声纹认证的准确性。Based on the method provided in the first aspect above, the first electronic device can acquire the first voice signal and the first parameter information corresponding to the second electronic device, adjust the first voice signal according to the first parameter information, and obtain a voice signal suitable for the second electronic device. The second voice signal (the second voice signal can be equivalent to the voice signal collected by the second electronic device, that is to say, the first electronic device can simulate the voice signal collected by the second electronic device according to the first voice signal and the first parameter information ), and generate the first voiceprint model according to the second speech signal. In this way, it is possible to collect a voice signal once, simulate the voice signal collected by the second electronic device according to the voice signal, and perform voiceprint registration according to the simulated voice signal (ie, the second voice signal). Wherein, the first electronic device is the second voice signal simulated according to the parameters of the voice signal collected by the second electronic device. Therefore, the similarity between the second voice signal and the voice signal actually collected by the second electronic device is very high, so according to the first The first voiceprint model generated from the second voice signal performs voiceprint authentication on the voice signal collected by the second electronic device, which can improve the accuracy of voiceprint authentication.
在一种可能的实现方式中,该第一参数信息包括以下至少一项:该第二电子设备的麦克类型、该第二电子设备的采样率、该第二电子设备的编码方式或该第二电子设备所处的环境信息。基于上述方法,可以根据上述至少一种参数调整第一语音信号,得到第二语音信号,提高了调整第一语音信号的灵活性和多样性。In a possible implementation manner, the first parameter information includes at least one of the following: the microphone type of the second electronic device, the sampling rate of the second electronic device, the encoding method of the second electronic device, or the second Information about the environment in which the electronic device is located. Based on the above method, the first speech signal can be adjusted according to the above at least one parameter to obtain the second speech signal, which improves the flexibility and diversity of adjusting the first speech signal.
在一种可能的实现方式中,根据该第一参数信息调整第一语音信号,得到第二语音信号,包括:通过第一算法使得该第一语音信号的参数趋近该第一参数信息指示的参数,得到该第二语音信号。基于上述方法,可以使得第一语音信号的参数趋近该第一参数信息指示的参数,从而实现第二语音信号与第二电子设备真实采集的语音信号的相似度较高。In a possible implementation manner, adjusting the first speech signal according to the first parameter information to obtain the second speech signal includes: using a first algorithm to make the parameters of the first speech signal approach the value indicated by the first parameter information parameters to obtain the second speech signal. Based on the above method, the parameters of the first voice signal can be made to approach the parameters indicated by the first parameter information, so that the second voice signal has a higher similarity with the voice signal actually collected by the second electronic device.
在一种可能的实现方式中,获取第一语音信号,包括:接收来自第三电子设备的该第一语音信号;或者,采集该第一语音信号。基于上述方法,第一电子设备可以从第三电子设备处获得第一语音信号,也可以自己采集第一语音信号。In a possible implementation manner, acquiring the first voice signal includes: receiving the first voice signal from a third electronic device; or, collecting the first voice signal. Based on the above method, the first electronic device may obtain the first voice signal from the third electronic device, or collect the first voice signal by itself.
在一种可能的实现方式中,根据该第一声纹模型对该第二电子设备采集的语音信号进行认证,包括:接收来自该第二电子设备的第二电子设备采集的语音信号;将该第二电子设备采集的语音信号输入该第一声纹模型进行声纹认证。基于上述方法,可以使用第二电子设备对应的第一声纹模型对第二电子设备采集的语音信号进行声纹认证,提高了声纹认证的准确性。In a possible implementation manner, authenticating the voice signal collected by the second electronic device according to the first voiceprint model includes: receiving the voice signal collected by the second electronic device from the second electronic device; The voice signal collected by the second electronic device is input into the first voiceprint model for voiceprint authentication. Based on the above method, the first voiceprint model corresponding to the second electronic device can be used to perform voiceprint authentication on the voice signal collected by the second electronic device, which improves the accuracy of voiceprint authentication.
在一种可能的实现方式中,该方法还包括:获取第二参数信息,该第二参数信息用于指示该第一电子设备采集语音信号的参数;根据该第二参数信息调整该第一语音信号,得到第三语音信号;根据该第三语音信号生成第二声纹模型,根据该第二声纹模型对该第一电子设备采集的语音信号进行认证。基于上述方法,第一电子设备可以获取第一电子设备对应的第二参数信息,根据第二参数信息调整第一语音信号,得到适用于第一电子设备的第三语音信号(该第三语音信号可以相当于第一电子设备采集的语音信号,也就是说,第一电子设备可以根据第一语音信号和第二参数信息模拟第一电子设备采集的语音信号),并根据第三语音信号生成第二声纹模型。如此,可以实现采集一次语音信号,根据该语音信号模拟出第一电子设备采集的语音信号,根据模拟出的语音信号(即第三语音信号)进行声纹注册。其中,第一电子设备是根据第一电子设备采集语音信号的参数模拟出的第三语音信号,因此,第三语音信号与第一电子设备真实采集的语音信号的相似度非常高,所以根据第三语音信号生成的第二声纹模型对第一电子设备采集的语音信号进行声纹认证,可以提高声纹认证的准确性。另外,若第一语音信号为第一电子设备采集的,通过第二参数信息调整第一语音信号,可以丰富用于声纹注册的语音信号,进一步提高声纹认证的准确性。In a possible implementation manner, the method further includes: acquiring second parameter information, where the second parameter information is used to instruct the first electronic device to collect parameters of voice signals; adjusting the first voice signal according to the second parameter information signal to obtain a third voice signal; a second voiceprint model is generated according to the third voice signal, and the voice signal collected by the first electronic device is authenticated according to the second voiceprint model. Based on the method above, the first electronic device can obtain the second parameter information corresponding to the first electronic device, adjust the first voice signal according to the second parameter information, and obtain a third voice signal suitable for the first electronic device (the third voice signal It can be equivalent to the voice signal collected by the first electronic device, that is, the first electronic device can simulate the voice signal collected by the first electronic device according to the first voice signal and the second parameter information), and generate the second voice signal according to the third voice signal Two voiceprint models. In this way, it is possible to collect a voice signal once, simulate the voice signal collected by the first electronic device according to the voice signal, and perform voiceprint registration according to the simulated voice signal (ie, the third voice signal). Wherein, the first electronic device is the third voice signal simulated according to the parameters of the voice signal collected by the first electronic device. Therefore, the similarity between the third voice signal and the voice signal actually collected by the first electronic device is very high, so according to the first electronic device The second voiceprint model generated from the three voice signals performs voiceprint authentication on the voice signals collected by the first electronic device, which can improve the accuracy of voiceprint authentication. In addition, if the first voice signal is collected by the first electronic device, adjusting the first voice signal through the second parameter information can enrich the voice signal used for voiceprint registration and further improve the accuracy of voiceprint authentication.
在一种可能的实现方式中,该第二参数信息包括以下至少一项:该第一电子设备的麦克类型、该第一电子设备的采样率、该第一电子设备的编码方式或该第一电子设备所处的环境信息。基于上述方法,可以根据上述至少一种参数调整第一语音信号,得到第三语音信号,提高了调整第一语音信号的灵活性和多样性。In a possible implementation manner, the second parameter information includes at least one of the following: the microphone type of the first electronic device, the sampling rate of the first electronic device, the encoding method of the first electronic device, or the first Information about the environment in which the electronic device is located. Based on the above method, the first voice signal can be adjusted according to the above at least one parameter to obtain the third voice signal, which improves the flexibility and diversity of adjusting the first voice signal.
在一种可能的实现方式中,根据该第二参数信息调整第一语音信号,得到第三语音信号,包括:通过第二算法使得该第一语音信号的参数趋近该第二参数信息指示的参数,得到该第三语音信号,第二算法和第一算法相同或不同。基于上述方法,可以使得第一语音信号的参数趋近该第二参数信息指示的参数,从而实现第三语音信号与第一电子设备真实采集的语音信号的相似度较高。In a possible implementation manner, adjusting the first speech signal according to the second parameter information to obtain the third speech signal includes: using a second algorithm to make the parameters of the first speech signal approach the value indicated by the second parameter information parameters to obtain the third speech signal, and the second algorithm is the same as or different from the first algorithm. Based on the above method, the parameters of the first voice signal can be made to approach the parameters indicated by the second parameter information, so that the third voice signal has a higher similarity with the voice signal actually collected by the first electronic device.
在一种可能的实现方式中,根据该第二声纹模型对该第一电子设备采集的语音信号进行认证,包括:采集语音信号;将该第一电子设备采集的语音信号输入该第二声纹模型进行声纹认证。基于上述方法,可以使用第一电子设备对应的第二声纹模型对第一电子设备采集的语音信号进行声纹认证,提高了声纹认证的准确性。In a possible implementation manner, authenticating the voice signal collected by the first electronic device according to the second voiceprint model includes: collecting the voice signal; inputting the voice signal collected by the first electronic device into the second voice The fingerprint model is used for voiceprint authentication. Based on the above method, the second voiceprint model corresponding to the first electronic device can be used to perform voiceprint authentication on the voice signal collected by the first electronic device, which improves the accuracy of voiceprint authentication.
在一种可能的实现方式中,该方法还包括:获取第三参数信息,该第三参数信息用于指示第四电子设备采集语音信号的参数;根据该第三参数信息调整该第一语音信号,得到第四语音信号;根据该第四语音信号生成第三声纹模型,向该第四电子设备发送该第三声纹模型,或者根据该第三声纹模型对第四电子设备采集的语音信号进行认证。基于上述方法,第一电子设备可以获取第四电子设备对应的第三参数信息,根据第三参数信息调整第一语音信号,得到适用于第四电子设备的第四语音信号(该第四语音信号可以相当于第四电子设备采集的语音信号,也就是说,第一电子设备可以根据第一语音信号和第三参数信息模拟第四电子设备采集的语音信号),并根据第四语音信号生成第三声纹模型。如此,可以实现采集一次语音信号,根据该语音信号模拟出第四电子设备采集的语音信号,根据模拟出的语音信号(即第四语音信号)进行声纹注册。其中,第一电子设备是根据第四电子设备采集语音信号的参数模拟出的第四语音信号,因此,第四语音信号与第四电子设备真实采集的语音信号的相似度非常高,所以根据第四语音信号生成的第三声纹模型对第四电子设备采集的语音信号进行声纹认证,可以提高声纹认证的准确性。In a possible implementation manner, the method further includes: acquiring third parameter information, where the third parameter information is used to instruct the fourth electronic device to collect parameters of the voice signal; adjusting the first voice signal according to the third parameter information , to obtain a fourth voice signal; generate a third voiceprint model according to the fourth voice signal, send the third voiceprint model to the fourth electronic device, or collect the voice of the fourth electronic device according to the third voiceprint model The signal is authenticated. Based on the above method, the first electronic device can obtain the third parameter information corresponding to the fourth electronic device, adjust the first voice signal according to the third parameter information, and obtain the fourth voice signal suitable for the fourth electronic device (the fourth voice signal It can be equivalent to the voice signal collected by the fourth electronic device, that is, the first electronic device can simulate the voice signal collected by the fourth electronic device according to the first voice signal and the third parameter information), and generate the first voice signal according to the fourth voice signal Three voiceprint models. In this way, it is possible to collect a voice signal once, simulate the voice signal collected by the fourth electronic device according to the voice signal, and perform voiceprint registration according to the simulated voice signal (that is, the fourth voice signal). Wherein, the first electronic device is the fourth voice signal simulated according to the parameters of the voice signal collected by the fourth electronic device. Therefore, the similarity between the fourth voice signal and the voice signal actually collected by the fourth electronic device is very high, so according to the first The third voiceprint model generated from the four voice signals performs voiceprint authentication on the voice signals collected by the fourth electronic device, which can improve the accuracy of voiceprint authentication.
在一种可能的实现方式中,该第三参数信息包括以下至少一项:该第四电子设备的麦克类型、该第四电子设备的采样率、该第四电子设备的编码方式或该第四电子设备所处的环境信息。基于上述方法,可以根据上述至少一种参数调整第一语音信号,得到第四语音信号,提高了调整第一语音信号的灵活性和多样性。In a possible implementation manner, the third parameter information includes at least one of the following: the microphone type of the fourth electronic device, the sampling rate of the fourth electronic device, the encoding method of the fourth electronic device, or the fourth Information about the environment in which the electronic device is located. Based on the above method, the first speech signal can be adjusted according to the above at least one parameter to obtain the fourth speech signal, which improves the flexibility and diversity of adjusting the first speech signal.
在一种可能的实现方式中,根据该第三参数信息调整第一语音信号,得到第四语音信号,包括:通过第三算法使得该第一语音信号的参数趋近该第三参数信息指示的参数,得到该第四语音信号,第三算法和第一算法相同或不同,第三算法和第二算法相同或不同。基于上述方法,可以使得第一语音信号的参数趋近该第三参数信息指示的参数,从而实现第四语音信号与第四电子设备真实采集的语音信号的相似度较高。In a possible implementation manner, adjusting the first speech signal according to the third parameter information to obtain the fourth speech signal includes: using a third algorithm to make the parameters of the first speech signal approach the value indicated by the third parameter information parameters to obtain the fourth speech signal, the third algorithm is the same or different from the first algorithm, and the third algorithm is the same or different from the second algorithm. Based on the above method, the parameters of the first voice signal can be made to approach the parameters indicated by the third parameter information, so that the fourth voice signal has a higher similarity with the voice signal actually collected by the fourth electronic device.
在一种可能的实现方式中,根据该第三声纹模型对该第四电子设备采集的语音信号进行认证,包括:接收来自该第四电子设备的第四电子设备采集的语音信号;将第四电子设备采集的语音信号输入该第三声纹模型进行声纹认证。基于上述方法,可以使用第四电子设备对应的第三声纹模型对第四电子设备采集的语音信号进行声纹认证,提高了声纹认证的准确性。In a possible implementation manner, authenticating the voice signal collected by the fourth electronic device according to the third voiceprint model includes: receiving the voice signal collected by the fourth electronic device from the fourth electronic device; The voice signal collected by the fourth electronic device is input into the third voiceprint model for voiceprint authentication. Based on the above method, the third voiceprint model corresponding to the fourth electronic device can be used to perform voiceprint authentication on the voice signal collected by the fourth electronic device, which improves the accuracy of voiceprint authentication.
第二方面,本申请实施例提供一种电子设备,该电子设备包括:获取模块、处理模块和发送模块;获取模块,用于获取第一语音信号和用于指示第二电子设备采集语 音信号的参数的第一参数信息;处理模块,用于根据该第一参数信息调整该第一语音信号,得到第二语音信号;处理模块,还用于根据该第二语音信号生成第一声纹模型;发送模块,用于向该第二电子设备发送该第一声纹模型。或者,该电子设备包括:获取模块和处理模块;获取模块,用于获取第一语音信号和用于指示第二电子设备采集语音信号的参数的第一参数信息;处理模块,用于根据该第一参数信息调整该第一语音信号,得到第二语音信号;处理模块,还用于根据该第二语音信号生成第一声纹模型;处理模块,还用于根据该第一声纹模型对该第二电子设备采集的语音信号进行认证。In the second aspect, the embodiment of the present application provides an electronic device, the electronic device includes: an acquisition module, a processing module, and a sending module; The first parameter information of the parameter; the processing module is used to adjust the first voice signal according to the first parameter information to obtain a second voice signal; the processing module is also used to generate a first voiceprint model according to the second voice signal; A sending module, configured to send the first voiceprint model to the second electronic device. Alternatively, the electronic device includes: an acquisition module and a processing module; the acquisition module is used to acquire the first voice signal and the first parameter information used to instruct the second electronic device to collect parameters of the voice signal; the processing module is used to obtain the first voice signal according to the first parameter information A parameter information adjusts the first voice signal to obtain a second voice signal; the processing module is also used to generate a first voiceprint model according to the second voice signal; the processing module is also used to generate the first voiceprint model according to the first voiceprint model. The voice signal collected by the second electronic device is used for authentication.
在一种可能的实现方式中,该第一参数信息包括以下至少一项:该第二电子设备的麦克类型、该第二电子设备的采样率、该第二电子设备的编码方式或该第二电子设备所处的环境信息。In a possible implementation manner, the first parameter information includes at least one of the following: the microphone type of the second electronic device, the sampling rate of the second electronic device, the encoding method of the second electronic device, or the second Information about the environment in which the electronic device is located.
在一种可能的实现方式中,处理模块,具体用于通过第一算法使得该第一语音信号的参数趋近该第一参数信息指示的参数,得到该第二语音信号。In a possible implementation manner, the processing module is specifically configured to use a first algorithm to make a parameter of the first speech signal approach a parameter indicated by the first parameter information to obtain the second speech signal.
在一种可能的实现方式中,获取模块,具体用于接收来自第三电子设备的该第一语音信号;或者,获取模块,具体用于采集该第一语音信号。In a possible implementation manner, the obtaining module is specifically configured to receive the first voice signal from the third electronic device; or, the obtaining module is specifically configured to collect the first voice signal.
在一种可能的实现方式中,处理模块,具体用于接收来自该第二电子设备的第二电子设备采集的语音信号;处理模块,还具体用于将该第二电子设备采集的语音信号输入该第一声纹模型进行声纹认证。In a possible implementation manner, the processing module is specifically configured to receive the voice signal collected by the second electronic device from the second electronic device; the processing module is also specifically configured to input the voice signal collected by the second electronic device into The first voiceprint model performs voiceprint authentication.
在一种可能的实现方式中,获取模块,还用于获取第二参数信息,该第二参数信息用于指示该第一电子设备采集语音信号的参数;处理模块,还用于根据该第二参数信息调整该第一语音信号,得到第三语音信号;处理模块,还用于根据该第三语音信号生成第二声纹模型,处理模块,还用于根据该第二声纹模型对该第一电子设备采集的语音信号进行认证。In a possible implementation manner, the acquiring module is further configured to acquire second parameter information, where the second parameter information is used to instruct the first electronic device to collect parameters of voice signals; the processing module is also configured to The parameter information adjusts the first voice signal to obtain a third voice signal; the processing module is also used to generate a second voiceprint model according to the third voice signal, and the processing module is also used to generate the second voiceprint model according to the second voiceprint model. The voice signal collected by an electronic device is used for authentication.
在一种可能的实现方式中,该第二参数信息包括以下至少一项:该第一电子设备的麦克类型、该第一电子设备的采样率、该第一电子设备的编码方式或该第一电子设备所处的环境信息。In a possible implementation manner, the second parameter information includes at least one of the following: the microphone type of the first electronic device, the sampling rate of the first electronic device, the encoding method of the first electronic device, or the first Information about the environment in which the electronic device is located.
在一种可能的实现方式中,处理模块,具体用于通过第二算法使得该第一语音信号的参数趋近该第二参数信息指示的参数,得到该第三语音信号,第二算法和第一算法相同或不同。In a possible implementation manner, the processing module is specifically configured to use the second algorithm to make the parameters of the first speech signal approach the parameters indicated by the second parameter information to obtain the third speech signal, and the second algorithm and the second An algorithm is the same or different.
在一种可能的实现方式中,处理模块,具体用于采集语音信号;处理模块,还具体用于将该第一电子设备采集的语音信号输入该第二声纹模型进行声纹认证。In a possible implementation manner, the processing module is specifically configured to collect voice signals; the processing module is further specifically configured to input the voice signals collected by the first electronic device into the second voiceprint model for voiceprint authentication.
在一种可能的实现方式中,获取模块,还用于获取第三参数信息,该第三参数信息用于指示第四电子设备采集语音信号的参数;处理模块,还用于根据该第三参数信息调整该第一语音信号,得到第四语音信号;处理模块,还用于根据该第四语音信号生成第三声纹模型,发送模块,还用于向该第四电子设备发送该第三声纹模型,或者处理模块,还用于根据该第三声纹模型对第四电子设备采集的语音信号进行认证。In a possible implementation manner, the acquiring module is further configured to acquire third parameter information, where the third parameter information is used to instruct the fourth electronic device to collect parameters of voice signals; the processing module is also configured to information to adjust the first voice signal to obtain a fourth voice signal; the processing module is also used to generate a third voiceprint model according to the fourth voice signal, and the sending module is also used to send the third voice signal to the fourth electronic device The fingerprint model, or the processing module, is further configured to authenticate the voice signal collected by the fourth electronic device according to the third voiceprint model.
在一种可能的实现方式中,该第三参数信息包括以下至少一项:该第四电子设备的麦克类型、该第四电子设备的采样率、该第四电子设备的编码方式或该第四电子设备所处的环境信息。In a possible implementation manner, the third parameter information includes at least one of the following: the microphone type of the fourth electronic device, the sampling rate of the fourth electronic device, the encoding method of the fourth electronic device, or the fourth Information about the environment in which the electronic device is located.
在一种可能的实现方式中,处理模块,还用于通过第三算法使得该第一语音信号的参数趋近该第三参数信息指示的参数,得到该第四语音信号,第三算法和第一算法相同或不同,第三算法和第二算法相同或不同。In a possible implementation manner, the processing module is further configured to use a third algorithm to make the parameters of the first speech signal approach the parameters indicated by the third parameter information to obtain the fourth speech signal, and the third algorithm and the first The first algorithm is the same or different, and the third algorithm is the same or different from the second algorithm.
在一种可能的实现方式中,处理模块,还用于接收来自该第四电子设备的第四电子设备采集的语音信号;处理模块,还用于将第四电子设备采集的语音信号输入该第三声纹模型进行声纹认证。In a possible implementation manner, the processing module is further configured to receive the voice signal collected by the fourth electronic device from the fourth electronic device; the processing module is also configured to input the voice signal collected by the fourth electronic device into the fourth electronic device. Three voiceprint models are used for voiceprint authentication.
第三方面,提供了一种电子设备,包括:处理器;该处理器用于与存储器耦合,并读取存储器中的指令之后,根据该指令执行如上述任一方面所述的方法。该电子设备可以为上述第一方面中的第一电子设备。In a third aspect, an electronic device is provided, including: a processor; the processor is configured to be coupled with a memory, and after reading an instruction in the memory, execute the method according to any one of the above aspects according to the instruction. The electronic device may be the first electronic device in the above first aspect.
结合上述第三方面,在一种可能的实现方式中,该电子设备还包括存储器,该存储器,用于保存必要的程序指令和数据。With reference to the third aspect above, in a possible implementation manner, the electronic device further includes a memory, where the memory is configured to store necessary program instructions and data.
结合上述第三方面,在一种可能的实现方式中,该电子设备为芯片或芯片系统。可选的,该电子设备是芯片系统时,可以由芯片构成,也可以包含芯片和其他分立器件。With reference to the third aspect above, in a possible implementation manner, the electronic device is a chip or a chip system. Optionally, when the electronic device is a system-on-a-chip, it may consist of chips, or may include chips and other discrete devices.
第四方面,提供了一种电子设备,包括:处理器和接口电路;接口电路,用于接收计算机程序或指令并传输至处理器;处理器用于执行所述计算机程序或指令,以使该电子设备执执行如上述第一方面所述的方法。In a fourth aspect, an electronic device is provided, including: a processor and an interface circuit; the interface circuit is used to receive computer programs or instructions and transmit them to the processor; the processor is used to execute the computer programs or instructions, so that the electronic The device executes the method described in the first aspect above.
结合上述第四方面,在一种可能的实现方式中,该电子设备为芯片或芯片系统。可选的,该电子设备是芯片系统时,可以由芯片构成,也可以包含芯片和其他分立器件。With reference to the fourth aspect above, in a possible implementation manner, the electronic device is a chip or a chip system. Optionally, when the electronic device is a system-on-a-chip, it may consist of chips, or may include chips and other discrete devices.
第五方面,提供了一种计算机可读存储介质,该计算机可读存储介质中存储有指令,当其在计算机上运行时,使得计算机可以执行上述第一方面所述的方法。In a fifth aspect, a computer-readable storage medium is provided, and instructions are stored in the computer-readable storage medium, and when the computer-readable storage medium is run on a computer, the computer can execute the method described in the above-mentioned first aspect.
第六方面,提供了一种包含指令的计算机程序产品,当其在计算机上运行时,使得计算机可以执行上述第一方面所述的方法。A sixth aspect provides a computer program product containing instructions, which when run on a computer enables the computer to execute the method described in the first aspect above.
其中,第二方面至第六方面中任一种可能的实现方式所带来的技术效果可参见上述第一方面或第一方面中不同可能的实现方式所带来的技术效果,此处不再赘述。Wherein, the technical effects brought about by any one of the possible implementations from the second aspect to the sixth aspect can refer to the above-mentioned first aspect or the technical effects brought about by different possible implementations in the first aspect, which are not repeated here. repeat.
附图说明Description of drawings
图1为本申请实施例提供的声纹注册系统架构示意图;FIG. 1 is a schematic diagram of the architecture of the voiceprint registration system provided by the embodiment of the present application;
图2为本申请实施例提供的手机的结构示意图;FIG. 2 is a schematic structural diagram of a mobile phone provided by an embodiment of the present application;
图3为本申请实施例提供的声纹注册方法的流程示意图一;FIG. 3 is a first schematic flow diagram of the voiceprint registration method provided by the embodiment of the present application;
图4为本申请实施例提供的声纹注册方法的流程示意图二;FIG. 4 is a schematic flow diagram II of the voiceprint registration method provided by the embodiment of the present application;
图5为本申请实施例提供的声纹注册方法的流程示意图三;FIG. 5 is a schematic flow diagram III of the voiceprint registration method provided by the embodiment of the present application;
图6为本申请实施例提供的一种电子设备的结构组成示意图。FIG. 6 is a schematic diagram of the structure and composition of an electronic device provided by an embodiment of the present application.
具体实施方式Detailed ways
以下实施例中所使用的术语只是为了描述特定实施例的目的,而并非旨在作为对本申请的限制。如在本申请的说明书和所附权利要求书中所使用的那样,单数表达形式“一个”、“一种”、“所述”、“上述”、“该”和“这一”旨在也包括例如“一个或多个”这种表达形式,除非其上下文中明确地有相反指示。还应当理解,在本申 请以下各实施例中,“至少一个”、“一个或多个”是指一个、两个或两个以上。术语“和/或”,用于描述关联对象的关联关系,表示可以存在三种关系;例如,A和/或B,可以表示:单独存在A,同时存在A和B,单独存在B的情况,其中A、B可以是单数或者复数。字符“/”一般表示前后关联对象是一种“或”的关系。The terms used in the following examples are for the purpose of describing particular examples only, and are not intended to limit the application. As used in the specification and appended claims of this application, the singular expressions "a", "an", "said", "above", "the" and "this" are intended to also Expressions such as "one or more" are included unless the context clearly dictates otherwise. It should also be understood that in the following embodiments of the present application, "at least one" and "one or more" refer to one, two or more than two. The term "and/or" is used to describe the relationship between associated objects, indicating that there may be three relationships; for example, A and/or B may indicate: A exists alone, A and B exist at the same time, and B exists alone, Wherein A and B can be singular or plural. The character "/" generally indicates that the contextual objects are an "or" relationship.
在本说明书中描述的参考“一个实施例”或“一些实施例”等意味着在本申请的一个或多个实施例中包括结合该实施例描述的特定特征、结构或特点。由此,在本说明书中的不同之处出现的语句“在一个实施例中”、“在一些实施例中”、“在其他一些实施例中”、“在另外一些实施例中”等不是必然都参考相同的实施例,而是意味着“一个或多个但不是所有的实施例”,除非是以其他方式另外特别强调。术语“包括”、“包含”、“具有”及它们的变形都意味着“包括但不限于”,除非是以其他方式另外特别强调。术语“连接”包括直接连接和间接连接,除非另外说明。Reference to "one embodiment" or "some embodiments" or the like in this specification means that a particular feature, structure, or characteristic described in connection with the embodiment is included in one or more embodiments of the present application. Thus, appearances of the phrases "in one embodiment," "in some embodiments," "in other embodiments," "in other embodiments," etc. in various places in this specification are not necessarily All refer to the same embodiment, but mean "one or more but not all embodiments" unless specifically stated otherwise. The terms "including", "comprising", "having" and variations thereof mean "including but not limited to", unless specifically stated otherwise. The term "connected" includes both direct and indirect connections, unless otherwise stated.
需要说明的是,本申请下述实施例中各个电子设备之间的消息名字或消息中各参数的名字等只是一个示例,具体实现中也可以是其他的名字,本申请实施例对此不作具体限定。It should be noted that the names of messages between electronic devices or the names of parameters in messages in the following embodiments of this application are just examples, and other names can also be used in specific implementations, and this embodiment of this application does not make specific limited.
为了便于描述本申请实施例的技术方案,在本申请实施例中,可以采用“第一”、“第二”等字样对功能相同或相似的技术特征进行区分。该“第一”、“第二”等字样并不对数量和执行次序进行限定,并且“第一”、“第二”等字样也并不限定一定不同。在本申请实施例中,“示例性的”或者“例如”等词用于表示例子、例证或说明,被描述为“示例性的”或者“例如”的任何实施例或设计方案不应被解释为比其它实施例或设计方案更优选或更具优势。使用“示例性的”或者“例如”等词旨在以具体方式呈现相关概念,便于理解。此外,在本申请实施例中,对于一种技术特征,通过“第一”、“第二”、“第三”等区分该种技术特征中的技术特征,该“第一”、“第二”、“第三”描述的技术特征间无先后顺序或者大小顺序。In order to facilitate the description of the technical solutions of the embodiments of the present application, in the embodiments of the present application, words such as "first" and "second" may be used to distinguish technical features with the same or similar functions. The words "first" and "second" do not limit the number and execution order, and the words "first" and "second" do not necessarily mean that they must be different. In the embodiments of this application, words such as "exemplary" or "for example" are used to represent examples, illustrations or illustrations, and any embodiment or design described as "exemplary" or "for example" should not be interpreted It is more preferred or more advantageous than other embodiments or design solutions. The use of words such as "exemplary" or "for example" is intended to present related concepts in a specific manner for easy understanding. In addition, in the embodiment of the present application, for a technical feature, the technical features of this technical feature are distinguished by "first", "second", "third", etc., the "first", "second" ", "Third" describes the technical features in no order or order of magnitude.
可以理解的,本申请实施例中同一个步骤或者具有相同功能的步骤或者技术特征在不同实施例之间可以互相参考借鉴。It can be understood that, in the embodiments of the present application, the same step or steps or technical features having the same function can be used for reference in different embodiments.
通过背景技术中的描述可以知道,由于不同电子设备的硬件存在差异,并且不同电子设备所处的环境也可能相差较大,所以导致了不同电子设备采集的同一用户的语音信号提取出来的声纹也有一定差异。因此,在多个电子设备协同工作的环境中,若一台电子设备采集的语音信号需要其他电子设备来进行认证的情况下,会导致声纹认证的准确率较低。From the description in the background technology, it can be known that due to the differences in the hardware of different electronic devices, and the environment in which different electronic devices are located may also be quite different, so the voiceprint extracted from the voice signal of the same user collected by different electronic devices There are also certain differences. Therefore, in an environment where multiple electronic devices work together, if the voice signal collected by one electronic device needs to be authenticated by other electronic devices, the accuracy of voiceprint authentication will be low.
为了解决声纹认证的准确率较低的问题,本申请实施例提供了如下三种方法:In order to solve the problem of low accuracy of voiceprint authentication, the embodiment of this application provides the following three methods:
方法1:可以在电子设备中预置声纹注册算法。该声纹注册算法是根据不同环境下,不同收声硬件,不同说话人的语音信号训练得到的。电子设备获取到用于注册的语音信号后,可以使用该声纹注册算法对用于注册的语音信号进行声纹注册,建立声纹模型。后续,电子设备可以根据该声纹模型对用户进行认证。因为声纹注册算法是根据不同环境,不同收声硬件,不同说话人的语音信号训练得到的,所以该算法可以提取更加全面和深层次的声纹信息,鲁棒性更好,可以提高声纹模型进行声纹认证的准确性。Method 1: A voiceprint registration algorithm can be preset in the electronic device. The voiceprint registration algorithm is trained according to different environments, different sound receiving hardware, and different speakers' voice signals. After the electronic device obtains the voice signal for registration, it can use the voiceprint registration algorithm to perform voiceprint registration on the voice signal for registration to establish a voiceprint model. Subsequently, the electronic device can authenticate the user according to the voiceprint model. Because the voiceprint registration algorithm is trained according to the voice signals of different environments, different sound receiving hardware, and different speakers, the algorithm can extract more comprehensive and in-depth voiceprint information, which is more robust and can improve voiceprint registration. Accuracy of the model for voiceprint authentication.
方法2:可以在多个电子设备中的每个电子设备分别进行声纹注册。后续,用户 可以在各个电子设备进行认证。因为用户进行注册和认证的设备是同一个设备,所以可以提高声纹认证的准确率。Method 2: voiceprint registration can be performed on each of the multiple electronic devices. Subsequently, the user can perform authentication on each electronic device. Because the device used by the user for registration and authentication is the same device, the accuracy of voiceprint authentication can be improved.
方法3:第一电子设备可以获取第一语音信号和第一参数信息,根据第一参数信息调整第一语音信号,得到第二语音信号,根据第二语音信号生成第一声纹模型,向第二电子设备发送第一声纹模型或者根据第一声纹模型对第二电子设备采集的语音信号进行认证。其中,第一参数信息可以用于指示第二电子设备采集语音信号的参数。方法3的具体过程将在下述图3所示的方法中具体阐述,在此不做赘述。Method 3: The first electronic device can obtain the first voice signal and the first parameter information, adjust the first voice signal according to the first parameter information to obtain the second voice signal, generate the first voiceprint model according to the second voice signal, and send the first voiceprint model to the second voice signal. The second electronic device sends the first voiceprint model or authenticates the voice signal collected by the second electronic device according to the first voiceprint model. Wherein, the first parameter information may be used to instruct the second electronic device to collect parameters of the voice signal. The specific process of method 3 will be described in detail in the method shown in FIG. 3 below, and will not be repeated here.
可以理解的,与方法1相比,方法3不需要采集不同环境下,不同收声硬件,不同说话人的语音信号,训练成本较低,模型的复杂度也较低。另外,在方法1中,声纹注册算法是预置在电子设备中的,当电子设备的环境、硬件等条件改变时,不易更新,用户体验较差。而方法3可以随时更新第一参数信息,根据更新后的第一参数信息调整第一语音信号,较为灵活,用户体验好。与方法2相比,方法3不需要在多个电子设备中的每个电子设备上进行声纹注册,用户体验好。而且,对于不支持声纹注册的电子设备,方法3也可以协助其对用户进行认证,以提高语音交互的安全性。It can be understood that, compared with method 1, method 3 does not need to collect speech signals of different environments, different sound receiving hardware, and different speakers, the training cost is lower, and the complexity of the model is also lower. In addition, in Method 1, the voiceprint registration algorithm is preset in the electronic device. When the environment, hardware and other conditions of the electronic device change, it is difficult to update and the user experience is poor. In method 3, the first parameter information can be updated at any time, and the first voice signal can be adjusted according to the updated first parameter information, which is more flexible and provides better user experience. Compared with method 2, method 3 does not need to perform voiceprint registration on each of the multiple electronic devices, and the user experience is better. Moreover, for electronic devices that do not support voiceprint registration, method 3 can also assist them in authenticating users to improve the security of voice interaction.
下面将结合附图对本申请实施例的实施方式进行详细描述。The implementation of the embodiment of the present application will be described in detail below with reference to the accompanying drawings.
如图1所示,为本申请实施例提供的声纹注册系统的架构示意图。该声纹注册系统至少可以包括:电子设备101和电子设备102。可选的,该声纹注册系统还可以包括电子设备103和/或电子设备104。As shown in FIG. 1 , it is a schematic diagram of the structure of the voiceprint registration system provided by the embodiment of the present application. The voiceprint registration system may at least include: an electronic device 101 and an electronic device 102 . Optionally, the voiceprint registration system may further include an electronic device 103 and/or an electronic device 104 .
图1中的电子设备之间(如电子设备101和电子设备102之间,电子设备101和电子设备103之间)可以通过有线(如通用串行总线(universal serial bus,USB)数据线)或无线的方式建立连接,本申请实施例对具体的连接方式不作限定。图1中的电子设备之间通过无线方式建立连接时,所采用的无线通信协议可以为无线保真(wireless fidelity,Wi-Fi)协议、各种蜂窝网(如第四代(4th generation,4G)通信网络或第五代(5th generation,5G)通信网络)协议等,在此不做具体限制。Between the electronic devices in Fig. 1 (such as between the electronic device 101 and the electronic device 102, between the electronic device 101 and the electronic device 103) can be wired (such as a universal serial bus (universal serial bus, USB) data line) or The connection is established in a wireless manner, and the embodiment of the present application does not limit the specific connection manner. When the electronic devices in Figure 1 are connected wirelessly, the wireless communication protocol adopted can be a wireless fidelity (Wi-Fi) protocol, various cellular networks (such as the fourth generation (4th generation, 4G ) communication network or the fifth generation (5th generation, 5G) communication network) protocol, etc., there is no specific limitation here.
在一些实施例中,图1中的电子设备可以形成超级终端。例如,图1中的电子设备之间可以基于任一种认证机制(如HiChian机制)进行身份认证,认证通过的电子设备可以形成超级终端。可以理解的,超级终端可以包括多个电子设备,该多个电子设备为组网连接状态,该多个电子设备互为可信设备。In some embodiments, the electronic device in FIG. 1 may form a HyperTerminal. For example, identity authentication between the electronic devices in FIG. 1 can be performed based on any authentication mechanism (such as the HiChian mechanism), and electronic devices that pass the authentication can form a hyper terminal. It can be understood that the HyperTerminal may include multiple electronic devices, the multiple electronic devices are in a network connection state, and the multiple electronic devices are mutually trusted devices.
在具体实现时,图1中的电子设备,例如电子设备101、电子设备102、电子设备103或电子设备104,可以为手机,平板电脑,手持计算机,个人计算机(personal computer,PC),蜂窝电话,个人数字助理(personal digital assistant,PDA),可穿戴式设备(如智能手表、智能手环等),游戏机,或增强现实(augmented reality,AR)/虚拟现实(virtual reality,VR)设备等电子设备。本申请实施例对图1中的电子设备的具体设备形态不做特殊限制。例如,图1中的电子设备还可以为智能家居设备(如电视机,智能音箱),车载电脑(或者称为车机)等。且在本申请实施例中,图1中的电子设备的设备形态可以相同。例如,电子设备101和电子设备102均为手机。图1中的电子设备的设备形态也可以不同。例如,电子设备101为手机,电子设备102为平板电脑。又例如,电子设备101为智能手表,电子设备102为PC。During specific implementation, the electronic equipment in Fig. 1, for example electronic equipment 101, electronic equipment 102, electronic equipment 103 or electronic equipment 104, can be mobile phone, panel computer, handheld computer, personal computer (personal computer, PC), cell phone , personal digital assistant (personal digital assistant, PDA), wearable devices (such as smart watches, smart bracelets, etc.), game consoles, or augmented reality (augmented reality, AR) / virtual reality (virtual reality, VR) equipment, etc. Electronic equipment. The embodiment of the present application does not specifically limit the specific device form of the electronic device in FIG. 1 . For example, the electronic device in FIG. 1 may also be a smart home device (such as a TV, a smart speaker), a vehicle-mounted computer (or called a vehicle machine), and the like. And in the embodiment of the present application, the device forms of the electronic devices in FIG. 1 may be the same. For example, both the electronic device 101 and the electronic device 102 are mobile phones. The device form of the electronic device in FIG. 1 may also be different. For example, the electronic device 101 is a mobile phone, and the electronic device 102 is a tablet computer. For another example, the electronic device 101 is a smart watch, and the electronic device 102 is a PC.
图1中的电子设备可以是触屏设备,也可以是非触屏设备。触屏设备可以通过手 指、触控笔等在屏幕上点击、滑动等方式对电子设备进行控制。非触屏设备可以连接鼠标、键盘、触控面板等输入设备,通过输入设备对电子设备进行控制。在本申请实施例中,图1中的电子设备均是可以运行操作系统,安装应用的电子设备。其中,图1中的电子设备的操作系统可以是鸿蒙系统、Android系统、ios系统、windows系统、mac系统、Linux系统等,本申请实施例在此不做具体限制。图1中的电子设备的操作系统可以相同,也可以不同。作为一种示例,图1中的电子设备分别可以包括内存和处理器。其中,内存可以用于存储操作系统,处理器可以用于运行内存中存储的操作系统。The electronic device in FIG. 1 may be a touch screen device or a non-touch screen device. Touch screen devices can control electronic devices by clicking and sliding on the screen with fingers, stylus, etc. Non-touch screen devices can be connected to input devices such as a mouse, keyboard, and touch panel, and electronic devices can be controlled through the input devices. In the embodiment of the present application, the electronic devices in FIG. 1 are all electronic devices capable of running an operating system and installing applications. Wherein, the operating system of the electronic device in FIG. 1 may be a Hongmeng system, an Android system, an ios system, a windows system, a mac system, a Linux system, etc., which are not specifically limited in this embodiment of the present application. The operating systems of the electronic devices in FIG. 1 may be the same or different. As an example, the electronic devices in FIG. 1 may respectively include a memory and a processor. Wherein, the memory can be used to store the operating system, and the processor can be used to run the operating system stored in the memory.
本申请实施例中,内存也可以称为存储器,用于存储操作系统和处理器运算的数据,内存还可以用于运行电子设备上安装的应用的程序。作为一种示例,内存可以是图2中的内部存储器121。In the embodiment of the present application, the memory may also be referred to as a memory, and is used to store data calculated by an operating system and a processor, and the memory may also be used to run application programs installed on electronic devices. As an example, the memory may be the internal memory 121 in FIG. 2 .
本申请实施例中,图1所示的电子设备上可以部署分布式系统。部署了该分布式系统的电子设备可以执行本申请实施例提供的声纹注册方法,使得一个电子设备可以根据另一个电子设备采集语音信号的参数,调整语音信号,根据调整后的语音信号生成声纹模型,并根据生成的声纹模型对另一个电子设备采集的语音信号进行声纹认证,可以提高声纹认证的准确率,或者,将生成的声纹模型发送给另一个电子设备,使得另一个电子设备可以根据生成的声纹模型对应其采集的语音信号进行声纹认证。In the embodiment of the present application, a distributed system may be deployed on the electronic device shown in FIG. 1 . The electronic devices deployed with the distributed system can execute the voiceprint registration method provided by the embodiment of the present application, so that one electronic device can collect the parameters of the voice signal according to another electronic device, adjust the voice signal, and generate a voice signal according to the adjusted voice signal. According to the generated voiceprint model, voiceprint authentication is performed on the voice signal collected by another electronic device, which can improve the accuracy of voiceprint authentication, or send the generated voiceprint model to another electronic device, so that another electronic device An electronic device can perform voiceprint authentication corresponding to the voice signal it collects according to the generated voiceprint model.
图1所示的声纹注册系统仅用于举例,并非用于限制本申请实施例的技术方案。本领域的技术人员应当明白,在具体实现过程中,该声纹注册系统还可以包括其他设备,同时也可根据具体需要来确定电子设备的数量,不予限制。The voiceprint registration system shown in FIG. 1 is for example only, and is not intended to limit the technical solutions of the embodiments of the present application. Those skilled in the art should understand that in the actual implementation process, the voiceprint registration system may also include other devices, and the number of electronic devices may also be determined according to specific needs, without limitation.
在本申请实施例中,以电子设备为手机为例。请参考图2,为本申请实施例提供的一种手机的结构示意图。以下实施例中的方法可以在具有下述硬件结构的手机中实现。In the embodiment of the present application, the electronic device is taken as an example of a mobile phone. Please refer to FIG. 2 , which is a schematic structural diagram of a mobile phone provided by an embodiment of the present application. The methods in the following embodiments can be implemented in a mobile phone with the following hardware structure.
如图2所示,手机可以包括处理器110,外部存储器接口120,内部存储器121,通用串行总线(universal serial bus,USB)接口130,天线1,天线2,移动通信模块150,无线通信模块160,音频模块170,扬声器170A,受话器170B,麦克风170C,耳机接口170D,传感器模块180等。As shown in Figure 2, the mobile phone can include a processor 110, an external memory interface 120, an internal memory 121, a universal serial bus (universal serial bus, USB) interface 130, an antenna 1, an antenna 2, a mobile communication module 150, and a wireless communication module 160, audio module 170, speaker 170A, receiver 170B, microphone 170C, earphone interface 170D, sensor module 180, etc.
可以理解的是,本申请实施例示意的结构并不构成对手机的具体限定。在本申请另一些实施例中,手机可以包括比图示更多或更少的部件,或者组合某些部件,或者拆分某些部件,或者不同的部件布置。图示的部件可以以硬件,软件或软件和硬件的组合实现。It can be understood that the structure shown in the embodiment of the present application does not constitute a specific limitation on the mobile phone. In other embodiments of the present application, the mobile phone may include more or fewer components than shown in the figure, or combine certain components, or separate certain components, or arrange different components. The illustrated components can be realized in hardware, software or a combination of software and hardware.
处理器110可以包括一个或多个处理单元,例如:处理器110可以包括应用处理器(application processor,AP),调制解调处理器,图形处理器(graphics processing unit,GPU),图像信号处理器(image signal processor,ISP),控制器,存储器,视频编解码器,数字信号处理器(digital signal processor,DSP),基带处理器,和/或神经网络处理器(neural-network processing unit,NPU)等。其中,不同的处理单元可以是独立的器件,也可以集成在一个或多个处理器中。The processor 110 may include one or more processing units, for example: the processor 110 may include an application processor (application processor, AP), a modem processor, a graphics processing unit (graphics processing unit, GPU), an image signal processor (image signal processor, ISP), controller, memory, video codec, digital signal processor (digital signal processor, DSP), baseband processor, and/or neural network processor (neural-network processing unit, NPU) wait. Wherein, different processing units may be independent devices, or may be integrated in one or more processors.
处理器110中还可以设置存储器,用于存储指令和数据。在一些实施例中,处理器110中的存储器为高速缓冲存储器。该存储器可以保存处理器110刚用过或循环使 用的指令或数据。如果处理器110需要再次使用该指令或数据,可从所述存储器中直接调用。避免了重复存取,减少了处理器110的等待时间,因而提高了系统的效率。A memory may also be provided in the processor 110 for storing instructions and data. In some embodiments, the memory in processor 110 is a cache memory. This memory may hold instructions or data that processor 110 has just used or recycled. If the processor 110 needs to use the instruction or data again, it can be called directly from the memory. Repeated access is avoided, and the waiting time of the processor 110 is reduced, thereby improving the efficiency of the system.
手机的无线通信功能可以通过天线1,天线2,移动通信模块150,无线通信模块160,调制解调处理器以及基带处理器等实现。The wireless communication function of the mobile phone can be realized by the antenna 1, the antenna 2, the mobile communication module 150, the wireless communication module 160, the modem processor and the baseband processor.
天线1和天线2用于发射和接收电磁波信号。手机中的每个天线可用于覆盖单个或多个通信频带。不同的天线还可以复用,以提高天线的利用率。例如:可以将天线1复用为无线局域网的分集天线。在另外一些实施例中,天线可以和调谐开关结合使用。Antenna 1 and Antenna 2 are used to transmit and receive electromagnetic wave signals. Each antenna in a mobile phone can be used to cover single or multiple communication bands. Different antennas can also be multiplexed to improve the utilization of the antennas. For example: Antenna 1 can be multiplexed as a diversity antenna of a wireless local area network. In other embodiments, the antenna may be used in conjunction with a tuning switch.
移动通信模块150可以提供应用在手机上的包括2G/3G/4G/5G等无线通信的解决方案。移动通信模块150可以包括至少一个滤波器,开关,功率放大器,低噪声放大器(low noise amplifier,LNA)等。移动通信模块150可以由天线1接收电磁波,并对接收的电磁波进行滤波,放大等处理,传送至调制解调处理器进行解调。移动通信模块150还可以对经调制解调处理器调制后的信号放大,经天线1转为电磁波辐射出去。在一些实施例中,移动通信模块150的至少部分功能模块可以被设置于处理器110中。在一些实施例中,移动通信模块150的至少部分功能模块可以与处理器110的至少部分模块被设置在同一个器件中。The mobile communication module 150 can provide wireless communication solutions including 2G/3G/4G/5G applied to mobile phones. The mobile communication module 150 may include at least one filter, switch, power amplifier, low noise amplifier (low noise amplifier, LNA) and the like. The mobile communication module 150 can receive electromagnetic waves through the antenna 1, filter and amplify the received electromagnetic waves, and send them to the modem processor for demodulation. The mobile communication module 150 can also amplify the signals modulated by the modem processor, and convert them into electromagnetic waves through the antenna 1 for radiation. In some embodiments, at least part of the functional modules of the mobile communication module 150 may be set in the processor 110 . In some embodiments, at least part of the functional modules of the mobile communication module 150 and at least part of the modules of the processor 110 may be set in the same device.
无线通信模块160可以提供应用在手机上的包括无线局域网(wireless local area networks,WLAN)(如无线保真(wireless fidelity,Wi-Fi)网络),蓝牙(bluetooth,BT),全球导航卫星系统(global navigation satellite system,GNSS),调频(frequency modulation,FM),近距离无线通信技术(near field communication,NFC),红外技术(infrared,IR)等无线通信的解决方案。无线通信模块160可以是集成至少一个通信处理模块的一个或多个器件。无线通信模块160经由天线2接收电磁波,将电磁波信号调频以及滤波处理,将处理后的信号发送到处理器110。无线通信模块160还可以从处理器110接收待发送的信号,对其进行调频,放大,经天线2转为电磁波辐射出去。The wireless communication module 160 can provide applications on mobile phones including wireless local area networks (wireless local area networks, WLAN) (such as wireless fidelity (wireless fidelity, Wi-Fi) network), bluetooth (bluetooth, BT), global navigation satellite system ( Global navigation satellite system (GNSS), frequency modulation (frequency modulation, FM), near field communication (near field communication, NFC), infrared technology (infrared, IR) and other wireless communication solutions. The wireless communication module 160 may be one or more devices integrating at least one communication processing module. The wireless communication module 160 receives electromagnetic waves via the antenna 2 , frequency-modulates and filters the electromagnetic wave signals, and sends the processed signals to the processor 110 . The wireless communication module 160 can also receive the signal to be sent from the processor 110 , frequency-modulate it, amplify it, and convert it into electromagnetic waves through the antenna 2 for radiation.
在一些实施例中,手机的天线1和移动通信模块150耦合,天线2和无线通信模块160耦合,使得手机可以通过无线通信技术与网络以及其他设备通信。所述无线通信技术可以包括全球移动通讯系统(global system for mobile communications,GSM),通用分组无线服务(general packet radio service,GPRS),码分多址接入(code division multiple access,CDMA),宽带码分多址(wideband code division multiple access,WCDMA),时分码分多址(time-division code division multiple access,TD-SCDMA),长期演进(long term evolution,LTE),BT,GNSS,WLAN,NFC,FM,和/或IR技术等。所述GNSS可以包括全球卫星定位系统(global positioning system,GPS),全球导航卫星系统(global navigation satellite system,GLONASS),北斗卫星导航系统(beidou navigation satellite system,BDS),准天顶卫星系统(quasi-zenith satellite system,QZSS)和/或星基增强系统(satellite based augmentation systems,SBAS)。In some embodiments, the antenna 1 of the mobile phone is coupled to the mobile communication module 150, and the antenna 2 is coupled to the wireless communication module 160, so that the mobile phone can communicate with the network and other devices through wireless communication technology. The wireless communication technology may include global system for mobile communications (GSM), general packet radio service (general packet radio service, GPRS), code division multiple access (code division multiple access, CDMA), broadband Code division multiple access (wideband code division multiple access, WCDMA), time division code division multiple access (time-division code division multiple access, TD-SCDMA), long term evolution (long term evolution, LTE), BT, GNSS, WLAN, NFC , FM, and/or IR techniques, etc. The GNSS may include a global positioning system (global positioning system, GPS), a global navigation satellite system (global navigation satellite system, GLONASS), a Beidou navigation satellite system (beidou navigation satellite system, BDS), a quasi-zenith satellite system (quasi -zenith satellite system (QZSS) and/or satellite based augmentation systems (SBAS).
手机通过GPU,显示屏194,以及应用处理器等实现显示功能。GPU为图像处理的微处理器,连接显示屏194和应用处理器。GPU用于执行数学和几何计算,用于图形渲染。处理器110可包括一个或多个GPU,其执行程序指令以生成或改变显示信息。The mobile phone realizes the display function through the GPU, the display screen 194, and the application processor. The GPU is a microprocessor for image processing, and is connected to the display screen 194 and the application processor. GPUs are used to perform mathematical and geometric calculations for graphics rendering. Processor 110 may include one or more GPUs that execute program instructions to generate or change display information.
显示屏194用于显示图像,视频等。显示屏194包括显示面板。显示面板可以采用液晶显示屏(liquid crystal display,LCD),有机发光二极管(organic light-emitting diode,OLED),有源矩阵有机发光二极体或主动矩阵有机发光二极体(active-matrix organic light emitting diode,AMOLED),柔性发光二极管(flex light-emitting diode,FLED),Miniled,MicroLed,Micro-oLed,量子点发光二极管(quantum dot light emitting diodes,QLED)等。在一些实施例中,手机可以包括1个或N个显示屏194,N为大于1的正整数。The display screen 194 is used to display images, videos and the like. The display screen 194 includes a display panel. The display panel can be a liquid crystal display (LCD), an organic light-emitting diode (OLED), an active matrix organic light emitting diode or an active matrix organic light emitting diode (active-matrix organic light emitting diode, AMOLED), flexible light-emitting diode (flex light-emitting diode, FLED), Miniled, MicroLed, Micro-oLed, quantum dot light emitting diodes (quantum dot light emitting diodes, QLED), etc. In some embodiments, the mobile phone may include 1 or N display screens 194, where N is a positive integer greater than 1.
手机可以通过ISP,摄像头193,视频编解码器,GPU,显示屏194以及应用处理器等实现拍摄功能。The mobile phone can realize shooting function through ISP, camera 193 , video codec, GPU, display screen 194 and application processor.
ISP用于处理摄像头193反馈的数据。例如,拍照时,打开快门,光线通过镜头被传递到摄像头感光元件上,光信号转换为电信号,摄像头感光元件将所述电信号传递给ISP处理,转化为肉眼可见的图像。ISP还可以对图像的噪点,亮度,肤色进行算法优化。ISP还可以对拍摄场景的曝光,色温等参数优化。在一些实施例中,ISP可以设置在摄像头193中。The ISP is used for processing the data fed back by the camera 193 . For example, when taking a picture, open the shutter, the light is transmitted to the photosensitive element of the camera through the lens, and the light signal is converted into an electrical signal, and the photosensitive element of the camera transmits the electrical signal to the ISP for processing, and converts it into an image visible to the naked eye. ISP can also perform algorithm optimization on image noise, brightness, and skin color. ISP can also optimize the exposure, color temperature and other parameters of the shooting scene. In some embodiments, the ISP may be located in the camera 193 .
摄像头193用于捕获静态图像或视频。物体通过镜头生成光学图像投射到感光元件。感光元件可以是电荷耦合器件(charge coupled device,CCD)或互补金属氧化物半导体(complementary metal-oxide-semiconductor,CMOS)光电晶体管。感光元件把光信号转换成电信号,之后将电信号传递给ISP转换成数字图像信号。ISP将数字图像信号输出到DSP加工处理。DSP将数字图像信号转换成标准的RGB,YUV等格式的图像信号。在一些实施例中,手机可以包括1个或N个摄像头193,N为大于1的正整数。Camera 193 is used to capture still images or video. The object generates an optical image through the lens and projects it to the photosensitive element. The photosensitive element may be a charge coupled device (CCD) or a complementary metal-oxide-semiconductor (CMOS) phototransistor. The photosensitive element converts the light signal into an electrical signal, and then transmits the electrical signal to the ISP to convert it into a digital image signal. The ISP outputs the digital image signal to the DSP for processing. DSP converts digital image signals into standard RGB, YUV and other image signals. In some embodiments, the mobile phone may include 1 or N cameras 193, where N is a positive integer greater than 1.
数字信号处理器用于处理数字信号,除了可以处理数字图像信号,还可以处理其他数字信号。例如,当手机在频点选择时,数字信号处理器用于对频点能量进行傅里叶变换等。Digital signal processors are used to process digital signals. In addition to digital image signals, they can also process other digital signals. For example, when the mobile phone selects the frequency point, the digital signal processor is used to perform Fourier transform on the frequency point energy.
视频编解码器用于对数字视频压缩或解压缩。手机可以支持一种或多种视频编解码器。这样,手机可以播放或录制多种编码格式的视频,例如:动态图像专家组(moving picture experts group,MPEG)1,MPEG2,MPEG3,MPEG4等。Video codecs are used to compress or decompress digital video. A mobile phone can support one or more video codecs. In this way, the mobile phone can play or record videos in multiple encoding formats, such as: moving picture experts group (moving picture experts group, MPEG) 1, MPEG2, MPEG3, MPEG4, etc.
外部存储器接口120可以用于连接外部存储卡,例如Micro SD卡,实现扩展手机的存储能力。外部存储卡通过外部存储器接口120与处理器110通信,实现数据存储功能。例如将音乐,视频等文件保存在外部存储卡中。The external memory interface 120 can be used to connect an external memory card, such as a Micro SD card, to expand the storage capacity of the mobile phone. The external memory card communicates with the processor 110 through the external memory interface 120 to implement a data storage function. Such as saving music, video and other files in the external memory card.
内部存储器121可以用于存储计算机可执行程序代码,所述可执行程序代码包括指令。处理器110通过运行存储在内部存储器121的指令,从而执行手机的各种功能应用以及数据处理。内部存储器121可以包括存储程序区和存储数据区。其中,存储程序区可存储操作系统,至少一个功能所需的应用程序(比如声音播放功能,图像播放功能等)等。存储数据区可存储手机使用过程中所创建的数据(比如音频数据,电话本等)等。此外,内部存储器121可以包括高速随机存取存储器,还可以包括非易失性存储器,例如至少一个磁盘存储器件,闪存器件,通用闪存存储器(universal flash storage,UFS)等。The internal memory 121 may be used to store computer-executable program codes including instructions. The processor 110 executes various functional applications and data processing of the mobile phone by executing instructions stored in the internal memory 121 . The internal memory 121 may include an area for storing programs and an area for storing data. Wherein, the stored program area can store an operating system, at least one application program required by a function (such as a sound playing function, an image playing function, etc.) and the like. The storage data area can store data (such as audio data, phone book, etc.) created during the use of the mobile phone. In addition, the internal memory 121 may include a high-speed random access memory, and may also include a non-volatile memory, such as at least one magnetic disk storage device, flash memory device, universal flash storage (universal flash storage, UFS) and the like.
手机可以通过音频模块170,扬声器170A,受话器170B,麦克风170C,耳机接 口170D,以及应用处理器等实现音频功能。例如音乐播放,录音等。The mobile phone can realize the audio function through the audio module 170, the speaker 170A, the receiver 170B, the microphone 170C, the earphone interface 170D, and the application processor. Such as music playback, recording, etc.
音频模块170用于将数字音频信息转换成模拟音频信号输出,也用于将模拟音频输入转换为数字音频信号。音频模块170还可以用于对音频信号编码和解码。在一些实施例中,音频模块170可以设置于处理器110中,或将音频模块170的部分功能模块设置于处理器110中。The audio module 170 is used to convert digital audio information into analog audio signal output, and is also used to convert analog audio input into digital audio signal. The audio module 170 may also be used to encode and decode audio signals. In some embodiments, the audio module 170 may be set in the processor 110 , or some functional modules of the audio module 170 may be set in the processor 110 .
扬声器170A,也称“喇叭”,用于将音频电信号转换为声音信号。手机可以通过扬声器170A收听音乐,或收听免提通话。 Speaker 170A, also referred to as a "horn", is used to convert audio electrical signals into sound signals. The cell phone can listen to music through speaker 170A, or listen to hands-free calls.
受话器170B,也称“听筒”,用于将音频电信号转换成声音信号。当手机接听电话或语音信息时,可以通过将受话器170B靠近人耳接听语音。 Receiver 170B, also called "earpiece", is used to convert audio electrical signals into sound signals. When the mobile phone receives a call or a voice message, the receiver 170B can be placed close to the human ear to listen to the voice.
麦克风170C,也称“话筒”,“传声器”,用于将声音信号转换为电信号。当拨打电话或发送语音信息时,用户可以通过人嘴靠近麦克风170C发声,将声音信号输入到麦克风170C。手机可以设置至少一个麦克风170C。在另一些实施例中,手机可以设置两个麦克风170C,除了采集声音信号,还可以实现降噪功能。在另一些实施例中,手机还可以设置三个,四个或更多麦克风170C,实现采集声音信号,降噪,还可以识别声音来源,实现定向录音功能等。The microphone 170C, also called "microphone" or "microphone", is used to convert sound signals into electrical signals. When making a phone call or sending a voice message, the user can put his mouth close to the microphone 170C to make a sound, and input the sound signal to the microphone 170C. The mobile phone may be provided with at least one microphone 170C. In some other embodiments, the mobile phone can be provided with two microphones 170C, which can also implement a noise reduction function in addition to collecting sound signals. In some other embodiments, the mobile phone can also be equipped with three, four or more microphones 170C to realize the collection of sound signals, noise reduction, identification of sound sources, and realization of directional recording functions, etc.
耳机接口170D用于连接有线耳机。耳机接口170D可以是USB接口130,也可以是3.5mm的开放移动电子设备平台(open mobile terminal platform,OMTP)标准接口,美国蜂窝电信工业协会(cellular telecommunications industry association of the USA,CTIA)标准接口。The earphone interface 170D is used for connecting wired earphones. The earphone interface 170D can be a USB interface 130, or a 3.5mm open mobile terminal platform (OMTP) standard interface, or a cellular telecommunications industry association of the USA (CTIA) standard interface.
传感器模块180中可以包括以下一种或多种传感器:压力传感器,陀螺仪传感器,气压传感器,磁传感器,加速度传感器,距离传感器,接近光传感器,指纹传感器,温度传感器,触摸传感器,环境光传感器,骨传导传感器等。The sensor module 180 may include one or more of the following sensors: pressure sensor, gyroscope sensor, air pressure sensor, magnetic sensor, acceleration sensor, distance sensor, proximity light sensor, fingerprint sensor, temperature sensor, touch sensor, ambient light sensor, bone conduction sensor, etc.
当然,手机还可以包括充电管理模块、电源管理模块、电池、按键、指示器以及1个或多个SIM卡接口等,本申请实施例对此不做任何限制。Of course, the mobile phone may also include a charging management module, a power management module, a battery, buttons, an indicator, and one or more SIM card interfaces, etc., which are not limited in this embodiment of the present application.
下面将结合附图,对本申请实施例提供的声纹注册方法进行描述。The voiceprint registration method provided by the embodiment of the present application will be described below with reference to the accompanying drawings.
可以理解的,本申请实施例中,第一电子设备可以执行本申请实施例中的部分或全部步骤,这些步骤仅是示例,本申请实施例还可以执行其它步骤或者各种步骤的变形。此外,各个步骤可以按照本申请实施例呈现的不同的顺序来执行,并且有可能并非要执行本申请实施例中的全部步骤。It can be understood that, in the embodiment of the present application, the first electronic device may perform some or all of the steps in the embodiment of the present application, and these steps are only examples, and the embodiment of the present application may also perform other steps or variations of various steps. In addition, each step may be performed in a different order presented in the embodiment of the present application, and it may not be necessary to perform all the steps in the embodiment of the present application.
如图3所示,为本申请实施例提供的一种声纹注册方法,该声纹注册方法包括S301-S304a或S301-S304b。As shown in FIG. 3 , it is a voiceprint registration method provided in the embodiment of the present application, and the voiceprint registration method includes S301-S304a or S301-S304b.
S301:第一电子设备获取第一语音信号和第一参数信息。S301: The first electronic device acquires a first voice signal and first parameter information.
其中,第一电子设备可以是图1中的任一电子设备。例如,第一电子设备为图1中的电子设备101或电子设备102。Wherein, the first electronic device may be any electronic device in FIG. 1 . For example, the first electronic device is the electronic device 101 or the electronic device 102 in FIG. 1 .
本申请实施例中,第一语音信号可以是第一电子设备采集的,也可以不是第一电子设备采集的。In this embodiment of the present application, the first voice signal may or may not be collected by the first electronic device.
示例性的,若第一语音信号是第一电子设备采集的,则第一电子设备通过第一电子设备的语音采集模块采集第一语音信号。其中,该语音采集模块可以为第一电子设备中的芯片、电路或芯片系统,用于采集语音信号,如将用户说的话进行录音并存储, 得到语音信号。若第一语音信号不是第一电子设备采集的,则第一电子设备接收来自第三电子设备的第一语音信号。其中,第三电子设备可以是除第一电子设备之外的电子设备。以图1所示的声纹注册系统为例,若第一电子设备为图1中的电子设备101,则第三电子设备为图1中的电子设备102-电子设备104中的至少一个。Exemplarily, if the first voice signal is collected by the first electronic device, the first electronic device collects the first voice signal through a voice collection module of the first electronic device. Wherein, the voice collection module may be a chip, a circuit or a chip system in the first electronic device, which is used to collect voice signals, such as recording and storing the words spoken by the user to obtain voice signals. If the first voice signal is not collected by the first electronic device, the first electronic device receives the first voice signal from the third electronic device. Wherein, the third electronic device may be an electronic device other than the first electronic device. Taking the voiceprint registration system shown in FIG. 1 as an example, if the first electronic device is the electronic device 101 in FIG. 1 , then the third electronic device is at least one of the electronic devices 102-104 in FIG. 1 .
作为一种示例,第三电子设备通过第三电子设备的语音采集模块采集第一语音信号,并向第一电子设备发送第一语音信号。第三电子设备的语音采集模块可以为第三电子设备中的芯片、电路或芯片系统,用于采集语音信号。As an example, the third electronic device collects the first voice signal through the voice collection module of the third electronic device, and sends the first voice signal to the first electronic device. The voice collection module of the third electronic device may be a chip, a circuit or a chip system in the third electronic device, and is used for collecting voice signals.
本申请实施例中,第一参数信息可以用于指示第二电子设备采集语音信号的参数。第二电子设备和第三电子设备可以相同也可以不同。In this embodiment of the present application, the first parameter information may be used to instruct the second electronic device to collect parameters of the voice signal. The second electronic device and the third electronic device may be the same or different.
一种可能的实现方式,第一参数信息包括以下至少一项:第二电子设备的麦克类型、第二电子设备的采样率、第二电子设备的编码方式或第二电子设备所处的环境信息。In a possible implementation manner, the first parameter information includes at least one of the following: the microphone type of the second electronic device, the sampling rate of the second electronic device, the encoding mode of the second electronic device, or the environment information of the second electronic device .
示例性的,第二电子设备的麦克类型包括动圈式麦克或电容式麦克。第二电子设备的采样率可以理解为第二电子设备对语音信号的采样率,如8000Hz或16000Hz等。第二电子设备的编码方式可以理解为第二电子设备对语音信号的编码方式,如线性脉冲编码、非线性脉冲编码或自适应线性编码等。第二电子设备所处的环境可以是第二电子设备经常所处的环境,或第二电子设备一段时间内(如一个月内)所处过的环境。例如,第二电子设备所处的环境可以为客厅、卧室、书房、厨房、小区、街道、商场或汽车中的一种或多种。Exemplarily, the microphone type of the second electronic device includes a dynamic microphone or a condenser microphone. The sampling rate of the second electronic device can be understood as the sampling rate of the voice signal by the second electronic device, such as 8000 Hz or 16000 Hz. The encoding method of the second electronic device may be understood as the encoding method of the second electronic device for the voice signal, such as linear pulse coding, nonlinear pulse coding, or adaptive linear coding. The environment where the second electronic device is located may be an environment where the second electronic device is often located, or an environment where the second electronic device has been located within a period of time (eg, within one month). For example, the environment where the second electronic device is located may be one or more of a living room, a bedroom, a study room, a kitchen, a residential area, a street, a shopping mall, or a car.
作为一种示例,第二电子设备所处的环境信息可以包括n比特,该n比特用于指示第二电子设备所处的环境,n为正整数。以n为2为例,若该环境信息的值为“00”,则第一参数信息指示第二电子设备所处的环境为客厅,若该环境信息的值为“01”,则第一参数信息指示第二电子设备所处的环境为卧室,若该环境信息的值为“10”,则第一参数信息指示第二电子设备所处的环境为小区,若该环境信息的值为“11”,则第一参数信息指示第二电子设备所处的环境为汽车中。As an example, the environment information of the second electronic device may include n bits, where the n bits are used to indicate the environment of the second electronic device, and n is a positive integer. Taking n as 2 as an example, if the value of the environment information is "00", the first parameter information indicates that the environment where the second electronic device is located is the living room; if the value of the environment information is "01", the first parameter information The information indicates that the environment where the second electronic device is located is a bedroom. If the value of the environment information is "10", the first parameter information indicates that the environment where the second electronic device is located is a residential area. If the value of the environment information is "11" ”, the first parameter information indicates that the environment where the second electronic device is located is in a car.
可以理解的,上述第一参数信息包括的内容仅是示例性的。在具体应用中,第一参数信息还可以包括其他参数,本申请实施例不进行具体限制。It can be understood that the content included in the above first parameter information is only exemplary. In a specific application, the first parameter information may also include other parameters, which are not specifically limited in this embodiment of the present application.
可以理解的,第一电子设备可以同时获取第一语音信号和第一参数信息,也可以分别获取第一语音信号和第一参数信息。It can be understood that the first electronic device may acquire the first voice signal and the first parameter information at the same time, or may acquire the first voice signal and the first parameter information separately.
作为一种示例,若第一语音信号为第二电子设备采集的,则第二电子设备在向第一电子设备发送第一语音信号的同时,可以向第一电子设备发送第一参数信息,也就是说,第一电子设备可以同时获取第一语音信号和第一参数信息。As an example, if the first voice signal is collected by the second electronic device, the second electronic device may send the first parameter information to the first electronic device while sending the first voice signal to the first electronic device, or That is to say, the first electronic device can acquire the first voice signal and the first parameter information at the same time.
作为另一种示例,第一电子设备获取到第一语音信号后,可以再获取第一参数信息。例如,第一电子设备获取到第一语音信号后,向第二电子设备发送获取第一参数的指示信息,第二电子设备接收到该指示信息后,向第一电子设备发送第一参数信息。又例如,第一电子设备和第二电子设备建立连接之后,第二电子设备向第一电子设备发送第一参数信息,第一电子设备接收到第一参数信息后,将第一参数信息存储在本地。后续,第一电子设备获取到第一语音信号后,从本地获取第一参数信息。As another example, after the first electronic device acquires the first voice signal, it may acquire the first parameter information. For example, after acquiring the first voice signal, the first electronic device sends instruction information for acquiring the first parameter to the second electronic device, and the second electronic device sends the first parameter information to the first electronic device after receiving the instruction information. For another example, after the first electronic device establishes a connection with the second electronic device, the second electronic device sends the first parameter information to the first electronic device, and after receiving the first parameter information, the first electronic device stores the first parameter information in the local. Subsequently, after acquiring the first voice signal, the first electronic device acquires the first parameter information locally.
S302:第一电子设备根据第一参数信息调整第一语音信号,得到第二语音信号。S302: The first electronic device adjusts the first voice signal according to the first parameter information to obtain a second voice signal.
可选的,第一电子设备获取采集第一语音信号的电子设备采集语音信号的参数。如此,第一电子设备可以获取第一语音信号的参数,即采集第一语音信号的麦克类型、采集第一语音信号的采样率、第一语音信号的编码方式或采集第一语音信号的电子设备所处的环境信息中的一种或多种。其中,采集第一语音信号的麦克类型、采集第一语音信号的采样率、第一语音信号的编码方式和采集第一语音信号的电子设备所处的环境信息的介绍,可以参考前文对第二电子设备的麦克类型、第二电子设备的采样率、第二电子设备的编码方式和第二电子设备所处的环境信息的描述,在此不做赘述。Optionally, the first electronic device acquires parameters of the voice signal collected by the electronic device that collects the first voice signal. In this way, the first electronic device can acquire the parameters of the first voice signal, that is, the type of microphone used to collect the first voice signal, the sampling rate of the first voice signal collected, the encoding method of the first voice signal, or the electronic device used to collect the first voice signal One or more of the environment information. Wherein, for the introduction of the microphone type for collecting the first voice signal, the sampling rate for collecting the first voice signal, the encoding method of the first voice signal, and the environment information of the electronic device for collecting the first voice signal, you can refer to the above-mentioned introduction to the second The description of the microphone type of the electronic device, the sampling rate of the second electronic device, the coding mode of the second electronic device, and the environment information of the second electronic device will not be repeated here.
一种可能的实现方式,第一电子设备通过第一算法使得第一语音信号的参数趋近于第一参数信息指示的参数,得到第二语音信号。In a possible implementation manner, the first electronic device uses the first algorithm to make the parameters of the first voice signal approach the parameters indicated by the first parameter information to obtain the second voice signal.
示例性的,以第一参数信息包括第二电子设备的麦克类型为例,第一电子设备可以通过第一算法模拟该麦克类型对应的麦克对语音信号造成的影响,来对第一语音信号进行调整,得到第二语音信号。Exemplarily, taking the first parameter information including the microphone type of the second electronic device as an example, the first electronic device may use the first algorithm to simulate the impact of the microphone corresponding to the microphone type on the voice signal, and perform a process on the first voice signal. Adjust to obtain the second voice signal.
示例性的,以第一参数信息包括第二电子设备的采样率为例,第一电子设备可以通过音频处理算法将第一语音信号的采样率调整为第二电子设备的采样率,得到第二语音信号。Exemplarily, taking the first parameter information including the sampling rate of the second electronic device as an example, the first electronic device may adjust the sampling rate of the first voice signal to the sampling rate of the second electronic device through an audio processing algorithm to obtain the second voice signal.
示例性的,以第一参数信息包括第二电子设备的编码方式为例,第一电子设备可以根据第二电子设备的编码方式,对第一语音信号的编码格式重新进行编解码,得到第二语音信号。Exemplarily, taking the first parameter information including the encoding method of the second electronic device as an example, the first electronic device may re-encode and decode the encoding format of the first voice signal according to the encoding method of the second electronic device to obtain the second voice signal.
示例性的,以第一参数信息包括第二电子设备所处的环境信息为例,第一电子设备可以根据第二电子设备所处的环境信息,叠加环境噪声信号和/或空间混响信号,得到第二语音信号。其中,环境噪声信号和空间混响信号可以是预配置在第一电子设备中的。Exemplarily, taking the first parameter information including the environment information of the second electronic device as an example, the first electronic device may superimpose the environmental noise signal and/or the spatial reverberation signal according to the environment information of the second electronic device, Obtain the second voice signal. Wherein, the environmental noise signal and the spatial reverberation signal may be preconfigured in the first electronic device.
示例性的,以第一参数信息包括第二电子设备的麦克类型和第二电子设备的采样率为例,第一电子设备可以通过算法模拟该麦克类型对应的麦克对语音信号造成的影响,来对第一语音信号进行调整,并通过音频处理算法将第一语音信号的采样率调整为第二电子设备的采样率,得到第二语音信号。Exemplarily, taking the first parameter information including the microphone type of the second electronic device and the sampling rate of the second electronic device as an example, the first electronic device may use an algorithm to simulate the impact of the microphone corresponding to the microphone type on the voice signal to The first voice signal is adjusted, and the sampling rate of the first voice signal is adjusted to the sampling rate of the second electronic device through an audio processing algorithm to obtain a second voice signal.
示例性的,以第一参数信息包括第二电子设备的麦克类型、第二电子设备的采样率和第二电子设备所处的环境信息为例,第一电子设备可以通过算法模拟该麦克类型对应的麦克对语音信号造成的影响,来对第一语音信号进行调整,并通过音频处理算法将第一语音信号的采样率调整为第二电子设备的采样率,再根据第二电子设备所处的环境信息,叠加环境噪声信号和/或空间混响信号,得到第二语音信号。其中,环境噪声信号和空间混响信号可以是预配置在第一电子设备中的。Exemplarily, taking the first parameter information including the microphone type of the second electronic device, the sampling rate of the second electronic device, and the environment information of the second electronic device as an example, the first electronic device can use an algorithm to simulate the corresponding The impact of the microphone on the voice signal is used to adjust the first voice signal, and the sampling rate of the first voice signal is adjusted to the sampling rate of the second electronic device through an audio processing algorithm, and then according to the location of the second electronic device The environment information is obtained by superimposing the environment noise signal and/or the space reverberation signal to obtain the second voice signal. Wherein, the environmental noise signal and the spatial reverberation signal may be preconfigured in the first electronic device.
可以理解的,在具体应用中,调整第一语音信号的电子设备还可以是除第一电子设备之外的电子设备。例如,第一电子设备获取到第一语音信号和第一参数信息后,可以向第五电子设备发送第一语音信号和第一参数信息。第五电子设备接收到第一语音信号和第一参数信息后,可以根据第一参数信息调整第一语音信号,得到第二语音信号,并向第一电子设备发送第二语音信号。其中,第五电子设备与第一电子设备不同。It can be understood that, in a specific application, the electronic device that adjusts the first voice signal may also be an electronic device other than the first electronic device. For example, after acquiring the first voice signal and the first parameter information, the first electronic device may send the first voice signal and the first parameter information to the fifth electronic device. After receiving the first voice signal and the first parameter information, the fifth electronic device may adjust the first voice signal according to the first parameter information to obtain a second voice signal, and send the second voice signal to the first electronic device. Wherein, the fifth electronic device is different from the first electronic device.
S303:第一电子设备根据第二语音信号生成第一声纹模型。S303: The first electronic device generates a first voiceprint model according to the second voice signal.
一种可能的实现方式,第一电子设备对第二语音信号进行特征提取,根据提取的特征生成第一声纹模型。可以理解的,第一声纹模型生成后即完成了声纹注册。后续,可以通过第一声纹模型对用户进行认证。例如,可以将一个语音信号作为输入,输入到第一声纹模型中,该第一声纹模型可以输出该语音信号和第一语音信号是否是来自同一个用户。In a possible implementation manner, the first electronic device extracts features from the second voice signal, and generates the first voiceprint model according to the extracted features. It can be understood that the voiceprint registration is completed after the first voiceprint model is generated. Subsequently, the user may be authenticated through the first voiceprint model. For example, a voice signal may be input into the first voiceprint model, and the first voiceprint model may output whether the voice signal and the first voice signal are from the same user.
可以理解的,生成第一声纹模型的电子设备还可以是除了第一电子设备之外的电子设备。例如,第一电子设备得到第二语音信号后,可以向第六电子设备发送第二语音信号。第六电子设备接收到第二语音信号后,可以根据第二语音信号生成第一声纹模型,并向第一电子设备发送第一声纹模型。第六电子设备和第五电子设备可以相同或不同。可选的,若第六电子设备与第五电子设备不同,第五电子设备可以不向第一电子设备发送第二语音信号,而是将第二语音信号发送给第六电子设备,以便第六电子设备根据第二语音信号生成第一声纹模型,向第一电子设备发送第一声纹模型。It can be understood that the electronic device that generates the first voiceprint model may also be an electronic device other than the first electronic device. For example, after obtaining the second voice signal, the first electronic device may send the second voice signal to the sixth electronic device. After receiving the second voice signal, the sixth electronic device may generate the first voiceprint model according to the second voice signal, and send the first voiceprint model to the first electronic device. The sixth electronic device and the fifth electronic device may be the same or different. Optionally, if the sixth electronic device is different from the fifth electronic device, the fifth electronic device may not send the second voice signal to the first electronic device, but may send the second voice signal to the sixth electronic device, so that the sixth electronic device may The electronic device generates a first voiceprint model according to the second voice signal, and sends the first voiceprint model to the first electronic device.
S304a:第一电子设备根据第一声纹模型对第二电子设备采集的语音信号进行认证。S304a: The first electronic device authenticates the voice signal collected by the second electronic device according to the first voiceprint model.
一种可能的实现方式,第一电子设备可以接收来自第二电子设备的第二电子设备采集的语音信号,将第二电子设备采集的语音信号输入第一声纹模型进行声纹认证。In a possible implementation manner, the first electronic device may receive the voice signal collected by the second electronic device from the second electronic device, and input the voice signal collected by the second electronic device into the first voiceprint model for voiceprint authentication.
作为一种示例,第二电子设备通过第二电子设备的语音采样模块采集语音信号1,并将语音信号1发送给第一电子设备。第一电子设备接收到语音信号1后,将该语音信号1输入第一声纹模型进行声纹认证。若第一声纹模型输出为0,则表示该语音信号1和第一语音信号不是来自同一个用户,认证失败,若第一声纹模型输出为1,则表示该语音信号1和第一语音信号来自同一个用户,认证成功。由于上述第一声纹模型是根据第二语音信号(即第一电子设备根据第一语音信号和第一参数信息模拟的第二电子设备采集的语音信号)生成的,所以用该第一声纹模型对第二电子设备采集的语音信号进行认证,可以提高声纹认证的准确性。As an example, the second electronic device collects the voice signal 1 through the voice sampling module of the second electronic device, and sends the voice signal 1 to the first electronic device. After receiving the voice signal 1, the first electronic device inputs the voice signal 1 into the first voiceprint model for voiceprint authentication. If the output of the first voiceprint model is 0, it means that the voice signal 1 and the first voice signal are not from the same user, and the authentication fails. If the output of the first voiceprint model is 1, it means that the voice signal 1 and the first voice signal are not from the same user. The signal is from the same user and the authentication is successful. Since the above-mentioned first voiceprint model is generated according to the second voice signal (that is, the voice signal collected by the second electronic device simulated by the first electronic device based on the first voice signal and the first parameter information), the first voiceprint model is used to The model authenticates the voice signal collected by the second electronic device, which can improve the accuracy of voiceprint authentication.
本申请实施例中,上述S304a还可以替换为S304b。In the embodiment of the present application, the above S304a may also be replaced with S304b.
S304b:第一电子设备向第二电子设备发送第一声纹模型。对应的,第二电子设备接收来自第一电子设备的第一声纹模型。S304b: The first electronic device sends the first voiceprint model to the second electronic device. Correspondingly, the second electronic device receives the first voiceprint model from the first electronic device.
可以理解的,第一电子设备可以直接向第二电子设备发送第一声纹模型,也可以经一个或多个电子设备将第一声纹模型发送给第二电子设备。第二电子设备接收到第一声纹模型后,可以根据第一声纹模型对第二电子设备采集的语音信号进行认证。如:第二电子设备将自己采集的语音信号输入第一声纹模型进行声纹认证。It can be understood that the first electronic device may directly send the first voiceprint model to the second electronic device, or may send the first voiceprint model to the second electronic device via one or more electronic devices. After receiving the first voiceprint model, the second electronic device may authenticate the voice signal collected by the second electronic device according to the first voiceprint model. For example, the second electronic device inputs the voice signal collected by itself into the first voiceprint model for voiceprint authentication.
可以理解的,第一电子设备还可以向除第二电子设备之外的电子设备发送第一声纹模型,使得除第二电子设备之外的电子设备也可以根据第一声纹模型对第二电子设备采集的语音信号进行认证。It can be understood that the first electronic device can also send the first voiceprint model to electronic devices other than the second electronic device, so that electronic devices other than the second electronic device can also send the second voiceprint model to the second electronic device according to the first voiceprint model. Voice signals collected by electronic equipment are used for authentication.
基于图3所示的方法,第一电子设备可以获取第一语音信号和第二电子设备对应的第一参数信息,根据第一参数信息调整第一语音信号,得到适用于第二电子设备的第二语音信号(该第二语音信号可以相当于第二电子设备采集的语音信号,也就是说,第一电子设备可以根据第一语音信号和第一参数信息模拟第二电子设备采集的语音信号),并根据第二语音信号生成第一声纹模型。如此,可以实现采集一次语音信号,根据该语音信号模拟出第二电子设备采集的语音信号,根据模拟出的语音信号(即第 二语音信号)进行声纹注册。其中,第一电子设备是根据第二电子设备采集语音信号的参数模拟出的第二语音信号,因此,第二语音信号与第二电子设备真实采集的语音信号的相似度非常高,所以根据第二语音信号生成的第一声纹模型对第二电子设备采集的语音信号进行声纹认证,可以提高声纹认证的准确性。Based on the method shown in FIG. 3, the first electronic device can acquire the first voice signal and the first parameter information corresponding to the second electronic device, adjust the first voice signal according to the first parameter information, and obtain the first voice signal suitable for the second electronic device. Two voice signals (the second voice signal can be equivalent to the voice signal collected by the second electronic device, that is to say, the first electronic device can simulate the voice signal collected by the second electronic device according to the first voice signal and the first parameter information) , and generate the first voiceprint model according to the second voice signal. In this way, it is possible to collect a voice signal once, simulate the voice signal collected by the second electronic device according to the voice signal, and perform voiceprint registration according to the simulated voice signal (that is, the second voice signal). Wherein, the first electronic device is the second voice signal simulated according to the parameters of the voice signal collected by the second electronic device. Therefore, the similarity between the second voice signal and the voice signal actually collected by the second electronic device is very high, so according to the first The first voiceprint model generated from the second voice signal performs voiceprint authentication on the voice signal collected by the second electronic device, which can improve the accuracy of voiceprint authentication.
可以理解的,上述图3所示的方法中,第一电子设备模拟了第二电子设备的采集的语音信号,并根据该语音信号进行了声纹注册。在具体应用中,除了第二电子设备之外,第一电子设备还可以根据第一语音信号模拟其他至少一个电子设备采集的语音信号,根据模拟的语音信号进行声纹注册。例如,第一电子设备还可以根据第一语音信号模拟第一电子设备采集的语音信号,根据模拟的第一电子设备采集的语音信号进行声纹注册。具体的,可以参考下述图4所示的方法中所述。又例如,第一电子设备还可以根据第一语音信号模拟第四电子设备采集的语音信号,根据模拟的第四电子设备采集的语音信号进行声纹注册。具体的,可以参考下述图5所示的方法中所述。It can be understood that in the method shown in FIG. 3 above, the first electronic device simulates the voice signal collected by the second electronic device, and registers the voiceprint according to the voice signal. In a specific application, in addition to the second electronic device, the first electronic device may also simulate a voice signal collected by at least one other electronic device according to the first voice signal, and perform voiceprint registration according to the simulated voice signal. For example, the first electronic device may also simulate the voice signal collected by the first electronic device according to the first voice signal, and perform voiceprint registration according to the simulated voice signal collected by the first electronic device. Specifically, reference may be made to the description in the method shown in FIG. 4 below. For another example, the first electronic device may also simulate the voice signal collected by the fourth electronic device according to the first voice signal, and perform voiceprint registration according to the simulated voice signal collected by the fourth electronic device. Specifically, reference may be made to the description in the method shown in FIG. 5 below.
可选的,如图4所示,图3所示的方法还包括S305-S308。Optionally, as shown in FIG. 4, the method shown in FIG. 3 further includes S305-S308.
S305:第一电子设备获取第二参数信息。S305: The first electronic device acquires second parameter information.
其中,第二参数信息可以用于指示第一电子设备采集语音信号的参数。例如,第二参数信息包括以下至少一项:第一电子设备的麦克类型、第一电子设备的采样率、第一电子设备的编码方式或第一电子设备所处的环境信息。第二参数信息的具体介绍可以参考上述对第一参数信息的描述,在此不做赘述。Wherein, the second parameter information may be used to instruct the first electronic device to collect parameters of the voice signal. For example, the second parameter information includes at least one of the following: a microphone type of the first electronic device, a sampling rate of the first electronic device, a coding mode of the first electronic device, or environment information of the first electronic device. For a specific introduction of the second parameter information, reference may be made to the foregoing description of the first parameter information, and details are not repeated here.
一种可能的实现方式,第一电子设备从本地获取第二参数信息。In a possible implementation manner, the first electronic device acquires the second parameter information locally.
S306:第一电子设备根据第二参数信息调整第一语音信号,得到第三语音信号。S306: The first electronic device adjusts the first voice signal according to the second parameter information to obtain a third voice signal.
S307:第一电子设备根据第三语音信号生成第二声纹模型。S307: The first electronic device generates a second voiceprint model according to the third voice signal.
S306-S307的具体过程可以参考上述S302-S303中对应的描述,在此不做赘述。For the specific process of S306-S307, reference may be made to the corresponding description in S302-S303 above, and details are not repeated here.
S308:第一电子设备根据第二声纹模型对第一电子设备采集的语音信号进行认证。S308: The first electronic device authenticates the voice signal collected by the first electronic device according to the second voiceprint model.
一种可能的实现方式,第一电子设备通过第一电子设备的语音采集模块采集语音信号,并将采集的语音信号输入第二声纹模型进行声纹认证。具体的,可以参考上述S304a中对应的描述,在此不做赘述。In a possible implementation manner, the first electronic device collects voice signals through the voice collection module of the first electronic device, and inputs the collected voice signals into the second voiceprint model for voiceprint authentication. Specifically, reference may be made to the corresponding description in S304a above, and details are not repeated here.
可以理解的,第一电子设备获取到第一语音信号后,可以先生成第一声纹模型,如:获取第一参数信息,根据第一参数信息调整第一语音信号,得到第二语音信号,根据第二语音信号生成第一声纹模型,再生成第二声纹模型,如:获取第二参数信息,根据第二参数信息调整第一语音信号,得到第三语音信号,根据第三语音信号生成第二声纹模型。第一电子设备也可以先生成第二声纹模型,再生成第一声纹模型,还可以同时执行上述两个过程,不予限制。It can be understood that after the first electronic device acquires the first voice signal, it may first generate the first voiceprint model, such as: acquire the first parameter information, adjust the first voice signal according to the first parameter information, and obtain the second voice signal, Generate the first voiceprint model according to the second voice signal, and then generate the second voiceprint model, such as: obtain the second parameter information, adjust the first voice signal according to the second parameter information, and obtain the third voice signal, according to the third voice signal Generate a second voiceprint model. The first electronic device may also first generate the second voiceprint model, and then generate the first voiceprint model, and may also execute the above two processes at the same time, without limitation.
可以理解的,第一电子设备还可以向除第一电子设备之外的电子设备发送第一声纹模型,使得除第一电子设备之外的电子设备也可以根据第二声纹模型对第一电子设备采集的语音信号进行认证。It can be understood that the first electronic device can also send the first voiceprint model to electronic devices other than the first electronic device, so that electronic devices other than the first electronic device can also use the second voiceprint model for the first voiceprint model. Voice signals collected by electronic equipment are used for authentication.
可选的,如图5所示,图3所示的方法还包括S309-S312a或S309-S312b。Optionally, as shown in FIG. 5, the method shown in FIG. 3 further includes S309-S312a or S309-S312b.
S309:第一电子设备获取第三参数信息。S309: The first electronic device acquires third parameter information.
其中,第三参数信息用于指示第四电子设备采集语音信号的参数。例如,第三参数信息包括以下至少一项:第四电子设备的麦克类型、第四电子设备的采样率、第四 电子设备的编码方式或第四电子设备所处的环境信息。第三参数信息的具体介绍可以参考上述对第一参数信息的描述,在此不做赘述。Wherein, the third parameter information is used to instruct the fourth electronic device to collect parameters of the voice signal. For example, the third parameter information includes at least one of the following: a microphone type of the fourth electronic device, a sampling rate of the fourth electronic device, a coding method of the fourth electronic device, or environment information of the fourth electronic device. For a specific introduction of the third parameter information, reference may be made to the foregoing description of the first parameter information, and details are not repeated here.
其中,第四电子设备与第一电子设备、第二电子设备不同。例如,若第一电子设备为图1中的电子设备101,第二电子设备为图1中的电子设备102,则第四电子设备为图1中的电子设备103或电子设备104。Wherein, the fourth electronic device is different from the first electronic device and the second electronic device. For example, if the first electronic device is the electronic device 101 in FIG. 1 and the second electronic device is the electronic device 102 in FIG. 1 , then the fourth electronic device is the electronic device 103 or the electronic device 104 in FIG. 1 .
S310:第一电子设备根据第三参数信息调整第一语音信号,得到第四语音信号;S310: The first electronic device adjusts the first voice signal according to the third parameter information to obtain a fourth voice signal;
S311:第一电子设备根据第四语音信号生成第三声纹模型。S311: The first electronic device generates a third voiceprint model according to the fourth voice signal.
S312a:第一电子设备根据第三声纹模型对第四电子设备采集的语音信号进行认证。S312a: The first electronic device authenticates the voice signal collected by the fourth electronic device according to the third voiceprint model.
本申请实施例中,S312a还可以替换为S312b。In this embodiment of the application, S312a may also be replaced with S312b.
S312b:第一电子设备向第四电子设备发送第三声纹模型。对应的,第四电子设备接收来自第一电子设备的第三声纹模型。S312b: The first electronic device sends the third voiceprint model to the fourth electronic device. Correspondingly, the fourth electronic device receives the third voiceprint model from the first electronic device.
S310-S312b的具体过程可以参考上述S302-S304b中对应的描述,在此不做赘述。For specific processes of S310-S312b, reference may be made to the corresponding descriptions in S302-S304b above, and details are not repeated here.
可以理解的,第一电子设备还可以向除第四电子设备之外的电子设备发送第三声纹模型,使得除第四电子设备之外的电子设备也可以根据第三声纹模型对第四电子设备采集的语音信号进行认证。It can be understood that the first electronic device can also send the third voiceprint model to electronic devices other than the fourth electronic device, so that the electronic devices other than the fourth electronic device can also send the third voiceprint model to the fourth electronic device according to the third voiceprint model. Voice signals collected by electronic equipment are used for authentication.
可以理解的,第一电子设备获取到第一语音信号后,可以先生成第一声纹模型,如:可以先获取第一参数信息,根据第一参数信息调整第一语音信号,得到第二语音信号,根据第二语音信号生成第一声纹模型,再生成第三声纹模型,如:获取第三参数信息,根据第三参数信息调整第一语音信号,得到第四语音信号,根据第四语音信号生成第三声纹模型。第一电子设备也可以先生成第三声纹模型,再生成第一声纹模型,还可以同时执行上述两个过程,不予限制。It can be understood that after the first electronic device acquires the first voice signal, it can first generate the first voiceprint model, for example, it can first acquire the first parameter information, adjust the first voice signal according to the first parameter information, and obtain the second voice signal, generate the first voiceprint model according to the second voice signal, and then generate the third voiceprint model, such as: obtain the third parameter information, adjust the first voice signal according to the third parameter information, and obtain the fourth voice signal, according to the fourth The speech signal generates a third voiceprint model. The first electronic device may also first generate the third voiceprint model, and then generate the first voiceprint model, and may also execute the above two processes at the same time, without limitation.
可以理解的,上述S309-S312b也可以在图4所示方法中执行,例如,在第一电子设备获取第一语音信号之后执行,或者在S303之后执行,或者在S308之后执行,或者和S305-S308同时执行,不予限制。It can be understood that the above S309-S312b can also be performed in the method shown in FIG. 4, for example, after the first electronic device acquires the first voice signal, or after S303, or after S308, or with S305- S308 is executed at the same time without limitation.
可以理解的,以上各个实施例中,由第一电子设备实现的方法和/或步骤,也可以由可用于第一电子设备的部件(例如芯片或者电路)实现。It can be understood that, in each of the above embodiments, the methods and/or steps implemented by the first electronic device may also be implemented by components (such as chips or circuits) that can be used in the first electronic device.
可以理解的是,上述电子设备为了实现上述功能,其包含了执行各个功能相应的硬件结构和/或软件模块。本领域技术人员应该很容易意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,本申请实施例能够以硬件或硬件和计算机软件的结合形式来实现。某个功能究竟以硬件还是计算机软件驱动硬件的方式来执行,取决于技术方案的特定应用和设计约束条件。本领域技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请实施例的范围。It can be understood that, in order to realize the above-mentioned functions, the above-mentioned electronic device includes corresponding hardware structures and/or software modules for performing each function. Those skilled in the art should easily realize that the embodiments of the present application can be implemented in the form of hardware or a combination of hardware and computer software in combination with the example units and algorithm steps described in the embodiments disclosed herein. Whether a certain function is executed by hardware or computer software drives hardware depends on the specific application and design constraints of the technical solution. Those skilled in the art may use different methods to implement the described functions for each specific application, but such implementation should not be regarded as exceeding the scope of the embodiments of the present application.
本申请实施例可以根据上述方法示例对上述电子设备进行功能模块的划分,例如,可以对应各个功能划分各个功能模块,也可以将两个或两个以上的功能集成在一个处理模块中。上述集成的模块既可以采用硬件的形式实现,也可以采用软件功能模块的形式实现。需要说明的是,本申请实施例中对模块的划分是示意性的,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式。The embodiments of the present application may divide the above-mentioned electronic device into functional modules according to the above-mentioned method examples. For example, each functional module may be divided corresponding to each function, or two or more functions may be integrated into one processing module. The above-mentioned integrated modules can be implemented in the form of hardware or in the form of software function modules. It should be noted that the division of modules in the embodiment of the present application is schematic, and is only a logical function division, and there may be other division methods in actual implementation.
如图6所示,本申请实施例公开了一种电子设备600,该电子设备可以为上述实施例中的第一电子设备。该电子设备600具体可以包括:输入设备601(例如鼠标、 键盘或触摸屏等);一个或多个处理器602;存储器603;一个或多个应用程序(未示出);以及一个或多个计算机程序604,上述各器件可以通过一个或多个通信总线605连接。可选的,电子设备还包括语音采集设备(如录音设备),用于采集语音信号。其中,上述一个或多个计算机程序604被存储在上述存储器603中并被配置为被该一个或多个处理器602执行,该一个或多个计算机程序604包括指令,该指令可以用于执行上述实施例中的相关步骤。在一种示例中,该电子设备600可以为图1中电子设备101、电子设备102、电子设备103或电子设备104。As shown in FIG. 6 , the embodiment of the present application discloses an electronic device 600 , which may be the first electronic device in the foregoing embodiments. The electronic device 600 may specifically include: an input device 601 (such as a mouse, a keyboard or a touch screen, etc.); one or more processors 602; a memory 603; one or more application programs (not shown); Program 604 , the above-mentioned devices may be connected through one or more communication buses 605 . Optionally, the electronic device further includes a voice collection device (such as a recording device) for collecting voice signals. Wherein, the above-mentioned one or more computer programs 604 are stored in the above-mentioned memory 603 and configured to be executed by the one or more processors 602, the one or more computer programs 604 include instructions, and the instructions can be used to execute the above-mentioned Relevant steps in the examples. In an example, the electronic device 600 may be the electronic device 101, the electronic device 102, the electronic device 103, or the electronic device 104 in FIG. 1 .
本申请实施例还提供了一种芯片系统,包括:至少一个处理器和接口,该至少一个处理器通过接口与存储器耦合,当该至少一个处理器执行存储器中的计算机程序或指令时,使得上述任一方法实施例中的方法被执行。在一种可能的实现方式中,该芯片系统还包括存储器。可选的,该芯片系统可以由芯片构成,也可以包含芯片和其他分立器件,本申请实施例对此不作具体限定。The embodiment of the present application also provides a chip system, including: at least one processor and an interface, the at least one processor is coupled with the memory through the interface, when the at least one processor executes the computer program or instruction in the memory, the above-mentioned The method in any method embodiment is performed. In a possible implementation manner, the chip system further includes a memory. Optionally, the system-on-a-chip may consist of a chip, or may include a chip and other discrete devices, which is not specifically limited in this embodiment of the present application.
本申请实施例还提供一种计算机可读存储介质,该计算机可读存储介质中存储有计算机程序代码,当处理器执行该计算机程序代码时,电子设备执行上述实施例中的方法。The embodiment of the present application also provides a computer-readable storage medium, where computer program code is stored, and when the processor executes the computer program code, the electronic device executes the method in the foregoing embodiments.
本申请实施例还提供了一种计算机程序产品,当该计算机程序产品在计算机上运行时,使得计算机执行上述实施例中的方法。The embodiment of the present application also provides a computer program product, which causes the computer to execute the method in the foregoing embodiments when the computer program product is run on the computer.
其中,本申请实施例提供的电子设备600、计算机可读存储介质或者计算机程序产品均用于执行上文所提供的对应的方法,因此,其所能达到的有益效果可参考上文所提供的对应的方法中的有益效果,此处不再赘述。Wherein, the electronic device 600, the computer-readable storage medium or the computer program product provided in the embodiment of the present application are all used to execute the corresponding method provided above, therefore, the beneficial effects that it can achieve can refer to the above-mentioned The beneficial effects of the corresponding method will not be repeated here.
通过以上的实施方式的描述,所属领域的技术人员可以清楚地了解到,为描述的方便和简洁,仅以上述各功能模块的划分进行举例说明,实际应用中,可以根据需要而将上述功能分配由不同的功能模块完成,即将装置的内部结构划分成不同的功能模块,以完成以上描述的全部或者部分功能。Through the description of the above embodiments, those skilled in the art can clearly understand that for the convenience and brevity of the description, only the division of the above-mentioned functional modules is used as an example for illustration. In practical applications, the above-mentioned functions can be allocated according to needs It is completed by different functional modules, that is, the internal structure of the device is divided into different functional modules to complete all or part of the functions described above.
在本申请所提供的几个实施例中,应该理解到,所揭露的装置和方法,可以通过其它的方式实现。例如,以上所描述的装置实施例仅仅是示意性的,例如,所述模块或单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个装置,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided in this application, it should be understood that the disclosed devices and methods may be implemented in other ways. For example, the device embodiments described above are only illustrative. For example, the division of the modules or units is only a logical function division. In actual implementation, there may be other division methods. For example, multiple units or components can be Incorporation or may be integrated into another device, or some features may be omitted, or not implemented. In another point, the mutual coupling or direct coupling or communication connection shown or discussed may be through some interfaces, and the indirect coupling or communication connection of devices or units may be in electrical, mechanical or other forms.
另外,在本申请各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以使用硬件的形式实现,也可以使用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present application may be integrated into one processing unit, each unit may exist separately physically, or two or more units may be integrated into one unit. The above-mentioned integrated units can be implemented in the form of hardware or in the form of software functional units.
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个可读取存储介质中。基于这样的理解,本申请实施例的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该软件产品存储在一个存储介质中,包括若干指令用以使得一个设备(可以是单片机,芯片等)或处理器(processor)执行本申请各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、ROM、磁碟或者光盘等 各种可以存储程序代码的介质。If the integrated unit is realized in the form of a software function unit and sold or used as an independent product, it can be stored in a readable storage medium. Based on this understanding, the technical solution of the embodiment of the present application is essentially or the part that contributes to the prior art, or all or part of the technical solution can be embodied in the form of a software product, and the software product is stored in a storage medium Among them, several instructions are included to make a device (which may be a single-chip microcomputer, a chip, etc.) or a processor (processor) execute all or part of the steps of the methods described in the various embodiments of the present application. The aforementioned storage media include: various media that can store program codes such as U disk, mobile hard disk, ROM, magnetic disk or optical disk.
以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何在本申请揭露的技术范围内的变化或替换,都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以所述权利要求的保护范围为准。The above is only a specific implementation of the application, but the protection scope of the application is not limited thereto, and any changes or replacements within the technical scope disclosed in the application should be covered within the protection scope of the application . Therefore, the protection scope of the present application should be determined by the protection scope of the claims.

Claims (9)

  1. 一种声纹注册方法,其特征在于,应用于第一电子设备,所述方法包括:A voiceprint registration method, characterized in that it is applied to a first electronic device, the method comprising:
    获取第一语音信号和第一参数信息,所述第一参数信息用于指示第二电子设备采集语音信号的参数;Acquiring a first voice signal and first parameter information, where the first parameter information is used to instruct the second electronic device to collect parameters of the voice signal;
    根据所述第一参数信息调整所述第一语音信号,得到第二语音信号;adjusting the first speech signal according to the first parameter information to obtain a second speech signal;
    根据所述第二语音信号生成第一声纹模型;generating a first voiceprint model according to the second voice signal;
    向所述第二电子设备发送所述第一声纹模型,或者,根据所述第一声纹模型对所述第二电子设备采集的语音信号进行认证。Sending the first voiceprint model to the second electronic device, or authenticating the voice signal collected by the second electronic device according to the first voiceprint model.
  2. 根据权利要求1所述的方法,其特征在于,所述第一参数信息包括以下至少一项:所述第二电子设备的麦克类型、所述第二电子设备的采样率、所述第二电子设备的编码方式或所述第二电子设备所处的环境信息。The method according to claim 1, wherein the first parameter information includes at least one of the following: the microphone type of the second electronic device, the sampling rate of the second electronic device, the The encoding method of the device or the environment information of the second electronic device.
  3. 根据权利要求1或2所述的方法,其特征在于,所述根据所述第一参数信息调整所述第一语音信号,得到第二语音信号,包括:The method according to claim 1 or 2, wherein said adjusting said first voice signal according to said first parameter information to obtain a second voice signal comprises:
    通过第一算法使得所述第一语音信号的参数趋近所述第一参数信息指示的参数,得到所述第二语音信号。The second speech signal is obtained by making the parameters of the first speech signal approach the parameters indicated by the first parameter information through the first algorithm.
  4. 根据权利要求1-3中任一项所述的方法,其特征在于,所述获取第一语音信号,包括:The method according to any one of claims 1-3, wherein said acquiring the first voice signal comprises:
    接收来自第三电子设备的所述第一语音信号;或者,receiving said first voice signal from a third electronic device; or,
    采集所述第一语音信号。Collect the first voice signal.
  5. 根据权利要求1-4中任一项所述的方法,其特征在于,所述根据所述第一声纹模型对所述第二电子设备采集的语音信号进行认证,包括:The method according to any one of claims 1-4, wherein the authenticating the voice signal collected by the second electronic device according to the first voiceprint model comprises:
    接收来自所述第二电子设备的所述第二电子设备采集的语音信号;receiving a voice signal collected by the second electronic device from the second electronic device;
    将所述第二电子设备采集的语音信号输入所述第一声纹模型进行声纹认证。Inputting the voice signal collected by the second electronic device into the first voiceprint model for voiceprint authentication.
  6. 根据权利要求1-5中任一项所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 1-5, wherein the method further comprises:
    获取第二参数信息,所述第二参数信息用于指示所述第一电子设备采集语音信号的参数;Acquiring second parameter information, where the second parameter information is used to instruct the first electronic device to collect parameters of voice signals;
    根据所述第二参数信息调整所述第一语音信号,得到第三语音信号;adjusting the first speech signal according to the second parameter information to obtain a third speech signal;
    根据所述第三语音信号生成第二声纹模型;generating a second voiceprint model according to the third voice signal;
    根据所述第二声纹模型对所述第一电子设备采集的语音信号进行认证。The voice signal collected by the first electronic device is authenticated according to the second voiceprint model.
  7. 根据权利要求1-6中任一项所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 1-6, wherein the method further comprises:
    获取第三参数信息,所述第三参数信息用于指示第四电子设备采集语音信号的参数,所述第四电子设备与所述第二电子设备不同;Acquiring third parameter information, where the third parameter information is used to instruct a fourth electronic device to collect parameters of voice signals, where the fourth electronic device is different from the second electronic device;
    根据所述第三参数信息调整所述第一语音信号,得到第四语音信号;adjusting the first speech signal according to the third parameter information to obtain a fourth speech signal;
    根据所述第四语音信号生成第三声纹模型;generating a third voiceprint model according to the fourth voice signal;
    向所述第四电子设备发送所述第三声纹模型,或者,根据所述第三声纹模型对所述第四电子设备采集的语音信号进行认证。Sending the third voiceprint model to the fourth electronic device, or authenticating the voice signal collected by the fourth electronic device according to the third voiceprint model.
  8. 一种电子设备,其特征在于,包括:包括:处理器,所述处理器与存储器耦合,所述存储器用于存储程序或指令,当所述程序或指令被所述处理器执行时,使得所述电子设备执行如权利要求1至7中任一项所述的方法。An electronic device, characterized in that it includes: a processor, the processor is coupled with a memory, and the memory is used to store a program or an instruction, and when the program or instruction is executed by the processor, the The electronic device executes the method according to any one of claims 1-7.
  9. 一种计算机可读存储介质,其上存储有计算机程序或指令,其特征在于,所述计算机程序或指令被执行时使得计算机执行如权利要求1至7中任一项所述的方法。A computer-readable storage medium, on which computer programs or instructions are stored, wherein when the computer programs or instructions are executed, the computer executes the method according to any one of claims 1 to 7.
PCT/CN2022/123912 2021-10-28 2022-10-08 Voiceprint registration method and electronic devices WO2023071730A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202111266367.4A CN116052692A (en) 2021-10-28 2021-10-28 Voiceprint registration method and electronic equipment
CN202111266367.4 2021-10-28

Publications (1)

Publication Number Publication Date
WO2023071730A1 true WO2023071730A1 (en) 2023-05-04

Family

ID=86113746

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/123912 WO2023071730A1 (en) 2021-10-28 2022-10-08 Voiceprint registration method and electronic devices

Country Status (2)

Country Link
CN (1) CN116052692A (en)
WO (1) WO2023071730A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103685185A (en) * 2012-09-14 2014-03-26 上海掌门科技有限公司 Mobile equipment voiceprint registration and authentication method and system
US20160035350A1 (en) * 2014-07-29 2016-02-04 Samsung Electronics Co., Ltd. Electronic apparatus and control method thereof
WO2020139058A1 (en) * 2018-12-28 2020-07-02 Samsung Electronics Co., Ltd. Cross-device voiceprint recognition
CN113196236A (en) * 2021-02-04 2021-07-30 华为技术有限公司 Cross-device authentication method and electronic device
CN113470653A (en) * 2020-03-31 2021-10-01 华为技术有限公司 Voiceprint recognition method, electronic equipment and system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103685185A (en) * 2012-09-14 2014-03-26 上海掌门科技有限公司 Mobile equipment voiceprint registration and authentication method and system
US20160035350A1 (en) * 2014-07-29 2016-02-04 Samsung Electronics Co., Ltd. Electronic apparatus and control method thereof
WO2020139058A1 (en) * 2018-12-28 2020-07-02 Samsung Electronics Co., Ltd. Cross-device voiceprint recognition
CN113470653A (en) * 2020-03-31 2021-10-01 华为技术有限公司 Voiceprint recognition method, electronic equipment and system
CN113196236A (en) * 2021-02-04 2021-07-30 华为技术有限公司 Cross-device authentication method and electronic device

Also Published As

Publication number Publication date
CN116052692A (en) 2023-05-02

Similar Documents

Publication Publication Date Title
CN112312366B (en) Method, electronic equipment and system for realizing functions through NFC (near field communication) tag
WO2021104114A1 (en) Method for providing wireless fidelity (wifi) network access service, and electronic device
JP7173670B2 (en) VOICE CONTROL COMMAND GENERATION METHOD AND TERMINAL
WO2022100610A1 (en) Screen projection method and apparatus, and electronic device and computer-readable storage medium
WO2020216160A1 (en) Automatic routing method for se, and electronic device
CN113938720A (en) Multi-device cooperation method, electronic device and multi-device cooperation system
CN114339698A (en) Method for establishing wireless connection through equipment touch, electronic equipment and chip
WO2022148319A1 (en) Video switching method and apparatus, storage medium, and device
CN114422340A (en) Log reporting method, electronic device and storage medium
WO2022022319A1 (en) Image processing method, electronic device, image processing system and chip system
WO2020051852A1 (en) Method for recording and displaying information in communication process, and terminals
US20230010492A1 (en) Method for customizing key of foldable device, device, and storage medium
US20230350629A1 (en) Double-Channel Screen Mirroring Method and Electronic Device
CN113473013A (en) Display method and device for beautifying effect of image and terminal equipment
CN109285563B (en) Voice data processing method and device in online translation process
CN114120950B (en) Human voice shielding method and electronic equipment
US20240178771A1 (en) Method and apparatus for adjusting vibration waveform of linear motor
WO2023071730A1 (en) Voiceprint registration method and electronic devices
US20230319217A1 (en) Recording Method and Device
CN114120987B (en) Voice wake-up method, electronic equipment and chip system
CN113407076A (en) Method for starting application and electronic equipment
CN115185441A (en) Control method, control device, electronic equipment and readable storage medium
CN114157412A (en) Information verification method, electronic device and computer readable storage medium
CN114327198A (en) Control function pushing method and device
CN114844542A (en) Antenna selection method and device, electronic equipment and readable storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22885611

Country of ref document: EP

Kind code of ref document: A1