CN108847243B - Voiceprint feature updating method and device, storage medium and electronic equipment - Google Patents

Voiceprint feature updating method and device, storage medium and electronic equipment Download PDF

Info

Publication number
CN108847243B
CN108847243B CN201810632316.0A CN201810632316A CN108847243B CN 108847243 B CN108847243 B CN 108847243B CN 201810632316 A CN201810632316 A CN 201810632316A CN 108847243 B CN108847243 B CN 108847243B
Authority
CN
China
Prior art keywords
owner
terminal
time
voiceprint
recording process
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201810632316.0A
Other languages
Chinese (zh)
Other versions
CN108847243A (en
Inventor
黄粟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Guangdong Oppo Mobile Telecommunications Corp Ltd
Original Assignee
Guangdong Oppo Mobile Telecommunications Corp Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Guangdong Oppo Mobile Telecommunications Corp Ltd filed Critical Guangdong Oppo Mobile Telecommunications Corp Ltd
Priority to CN201810632316.0A priority Critical patent/CN108847243B/en
Publication of CN108847243A publication Critical patent/CN108847243A/en
Application granted granted Critical
Publication of CN108847243B publication Critical patent/CN108847243B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/02Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/22Interactive procedures; Man-machine interfaces

Abstract

The application discloses a voiceprint feature updating method and device, a storage medium and electronic equipment. The method comprises the following steps: when the voiceprint feature of a terminal owner needs to be updated, acquiring a history record of starting a recording process in the terminal; acquiring voice information from the owner according to the history record; extracting target voiceprint features from the voice information; and updating the voiceprint characteristics of the owner stored in the terminal according to the target voiceprint characteristics. The embodiment can improve the flexibility of updating the stored voiceprint features by the terminal.

Description

Voiceprint feature updating method and device, storage medium and electronic equipment
Technical Field
The present application belongs to the field, and in particular, to a voiceprint feature updating method, apparatus, storage medium, and electronic device.
Background
With the development of technology, the interaction mode between human and machine becomes richer and richer. In the related art, a user can control a terminal through voice, that is, after receiving voice information sent by the user, the terminal can analyze the voice information to obtain a control instruction. Before executing the control instruction, the terminal needs to extract voiceprint features from the voice information and perform voiceprint recognition on the user according to the extracted voiceprint features. And only after the voiceprint recognition is passed, the terminal can execute the control instruction corresponding to the voice information. However, in the related art, the terminal has poor flexibility in updating the saved voiceprint feature.
Disclosure of Invention
The embodiment of the application provides a voiceprint feature updating method and device, a storage medium and electronic equipment, which can improve the flexibility of updating the stored voiceprint features by a terminal.
The embodiment of the application provides a voiceprint feature updating method, which comprises the following steps:
when the voiceprint feature of a terminal owner needs to be updated, acquiring a history record of starting a recording process in the terminal;
acquiring voice information from the owner according to the history record;
extracting target voiceprint features from the voice information;
and updating the voiceprint characteristics of the owner stored in the terminal according to the target voiceprint characteristics.
The embodiment of the application provides a voiceprint feature updating device, including:
the first acquisition module is used for acquiring a history record of starting a recording process in the terminal when the voiceprint feature of a terminal owner needs to be updated;
the second acquisition module is used for acquiring voice information from the owner according to the historical record;
the extraction module is used for extracting target voiceprint characteristics from the voice information;
and the updating module is used for updating the voiceprint characteristics of the owner stored in the terminal according to the target voiceprint characteristics.
The embodiment of the present application provides a storage medium, on which a computer program is stored, and when the computer program is executed on a computer, the computer is caused to execute the steps in the voiceprint feature updating method provided by the embodiment of the present application.
The embodiment of the present application further provides an electronic device, which includes a memory and a processor, where the processor is configured to execute the steps in the voiceprint feature updating method provided in the embodiment of the present application by calling the computer program stored in the memory.
In this embodiment, when the voiceprint feature of the owner stored in the terminal needs to be updated, the terminal may obtain the voice information of the owner according to the history record that the recording process is started. Then, the terminal can extract the target voiceprint feature from the voice message, and update the voiceprint feature of the owner stored in the terminal according to the target voiceprint feature. Because the embodiment can acquire the main voice for updating the voiceprint feature according to the history that the recording process is started, the embodiment can improve the flexibility of the terminal when the voiceprint feature is updated.
Drawings
The technical solution and the advantages of the present invention will be apparent from the following detailed description of the embodiments of the present invention with reference to the accompanying drawings.
Fig. 1 is a schematic flow chart of a voiceprint feature updating method provided in an embodiment of the present application.
Fig. 2 is another schematic flow chart of a voiceprint feature updating method according to an embodiment of the present application.
Fig. 3 to fig. 5 are scene diagrams of a voiceprint feature updating method provided in an embodiment of the present application.
Fig. 6 is a schematic structural diagram of a voiceprint feature updating apparatus according to an embodiment of the present application.
Fig. 7 is another schematic structural diagram of a voiceprint feature updating apparatus according to an embodiment of the present application.
Fig. 8 is a schematic structural diagram of a mobile terminal according to an embodiment of the present application.
Fig. 9 is another schematic structural diagram of a mobile terminal according to an embodiment of the present application.
Detailed Description
Referring now to the drawings, in which like numerals represent like elements, the principles of the present invention are illustrated as being implemented in a suitable computing environment. The following description is based on illustrated embodiments of the invention and should not be taken as limiting the invention with regard to other embodiments that are not detailed herein.
It can be understood that the execution subject of the embodiment of the present application may be a terminal device such as a smart phone or a tablet computer.
Referring to fig. 1, fig. 1 is a schematic flow chart of a voiceprint feature updating method according to an embodiment of the present application, where the flow chart may include:
in 101, when it is determined that the voiceprint feature of the terminal owner needs to be updated, a history that a recording process in the terminal is started is obtained.
With the development of technology, the interaction mode between human and machine becomes richer and richer. In the related art, a user can control a terminal through voice, that is, after receiving voice information sent by the user, the terminal can analyze the voice information to obtain a control instruction. Before executing the control instruction, the terminal needs to extract voiceprint features from the voice information and perform voiceprint recognition on the user according to the extracted voiceprint features. And only after the voiceprint recognition is passed, the terminal can execute the control instruction corresponding to the voice information. However, in the related art, the terminal has poor flexibility in updating the saved voiceprint feature.
In this embodiment, for example, when it is determined that the voiceprint feature of the owner stored in the terminal needs to be updated, the terminal may first obtain a history that the recording process is started.
It should be noted that the recording process in the terminal may be started by the terminal when the user makes a call, or by the terminal when the user sends voice information when using the instant messaging application, or by the terminal when the user starts the recording application to record an audio/video file, or the like. In these cases, the terminal recording process in the terminal wakes up the microphone and picks up the user's voice using the microphone.
At 102, voice information is obtained from the master based on the history.
For example, after acquiring the history that the recording process in the terminal is started, the terminal may acquire the voice information from the terminal owner according to the history.
In 103, target voiceprint features are extracted from the speech information.
For example, after acquiring voice information from the terminal owner, the terminal may extract voiceprint features from the voice information, i.e., obtain target voiceprint features.
At 104, the voiceprint feature of the owner stored in the terminal is updated based on the target voiceprint feature.
For example, after obtaining the target voiceprint feature, the terminal may update the previously saved voiceprint feature of the owner in the terminal according to the target voiceprint feature.
It can be understood that, in this embodiment, when the voiceprint feature of the owner saved in the terminal needs to be updated, the terminal may obtain the voice information of the owner according to the history that the recording process is started. Then, the terminal can extract the target voiceprint feature from the voice message, and update the voiceprint feature of the owner stored in the terminal according to the target voiceprint feature. Because the embodiment can acquire the main voice for updating the voiceprint feature according to the history that the recording process is started, the embodiment can improve the flexibility of the terminal when the voiceprint feature is updated.
In addition, since the voiceprint characteristics of the user may change in some cases, for example, when the throat of the user is inflamed, the present embodiment may improve the accuracy of the terminal in performing voiceprint recognition using the voiceprint characteristics by updating the voiceprint characteristics of the owner stored in the terminal, and avoid inconvenience in operation of the user due to the fact that the voiceprint characteristics of the owner stored in the terminal are too old.
Referring to fig. 2, fig. 2 is another schematic flow chart of a voiceprint feature updating method according to an embodiment of the present application, where the flow chart may include:
in 201, when the number of times that the voiceprint feature of the user is continuously unmatched with the owner voiceprint feature stored in the terminal reaches a preset value, the terminal performs owner identity authentication on the user.
For example, when a user completes a certain operation through a voice control terminal, the terminal needs to extract a voiceprint feature from a voice uttered by the user, and match the voiceprint feature with a voiceprint feature of a user owner stored in the terminal. And only when the two are matched, the terminal can execute corresponding operation. And if the voiceprint feature of the current user acquired by the terminal is not matched with the voiceprint feature of the owner stored in the terminal continuously, and the continuous unmatched times reach a preset numerical value, the terminal can carry out owner identity verification on the user.
For example, when the voiceprint feature of the user does not match the voiceprint feature of the owner stored in the terminal for 5 consecutive times, the terminal may perform owner authentication on the user. In one embodiment, the terminal may perform the owner identity verification on the user through one or more of fingerprint recognition, iris recognition, face recognition, or password verification.
If the user does not pass the owner identity authentication, the terminal can judge the user as an illegal user and refuse to execute corresponding operation.
If the user passes the owner identity verification, 202 is entered.
In 202, if the owner identity verification passes, the terminal determines that the saved voiceprint feature of the owner needs to be updated.
For example, if the current user passes the owner identity authentication, that is, the current user is actually the owner, but the voiceprint feature of the current user cannot be matched with the voiceprint feature of the owner stored in the terminal, it may be considered that the voiceprint feature changes due to reasons such as throat inflammation, so that the voiceprint feature cannot be successfully matched with the voiceprint feature of the owner stored in the terminal. In this case, the terminal may determine that the voiceprint feature of the owner stored in the terminal needs to be updated.
In 203, the terminal acquires a history that the recording process is started.
For example, when it is determined that the voiceprint feature of the owner stored in the terminal needs to be updated, the terminal may obtain a history that the recording process is started. It is understood that the history of the recording process being started may include information such as the time the recording process was started.
It should be noted that the recording process in the terminal may be started by the terminal when the user makes a call, or by the terminal when the user sends voice information when using the instant messaging application, or by the terminal when the user starts the recording application to record an audio/video file, or the like. In these cases, the terminal recording process in the terminal wakes up the microphone and picks up the user's voice using the microphone.
At 204, the terminal predicts the time when the recording process is started next time according to the history record, and acquires the current time.
For example, after obtaining a history record of the recording process in the terminal being started, the terminal may predict the time when the recording process is started next time according to the history record, and obtain the current time.
Then, the terminal may obtain a predicted interval between the next time the recording process is started and the current time, and detect whether the interval is less than or equal to a preset interval.
If it is detected that the interval is less than or equal to the preset interval, 205 is entered.
If the interval is detected to be greater than the preset interval, then 206 is entered.
In 205, if the interval between the next time the recording process is started and the current time is less than or equal to the preset interval, the terminal obtains the voice information from the host when the recording process is started next time.
For example, if the terminal detects that the interval between the current time and the predicted time when the recording process is started next time is less than or equal to the preset interval, the terminal may obtain the voice information from the host when the recording process is started next time.
For example, on a weekday, a user may frequently make voice calls with friends or buddies during various time periods of the day. The terminal predicts the time of the next time that the recording process is started to be 10:00 and the current time to be 9:55 according to the history that the recording process is started, and the time interval between the time and the current time is 5 minutes and is less than the preset interval of 30 minutes. In this case, the terminal may acquire the voice information uttered by the owner when the recording process is turned on next time.
After the owner's voice is obtained, 209 may be entered.
It can be understood that, in this embodiment, the terminal predicts the time when the recording process is started next time through the history of the start of the recording process, and if the interval between the time and the current time is short, the terminal may obtain the voice uttered by the user when the recording process is started next time. By the method, the voice used for updating the voiceprint can be acquired under the condition that the user feels no, the user does not need to be reminded to output a section of voice additionally, extra time of the user can not be occupied, and therefore user experience is improved.
In 206, if the interval between the next time the recording process is started and the current time is greater than the preset interval, the terminal generates text information.
At 207, the terminal presents the text message to the owner and prompts the owner to read the text message.
At 208, upon detecting that the owner speaks the text message, the terminal obtains voice information from the owner.
For example, 206, 207, and 208 may include:
for example, according to the history that the recording process is started, the terminal predicts that the interval between the time when the recording process is started next time and the current time is larger than a preset interval, for example, the history of the recording process shows that the owner rarely makes a call or sends voice information to friends, buddies and the like. In this case, for example, the terminal predicts that the recording process is started next time after 24 hours, the terminal may generate the text message.
After generating the text message, the terminal may display the text message on a screen, thereby presenting the text message to the owner and prompting the owner to read the text message aloud.
When detecting that the owner reads the text information, the terminal can acquire the voice information sent by the owner. After the voice information of the owner is acquired, 209 is entered.
It can be understood that, in this embodiment, when the interval between the time when the recording process predicted by the terminal is started next time and the current time is longer, the terminal may generate the text message and prompt the owner to read the text message, so that when the owner reads the text message, the owner obtains the voice of the owner. By the mode, the voiceprint feature of the owner can be updated rapidly, so that the user can use the terminal conveniently.
In 209, the terminal extracts the target voiceprint features from the voice information.
At 210, the terminal updates the saved voiceprint characteristics of the owner based on the target voiceprint characteristics.
For example, 209 and 210 may include:
after the voice of the owner is acquired, the terminal can extract the target voiceprint feature from the voice, and then update the previously stored voiceprint feature of the owner according to the target voiceprint feature.
In an implementation manner, after the step of obtaining the current time in 204, this embodiment may further include the following steps:
if the interval between the time of starting the recording process next time and the current time is larger than the preset interval, the terminal prompts the owner to output random voice, wherein the voice time needs to reach the preset time;
when the owner outputs random voice, the terminal acquires voice information from the owner.
For example, according to the history that the recording process is started, the terminal predicts that the interval between the next time the recording process is started and the current time is greater than the preset interval, for example, the terminal predicts that the next time the recording process is started is 24 hours later, then the terminal may prompt the owner to randomly output a section of voice at this time, and the duration of the voice needs to reach the preset duration. For example, the duration of the voice needs to be at least 30 seconds.
When detecting that the owner outputs random voice, the terminal can acquire the voice information of the owner.
Then, the terminal may extract a target voiceprint feature from the voice, and then update the previously saved voiceprint feature of the owner according to the target voiceprint feature.
In some embodiments, after updating the voiceprint feature of the owner, the terminal may further obtain physiological feature information of the owner, and predict a time period required for the sound of the owner to return to normal according to the physiological feature information. For example, if the voiceprint characteristics of the owner change due to throat inflammation of the owner, the terminal can acquire physiological characteristic information of the owner and predict the time required for the throat inflammation of the owner to heal according to the physiological characteristic information. After the inflammation of the throat of the owner is healed, the voice of the owner is recovered to be normal, so the voice print characteristic of the owner stored in the terminal can be updated again by the terminal, and the use of a user is facilitated.
Referring to fig. 3 to 5, fig. 3 to 5 are schematic scene diagrams of a voiceprint feature updating method according to an embodiment of the present application.
The unique characteristics of the voiceprint are determined primarily by two factors, the first being the size of the vocal cavity, including specifically the throat, nasal cavity, oral cavity, etc., the shape, size and location of these organs determining the magnitude of vocal cord tension and the range of vocal frequencies. The second factor that determines the characteristics of the sound is the manner in which the organs of the sound are manipulated, including the muscles of the lips, teeth, tongue, soft palate, and palate, which interact to produce clear speech. That is, the voiceprint is closely related to the physiological characteristics of the human body. In everyday life, for example, a user's cold with inflamed throat can cause the user's voice to become dull. In this case, the user's voiceprint characteristics may change.
For example, a user may experience a change in vocal print characteristics due to inflammation of the throat. In this case, the user issues a voice instruction of "xiaohu please open the instant messaging application a" to the terminal, as shown in fig. 3. After receiving the voice command, the terminal can extract the voiceprint feature of the current user from the voice command so as to verify the validity of the identity of the current user.
For example, the terminal detects that the voiceprint feature of the current user does not match the voiceprint feature of the owner saved in advance. At this time, the terminal may prompt the user that the voiceprints are not matched, the voice instruction for opening the instant messaging application a cannot be executed, and prompt the user to re-input the voice instruction. For example, the user then utters "xiaohu" twice in succession, please open the instant messaging application a ", and the voiceprint features extracted by the terminal from the two speeches also do not match the voiceprint features of the owner that are saved in advance.
Because the voiceprint feature of the current user is not matched with the voiceprint feature of the owner saved in advance for 3 times continuously, the terminal can carry out owner identity verification on the current user at the moment. For example, as shown in fig. 4, the terminal may prompt the user to enter fingerprint information to verify the owner identity.
Thereafter, the current user inputs fingerprint information. The terminal can match the fingerprint information with the fingerprint information of the owner which is stored in advance. For example, the fingerprint information of the current user is successfully matched with the fingerprint information of the owner saved in advance, so that the terminal can determine that the current user is the owner. In this case, the terminal may determine that the voiceprint feature of the owner stored in the terminal needs to be updated.
When it is determined that the voiceprint feature of the owner stored in the terminal needs to be updated, the terminal can acquire the history record of the starting of the recording process, predict the time of the starting of the recording process next time according to the history record, and then acquire the current time.
Then, the terminal may obtain a predicted interval between the next time the recording process is started and the current time, and detect whether the interval is less than or equal to a preset interval. For example, the preset interval is 15 minutes.
For example, the time when the terminal predicts that the recording process is started next time is 10:00 of today, while the current time is 09:05, and the interval between the two is 55 minutes and is larger than the preset interval by 15 minutes. In this case, the terminal may generate a piece of text information and display the text information on the screen while prompting the current user to read the piece of text information, as shown in fig. 5.
When the text information read aloud by the user is detected, the terminal can acquire the voice information of the current user. Then, the terminal may extract voiceprint feature information of the current user from the voice information and determine it as a target voiceprint feature. Then, the terminal can update the previously saved voiceprint feature of the owner in the terminal according to the target voiceprint feature.
Referring to fig. 6, fig. 6 is a schematic structural diagram of a voiceprint feature updating apparatus according to an embodiment of the present application. The voiceprint feature updating apparatus 300 may include: a first obtaining module 301, a second obtaining module 302, an extracting module 303, and an updating module 304.
A first obtaining module 301, configured to obtain a history record that a recording process in the terminal is started when it is determined that a voiceprint feature of a terminal owner needs to be updated.
A second obtaining module 302, configured to obtain voice information from the owner according to the history.
An extracting module 303, configured to extract a target voiceprint feature from the voice information.
An updating module 304, configured to update the voiceprint feature of the owner stored in the terminal according to the target voiceprint feature.
In one embodiment, the second obtaining module 302 may be configured to:
predicting the time of starting the recording process in the terminal next time according to the history record, and acquiring the current time;
and if the interval between the time when the recording process is started next time and the current time is less than or equal to the preset interval, acquiring voice information from the owner when the recording process is started next time.
In one embodiment, the second obtaining module 302 may be configured to:
if the interval between the time when the recording process is started next time and the current time is larger than the preset interval, generating text information;
displaying the text information to the owner and prompting the owner to read the text information;
and when the text information is detected to be read aloud by the owner, acquiring voice information from the owner.
In one embodiment, the second obtaining module 302 may be configured to:
if the interval between the time when the recording process is started next time and the current time is larger than a preset interval, prompting the owner to output random voice, wherein the voice time needs to reach the preset time;
and when the owner outputs random voice, acquiring voice information from the owner.
Referring to fig. 7, fig. 7 is another schematic structural diagram of a voiceprint feature update apparatus according to an embodiment of the present application. In an embodiment, the voiceprint feature updating apparatus 300 may further include: a module 305 is determined.
A determining module 305, configured to perform owner identity verification on the user when the number of times that the voiceprint feature of the user is continuously unmatched with the owner voiceprint feature stored in the terminal reaches a preset value; and if the owner identity authentication is passed, determining that the voiceprint feature of the owner stored in the terminal needs to be updated.
The present application provides a computer-readable storage medium, on which a computer program is stored, and when the computer program is executed on a computer, the computer is caused to execute the steps in the voiceprint feature updating method provided in this embodiment.
The embodiment of the present application further provides an electronic device, which includes a memory and a processor, where the processor is configured to execute the steps in the voiceprint feature updating method provided in this embodiment by calling the computer program stored in the memory.
For example, the electronic device may be a mobile terminal such as a tablet computer or a smart phone. Referring to fig. 8, fig. 8 is a schematic structural diagram of a mobile terminal according to an embodiment of the present application.
The mobile terminal 400 may include components such as a microphone 401, memory 402, processor 403, and the like. Those skilled in the art will appreciate that the mobile terminal architecture shown in fig. 8 is not intended to be limiting of mobile terminals and may include more or fewer components than those shown, or some components may be combined, or a different arrangement of components.
The microphone 401 may be used to pick up voice information uttered by the user, and the like.
The memory 402 may be used to store applications and data. The memory 402 stores applications containing executable code. The application programs may constitute various functional modules. The processor 403 executes various functional applications and data processing by running an application program stored in the memory 402.
The processor 403 is a control center of the mobile terminal, connects various parts of the entire mobile terminal using various interfaces and lines, and performs various functions of the mobile terminal and processes data by running or executing an application program stored in the memory 402 and calling data stored in the memory 402, thereby performing overall monitoring of the mobile terminal.
In this embodiment, the processor 403 in the mobile terminal loads the executable code corresponding to the process of one or more application programs into the memory 402 according to the following instructions, and the processor 403 runs the application programs stored in the memory 402, thereby implementing the steps:
when the voiceprint feature of a terminal owner needs to be updated, acquiring a history record of starting a recording process in the terminal;
acquiring voice information from the owner according to the history record;
extracting target voiceprint features from the voice information;
and updating the voiceprint characteristics of the owner stored in the terminal according to the target voiceprint characteristics.
Referring to fig. 9, the mobile terminal 500 may include a microphone 501, a memory 502, a processor 503, an input unit 504, an output unit 505, and the like.
The microphone 501 may be used to pick up voice information uttered by a user, etc.
The memory 502 may be used to store applications and data. Memory 502 stores applications containing executable code. The application programs may constitute various functional modules. The processor 503 executes various functional applications and data processing by running an application program stored in the memory 502.
The processor 503 is a control center of the mobile terminal, connects various parts of the entire mobile terminal using various interfaces and lines, and performs various functions of the mobile terminal and processes data by running or executing an application program stored in the memory 502 and calling data stored in the memory 502, thereby performing overall monitoring of the mobile terminal.
The input unit 504 may be used to receive input numbers, character information, or user characteristic information (such as a fingerprint), and to generate keyboard, mouse, joystick, optical, or trackball signal inputs related to user settings and function control.
The output unit 505 may be used to display information input by or provided to a user and various graphic user interfaces of the mobile terminal, which may be configured by graphics, text, icons, video, and any combination thereof. The output unit may include a display panel.
In this embodiment, the processor 503 in the mobile terminal loads the executable code corresponding to the process of one or more application programs into the memory 502 according to the following instructions, and the processor 503 runs the application programs stored in the memory 502, thereby implementing the steps:
when the voiceprint feature of a terminal owner needs to be updated, acquiring a history record of starting a recording process in the terminal;
acquiring voice information from the owner according to the history record;
extracting target voiceprint features from the voice information;
and updating the voiceprint characteristics of the owner stored in the terminal according to the target voiceprint characteristics.
In one embodiment, when the processor 503 executes the step of obtaining the voice information from the owner according to the history, it may execute: predicting the time of starting the recording process in the terminal next time according to the history record, and acquiring the current time; and if the interval between the time when the recording process is started next time and the current time is less than or equal to the preset interval, acquiring voice information from the owner when the recording process is started next time.
In one embodiment, after the step of obtaining the current time, the processor 503 may further perform: if the interval between the time when the recording process is started next time and the current time is larger than the preset interval, generating text information; displaying the text information to the owner and prompting the owner to read the text information; and when the text information is detected to be read aloud by the owner, acquiring voice information from the owner.
In one embodiment, after the step of obtaining the current time, the processor 503 may further perform: if the interval between the time when the recording process is started next time and the current time is larger than a preset interval, prompting the owner to output random voice, wherein the voice time needs to reach the preset time; and when the owner outputs random voice, acquiring voice information from the owner.
In one embodiment, when the processor 503 performs the step of determining that the voiceprint feature of the terminal owner needs to be updated, it may perform: when the number of times that the voiceprint characteristics of the user are continuously unmatched with the main voiceprint characteristics stored in the terminal reaches a preset value, performing main identity authentication on the user; and if the owner identity authentication is passed, determining that the voiceprint feature of the owner stored in the terminal needs to be updated.
In the above embodiments, the descriptions of the embodiments have respective emphasis, and a part that is not described in detail in a certain embodiment may refer to the above detailed description of the voiceprint feature updating method, and is not described herein again.
The voiceprint feature updating device provided in the embodiment of the present application and the voiceprint feature updating method in the above embodiment belong to the same concept, and any one of the methods provided in the voiceprint feature updating method embodiment may be run on the voiceprint feature updating device, and a specific implementation process thereof is described in the voiceprint feature updating method embodiment, and is not described herein again.
It should be noted that, for the voiceprint feature updating method described in the embodiment of the present application, it can be understood by those skilled in the art that all or part of the process of implementing the voiceprint feature updating method described in the embodiment of the present application can be implemented by controlling the related hardware through a computer program, where the computer program can be stored in a computer-readable storage medium, such as a memory, and executed by at least one processor, and during the execution, the process of the embodiment of the voiceprint feature updating method can be included. The storage medium may be a magnetic disk, an optical disk, a Read Only Memory (ROM), a Random Access Memory (RAM), or the like.
For the voiceprint feature updating apparatus in the embodiment of the present application, each functional module may be integrated into one processing chip, or each module may exist alone physically, or two or more modules are integrated into one module. The integrated module can be realized in a hardware mode, and can also be realized in a software functional module mode. The integrated module, if implemented in the form of a software functional module and sold or used as a stand-alone product, may also be stored in a computer readable storage medium, such as a read-only memory, a magnetic or optical disk, or the like.
The voiceprint feature updating method, the voiceprint feature updating device, the storage medium and the electronic device provided by the embodiment of the application are introduced in detail, a specific example is applied in the description to explain the principle and the implementation manner of the invention, and the description of the embodiment is only used for helping to understand the method and the core idea of the invention; meanwhile, for those skilled in the art, according to the idea of the present invention, there may be variations in the specific embodiments and the application scope, and in summary, the content of the present specification should not be construed as a limitation to the present invention.

Claims (8)

1. A voiceprint feature update method, comprising:
when determining that the voiceprint feature of the terminal owner changes and needs to be updated, acquiring a history record of starting a recording process in the terminal;
acquiring voice information from the owner according to the history record, predicting the time when a recording process in the terminal is started next time according to the history record, and acquiring the current time; if the interval between the time when the recording process is started next time and the current time is less than or equal to the preset interval, acquiring voice information from the owner when the recording process is started next time;
extracting target voiceprint features from the voice information;
and updating the voiceprint characteristics of the owner stored in the terminal according to the target voiceprint characteristics.
2. The voiceprint feature update method according to claim 1, further comprising, after the step of obtaining the current time:
if the interval between the time when the recording process is started next time and the current time is larger than the preset interval, generating text information;
displaying the text information to the owner and prompting the owner to read the text information;
and when the text information is detected to be read aloud by the owner, acquiring voice information from the owner.
3. The voiceprint feature update method according to claim 1, further comprising, after the step of obtaining the current time:
if the interval between the time when the recording process is started next time and the current time is larger than a preset interval, prompting the owner to output random voice, wherein the voice time needs to reach the preset time;
and when the owner outputs random voice, acquiring voice information from the owner.
4. The voiceprint feature updating method according to claim 1, wherein the step of determining that the voiceprint feature of the terminal owner needs to be updated comprises:
when the number of times that the voiceprint characteristics of the user are continuously unmatched with the main voiceprint characteristics stored in the terminal reaches a preset value, performing main identity authentication on the user;
and if the owner identity authentication is passed, determining that the voiceprint feature of the owner stored in the terminal needs to be updated.
5. A voiceprint feature update apparatus, comprising:
the first acquisition module is used for acquiring a history record of starting a recording process in the terminal when the change of the voiceprint characteristics of the terminal owner is determined and the voiceprint characteristics of the terminal owner need to be updated;
the second acquisition module is used for acquiring voice information from the owner according to the history record, predicting the time when the recording process in the terminal is started next time according to the history record and acquiring the current time; if the interval between the time when the recording process is started next time and the current time is less than or equal to the preset interval, acquiring voice information from the owner when the recording process is started next time;
the extraction module is used for extracting target voiceprint characteristics from the voice information;
and the updating module is used for updating the voiceprint characteristics of the owner stored in the terminal according to the target voiceprint characteristics.
6. The apparatus according to claim 5, wherein the second obtaining module is further configured to:
if the interval between the time when the recording process is started next time and the current time is larger than the preset interval, generating text information;
displaying the text information to the owner and prompting the owner to read the text information;
and when the text information is detected to be read aloud by the owner, acquiring voice information from the owner.
7. A storage medium having stored thereon a computer program, characterized in that the computer program, when executed on a computer, causes the computer to execute the method according to any of claims 1 to 4.
8. An electronic device comprising a memory, a processor, wherein the processor is configured to perform the method of any of claims 1 to 4 by invoking a computer program stored in the memory.
CN201810632316.0A 2018-06-19 2018-06-19 Voiceprint feature updating method and device, storage medium and electronic equipment Active CN108847243B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810632316.0A CN108847243B (en) 2018-06-19 2018-06-19 Voiceprint feature updating method and device, storage medium and electronic equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810632316.0A CN108847243B (en) 2018-06-19 2018-06-19 Voiceprint feature updating method and device, storage medium and electronic equipment

Publications (2)

Publication Number Publication Date
CN108847243A CN108847243A (en) 2018-11-20
CN108847243B true CN108847243B (en) 2020-07-07

Family

ID=64202963

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810632316.0A Active CN108847243B (en) 2018-06-19 2018-06-19 Voiceprint feature updating method and device, storage medium and electronic equipment

Country Status (1)

Country Link
CN (1) CN108847243B (en)

Families Citing this family (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110459242A (en) * 2019-08-21 2019-11-15 广州国音智能科技有限公司 Change of voice detection method, terminal and computer readable storage medium
CN110660398B (en) * 2019-09-19 2020-11-20 北京三快在线科技有限公司 Voiceprint feature updating method and device, computer equipment and storage medium
CN112580390B (en) * 2019-09-27 2023-10-17 百度在线网络技术(北京)有限公司 Security monitoring method and device based on intelligent sound box, sound box and medium
CN111091837A (en) * 2019-12-27 2020-05-01 中国人民解放军陆军工程大学 Time-varying voiceprint authentication method and system based on online learning

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1905445B (en) * 2005-07-27 2012-02-15 国际商业机器公司 System and method of speech identification using mobile speech identification card
US9154482B2 (en) * 2013-02-15 2015-10-06 Verizon Patent And Licensing Inc. Secure access credential updating
US10764424B2 (en) * 2014-12-05 2020-09-01 Microsoft Technology Licensing, Llc Intelligent digital assistant alarm system for application collaboration with notification presentation
CN105955818A (en) * 2016-04-15 2016-09-21 奇酷软件(深圳)有限公司 Reminding method, reminding device and terminal

Also Published As

Publication number Publication date
CN108847243A (en) 2018-11-20

Similar Documents

Publication Publication Date Title
CN108847243B (en) Voiceprint feature updating method and device, storage medium and electronic equipment
CN106782536B (en) Voice awakening method and device
CN108831477B (en) Voice recognition method, device, equipment and storage medium
CN110505504B (en) Video program processing method and device, computer equipment and storage medium
CN110225386A (en) A kind of display control method, display equipment
CN111105796A (en) Wireless earphone control device and control method, and voice control setting method and system
CN111261195A (en) Audio testing method and device, storage medium and electronic equipment
CN111312222A (en) Awakening and voice recognition model training method and device
CN110544473A (en) Voice interaction method and device
CN113327620A (en) Voiceprint recognition method and device
CN110580897B (en) Audio verification method and device, storage medium and electronic equipment
CN109032554A (en) A kind of audio-frequency processing method and electronic equipment
WO2019228135A1 (en) Method and device for adjusting matching threshold, storage medium and electronic device
CN111081260A (en) Method and system for identifying voiceprint of awakening word
CN110858479B (en) Voice recognition model updating method and device, storage medium and electronic equipment
KR102501083B1 (en) Method for voice detection and electronic device using the same
US20210082405A1 (en) Method for Location Reminder and Electronic Device
WO2019041871A1 (en) Voice object recognition method and device
CN111641751B (en) Screen unlocking method and device of terminal equipment, terminal equipment and storage medium
CN111161745A (en) Awakening method, device, equipment and medium for intelligent equipment
CN109064720B (en) Position prompting method and device, storage medium and electronic equipment
CN108922523B (en) Position prompting method and device, storage medium and electronic equipment
CN108877773B (en) Voice recognition method and electronic equipment
CN111124512B (en) Awakening method, device, equipment and medium for intelligent equipment
CN111754989B (en) Avoiding method for voice false wake-up and electronic equipment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant