WO2020024415A1 - Voiceprint recognition processing method and apparatus, electronic device and storage medium - Google Patents

Voiceprint recognition processing method and apparatus, electronic device and storage medium Download PDF

Info

Publication number
WO2020024415A1
WO2020024415A1 PCT/CN2018/107954 CN2018107954W WO2020024415A1 WO 2020024415 A1 WO2020024415 A1 WO 2020024415A1 CN 2018107954 W CN2018107954 W CN 2018107954W WO 2020024415 A1 WO2020024415 A1 WO 2020024415A1
Authority
WO
WIPO (PCT)
Prior art keywords
random code
voiceprint recognition
voice
voiceprint
prompt information
Prior art date
Application number
PCT/CN2018/107954
Other languages
French (fr)
Chinese (zh)
Inventor
潘燕飞
Original Assignee
平安科技(深圳)有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 平安科技(深圳)有限公司 filed Critical 平安科技(深圳)有限公司
Publication of WO2020024415A1 publication Critical patent/WO2020024415A1/en

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/06Decision making techniques; Pattern matching strategies

Definitions

  • the present application relates to the technical field of voiceprint recognition, and in particular, to a voiceprint recognition processing method, device, electronic device, and storage medium.
  • Voiceprint recognition is a system that recognizes the identity of a speaker based on the characteristics of the person's voice, and uses voice to verify the identity of the speaker. This technology has good features such as convenience, stability, measurability, accuracy, and security. As a non-contact collection and identification technology, voiceprint has low acquisition cost, convenient acquisition, and simple use. It has great application prospects in banks, social security, public security, smart home, and mobile payment.
  • the embodiments of the present application provide a voiceprint recognition processing method, device, electronic device, and storage medium, which can solve the security problem caused by the inability to determine the sound source in voiceprint recognition.
  • an embodiment of the present application provides a voiceprint recognition processing method.
  • the method includes: if an instruction to recognize a voiceprint is obtained, outputting prompt information including a first random code, where the prompt information refers to The information provided by the identified person should contain information to prompt; obtain the speech provided by the identified person containing the first random code; convert the speech to text through speech recognition, and extract the text containing A second random code; and verifying the second random code using the first random code; if the second random code is successfully verified, performing voiceprint recognition to obtain a voiceprint recognition result.
  • an embodiment of the present application further provides a voiceprint recognition processing device, wherein the device includes: an output unit configured to output prompt information including a first random code if an instruction to recognize the voiceprint is obtained, so that The prompt information refers to information for prompting the content provided by the identified person's voice; the acquisition unit is used to acquire the voice provided by the identified person containing the first random code; the extraction unit is used to pass Speech recognition, converting the speech into text, and extracting a second random code included in the text; and a verification unit for verifying the second random code using the first random code, if The second random code is successfully verified, and voiceprint recognition is performed to obtain a voiceprint recognition result.
  • the device includes: an output unit configured to output prompt information including a first random code if an instruction to recognize the voiceprint is obtained, so that The prompt information refers to information for prompting the content provided by the identified person's voice; the acquisition unit is used to acquire the voice provided by the identified person containing the first random code; the extraction unit is used to pass Speech recognition, converting the
  • an embodiment of the present application further provides an electronic device including a memory and a processor.
  • the memory stores a computer program
  • the processor implements the voiceprint recognition processing method when the processor executes the computer program.
  • an embodiment of the present application further provides a computer-readable storage medium.
  • the storage medium stores a computer program, where the computer program includes program instructions, and the program instructions can implement the foregoing sound when executed by a processor. Pattern recognition processing method.
  • Embodiments of the present application provide a voiceprint recognition processing method, device, electronic device, and storage medium.
  • the voiceprint recognition device outputs prompt information including a first random code, thereby obtaining a voice provided by the identified person, obtaining the second random code from the voice, and converting the first random code. Match the second random code, and if the second random code is successfully verified, further perform voiceprint recognition of the voice to obtain a voiceprint recognition result, which can ensure that the voice provider is a living body, thereby improving voice Pattern recognition security.
  • FIG. 1 is a schematic diagram of an application scenario of a voiceprint recognition processing method according to an embodiment of the present application
  • FIG. 2 is a schematic flowchart of a voiceprint recognition and processing method according to an embodiment of the present application
  • FIG. 3 is a schematic flowchart of a voiceprint recognition processing method according to another embodiment of the present application.
  • FIG. 4 is a schematic block diagram of a voiceprint recognition processing apparatus according to an embodiment of the present application.
  • FIG. 5 is a schematic block diagram of an electronic device according to an embodiment of the present application.
  • FIG. 1 is a schematic diagram of an application scenario of a voiceprint recognition processing method according to an embodiment of the present application.
  • the application scenario includes:
  • Voiceprint recognition device an electronic device, can be a terminal device. Voiceprint recognition is performed by acquiring the voice provided by the identified person.
  • the voiceprint recognition device can be obtained through the microphone component included in the voiceprint recognition device itself. The voice of the identified person.
  • each subject in Figure 1 The working process of each subject in Figure 1 is as follows: if the voiceprint recognition device obtains an instruction to recognize the voiceprint, it outputs a prompt message, and the identified person sends out a voice according to the prompt information. The voiceprint recognition result is obtained, and the identified person is identified and identified.
  • FIG. 1 only shows one identified person. In the actual operation process, there can be multiple identified persons.
  • the voiceprint recognition device can be a separate electronic device terminal or other electronic devices. Components, components or functional units included in other electronic devices, such as the functional units of smart terminal devices, to complete all or part of the functions of voiceprint recognition.
  • the application scenario of the above voiceprint recognition processing method is only used to illustrate the technology of this application The solution is not intended to limit the technical solution of the present application.
  • FIG. 2 is a schematic flowchart of a voiceprint recognition processing method according to an embodiment of the present application.
  • the voiceprint recognition processing method is applied to the voiceprint recognition device in FIG. 1.
  • the voiceprint recognition device may be a separate electronic device terminal, or may be a component or component of another electronic device to complete all of the voiceprint recognition. Or some features.
  • FIG. 2 is a schematic flowchart of a voiceprint recognition processing method according to an embodiment of the present application. As shown in Figure 2, the method includes the following steps S210-S240:
  • prompt information including a first random code is output, and the prompt information refers to information used to prompt the content provided by the identified person's voice.
  • voiceprint recognition refers to the identification of the speaker's identity based on the characteristics of the speaker's sound waves, and to identify or confirm the identity of the speaker who issued the voice.
  • Voiceprint recognition processing is widely used in wisdom that requires confirmation of the identity of the person. Construction, smart home, financial security and other fields.
  • the random code is a series of characters with a predetermined number of bits randomly generated by the voiceprint recognition device.
  • the random code can include numbers, characters, letters, and combinations of the above forms.
  • the random code can be "6589", "technology”, “jym”, or "6589jym”.
  • the prompt information refers to information for prompting the content that should be included in the voice provided by the identified person, and is used to make a clear prompt for the content that should be included in the voice provided by the user.
  • the prompt information shown may include random information.
  • the code can also include environmental information for voiceprint recognition, such as the address and floor information of the identified person, or the company address and company name of the identified person, which are used to determine the identity of the identified person and can be narrowed down in the database. Data matching range.
  • the prompt information may further include a segment of random text for the identified person to provide voice according to the prompt random text. Further, the provided random text ensures that the voiceprint recognition voice provider is a living body, further ensuring the security of voiceprint recognition.
  • the voiceprint recognition can be effectively applied to the security access control of public areas of a building, for example, the access control of a residential area or an office area.
  • the usual method is to obtain a voice, and then match the voiceprint in the database based on the acquired voice to verify the identity of the identified person.
  • this method cannot determine whether the acquired voice is provided by a living body or a recording obtained through a recording medium. Therefore, voiceprint recognition has a security problem.
  • the voiceprint recognition device obtains an instruction to recognize a voiceprint, it outputs prompt information including a first random code, and the prompt information is used to prompt the content that the voice provided by the identified person should contain.
  • the voiceprint recognition device detects the human body to activate voiceprint recognition through infrared detection, or activate the voiceprint recognition through the access control button. If the voiceprint recognition device activates voiceprint recognition, when performing personnel recognition, the voiceprint recognition device outputs prompt information including the first random code, so that the identified person provides a voice including the first random code, and the voiceprint recognition The device obtains the voice provided by the identified person, analyzes the content of the first random code through the voiceprint recognition processing technology, and verifies the first random code to realize the live detection in the voiceprint recognition processing to prevent recording counterfeiting. Personnel perform voiceprint recognition to achieve the purpose of living body detection, ensure the security of voiceprint recognition processing, and further identify the identity of the identified person based on the voiceprint characteristics of the identified person.
  • the user who logs in to the account can also be authenticated through voiceprint recognition processing.
  • the terminal enters the personnel identification interface, the terminal can also output a prompt message containing a first random code. , Verify the voiceprint recognition process for the identity of the person who logged in to the account.
  • the voiceprint recognition device outputs prompt information including the first random code
  • the prompt information may be the prompt information displayed by the voiceprint recognition device in a text form on the display interface of the voiceprint recognition device, or the voiceprint
  • the recognition device prompts the user to provide the prompt information included in the voice in the form of a voice broadcast, and may also simultaneously display in the form of a text display and a voice broadcast.
  • the identified person provides a voice including the first random code according to the prompt information output by the voiceprint recognition device, For example, the identified person reads the first random code "6589", “technology”, “jym” or “6589jym”, etc., for verification by the voiceprint recognition device when the voiceprint recognition device performs the voiceprint recognition.
  • the voice provider is a living body, and the voiceprint recognition device obtains the voice provided by the recognized person through a component such as a microphone.
  • S230 Convert the voice into text through speech recognition, and extract a second random code included in the text.
  • ASR Automatic Speech Recognition
  • the voiceprint recognition device uses ASR voice recognition technology to convert the voice into text and segment the converted text, according to the first paragraph defined in the prompt information in step S220.
  • Information of a random code the information of the first random code including the number of bits, content of the first random code, and the position of the first random code in the voice, etc. are extracted from the converted text The second random code. For example, if the information of the first random code included in step S220 includes a two-digit random code at the front of the speech, then in the converted text, the first two digits are taken as the second random code.
  • S240 Use the first random code to verify the second random code. If the second random code is successfully verified, perform voiceprint recognition to obtain a voiceprint recognition result.
  • the voiceprint recognition device implements verification of the second random code by comparing the first random code with the second random code, and then determines whether the identified person provided by the voice is a living body. If the first random code and the second random code are the same, it means that the identified person provided by the voice is alive, otherwise, it indicates that the identified person provided by the voice may be provided by a recording, which is not safe.
  • the second random code contained in the text obtained by the voiceprint recognition device through the voice recognition technology is different from the first random code output by the voiceprint recognition device, the second random code check fails, and a jump occurs There is a process, and the voiceprint recognition device prompts that the identification fails. Further, the prompt information including the first random code may be output again. Otherwise, if the second random code contained in the text obtained by the voice recognition technology is the same as the first random code output by the voiceprint recognition device, and the second random code is successfully checked, the voice is performed according to the voice. Pattern recognition processing, query to find out the voice of the person / person bank corresponding to this voice information, and then compare the voiceprints one to one or more than one. Through voiceprint recognition, get the voiceprint recognition results and perform user identification. Verification to ensure the security of voiceprint recognition processing.
  • the voiceprint recognition device outputs prompt information including a first random code, thereby obtaining a voice provided by the identified person, obtaining the second random code from the voice, and converting the first random code. Match with the second random code, if the second random code is successfully verified, and then perform voiceprint recognition of the voice to obtain a voiceprint recognition result, it can ensure that the voice provider is a living body, thereby improving the voice Pattern recognition security.
  • FIG. 3 is a schematic flowchart of a voiceprint recognition processing method according to another embodiment of the present application.
  • the voiceprint recognition processing method includes the following steps S310-S340:
  • prompt information including a first random code and preset content is output, where the prompt information refers to information used to prompt the content provided by the recognized voice.
  • the preset content is preset voice content provided by the user when the voiceprint recognition device is registered.
  • the preset content may be related to an application environment of voiceprint recognition, and the application environment includes a position or a name during voiceprint recognition. Users can set different preset contents according to different application environments. For example, in a residential area, the preset content may be "I live in xxx room xxx", while in an office area, the preset content may be "my company's address is xxx" or "my company's name is xxx" Wait.
  • the voiceprint recognition device obtains an instruction to recognize a voiceprint, it outputs prompt information including a first random code and preset content.
  • the voiceprint recognition device displays prompt information in a text form on a screen of the voiceprint recognition device through a display interface, or when the prompt information is output by a voice broadcast, the preset content uses a linguistic designation
  • the metaphor refers to that the specific content contained in the preset content is not explicitly prompted, and the preset content is not clear and specific.
  • the voiceprint recognition device prompts the user to enter a voice containing "first random code + my address”, or the voiceprint recognition device prompts the user to enter a voice containing "random code + my company's address” or "first random code + my company” Name "in the form of voice, etc., without prompting" My address "," My company's address "or” My company's name ".
  • the preset content can not only limit the identification of the identified person in different use environments, but also the voiceprint recognition device can reduce the range of data matching during voiceprint recognition through the preset content when performing voiceprint recognition. For example, in a residential area, if the preset content is "I live in xx building xx room", the prompt message output by the voiceprint recognition device is "random code + my address", and when the voiceprint recognition device performs voiceprint recognition If it is detected that the acquired voice content contains "I live in xx room xx room", when performing data matching in the database, the data range matched during voiceprint recognition can be narrowed down to the data range containing the "xx building" keyword It is not necessary to match the data in the entire database one by one when performing voiceprint recognition processing, thereby improving the efficiency of voiceprint recognition processing.
  • the voiceprint recognition device detects voiceprint data in the database that matches the voiceprint contained in the acquired voice information, it passes the identity verification of the identified person; otherwise, it does not pass the identity verification of the identified person.
  • the first random code and the prompt information of the preset content as the voice content of the voiceprint recognition process, not only can the verification of the random code ensure that the voiceprint recognition process is the voice provided by the living body, but also can pass the preset content.
  • the included user information guarantees security during voiceprint recognition processing, and can reduce the range of voiceprint data capacity matching during voiceprint recognition processing, improving the efficiency of voiceprint recognition processing.
  • the order of the first random code and the preset content is defined in the voice.
  • the voiceprint recognition device outputs prompt information including the first random code and preset content, and an order of the first random code and preset content is limited in the voice.
  • the sequence of voice content required by the voiceprint recognition device to be provided by the identified person may be "first random code + preset content", or “preset content + first random code”, or both of the above-mentioned sequential cycles, or the above Two orders are output randomly.
  • the recognition obtains the first random code and preset content included in the voice, and further uses the first random code and the preset content to perform voiceprint recognition processing verification, thereby achieving unpredictability provided by the voice.
  • the voiceprint recognition device performs voiceprint recognition, it can detect the sequence of voice content to further ensure that the voice is a real-time voice provided by the living body, and ensure the security of voiceprint recognition.
  • the voiceprint recognition device outputs prompt information including the first random code and preset content, and the prompt information may be displayed in text form on a display interface of the display screen of the voiceprint recognition device, such as "random code + during registration" "Preset content", or the voiceprint recognition device prompts the user with the content of the voice that should be provided in the form of a voice announcement.
  • the terminal device obtains a segment of speech provided by the identified person according to a specific use scenario, which includes the first random code and preset content.
  • the voiceprint recognition device prompts the prompt information that the voiceprint recognition requires for voiceprint recognition
  • the identified person provides a voice including the first random code and the preset content according to different usage scenarios, such as in In residential areas, presets including "random code and residential address provided during registration" can be provided, and in office areas, presets including "random code and office address or registered office name provided during registration” can be provided Content for voiceprint recognition processing by the voiceprint recognition device.
  • the voiceprint recognition device obtains a segment of speech provided by the identified person according to the current usage scenario, including the first random code and preset content.
  • the voiceprint recognition device obtains the speech provided by the identified person, the speech is converted into text by ASR speech recognition technology, and the converted text is segmented according to the first random included in the prompt information in step S330.
  • the position and content of the code and the preset content in the voice, and the number of digits of the second random code, and extract the information contained in the second random code and the preset content from the converted text By comparing the second random code extracted from the acquired voice with the first random code prompted by the voiceprint recognition device, and matching the corresponding data in a database according to the user information extracted from the acquired voice, In order to realize the identification of the user's identity through voiceprint recognition.
  • the voiceprint recognition device matches the voiceprint model in the database with xx building, thereby reducing the range of voiceprint model matching during voiceprint recognition and improving The efficiency of voiceprint recognition.
  • S340 Use the first random code to verify the second random code. If the second random code is successfully verified, perform voiceprint recognition according to the preset content to obtain a voiceprint recognition result.
  • the voiceprint recognition device according to the first random code and the voice sequence of the preset content included in the prompt information, and the second random code obtained by segmenting the voice into text by voice recognition And comparing with the first random code. If the second random code contained in the text obtained by speech recognition is different from the first random code output by the voiceprint recognition device, the second random code check fails, and there is a flow, the voiceprint recognition device It is prompted that the identification fails, and further, the prompt information including the first random code and the preset content may be output again.
  • the Set the user information contained in the content perform voiceprint recognition matching within the data range corresponding to the user information, query to find the sound of the person / person library corresponding to this voice information, and then perform one-to-one or one-to-many Comparison of individual voiceprints, user authentication is performed through voiceprint recognition.
  • the preset content is "I live in xx building xx room”
  • the prompt message output by the voiceprint recognition device is "random code + my address”
  • the voiceprint recognition device performs voiceprint recognition If it is detected that the acquired voice content contains "I live in xx room xx room”, when performing data matching in the database, the data range matched during voiceprint recognition can be narrowed down to the data range containing the "xx building” keyword Or reduce the range of data for voiceprint recognition to family members in the “xx building xx room”, instead of matching the data in the entire database one by one for voiceprint recognition processing, thereby improving voiceprint recognition processing s efficiency.
  • the voiceprint recognition processing data stored in the database of the voiceprint recognition device has been reduced, thereby greatly reducing the voiceprint recognition processing.
  • the comparison amount improves the efficiency and accuracy of voiceprint recognition processing.
  • the step of outputting prompt information including the first random code further includes: the prompt information includes the position of the first random code in the voice.
  • the prompt information includes the position of the first random code in the voice, which refers to the order in which the first random code of the first random code in the voice is prompted in the prompt message, and is preset in the voice. Determining the position of the first random code. Pre-setting the position of the first random code in the voice means that the position of the first random code in the voice is predefined, for example, the identified person first speaks the first random code The position of the first random code in the voice is at the head of the voice, and the identified person finally speaks the first random code, and the position of the first random code in the voice is in the voice The tail of the speech.
  • the first random code included in the voice is obtained according to the position of the first random code in the voice, and the first random code is further verified.
  • the voiceprint recognition device detects only the first random code in the voice, and other voice content included in the voice is not considered.
  • the position of the first random code in the voice is limited.
  • the first random code is in the front of the voice, or the first random code is in the tail of the voice. Taking the first few digits or the last few digits of the speech-transformed text according to the number of bits of the first random code.
  • the position of the first random code in the voice included in the prompt information is randomly defined during each voiceprint recognition process.
  • the position of the first random code in the voice is randomly defined at each voiceprint recognition process, which means that the position of the first random code in the voice is not fixed at each voiceprint recognition process.
  • It can prompt the identified person that the first random code is in the front of the voice, in the middle of the voice, or in the tail of the voice.
  • the voiceprint recognition device randomly defines the The position of the first random code in the voice, and storing the position of the first random code to the identified person in the voice through the prompt information, and in the subsequent steps, according to each time the first A position of a random code in the voice acquires the second random code included in the voice, and further verifies the second random code.
  • the first random code is at the front of the speech once, and the first random code is at the front or tail of the next speech, etc., by the position of the first random code in the speech is Random limitation can realize more flexible security verification for voiceprint recognition processing.
  • the prompt information further includes that the voice is a voice within a preset time length.
  • the voiceprint recognition device requires the identified person to provide a voice within a preset time length.
  • the preset time length can be, for example, a voice within 15 seconds, or a voice between 15 seconds and 30 seconds.
  • the voiceprint recognition processing can be more accurately limited.
  • Conditions to improve the security of voiceprint recognition processing Since the preset time length of the voice is preset in the background, others will not easily know that by limiting the length of the voice provided by the identified person, the voiceprint recognition process can be prevented from being continuously tried, and the voiceprint recognition process can be further guaranteed. Security.
  • the time length of the voice is randomly limited, and the identified person is prompted to require the identified person to provide the speech within a preset time length, such as requiring the identified person to provide a period of speech within 15 seconds, or The identified person is required to provide a voice within 20 seconds, which can be randomly limited by the preset time length of the voice, and can also realize the detection of the living body in the voiceprint recognition processing to prevent the voice recorder from performing voiceprint recognition.
  • FIG. 4 is a schematic block diagram of a voiceprint recognition processing device according to an embodiment of the present application.
  • the voiceprint recognition processing device includes a unit for performing the above-mentioned voiceprint recognition processing method, and the device may be configured in an electronic device such as a desktop computer, a notebook, or a smart phone.
  • the voiceprint recognition processing device includes an output unit 401, an obtaining unit 402, an extraction unit 403, and a verification unit 404.
  • the output unit 401 is configured to output prompt information including a first random code if an instruction to recognize a voiceprint is obtained, where the prompt information refers to information used to prompt the content provided by the recognized voice;
  • An obtaining unit 402 configured to obtain a voice provided by the identified person and including the first random code
  • An extraction unit 403 configured to convert the speech into text through speech recognition, and extract a second random code included in the text
  • the verification unit 404 is configured to verify the second random code by using the first random code. If the second random code is successfully verified, perform voiceprint recognition to obtain a voiceprint recognition result.
  • the prompt information further includes preset content, and an order of the first random code and the preset content is limited in the voice;
  • the checking unit 404 is configured to check the second random code using the first random code, and if the second random code is successfully checked, perform voiceprint recognition according to the preset content, Get voiceprint recognition results.
  • the prompt information output by the output unit 401 includes a position of the first random code in the voice and the voice is a voice within a preset time length.
  • each unit in the voiceprint recognition processing device is only for illustration.
  • the voiceprint recognition processing device can be divided into different units as required, and the voiceprint recognition processing can also be Each unit in the device adopts different connection sequences and methods to complete all or part of the functions of the voiceprint recognition processing device.
  • the above-mentioned voiceprint recognition processing device can be implemented in the form of a computer program, which can be run on an electronic device as shown in FIG. 5.
  • FIG. 5 is a schematic block diagram of an electronic device according to an embodiment of the present application.
  • the electronic device 500 may be a terminal, or a component or component in another device.
  • the terminal may be an electronic device with a communication function, such as a desktop computer.
  • the electronic device 500 includes a processor 502, a memory, a network interface 505, and an audio input interface 506 connected through a system bus 501.
  • the memory may include a non-volatile storage medium 503 and an internal memory 504.
  • the non-volatile storage medium 503 can store an operating system 5031 and a computer program 5032.
  • the computer program 5032 includes program instructions.
  • the processor 502 can execute one of the voiceprint recognition processing methods described above.
  • the processor 502 is used to provide computing and control capabilities to support the operation of the entire electronic device 500.
  • the internal memory 504 provides an environment for running a computer program 5032 in the non-volatile storage medium 503.
  • the processor 502 can execute one of the voiceprint recognition processing methods described above.
  • the network interface 505 is configured to perform network communication with other devices
  • the audio input interface 506 is configured to obtain a voice provided by an identified person
  • the audio input interface 506 may be a microphone (microphone) or the like.
  • FIG. 5 is only a block diagram of a part of the structure related to the solution of the present application, and does not constitute a limitation on the electronic device 500 to which the solution of the present application is applied.
  • the specific electronic device 500 may include more or fewer components than shown in the figure, or combine certain components, or have a different component arrangement.
  • the processor 502 is configured to run a computer program 5032 stored in a memory to implement the voiceprint recognition processing method in the embodiment of the present application.
  • the processor 502 may be a central processing unit (CPU), and the processor 502 may also be another general-purpose processor, a digital signal processor (Digital Signal Processor, DSP), Application-specific integrated circuits (Application Specific Integrated Circuits, ASICs), ready-made programmable gate arrays (Field-Programmable Gate Arrays, FPGAs) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc.
  • the general-purpose processor may be a microprocessor, or the processor may be any conventional processor.
  • the computer program may be stored in a storage medium, and the storage medium is a computer-readable storage medium.
  • the computer program is executed by at least one processor in the computer system to implement the steps of the embodiment of the voiceprint recognition processing method described above.
  • an embodiment of the present application further provides a computer-readable storage medium.
  • the storage medium stores a computer program, and when the computer program is executed by the processor, the processor causes the processor to execute the steps of the voiceprint recognition processing method described in the foregoing embodiments.
  • the storage medium may be various computer-readable storage media that can store a computer program, such as a U disk, a mobile hard disk, a read-only memory (ROM), a magnetic disk, or an optical disk.
  • a computer program such as a U disk, a mobile hard disk, a read-only memory (ROM), a magnetic disk, or an optical disk.

Landscapes

  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Computational Linguistics (AREA)
  • Business, Economics & Management (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Game Theory and Decision Science (AREA)
  • Telephonic Communication Services (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

Provided are a voiceprint recognition processing method and apparatus, an electronic device and a storage medium. The method comprises: outputting prompt information containing a first random code if an instruction for recognizing a voiceprint is obtained, wherein the prompt information refers to information used for prompting the content needing to be contained in speech provided by a recognized person (S210); acquiring speech which is provided by the recognized person and contains the first random code (S220); converting the speech into text by means of speech recognition, and extracting a second random code contained in the text (S230); and checking the second random code by using the first random code, and if the second random code is successfully checked, carrying out voiceprint recognition to obtain a voiceprint recognition result (S240).

Description

声纹识别处理方法、装置、电子设备及存储介质Voiceprint recognition processing method, device, electronic equipment and storage medium
本申请要求于2018年8月3日提交中国专利局、申请号为201810877973.1、申请名称为“声纹识别处理方法、装置、电子设备及存储介质”的中国专利申请的优先权,其全部内容通过引用结合在本申请中。This application claims the priority of a Chinese patent application filed with the Chinese Patent Office on August 3, 2018, with the application number 201810877973.1, and the application name is "Voiceprint Recognition Processing Method, Device, Electronic Equipment and Storage Medium" Citations are incorporated in this application.
技术领域Technical field
本申请涉及声纹识别技术领域,尤其涉及一种声纹识别处理方法、装置、电子设备及存储介质。The present application relates to the technical field of voiceprint recognition, and in particular, to a voiceprint recognition processing method, device, electronic device, and storage medium.
背景技术Background technique
声纹识别是根据人的声音的特质来识别说话人身份的系统,采用语音对说话人身份进行验证。这种技术具有较好的便捷性、稳定性、可测量性、准确性和安全性等特点。作为一种非接触式的采集、识别技术,声纹的获取成本较低、获取方便、使用简单,在银行、社保、公安、智能家居、移动支付等领域都有巨大应用前景。Voiceprint recognition is a system that recognizes the identity of a speaker based on the characteristics of the person's voice, and uses voice to verify the identity of the speaker. This technology has good features such as convenience, stability, measurability, accuracy, and security. As a non-contact collection and identification technology, voiceprint has low acquisition cost, convenient acquisition, and simple use. It has great application prospects in banks, social security, public security, smart home, and mobile payment.
传统的声纹识别中,通常采用的方式是,获取一段语音,根据获取的语音,在数据库中进行声纹的匹配,从而对被识别者身份进行验证。但此种声纹识别方式存在安全性问题。In traditional voiceprint recognition, the usual method is to obtain a voice, and then match the voiceprint in the database based on the acquired voice to verify the identity of the identified person. However, this voiceprint recognition method has security problems.
发明内容Summary of the invention
本申请实施例提供了一种声纹识别处理方法、装置、电子设备及存储介质,能够解决声纹识别中由于无法确定声音来源导致的安全性问题。The embodiments of the present application provide a voiceprint recognition processing method, device, electronic device, and storage medium, which can solve the security problem caused by the inability to determine the sound source in voiceprint recognition.
第一方面,本申请实施例提供了一种声纹识别处理方法,所述方法包括:若获得识别声纹的指令,输出包含第一随机码的提示信息,所述提示信息是指用于对被识别者提供的语音应包含的内容作出提示的信息;获取被识别者提供的包含所述第一随机码的语音;通过语音识别,将所述语音转换为文字,并提取所述文字中包含的第二随机码;以及使用所述第一随机码对所述第二随机码 进行校验,若对所述第二随机码校验成功,进行声纹识别得到声纹识别结果。In a first aspect, an embodiment of the present application provides a voiceprint recognition processing method. The method includes: if an instruction to recognize a voiceprint is obtained, outputting prompt information including a first random code, where the prompt information refers to The information provided by the identified person should contain information to prompt; obtain the speech provided by the identified person containing the first random code; convert the speech to text through speech recognition, and extract the text containing A second random code; and verifying the second random code using the first random code; if the second random code is successfully verified, performing voiceprint recognition to obtain a voiceprint recognition result.
第二方面,本申请实施例还提供了一种声纹识别处理装置,其中,所述装置包括:输出单元,用于若获得识别声纹的指令,输出包含第一随机码的提示信息,所述提示信息是指用于对被识别者提供的语音应包含的内容作出提示的信息;获取单元,用于获取被识别者提供的包含所述第一随机码的语音;提取单元,用于通过语音识别,将所述语音转换为文字,并提取所述文字中包含的第二随机码;以及校验单元,用于使用所述第一随机码对所述第二随机码进行校验,若对所述第二随机码校验成功,进行声纹识别得到声纹识别结果。In a second aspect, an embodiment of the present application further provides a voiceprint recognition processing device, wherein the device includes: an output unit configured to output prompt information including a first random code if an instruction to recognize the voiceprint is obtained, so that The prompt information refers to information for prompting the content provided by the identified person's voice; the acquisition unit is used to acquire the voice provided by the identified person containing the first random code; the extraction unit is used to pass Speech recognition, converting the speech into text, and extracting a second random code included in the text; and a verification unit for verifying the second random code using the first random code, if The second random code is successfully verified, and voiceprint recognition is performed to obtain a voiceprint recognition result.
第三方面,本申请实施例还提供了一种电子设备,其包括存储器及处理器,所述存储器上存储有计算机程序,所述处理器执行所述计算机程序时实现上述声纹识别处理方法。In a third aspect, an embodiment of the present application further provides an electronic device including a memory and a processor. The memory stores a computer program, and the processor implements the voiceprint recognition processing method when the processor executes the computer program.
第四方面,本申请实施例还提供了一种计算机可读存储介质,所述存储介质存储有计算机程序,所述计算机程序包括程序指令,所述程序指令当被处理器执行时可实现上述声纹识别处理方法。According to a fourth aspect, an embodiment of the present application further provides a computer-readable storage medium. The storage medium stores a computer program, where the computer program includes program instructions, and the program instructions can implement the foregoing sound when executed by a processor. Pattern recognition processing method.
本申请实施例提供了一种声纹识别处理方法、装置、电子设备及存储介质。本申请实施例通过声纹识别设备输出包含第一随机码的提示信息,进而获取所述被识别者提供的语音,从所述语音中获取所述第二随机码,将所述第一随机码和所述第二随机码进行匹配,若所述第二随机码校验成功,进而进行所述语音的声纹识别,得到声纹识别结果,可以保证所述语音提供者是活体,从而提高声纹识别的安全性。Embodiments of the present application provide a voiceprint recognition processing method, device, electronic device, and storage medium. In the embodiment of the present application, the voiceprint recognition device outputs prompt information including a first random code, thereby obtaining a voice provided by the identified person, obtaining the second random code from the voice, and converting the first random code. Match the second random code, and if the second random code is successfully verified, further perform voiceprint recognition of the voice to obtain a voiceprint recognition result, which can ensure that the voice provider is a living body, thereby improving voice Pattern recognition security.
附图说明BRIEF DESCRIPTION OF THE DRAWINGS
为了更清楚地说明本申请实施例技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to explain the technical solutions of the embodiments of the present application more clearly, the drawings used in the description of the embodiments are briefly introduced below. Obviously, the drawings in the following description are some embodiments of the present application. For ordinary technicians, other drawings can be obtained based on these drawings without paying creative work.
图1为本申请实施例提供的声纹识别处理方法的应用场景示意图;FIG. 1 is a schematic diagram of an application scenario of a voiceprint recognition processing method according to an embodiment of the present application; FIG.
图2为本申请实施例提供的声纹识别处理方法的流程示意图;2 is a schematic flowchart of a voiceprint recognition and processing method according to an embodiment of the present application;
图3为本申请另一个实施例提供的声纹识别处理方法的流程示意图;3 is a schematic flowchart of a voiceprint recognition processing method according to another embodiment of the present application;
图4为本申请实施例提供的声纹识别处理装置的示意性框图;FIG. 4 is a schematic block diagram of a voiceprint recognition processing apparatus according to an embodiment of the present application; FIG.
图5为本申请实施例提供的电子设备的示意性框图。FIG. 5 is a schematic block diagram of an electronic device according to an embodiment of the present application.
具体实施方式detailed description
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例是本申请一部分实施例,而不是全部的实施例。基于本申请中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本申请保护的范围。In the following, the technical solutions in the embodiments of the present application will be clearly and completely described with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments are part of the embodiments of the present application, but not all of the embodiments. Based on the embodiments in the present application, all other embodiments obtained by a person of ordinary skill in the art without creative efforts shall fall within the protection scope of the present application.
请参阅图1,图1为本申请实施例提供的声纹识别处理方法的应用场景示意图。所述应用场景包括:Please refer to FIG. 1. FIG. 1 is a schematic diagram of an application scenario of a voiceprint recognition processing method according to an embodiment of the present application. The application scenario includes:
(1)用户,指被识别者,也就是通过声纹识别设备进行声纹识别的人。(1) User refers to the person being identified, that is, the person performing voiceprint recognition through the voiceprint recognition device.
(2)声纹识别设备,一种电子设备,可以为一种终端设备,通过获取被识别者提供的语音,进行声纹识别,声纹识别设备可以通过声纹识别设备自身包含的麦克风组件获取被识别者发出的语音。(2) Voiceprint recognition device, an electronic device, can be a terminal device. Voiceprint recognition is performed by acquiring the voice provided by the identified person. The voiceprint recognition device can be obtained through the microphone component included in the voiceprint recognition device itself. The voice of the identified person.
图1中的各个主体工作过程如下:声纹识别设备若获得识别声纹的指令,输出提示信息,被识别者根据提示信息发出一段语音,声纹识别设备根据获取的语音,通过声纹识别,获得声纹识别结果,进行被识别者身份辨识和身份确认。The working process of each subject in Figure 1 is as follows: if the voiceprint recognition device obtains an instruction to recognize the voiceprint, it outputs a prompt message, and the identified person sends out a voice according to the prompt information. The voiceprint recognition result is obtained, and the identified person is identified and identified.
需要说明的是,图1中仅仅示意出一个被识别者,在实际操作过程中,被识别者可以有多个,同时,声纹识别设备可以为单独的电子设备终端,也可以是其他电子设备的部件、组件或者包含于其他电子设备的功能单元,比如是智能终端设备的功能单元,以完成声纹识别的全部或者部分功能,上述声纹识别处理方法的应用场景仅仅用于说明本申请技术方案,并不用于限定本申请技术方案。It should be noted that FIG. 1 only shows one identified person. In the actual operation process, there can be multiple identified persons. At the same time, the voiceprint recognition device can be a separate electronic device terminal or other electronic devices. Components, components or functional units included in other electronic devices, such as the functional units of smart terminal devices, to complete all or part of the functions of voiceprint recognition. The application scenario of the above voiceprint recognition processing method is only used to illustrate the technology of this application The solution is not intended to limit the technical solution of the present application.
图2为本申请实施例提供的声纹识别处理方法的示意性流程图。该声纹识别处理方法应用于图1中的声纹识别设备中,所述声纹识别设备可以为单独的电子设备终端,也可以是其他电子设备的部件或者组件,以完成声纹识别的全 部或者部分功能。FIG. 2 is a schematic flowchart of a voiceprint recognition processing method according to an embodiment of the present application. The voiceprint recognition processing method is applied to the voiceprint recognition device in FIG. 1. The voiceprint recognition device may be a separate electronic device terminal, or may be a component or component of another electronic device to complete all of the voiceprint recognition. Or some features.
图2是本申请实施例提供的声纹识别处理方法的流程示意图。如图2所示,该方法包括以下步骤S210-S240:FIG. 2 is a schematic flowchart of a voiceprint recognition processing method according to an embodiment of the present application. As shown in Figure 2, the method includes the following steps S210-S240:
S210、若获得识别声纹的指令,输出包含第一随机码的提示信息,所述提示信息是指用于对被识别者提供的语音应包含的内容作出提示的信息。S210. If an instruction to recognize a voiceprint is obtained, prompt information including a first random code is output, and the prompt information refers to information used to prompt the content provided by the identified person's voice.
具体地,声纹识别,是根据说话人的声波特性对说话人的身份进行身份辨识,识别或确认发出语音的说话人的身份,声纹识别处理广泛应用于需要对人的身份进行确认的智慧建筑、智能家居、金融安全等领域。Specifically, voiceprint recognition refers to the identification of the speaker's identity based on the characteristics of the speaker's sound waves, and to identify or confirm the identity of the speaker who issued the voice. Voiceprint recognition processing is widely used in wisdom that requires confirmation of the identity of the person. Construction, smart home, financial security and other fields.
其中,随机码是声纹识别设备随机产生的一串预设位数的字符。随机码可以包含数字、文字、字母及上述形式的组合等形式,比如,随机码可以为“6589”、“科技”、“jym”或者“6589jym”等。通过在被识别者提供的语音中设置随机码,可以实现声纹识别处理时保证是活体提供的语音,避免通过录音的形式进行对人员的识别,提高声纹识别处理时验证的安全性。需要说明的是,第一随机码也是随机码,只是为了区分不同的随机码而对随机码做的区别。The random code is a series of characters with a predetermined number of bits randomly generated by the voiceprint recognition device. The random code can include numbers, characters, letters, and combinations of the above forms. For example, the random code can be "6589", "technology", "jym", or "6589jym". By setting a random code in the voice provided by the identified person, the voice provided by the living body can be ensured during voiceprint recognition processing, the identification of the person by recording is avoided, and the security of verification during voiceprint recognition processing is improved. It should be noted that the first random code is also a random code, and is only a difference between the random codes in order to distinguish different random codes.
所述提示信息,是指用于对被识别者提供的语音应包含的内容作出提示的信息,用来对用户提供的语音应包含的内容做出明确提示,比如,所示提示信息可以包含随机码,还可以包含进行声纹识别的环境信息,比如被识别者的住址和楼层信息等,或者被识别者的公司地址和公司名称等,用于判断被识别者身份和能够缩小在数据库中进行数据匹配范围。所述提示信息还可以包括一段随机文字,供被识别者根据提示的随机文字提供语音,进一步通过提供的随机文字确保声纹识别的语音提供者是活体,进一步保证声纹识别的安全性。The prompt information refers to information for prompting the content that should be included in the voice provided by the identified person, and is used to make a clear prompt for the content that should be included in the voice provided by the user. For example, the prompt information shown may include random information. The code can also include environmental information for voiceprint recognition, such as the address and floor information of the identified person, or the company address and company name of the identified person, which are used to determine the identity of the identified person and can be narrowed down in the database. Data matching range. The prompt information may further include a segment of random text for the identified person to provide voice according to the prompt random text. Further, the provided random text ensures that the voiceprint recognition voice provider is a living body, further ensuring the security of voiceprint recognition.
具体地,声纹识别可以有效应用于建筑公共区域的安全访问权限管控,比如,应用于住宅区或者办公区的访问权限管控。但传统的声纹识别中,通常采用的方式是,获取一段语音,根据获取的语音,在数据库中进行声纹的匹配,从而对被识别者身份进行验证。但此种方式无法确定获取的语音是活体提供的还是通过录音介质获取的录音,因此,声纹识别存在安全性问题。在本申请实施例中,若声纹识别设备获得识别声纹的指令,输出包含第一随机码的提示信息,所述提示信息用于对被识别者提供的语音应包含的内容作出提示。比如当 人员在住宅区或者办公区入口处时,若声纹识别设备通过红外检测,检测到人体启动声纹识别,或者通过门禁按键启动声纹识别。若声纹识别设备启动声纹识别,进行人员辨识时,声纹识别设备输出包含所述第一随机码的提示信息,使被识别者提供一段包含所述第一随机码的语音,声纹识别设备通过获取被识别者提供的语音,通过声纹识别处理技术分析出第一随机码的内容,通过对所述第一随机码的验证,实现对声纹识别处理中的活体检测,防止录音仿冒人员进行声纹识别,从而达到活体检测的目的,保证声纹识别处理的安全性,从而进一步根据被识别者的声纹特征对被识别者的身份进行辨识。Specifically, the voiceprint recognition can be effectively applied to the security access control of public areas of a building, for example, the access control of a residential area or an office area. However, in traditional voiceprint recognition, the usual method is to obtain a voice, and then match the voiceprint in the database based on the acquired voice to verify the identity of the identified person. However, this method cannot determine whether the acquired voice is provided by a living body or a recording obtained through a recording medium. Therefore, voiceprint recognition has a security problem. In the embodiment of the present application, if the voiceprint recognition device obtains an instruction to recognize a voiceprint, it outputs prompt information including a first random code, and the prompt information is used to prompt the content that the voice provided by the identified person should contain. For example, when a person is at the entrance of a residential area or an office area, if the voiceprint recognition device detects the human body to activate voiceprint recognition through infrared detection, or activate the voiceprint recognition through the access control button. If the voiceprint recognition device activates voiceprint recognition, when performing personnel recognition, the voiceprint recognition device outputs prompt information including the first random code, so that the identified person provides a voice including the first random code, and the voiceprint recognition The device obtains the voice provided by the identified person, analyzes the content of the first random code through the voiceprint recognition processing technology, and verifies the first random code to realize the live detection in the voiceprint recognition processing to prevent recording counterfeiting. Personnel perform voiceprint recognition to achieve the purpose of living body detection, ensure the security of voiceprint recognition processing, and further identify the identity of the identified person based on the voiceprint characteristics of the identified person.
同样的,若用户在一些终端上进行账号登录时,也可以通过声纹识别处理对登录账号的人员进行身份验证,当终端进入人员识别界面时,终端也可以输出包含第一随机码的提示信息,对登录账户的人员身份进行声纹识别处理的验证。Similarly, when a user logs in to an account on some terminals, the user who logs in to the account can also be authenticated through voiceprint recognition processing. When the terminal enters the personnel identification interface, the terminal can also output a prompt message containing a first random code. , Verify the voiceprint recognition process for the identity of the person who logged in to the account.
进一步地,声纹识别设备输出包含所述第一随机码的提示信息,所述提示信息可以是声纹识别设备以文字形式在声纹识别设备的显示界面上显示的提示信息,或者是声纹识别设备以语音播报形式提示用户应提供的包含于语音的提示信息,还可以是同时以文字形式显示和语音播报的形式同时提示。Further, the voiceprint recognition device outputs prompt information including the first random code, and the prompt information may be the prompt information displayed by the voiceprint recognition device in a text form on the display interface of the voiceprint recognition device, or the voiceprint The recognition device prompts the user to provide the prompt information included in the voice in the form of a voice broadcast, and may also simultaneously display in the form of a text display and a voice broadcast.
S220、获取被识别者提供的包含所述第一随机码的语音。S220. Acquire the voice provided by the identified person and including the first random code.
具体地,声纹识别设备提示声纹识别需要的语音应包含的所述第一随机码的内容后,被识别者根据声纹识别设备输出的提示信息,提供一段包含第一随机码的语音,比如,所述被识别者通过读出包含所述第一随机码“6589”、“科技”、“jym”或者“6589jym”等,以供所述声纹识别设备进行声纹识别时验证所述语音提供者是活体,声纹识别设备通过麦克风等组件获取被识别者提供的语音。Specifically, after the voiceprint recognition device prompts the content of the first random code to be included in the voice required for voiceprint recognition, the identified person provides a voice including the first random code according to the prompt information output by the voiceprint recognition device, For example, the identified person reads the first random code "6589", "technology", "jym" or "6589jym", etc., for verification by the voiceprint recognition device when the voiceprint recognition device performs the voiceprint recognition. The voice provider is a living body, and the voiceprint recognition device obtains the voice provided by the recognized person through a component such as a microphone.
S230、通过语音识别,将所述语音转换为文字,并提取所述文字中包含的第二随机码。S230: Convert the voice into text through speech recognition, and extract a second random code included in the text.
其中,语音识别技术,也被称为自动语音识别,英文为Automatic Speech Recognition,简写为ASR,其目标是将人类的语音中的词汇内容转换为计算机可读的输入。Among them, speech recognition technology, also known as automatic speech recognition, English is Automatic Speech Recognition, abbreviated as ASR, and its goal is to convert vocabulary content in human speech into computer-readable input.
具体地,声纹识别设备获取被识别者提供的语音后,通过ASR语音识别技术,将语音转换为文字,将转换后的文字进行分割,根据步骤S220中所述提示信息中限定的所述第一随机码的信息,所述第一随机码的信息包括所述第一随机码的位数、内容及所述第一随机码在所述语音中的位置等,从转换后的文字中提取出第二随机码。比如,若步骤S220中包含的所述第一随机码的信息包括两位随机码在语音的前部,则在转换后的文字中,取前两位为所述第二随机码。Specifically, after the voiceprint recognition device obtains the voice provided by the identified person, it uses ASR voice recognition technology to convert the voice into text and segment the converted text, according to the first paragraph defined in the prompt information in step S220. Information of a random code, the information of the first random code including the number of bits, content of the first random code, and the position of the first random code in the voice, etc. are extracted from the converted text The second random code. For example, if the information of the first random code included in step S220 includes a two-digit random code at the front of the speech, then in the converted text, the first two digits are taken as the second random code.
S240、使用所述第一随机码对所述第二随机码进行校验,若对所述第二随机码校验成功,进行声纹识别得到声纹识别结果。S240. Use the first random code to verify the second random code. If the second random code is successfully verified, perform voiceprint recognition to obtain a voiceprint recognition result.
具体地,声纹识别设备通过对所述第一随机码和所述第二随机码的比较,实现对所述第二随机码的校验,进而判断语音提供的被识别者是否是活体。若所述第一随机码和所述第二随机码相同,则表示语音提供的被识别者是活体,否则,表示语音提供的被识别者有可能是录音提供的,是不安全的。Specifically, the voiceprint recognition device implements verification of the second random code by comparing the first random code with the second random code, and then determines whether the identified person provided by the voice is a living body. If the first random code and the second random code are the same, it means that the identified person provided by the voice is alive, otherwise, it indicates that the identified person provided by the voice may be provided by a recording, which is not safe.
若声纹识别设备通过语音识别技术获取的文字中包含的所述第二随机码与所述声纹识别设备输出的所述第一随机码不相同,所述第二随机码检验失败,跳出现有流程,声纹识别设备提示身份识别失败。进一步地,可以重新输出包含所述第一随机码的提示信息。否则,若通过语音识别技术获取的文字中包含的所述第二随机码与声纹识别设备输出的所述第一随机码相同,所述第二随机码检验成功,则根据所述语音进行声纹识别处理,查询找出这段语音信息所对应的人员/人员库的声音,再进行一比一或者一比多个的声纹对比,通过声纹识别,得到声纹识别结果,进行用户身份验证,从而保证声纹识别处理的安全性。If the second random code contained in the text obtained by the voiceprint recognition device through the voice recognition technology is different from the first random code output by the voiceprint recognition device, the second random code check fails, and a jump occurs There is a process, and the voiceprint recognition device prompts that the identification fails. Further, the prompt information including the first random code may be output again. Otherwise, if the second random code contained in the text obtained by the voice recognition technology is the same as the first random code output by the voiceprint recognition device, and the second random code is successfully checked, the voice is performed according to the voice. Pattern recognition processing, query to find out the voice of the person / person bank corresponding to this voice information, and then compare the voiceprints one to one or more than one. Through voiceprint recognition, get the voiceprint recognition results and perform user identification. Verification to ensure the security of voiceprint recognition processing.
本申请实施例通过声纹识别设备输出包含第一随机码的提示信息,进而获取所述被识别者提供的语音,从所述语音中获取所述第二随机码,将所述第一随机码和所述第二随机码进行匹配,若所述第二随机码校验成功,进而进行所述语音的声纹识别,获得声纹识别结果,可以保证所述语音提供者是活体,从而提高声纹识别的安全性。In the embodiment of the present application, the voiceprint recognition device outputs prompt information including a first random code, thereby obtaining a voice provided by the identified person, obtaining the second random code from the voice, and converting the first random code. Match with the second random code, if the second random code is successfully verified, and then perform voiceprint recognition of the voice to obtain a voiceprint recognition result, it can ensure that the voice provider is a living body, thereby improving the voice Pattern recognition security.
进一步地,若某一场景下有N个人,比如一个小区中居住有N人,或者一栋里住有N人,进行声纹识别时,数据库中有N个声纹模型数据需要进行匹配,若N的数值比较大,则声纹识别需要进行声纹模型的逐一比对,声纹识别处理 过程中需要进行处理的声纹匹配数量较大,导致声纹识别处理的识别率较低。请参阅图3,图3为本申请另一个实施例提供的声纹识别处理方法的流程示意图,所述声纹识别处理方法包括以下步骤S310-S340:Further, if there are N people in a scene, such as N people living in a community, or N people living in a building, for voiceprint recognition, there are N voiceprint model data in the database that need to be matched. The larger the value of N is, the voiceprint recognition needs to compare voiceprint models one by one. The number of voiceprint matching processes that need to be processed during voiceprint recognition processing is large, resulting in a low recognition rate of voiceprint recognition processing. Please refer to FIG. 3. FIG. 3 is a schematic flowchart of a voiceprint recognition processing method according to another embodiment of the present application. The voiceprint recognition processing method includes the following steps S310-S340:
S310、若获得识别声纹的指令,输出包含第一随机码和预设内容的提示信息,所述提示信息是指用于对被识别者提供的语音应包含的内容作出提示的信息。S310. If an instruction to recognize a voiceprint is obtained, prompt information including a first random code and preset content is output, where the prompt information refers to information used to prompt the content provided by the recognized voice.
其中,预设内容是用户在声纹识别设备注册时提供的预先设定的语音内容。所述预设内容可以与声纹识别的应用环境有关,所述应用环境包括声纹识别时的位置或者名称等。用户可以根据不同的应用环境,设置不同的预设内容。比如,在住宅区,预设内容可以为“我住在xxx栋xxx房间”,而在办公区域,所述预设内容可以为“我公司的地址是xxx”或者“我公司的名字是xxx”等。The preset content is preset voice content provided by the user when the voiceprint recognition device is registered. The preset content may be related to an application environment of voiceprint recognition, and the application environment includes a position or a name during voiceprint recognition. Users can set different preset contents according to different application environments. For example, in a residential area, the preset content may be "I live in xxx room xxx", while in an office area, the preset content may be "my company's address is xxx" or "my company's name is xxx" Wait.
具体地,声纹识别设备若获得识别声纹的指令,输出包含第一随机码和预设内容的提示信息。其中,为了确保用户隐私,声纹识别设备在声纹识别设备的屏幕通过显示界面以文字形式显示提示信息,或者通过以语音播报的方式输出所述提示信息时,所述预设内容采用隐语指代的形式,所述隐语指代是指所述预设内容包含的具体内容未被明确提示,所述预设内容不明确和具体的表述出来。比如,声纹识别设备提示用户输入包含“第一随机码+我的住址”的语音,或者声纹识别设备提示用户输入包含“随机码+我公司的地址”或者“第一随机码+我公司名字”的语音等形式,而不提示“我的住址”、“我公司的地址”或者“我公司的名字”具体是什么。Specifically, if the voiceprint recognition device obtains an instruction to recognize a voiceprint, it outputs prompt information including a first random code and preset content. Wherein, in order to ensure user privacy, the voiceprint recognition device displays prompt information in a text form on a screen of the voiceprint recognition device through a display interface, or when the prompt information is output by a voice broadcast, the preset content uses a linguistic designation In the form of generation, the metaphor refers to that the specific content contained in the preset content is not explicitly prompted, and the preset content is not clear and specific. For example, the voiceprint recognition device prompts the user to enter a voice containing "first random code + my address", or the voiceprint recognition device prompts the user to enter a voice containing "random code + my company's address" or "first random code + my company" Name "in the form of voice, etc., without prompting" My address "," My company's address "or" My company's name ".
所述预设内容不但可以限定被识别者在不同使用环境中的身份识别,而且声纹识别设备在进行声纹识别时还可以通过所述预设内容缩小声纹识别时数据匹配的范围。比如,在住宅区,若所述预设内容为“我住在xx栋xx房间”,声纹识别设备输出的提示信息为“随机码+我的住址”,声纹识别设备进行声纹识别时,若检测到获取的语音内容中包含“我住在xx栋xx房间”,在数据库中进行数据匹配时,可以将声纹识别时匹配的数据范围缩小到包含“xx栋”关键词的数据范围内,而不用将整个数据库中的数据逐一进行声纹识别处理时的匹配,从而提高声纹识别处理的效率。若所述声纹识别设备在数据库中检测到与获取 的语音信息中包含的声纹匹配的声纹数据,通过被识别者的身份验证,否则,不通过被识别者的身份验证。通过所述第一随机码结合预设内容的提示信息作为声纹识别处理的语音内容,不但可以通过随机码的验证,确保声纹识别处理是活体提供的语音,而且可以通过所述预设内容包含的用户信息,保证声纹识别处理时的安全性,且能够缩小声纹识别处理行时声纹数据容匹配的范围,提高声纹识别处理的效率。The preset content can not only limit the identification of the identified person in different use environments, but also the voiceprint recognition device can reduce the range of data matching during voiceprint recognition through the preset content when performing voiceprint recognition. For example, in a residential area, if the preset content is "I live in xx building xx room", the prompt message output by the voiceprint recognition device is "random code + my address", and when the voiceprint recognition device performs voiceprint recognition If it is detected that the acquired voice content contains "I live in xx room xx room", when performing data matching in the database, the data range matched during voiceprint recognition can be narrowed down to the data range containing the "xx building" keyword It is not necessary to match the data in the entire database one by one when performing voiceprint recognition processing, thereby improving the efficiency of voiceprint recognition processing. If the voiceprint recognition device detects voiceprint data in the database that matches the voiceprint contained in the acquired voice information, it passes the identity verification of the identified person; otherwise, it does not pass the identity verification of the identified person. By using the first random code and the prompt information of the preset content as the voice content of the voiceprint recognition process, not only can the verification of the random code ensure that the voiceprint recognition process is the voice provided by the living body, but also can pass the preset content. The included user information guarantees security during voiceprint recognition processing, and can reduce the range of voiceprint data capacity matching during voiceprint recognition processing, improving the efficiency of voiceprint recognition processing.
在一个实施例中,所述第一随机码和所述预设内容的顺序在所述语音中被限定。In one embodiment, the order of the first random code and the preset content is defined in the voice.
具体地,声纹识别设备输出包含所述第一随机码和预设内容的提示信息,所述第一随机码和预设内容的顺序在所述语音中被限定。比如,声纹识别设备要求被识别者提供的语音内容的顺序可以是“第一随机码+预设内容”,或者“预设内容+第一随机码”,或者上述两种顺序循环,或者上述两种顺序随机输出。通过所述提示信息中包含的所述第一随机码和预设内容的不同顺序的限定,后续步骤中根据每一次所述第一随机码和预设内容在所述语音中的位置,从语音识别中获取所述语音中包含的所述第一随机码和预设内容,进而利用所述第一随机码和所述预设内容进行声纹识别处理验证,实现语音提供的不可预测性。声纹识别设备在进行声纹识别时,可以进行对语音内容顺序的检测,进一步确保语音是活体提供的实时语音,保证声纹识别的安全性。Specifically, the voiceprint recognition device outputs prompt information including the first random code and preset content, and an order of the first random code and preset content is limited in the voice. For example, the sequence of voice content required by the voiceprint recognition device to be provided by the identified person may be "first random code + preset content", or "preset content + first random code", or both of the above-mentioned sequential cycles, or the above Two orders are output randomly. By limiting the different order of the first random code and the preset content included in the prompt information, in subsequent steps, according to each time the position of the first random code and the preset content in the voice, The recognition obtains the first random code and preset content included in the voice, and further uses the first random code and the preset content to perform voiceprint recognition processing verification, thereby achieving unpredictability provided by the voice. When the voiceprint recognition device performs voiceprint recognition, it can detect the sequence of voice content to further ensure that the voice is a real-time voice provided by the living body, and ensure the security of voiceprint recognition.
进一步地,声纹识别设备输出包含所述第一随机码和预设内容的提示信息,可以是以文字形式在声纹识别设备显示屏的显示界面上显示提示信息,比如“随机码+注册时预设的内容”,或者是声纹识别设备以语音播报的形式提示用户应提供的语音包含的内容。Further, the voiceprint recognition device outputs prompt information including the first random code and preset content, and the prompt information may be displayed in text form on a display interface of the display screen of the voiceprint recognition device, such as "random code + during registration" "Preset content", or the voiceprint recognition device prompts the user with the content of the voice that should be provided in the form of a voice announcement.
S320、获取被识别者提供的包含所述第一随机码和所述预设内容的语音。S320. Acquire a voice provided by the identified person and including the first random code and the preset content.
终端设备获取被识别者根据特定的使用场景,提供的一段包含所述第一随机码和预设内容的语音。The terminal device obtains a segment of speech provided by the identified person according to a specific use scenario, which includes the first random code and preset content.
具体地,声纹识别设备提示声纹识别需要的语音应包含的提示信息后,被识别者根据使用场景的不同,提供一段包含所述第一随机码和所述预设内容的语音,比如在住宅区,可以提供包含“随机码和注册时提供的住宅地址”的预 设内容,在办公区,可以提供包含“随机码和注册时提供的办公区的地址或者办公区的名字”的预设内容,以供声纹识别设备进行声纹识别处理。声纹识别设备获取被识别者根据当前的使用场景,提供的一段包含所述第一随机码和预设内容的语音。Specifically, after the voiceprint recognition device prompts the prompt information that the voiceprint recognition requires for voiceprint recognition, the identified person provides a voice including the first random code and the preset content according to different usage scenarios, such as in In residential areas, presets including "random code and residential address provided during registration" can be provided, and in office areas, presets including "random code and office address or registered office name provided during registration" can be provided Content for voiceprint recognition processing by the voiceprint recognition device. The voiceprint recognition device obtains a segment of speech provided by the identified person according to the current usage scenario, including the first random code and preset content.
S330、通过语音识别,将所述语音转换为文字,并提取所述文字中包含的第二随机码和所述预设内容。S330. Convert the voice into text through speech recognition, and extract a second random code and the preset content included in the text.
具体地,声纹识别设备获取被识别者提供的语音后,通过ASR语音识别技术,将语音转换为文字,将转换后的文字进行分割,根据步骤S330中提示信息中包含的所述第一随机码和所述预设内容在所述语音中的位置、内容及所述第二随机码的位数,从转换后的文字中提取出所述第二随机码和所述预设内容包含的信息,通过从获取的语音中提取的所述第二随机码和声纹识别设备提示的所述第一随机码进行比较,根据从获取的语音中提取的用户信息在数据库中进行相应数据的匹配,以实现通过声纹识别对用户身份的验证。比如,若采取的是两位所述第一随机码在所述预设内容前面的形式,则在转换后的文字中,取前两位为所述第二随机码。若所述预设内容为“我住在xx栋xx房间”,则声纹识别设备在数据库中与涉及xx栋的声纹模型进行匹配,从而缩减声纹识别时的声纹模型匹配范围,提高声纹识别的效率。Specifically, after the voiceprint recognition device obtains the speech provided by the identified person, the speech is converted into text by ASR speech recognition technology, and the converted text is segmented according to the first random included in the prompt information in step S330. The position and content of the code and the preset content in the voice, and the number of digits of the second random code, and extract the information contained in the second random code and the preset content from the converted text By comparing the second random code extracted from the acquired voice with the first random code prompted by the voiceprint recognition device, and matching the corresponding data in a database according to the user information extracted from the acquired voice, In order to realize the identification of the user's identity through voiceprint recognition. For example, if the first random code of two bits is in front of the preset content, the first two bits of the converted text are taken as the second random code. If the preset content is "I live in xx room xx room", the voiceprint recognition device matches the voiceprint model in the database with xx building, thereby reducing the range of voiceprint model matching during voiceprint recognition and improving The efficiency of voiceprint recognition.
S340、使用所述第一随机码对所述第二随机码进行校验,若对所述第二随机码校验成功,根据所述预设内容进行声纹识别,得到声纹识别结果。S340. Use the first random code to verify the second random code. If the second random code is successfully verified, perform voiceprint recognition according to the preset content to obtain a voiceprint recognition result.
具体地,声纹识别设备根据所述提示信息包含的所述第一随机码和预设内容的语音顺序,及通过语音识别,将所述语音转换为文字后分割获取的所述第二随机码与所述第一随机码进行比较。若通过语音识别获取的文字中包含的所述第二随机码与声纹识别设备输出的所述第一随机码不相同,所述第二随机码检验失败,跳出现有流程,声纹识别设备提示身份识别失败,进一步,可以重新输出包含所述第一随机码和所述预设内容的提示信息。否则,若通过语音识别获取的文字中包含的所述第二随机码与声纹识别设备输出的所述第一随机码相同,所述第二随机码检验成功,则根据所述语音提供的预设内容包含的用户信息,在所述用户信息对应的数据范围内进行声纹识别的匹配,查询找出这段 语音信息所对应的人员/人员库的声音,再进行一比一或者一比多个的声纹对比,通过声纹识别进行用户身份验证。比如,在住宅区,若所述预设内容为“我住在xx栋xx房间”,声纹识别设备输出的提示信息为“随机码+我的住址”,声纹识别设备进行声纹识别时,若检测到获取的语音内容中包含“我住在xx栋xx房间”,在数据库中进行数据匹配时,可以将声纹识别时匹配的数据范围缩小到包含“xx栋”关键词的数据范围内,或者将声纹识别时匹配的数据范围缩小到“xx栋xx房间”的家庭成员之间,而不用将整个数据库中的数据逐一进行声纹识别处理时的匹配,从而提高声纹识别处理的效率。由于根据用户信息确定的这段语音信息所对应的人员/人员库的声音,相对声纹识别设备的数据库中存储的声纹识别处理的数据已经缩小,从而极大的减小声纹识别处理中的比对量,提高声纹识别处理的效率和准确性。Specifically, the voiceprint recognition device according to the first random code and the voice sequence of the preset content included in the prompt information, and the second random code obtained by segmenting the voice into text by voice recognition And comparing with the first random code. If the second random code contained in the text obtained by speech recognition is different from the first random code output by the voiceprint recognition device, the second random code check fails, and there is a flow, the voiceprint recognition device It is prompted that the identification fails, and further, the prompt information including the first random code and the preset content may be output again. Otherwise, if the second random code included in the text obtained through speech recognition is the same as the first random code output by the voiceprint recognition device, and the second random code is successfully checked, then the Set the user information contained in the content, perform voiceprint recognition matching within the data range corresponding to the user information, query to find the sound of the person / person library corresponding to this voice information, and then perform one-to-one or one-to-many Comparison of individual voiceprints, user authentication is performed through voiceprint recognition. For example, in a residential area, if the preset content is "I live in xx building xx room", the prompt message output by the voiceprint recognition device is "random code + my address", and when the voiceprint recognition device performs voiceprint recognition If it is detected that the acquired voice content contains "I live in xx room xx room", when performing data matching in the database, the data range matched during voiceprint recognition can be narrowed down to the data range containing the "xx building" keyword Or reduce the range of data for voiceprint recognition to family members in the “xx building xx room”, instead of matching the data in the entire database one by one for voiceprint recognition processing, thereby improving voiceprint recognition processing s efficiency. Because the voice of the person / person bank corresponding to this piece of voice information determined according to the user information, the voiceprint recognition processing data stored in the database of the voiceprint recognition device has been reduced, thereby greatly reducing the voiceprint recognition processing. The comparison amount improves the efficiency and accuracy of voiceprint recognition processing.
在一个实施例中,所述若获得识别声纹的指令,输出包含第一随机码的提示信息的步骤还包括:所述提示信息中包含所述第一随机码在语音中的位置。In one embodiment, if the instruction for identifying the voiceprint is obtained, the step of outputting prompt information including the first random code further includes: the prompt information includes the position of the first random code in the voice.
具体地,所述提示信息中包含所述第一随机码在语音中的位置,是指在提示信息中提示被识别者所述第一随机码在语音中的顺序,在所述语音中预先设定所述第一随机码的位置。在所述语音中预先设定所述第一随机码的位置,是指所述第一随机码在所述语音中的位置被预先限定,比如,被识别者首先说出所述第一随机码,所述第一随机码在所述语音中的位置在所述语音的首部,被识别者最后说出所述第一随机码,所述第一随机码在所述语音中的位置在所述语音的尾部。后续步骤中根据所述第一随机码在所述语音中的位置获取所述语音中包含的所述第一随机码,进而对所述第一随机码进行验证。此种情形下,声纹识别设备只针对所述语音中的所述第一随机码进行检测,所述语音中包括的其他语音内容不加考虑。此时,限定所述第一随机码在所述语音中的位置,比如,所述第一随机码在所述语音的前部,或者所述第一随机码在所述语音的尾部等,则根据所述第一随机码的位数取所述语音转换后的文字中的前几位或者后几位。Specifically, the prompt information includes the position of the first random code in the voice, which refers to the order in which the first random code of the first random code in the voice is prompted in the prompt message, and is preset in the voice. Determining the position of the first random code. Pre-setting the position of the first random code in the voice means that the position of the first random code in the voice is predefined, for example, the identified person first speaks the first random code The position of the first random code in the voice is at the head of the voice, and the identified person finally speaks the first random code, and the position of the first random code in the voice is in the voice The tail of the speech. In a subsequent step, the first random code included in the voice is obtained according to the position of the first random code in the voice, and the first random code is further verified. In this case, the voiceprint recognition device detects only the first random code in the voice, and other voice content included in the voice is not considered. At this time, the position of the first random code in the voice is limited. For example, the first random code is in the front of the voice, or the first random code is in the tail of the voice. Taking the first few digits or the last few digits of the speech-transformed text according to the number of bits of the first random code.
进一步地,所述提示信息中包含所述第一随机码在语音中的位置在每一次声纹识别处理时被随机限定。Further, the position of the first random code in the voice included in the prompt information is randomly defined during each voiceprint recognition process.
具体地,所述第一随机码在所述语音中的位置在每一次声纹识别处理时被随机限定,是指每一次声纹识别时,所述第一随机码在语音中的位置不固定,可以提示被识别者所述第一随机码在所述语音的前部,在所述语音的中部,或者在所述语音的尾部,在每一次声纹识别时,声纹识别设备随机限定所述第一随机码在所述语音中的位置,并存储该次所述第一随机码通过所述提示信息提示给所述被识别者在语音中的位置,后续步骤中根据每一次所述第一随机码在所述语音中的位置获取所述语音中包含的所述第二随机码,进而对所述第二随机码进行验证。比如,所述第一随机码在一次所述语音的前部,所述第一随机码在下一次所述语音的前部或者尾部等,通过所述第一随机码在所述语音中的位置被随机限定,可以实现对声纹识别处理更加灵活的安全验证。Specifically, the position of the first random code in the voice is randomly defined at each voiceprint recognition process, which means that the position of the first random code in the voice is not fixed at each voiceprint recognition process. , It can prompt the identified person that the first random code is in the front of the voice, in the middle of the voice, or in the tail of the voice. At each voiceprint recognition, the voiceprint recognition device randomly defines the The position of the first random code in the voice, and storing the position of the first random code to the identified person in the voice through the prompt information, and in the subsequent steps, according to each time the first A position of a random code in the voice acquires the second random code included in the voice, and further verifies the second random code. For example, the first random code is at the front of the speech once, and the first random code is at the front or tail of the next speech, etc., by the position of the first random code in the speech is Random limitation can realize more flexible security verification for voiceprint recognition processing.
在一个实施例中,所述提示信息中还包括语音是预设时间长度内的语音。In one embodiment, the prompt information further includes that the voice is a voice within a preset time length.
具体地,声纹识别设备要求被识别者提供一段预设时间长度内的语音,通过对所述语音的时间长度的限定,可以进一步保证声纹识别处理的安全性。所述预设时间长度,比如,可以是15秒内的语音,也可以是15秒到30秒之间的语音,通过所述语音的时间长度的设置,可以更精确的限定声纹识别处理的条件,提高声纹识别处理的安全性。由于所述语音的预设时间长度是后台预先设定的,其他人不会轻易得知,通过限定被识别者提供的语音长度,可以防止声纹识别处理被不断尝试,进一步保证声纹识别处理的安全性。Specifically, the voiceprint recognition device requires the identified person to provide a voice within a preset time length. By limiting the time length of the voice, the security of the voiceprint recognition processing can be further ensured. The preset time length can be, for example, a voice within 15 seconds, or a voice between 15 seconds and 30 seconds. By setting the time length of the voice, the voiceprint recognition processing can be more accurately limited. Conditions to improve the security of voiceprint recognition processing. Since the preset time length of the voice is preset in the background, others will not easily know that by limiting the length of the voice provided by the identified person, the voiceprint recognition process can be prevented from being continuously tried, and the voiceprint recognition process can be further guaranteed. Security.
在一个实施例中,所述语音的时间长度被随机限定,并提示给被识别者,要求被识别者提供预设时间长度内的语音,比如要求被识别者提供一段15秒以内的语音,或者要求被识别者提供一段20秒之内的语音等,可以通过语音的预设时间长度的随机限定,也可以实现对声纹识别处理中的活体检测,防止录音仿冒人员进行声纹识别。In one embodiment, the time length of the voice is randomly limited, and the identified person is prompted to require the identified person to provide the speech within a preset time length, such as requiring the identified person to provide a period of speech within 15 seconds, or The identified person is required to provide a voice within 20 seconds, which can be randomly limited by the preset time length of the voice, and can also realize the detection of the living body in the voiceprint recognition processing to prevent the voice recorder from performing voiceprint recognition.
需要说明的是,上述各个实施例所述的声纹识别处理方法,可以根据需要将不同方法中包含的技术特征重新进行组合,以获取组合后的实施方案,但都在本申请要求的保护范围之内。It should be noted that the voiceprint recognition and processing methods described in the foregoing embodiments may be combined with technical features included in different methods as needed to obtain a combined implementation solution, but all fall within the protection scope required by this application. within.
请参阅图4,对应于上述声纹识别处理方法,本申请实施例还提供一种声纹识别处理装置。图4是本申请实施例提供的一种声纹识别处理装置的示意性框 图。该声纹识别处理装置包括用于执行上述声纹识别处理方法的单元,该装置可以被配置于台式电脑、笔记本、智能手机等电子设备中。具体地,请参阅图4,该声纹识别处理装置包括输出单元401、获取单元402、提取单元403以及校验单元404。Referring to FIG. 4, corresponding to the voiceprint recognition processing method described above, an embodiment of the present application further provides a voiceprint recognition processing device. FIG. 4 is a schematic block diagram of a voiceprint recognition processing device according to an embodiment of the present application. The voiceprint recognition processing device includes a unit for performing the above-mentioned voiceprint recognition processing method, and the device may be configured in an electronic device such as a desktop computer, a notebook, or a smart phone. Specifically, referring to FIG. 4, the voiceprint recognition processing device includes an output unit 401, an obtaining unit 402, an extraction unit 403, and a verification unit 404.
其中,输出单元401,用于若获得识别声纹的指令,输出包含第一随机码的提示信息,所述提示信息是指用于对被识别者提供的语音应包含的内容作出提示的信息;The output unit 401 is configured to output prompt information including a first random code if an instruction to recognize a voiceprint is obtained, where the prompt information refers to information used to prompt the content provided by the recognized voice;
获取单元402,用于获取被识别者提供的包含所述第一随机码的语音;An obtaining unit 402, configured to obtain a voice provided by the identified person and including the first random code;
提取单元403,用于通过语音识别,将所述语音转换为文字,并提取所述文字中包含的第二随机码;以及An extraction unit 403, configured to convert the speech into text through speech recognition, and extract a second random code included in the text; and
校验单元404,用于使用所述第一随机码对所述第二随机码进行校验,若对所述第二随机码校验成功,进行声纹识别得到声纹识别结果。The verification unit 404 is configured to verify the second random code by using the first random code. If the second random code is successfully verified, perform voiceprint recognition to obtain a voiceprint recognition result.
在一个实施例中,所述提示信息中还包括预设内容,所述第一随机码和所述预设内容的顺序在所述语音中被限定;In one embodiment, the prompt information further includes preset content, and an order of the first random code and the preset content is limited in the voice;
所述校验单元404,用于使用所述第一随机码对所述第二随机码进行校验,若对所述第二随机码校验成功,根据所述预设内容进行声纹识别,得到声纹识别结果。The checking unit 404 is configured to check the second random code using the first random code, and if the second random code is successfully checked, perform voiceprint recognition according to the preset content, Get voiceprint recognition results.
在一个实施例中,所述输出单元401所输出的所述提示信息中包含所述第一随机码在语音中的位置和所述语音是预设时间长度内的语音。In one embodiment, the prompt information output by the output unit 401 includes a position of the first random code in the voice and the voice is a voice within a preset time length.
需要说明的是,所属领域的技术人员可以清楚地了解到,上述声纹识别处理装置400和各单元的具体实现过程,可以参考前述方法实施例中的相应描述,为了描述的方便和简洁,在此不再赘述。It should be noted that those skilled in the art can clearly understand that the specific implementation process of the voiceprint recognition processing device 400 and each unit can refer to the corresponding descriptions in the foregoing method embodiments. For convenience and concise description, This will not be repeated here.
同时,上述声纹识别处理装置中各个单元的划分和连接方式仅用于举例说明,在其他实施例中,可将声纹识别处理装置按照需要划分为不同的单元,也可将声纹识别处理装置中各单元采取不同的连接顺序和方式,以完成上述声纹识别处理装置的全部或部分功能。At the same time, the division and connection of each unit in the voiceprint recognition processing device is only for illustration. In other embodiments, the voiceprint recognition processing device can be divided into different units as required, and the voiceprint recognition processing can also be Each unit in the device adopts different connection sequences and methods to complete all or part of the functions of the voiceprint recognition processing device.
上述声纹识别处理装置可以实现为一种计算机程序的形式,该计算机程序可以在如图5所示的电子设备上运行。The above-mentioned voiceprint recognition processing device can be implemented in the form of a computer program, which can be run on an electronic device as shown in FIG. 5.
请参阅图5,图5是本申请实施例提供的一种电子设备的示意性框图。该电子设备500可以是终端,也可以是其他设备中的组件或者部件,其中,终端可以是台式电脑等具有通信功能的电子设备。Please refer to FIG. 5, which is a schematic block diagram of an electronic device according to an embodiment of the present application. The electronic device 500 may be a terminal, or a component or component in another device. The terminal may be an electronic device with a communication function, such as a desktop computer.
参阅图5,该电子设备500包括通过系统总线501连接的处理器502、存储器、网络接口505和音频输入接口506,其中,存储器可以包括非易失性存储介质503和内存储器504。Referring to FIG. 5, the electronic device 500 includes a processor 502, a memory, a network interface 505, and an audio input interface 506 connected through a system bus 501. The memory may include a non-volatile storage medium 503 and an internal memory 504.
该非易失性存储介质503可存储操作系统5031和计算机程序5032。该计算机程序5032包括程序指令,该程序指令被执行时,可使得处理器502执行一种上述声纹识别处理方法。The non-volatile storage medium 503 can store an operating system 5031 and a computer program 5032. The computer program 5032 includes program instructions. When the program instructions are executed, the processor 502 can execute one of the voiceprint recognition processing methods described above.
该处理器502用于提供计算和控制能力,以支撑整个电子设备500的运行。The processor 502 is used to provide computing and control capabilities to support the operation of the entire electronic device 500.
该内存储器504为非易失性存储介质503中的计算机程序5032的运行提供环境,该计算机程序5032被处理器502执行时,可使得处理器502执行一种上述声纹识别处理方法。The internal memory 504 provides an environment for running a computer program 5032 in the non-volatile storage medium 503. When the computer program 5032 is executed by the processor 502, the processor 502 can execute one of the voiceprint recognition processing methods described above.
该网络接口505用于与其它设备进行网络通信,该音频输入接口506用于获取被识别者提供的语音,所述音频输入接口506可以为话筒(麦克风)等。本领域技术人员可以理解,图5中示出的结构,仅仅是与本申请方案相关的部分结构的框图,并不构成对本申请方案所应用于其上的电子设备500的限定,具体的电子设备500可以包括比图中所示更多或更少的部件,或者组合某些部件,或者具有不同的部件布置。The network interface 505 is configured to perform network communication with other devices, the audio input interface 506 is configured to obtain a voice provided by an identified person, and the audio input interface 506 may be a microphone (microphone) or the like. Those skilled in the art can understand that the structure shown in FIG. 5 is only a block diagram of a part of the structure related to the solution of the present application, and does not constitute a limitation on the electronic device 500 to which the solution of the present application is applied. The specific electronic device 500 may include more or fewer components than shown in the figure, or combine certain components, or have a different component arrangement.
其中,所述处理器502用于运行存储在存储器中的计算机程序5032,以实现本申请实施例的声纹识别处理方法。The processor 502 is configured to run a computer program 5032 stored in a memory to implement the voiceprint recognition processing method in the embodiment of the present application.
应当理解,在本申请实施例中,处理器502可以是中央处理单元(Central Processing Unit,CPU),该处理器502还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现成可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。其中,通用处理器可以是微处理器或者该处理器也可以是任何常规的处理器等。It should be understood that, in the embodiment of the present application, the processor 502 may be a central processing unit (CPU), and the processor 502 may also be another general-purpose processor, a digital signal processor (Digital Signal Processor, DSP), Application-specific integrated circuits (Application Specific Integrated Circuits, ASICs), ready-made programmable gate arrays (Field-Programmable Gate Arrays, FPGAs) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc. The general-purpose processor may be a microprocessor, or the processor may be any conventional processor.
本领域普通技术人员可以理解的是实现上述实施例的方法中的全部或部分 流程,是可以通过计算机程序来完成。该计算机程序可存储于一存储介质中,该存储介质为计算机可读存储介质。该计算机程序被该计算机系统中的至少一个处理器执行,以实现上述声纹识别处理方法的实施例的步骤。A person of ordinary skill in the art can understand that all or part of the processes in the methods of the foregoing embodiments can be implemented by a computer program. The computer program may be stored in a storage medium, and the storage medium is a computer-readable storage medium. The computer program is executed by at least one processor in the computer system to implement the steps of the embodiment of the voiceprint recognition processing method described above.
因此,本申请实施例还提供一种计算机可读存储介质。该存储介质存储有计算机程序,其中计算机程序被处理器执行时使处理器执行以上各实施例中所描述的声纹识别处理方法的步骤。Therefore, an embodiment of the present application further provides a computer-readable storage medium. The storage medium stores a computer program, and when the computer program is executed by the processor, the processor causes the processor to execute the steps of the voiceprint recognition processing method described in the foregoing embodiments.
所述存储介质可以是U盘、移动硬盘、只读存储器(Read-Only Memory,ROM)、磁碟或者光盘等各种可以存储计算机程序的计算机可读存储介质。The storage medium may be various computer-readable storage media that can store a computer program, such as a U disk, a mobile hard disk, a read-only memory (ROM), a magnetic disk, or an optical disk.
本领域普通技术人员可以意识到,结合本文中所公开的实施例描述的各示例的单元及算法步骤,能够以电子硬件、计算机软件或者二者的结合来实现,为了清楚地说明硬件和软件的可互换性,在上述说明中已经按照功能一般性地描述了各示例的组成及步骤。这些功能究竟以硬件还是软件方式来执行,取决于技术方案的特定应用和设计约束条件。专业技术人员可以对每个特定的应用来使用不同方法来实现所描述的功能,但是这种实现不应认为超出本申请的范围。Those of ordinary skill in the art may realize that the units and algorithm steps of each example described in combination with the embodiments disclosed herein can be implemented by electronic hardware, computer software, or a combination of the two. In order to clearly illustrate the hardware and software, Interchangeability. In the above description, the composition and steps of each example have been described generally in terms of functions. Whether these functions are performed by hardware or software depends on the specific application and design constraints of the technical solution. Professional technicians can use different methods to implement the described functions for each specific application, but such implementation should not be considered to be beyond the scope of this application.
以上所述,仅为本申请的具体实施方式,但本申请的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本申请揭露的技术范围内,可轻易想到各种等效的修改或替换,这些修改或替换都应涵盖在本申请的保护范围之内。因此,本申请的保护范围应以权利要求的保护范围为准。The above is only a specific implementation of this application, but the scope of protection of this application is not limited to this. Any person skilled in the art can easily think of various equivalents within the technical scope disclosed in this application. Modifications or replacements, and these modifications or replacements should be covered by the protection scope of this application. Therefore, the protection scope of this application shall be subject to the protection scope of the claims.

Claims (20)

  1. 一种声纹识别处理方法,包括:A voiceprint recognition processing method includes:
    若获得识别声纹的指令,输出包含第一随机码的提示信息,所述提示信息是指用于对被识别者提供的语音应包含的内容作出提示的信息;If an instruction to recognize a voiceprint is obtained, a prompt message including a first random code is output, and the prompt information refers to information for prompting what should be included in the voice provided by the identified person;
    获取被识别者提供的包含所述第一随机码的语音;Acquiring the speech provided by the identified person and including the first random code;
    通过语音识别,将所述语音转换为文字,并提取所述文字中包含的第二随机码;以及Converting the speech into text through speech recognition, and extracting a second random code contained in the text; and
    使用所述第一随机码对所述第二随机码进行校验,若对所述第二随机码校验成功,进行声纹识别得到声纹识别结果。Use the first random code to verify the second random code. If the second random code is successfully verified, perform voiceprint recognition to obtain a voiceprint recognition result.
  2. 根据权利要求1所述声纹识别处理方法,其中,所述提示信息中还包括预设内容,所述第一随机码和所述预设内容的顺序在所述语音中被限定;The voiceprint recognition processing method according to claim 1, wherein the prompt information further includes preset content, and an order of the first random code and the preset content is limited in the voice;
    所述使用所述第一随机码对所述第二随机码进行校验,若对所述第二随机码校验成功,进行声纹识别得到声纹识别结果的步骤包括:The verifying the second random code using the first random code, and if the verification of the second random code is successful, the step of performing voiceprint recognition to obtain a voiceprint recognition result includes:
    使用所述第一随机码对所述第二随机码进行校验,若对所述第二随机码校验成功,根据所述预设内容进行声纹识别,得到声纹识别结果。Use the first random code to verify the second random code, and if the second random code is successfully verified, perform voiceprint recognition according to the preset content to obtain a voiceprint recognition result.
  3. 根据权利要求1所述声纹识别处理方法,其中,所述若获得识别声纹的指令,输出包含第一随机码的提示信息的步骤还包括:The voiceprint recognition processing method according to claim 1, wherein the step of outputting prompt information including a first random code if an instruction to recognize a voiceprint is obtained, further comprising:
    所述提示信息中包含所述第一随机码在语音中的位置。The prompt information includes a position of the first random code in the voice.
  4. 根据权利要求3所述声纹识别处理方法,其中,所述提示信息中包含所述第一随机码在语音中的位置在每一次声纹识别时被随机限定。The voiceprint recognition processing method according to claim 3, wherein the prompt information includes the position of the first random code in the voice is randomly defined each time the voiceprint recognition is performed.
  5. 根据权利要求1所述声纹识别处理方法,其中其特征在于,所述提示信息中还包括语音是预设时间长度内的语音。The voiceprint recognition processing method according to claim 1, wherein the prompt information further comprises that the voice is a voice within a preset time length.
  6. 一种声纹识别处理装置,包括:A voiceprint recognition processing device includes:
    输出单元,用于若获得识别声纹的指令,输出包含第一随机码的提示信息,所述提示信息是指用于对被识别者提供的语音应包含的内容作出提示的信息;An output unit, configured to output prompt information including a first random code if an instruction to recognize a voiceprint is obtained, where the prompt information refers to information used to prompt the content provided by the identified person's voice;
    获取单元,用于获取被识别者提供的包含所述第一随机码的语音;An obtaining unit, configured to obtain a voice provided by the identified person and including the first random code;
    提取单元,用于通过语音识别,将所述语音转换为文字,并提取所述文字中包含的第二随机码;以及An extraction unit, configured to convert the speech into text through speech recognition, and extract a second random code included in the text; and
    校验单元,用于使用所述第一随机码对所述第二随机码进行校验,若对所述第二随机码校验成功,进行声纹识别得到声纹识别结果。The verification unit is configured to verify the second random code by using the first random code. If the second random code is successfully verified, perform voiceprint recognition to obtain a voiceprint recognition result.
  7. 根据权利要求6所述声纹识别处理装置,其中,所述提示信息中还包括预设内容,所述第一随机码和所述预设内容的顺序在所述语音中被限定;The voiceprint recognition processing device according to claim 6, wherein the prompt information further includes preset content, and an order of the first random code and the preset content is limited in the voice;
    所述校验单元,用于使用所述第一随机码对所述第二随机码进行校验,若对所述第二随机码校验成功,根据所述预设内容进行声纹识别,得到声纹识别结果。The verification unit is configured to verify the second random code by using the first random code. If the verification of the second random code is successful, perform voiceprint recognition according to the preset content to obtain Voiceprint recognition results.
  8. 根据权利要求6所述声纹识别处理装置,其中,所述输出单元所输出的所述提示信息中包含所述第一随机码在语音中的位置。The voiceprint recognition processing device according to claim 6, wherein the prompt information output by the output unit includes a position of the first random code in a voice.
  9. 根据权利要求8所述声纹识别处理装置,其中,所述输出单元所输出的所述提示信息中包含所述第一随机码在语音中的位置在每一次声纹识别时被随机限定。The voiceprint recognition processing device according to claim 8, wherein the prompt information output by the output unit includes the position of the first random code in the voice is randomly defined each time the voiceprint is recognized.
  10. 根据权利要求6所述声纹识别处理装置,其中,所述输出单元所输出的所述提示信息中还包含语音是预设时间长度内的语音。The voiceprint recognition processing device according to claim 6, wherein the prompt information output by the output unit further includes that the voice is a voice within a preset time length.
  11. 一种电子设备,包括存储器及处理器,其中,所述存储器上存储有计算机程序,所述处理器执行所述计算机程序时实现以下步骤:An electronic device includes a memory and a processor, wherein the memory stores a computer program, and the processor implements the following steps when the processor executes the computer program:
    若获得识别声纹的指令,输出包含第一随机码的提示信息,所述提示信息是指用于对被识别者提供的语音应包含的内容作出提示的信息;If an instruction to recognize a voiceprint is obtained, a prompt message including a first random code is output, and the prompt information refers to information for prompting what should be included in the voice provided by the identified person;
    获取被识别者提供的包含所述第一随机码的语音;Acquiring the speech provided by the identified person and including the first random code;
    通过语音识别,将所述语音转换为文字,并提取所述文字中包含的第二随机码;以及Converting the speech into text through speech recognition, and extracting a second random code contained in the text; and
    使用所述第一随机码对所述第二随机码进行校验,若对所述第二随机码校验成功,进行声纹识别得到声纹识别结果。Use the first random code to verify the second random code. If the second random code is successfully verified, perform voiceprint recognition to obtain a voiceprint recognition result.
  12. 根据权利要求11所述电子设备,其中,所述提示信息中还包括预设内容,所述第一随机码和所述预设内容的顺序在所述语音中被限定;The electronic device according to claim 11, wherein the prompt information further includes preset content, and an order of the first random code and the preset content is limited in the voice;
    所述使用所述第一随机码对所述第二随机码进行校验,若对所述第二随机码校验成功,进行声纹识别得到声纹识别结果的步骤包括:The verifying the second random code using the first random code, and if the verification of the second random code is successful, the step of performing voiceprint recognition to obtain a voiceprint recognition result includes:
    使用所述第一随机码对所述第二随机码进行校验,若对所述第二随机码校 验成功,根据所述预设内容进行声纹识别,得到声纹识别结果。The second random code is verified using the first random code. If the second random code is successfully verified, voiceprint recognition is performed according to the preset content to obtain a voiceprint recognition result.
  13. 根据权利要求11所述电子设备,其中,所述若获得识别声纹的指令,输出包含第一随机码的提示信息的步骤还包括:The electronic device according to claim 11, wherein if the instruction for identifying the voiceprint is obtained, the step of outputting the prompt information including the first random code further comprises:
    所述提示信息中包含所述第一随机码在语音中的位置。The prompt information includes a position of the first random code in the voice.
  14. 根据权利要求13所述电子设备,其中,所述提示信息中包含所述第一随机码在语音中的位置在每一次声纹识别时被随机限定。The electronic device according to claim 13, wherein the position of the first random code in the voice included in the prompt information is randomly defined every time the voiceprint is recognized.
  15. 根据权利要求11所述电子设备,其中,所述提示信息中还包括语音是预设时间长度内的语音。The electronic device according to claim 11, wherein the prompt information further comprises that the voice is a voice within a preset time length.
  16. 一种存储介质,其中,所述存储介质存储有计算机程序,所述计算机程序当被处理器执行时可实现如下操作:A storage medium, wherein the storage medium stores a computer program, and the computer program, when executed by a processor, can implement the following operations:
    若获得识别声纹的指令,输出包含第一随机码的提示信息,所述提示信息是指用于对被识别者提供的语音应包含的内容作出提示的信息;If an instruction to recognize a voiceprint is obtained, a prompt message including a first random code is output, and the prompt information refers to information for prompting what should be included in the voice provided by the identified person;
    获取被识别者提供的包含所述第一随机码的语音;Acquiring the speech provided by the identified person and including the first random code;
    通过语音识别,将所述语音转换为文字,并提取所述文字中包含的第二随机码;以及Converting the speech into text through speech recognition, and extracting a second random code contained in the text; and
    使用所述第一随机码对所述第二随机码进行校验,若对所述第二随机码校验成功,进行声纹识别得到声纹识别结果。Use the first random code to verify the second random code. If the second random code is successfully verified, perform voiceprint recognition to obtain a voiceprint recognition result.
  17. 根据权利要求16所述存储介质,其中,所述提示信息中还包括预设内容,所述第一随机码和所述预设内容的顺序在所述语音中被限定;The storage medium according to claim 16, wherein the prompt information further includes preset content, and an order of the first random code and the preset content is limited in the voice;
    所述使用所述第一随机码对所述第二随机码进行校验,若对所述第二随机码校验成功,进行声纹识别得到声纹识别结果的步骤包括:The verifying the second random code using the first random code, and if the verification of the second random code is successful, the step of performing voiceprint recognition to obtain a voiceprint recognition result includes:
    使用所述第一随机码对所述第二随机码进行校验,若对所述第二随机码校验成功,根据所述预设内容进行声纹识别,得到声纹识别结果。Use the first random code to verify the second random code, and if the second random code is successfully verified, perform voiceprint recognition according to the preset content to obtain a voiceprint recognition result.
  18. 根据权利要求16所述存储介质,其中,所述若获得识别声纹的指令,输出包含第一随机码的提示信息的步骤还包括:The storage medium according to claim 16, wherein if the instruction for identifying the voiceprint is obtained, the step of outputting the prompt information including the first random code further comprises:
    所述提示信息中包含所述第一随机码在语音中的位置。The prompt information includes a position of the first random code in the voice.
  19. 根据权利要求18所述存储介质,其中,所述提示信息中包含所述第一随机码在语音中的位置在每一次声纹识别时被随机限定。The storage medium according to claim 18, wherein the position of the first random code in the voice included in the prompt information is randomly defined every time the voiceprint is recognized.
  20. 根据权利要求16所述存储介质,其中,所述提示信息中还包括语音是预设时间长度内的语音。The storage medium according to claim 16, wherein the prompt information further includes that the voice is a voice within a preset time length.
PCT/CN2018/107954 2018-08-03 2018-09-27 Voiceprint recognition processing method and apparatus, electronic device and storage medium WO2020024415A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201810877973.1A CN109087647B (en) 2018-08-03 2018-08-03 Voiceprint recognition processing method and device, electronic equipment and storage medium
CN201810877973.1 2018-08-03

Publications (1)

Publication Number Publication Date
WO2020024415A1 true WO2020024415A1 (en) 2020-02-06

Family

ID=64833567

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/107954 WO2020024415A1 (en) 2018-08-03 2018-09-27 Voiceprint recognition processing method and apparatus, electronic device and storage medium

Country Status (2)

Country Link
CN (1) CN109087647B (en)
WO (1) WO2020024415A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115759649A (en) * 2022-11-22 2023-03-07 北京丹灵云科技有限责任公司 Police material figure interconnection safety management and control method

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110085228A (en) * 2019-04-28 2019-08-02 广西盖德科技有限公司 Phonetic code application method, applications client and system
CN112309060A (en) * 2019-08-02 2021-02-02 广东美的制冷设备有限公司 Security and protection equipment and indoor monitoring method, control device and readable storage medium thereof

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102916815A (en) * 2012-11-07 2013-02-06 华为终端有限公司 Method and device for checking identity of user
CN105913850A (en) * 2016-04-20 2016-08-31 上海交通大学 Text related vocal print password verification method
CN106357411A (en) * 2016-10-14 2017-01-25 深圳天珑无线科技有限公司 Identity verification method and device
CN106506524A (en) * 2016-11-30 2017-03-15 百度在线网络技术(北京)有限公司 Method and apparatus for verifying user
CN107147499A (en) * 2017-05-17 2017-09-08 刘光明 The method and system verified using phonetic entry

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20030037004A1 (en) * 2001-08-14 2003-02-20 Chuck Buffum Dialog-based voiceprint security for business transactions
CN102413101A (en) * 2010-09-25 2012-04-11 盛乐信息技术(上海)有限公司 Voice-print authentication system having voice-print password voice prompting function and realization method thereof
CN102142254A (en) * 2011-03-25 2011-08-03 北京得意音通技术有限责任公司 Voiceprint identification and voice identification-based recording and faking resistant identity confirmation method
CN102737634A (en) * 2012-05-29 2012-10-17 百度在线网络技术(北京)有限公司 Authentication method and device based on voice
CN102708867A (en) * 2012-05-30 2012-10-03 北京正鹰科技有限责任公司 Method and system for identifying faked identity by preventing faked recordings based on voiceprint and voice
CN103986725A (en) * 2014-05-29 2014-08-13 中国农业银行股份有限公司 Client side, server side and identity authentication system and method
US10008208B2 (en) * 2014-09-18 2018-06-26 Nuance Communications, Inc. Method and apparatus for performing speaker recognition
CN105635087B (en) * 2014-11-20 2019-09-20 阿里巴巴集团控股有限公司 Pass through the method and device of voice print verification user identity
CN105933272A (en) * 2015-12-30 2016-09-07 中国银联股份有限公司 Voiceprint recognition method capable of preventing recording attack, server, terminal, and system
CN107068154A (en) * 2017-03-13 2017-08-18 平安科技(深圳)有限公司 The method and system of authentication based on Application on Voiceprint Recognition
CN107919961A (en) * 2017-12-07 2018-04-17 广州势必可赢网络科技有限公司 Identity authentication protocol and server based on dynamic code and dynamic voiceprint update

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102916815A (en) * 2012-11-07 2013-02-06 华为终端有限公司 Method and device for checking identity of user
CN105913850A (en) * 2016-04-20 2016-08-31 上海交通大学 Text related vocal print password verification method
CN106357411A (en) * 2016-10-14 2017-01-25 深圳天珑无线科技有限公司 Identity verification method and device
CN106506524A (en) * 2016-11-30 2017-03-15 百度在线网络技术(北京)有限公司 Method and apparatus for verifying user
CN107147499A (en) * 2017-05-17 2017-09-08 刘光明 The method and system verified using phonetic entry

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN115759649A (en) * 2022-11-22 2023-03-07 北京丹灵云科技有限责任公司 Police material figure interconnection safety management and control method
CN115759649B (en) * 2022-11-22 2024-03-29 北京丹灵云科技有限责任公司 Police material character interconnection safety control method

Also Published As

Publication number Publication date
CN109087647A (en) 2018-12-25
CN109087647B (en) 2023-06-13

Similar Documents

Publication Publication Date Title
WO2017197953A1 (en) Voiceprint-based identity recognition method and device
US9230547B2 (en) Metadata extraction of non-transcribed video and audio streams
WO2019179036A1 (en) Deep neural network model, electronic device, identity authentication method, and storage medium
US9979721B2 (en) Method, server, client and system for verifying verification codes
WO2018149209A1 (en) Voice recognition method, electronic device, and computer storage medium
US9728191B2 (en) Speaker verification methods and apparatus
WO2019090834A1 (en) Express cabinet pickup method and apparatus based on voiceprint
US20170118205A1 (en) User biological feature authentication method and system
WO2019179029A1 (en) Electronic device, identity verification method and computer-readable storage medium
WO2019019743A1 (en) Information auditing method and apparatus, electronic device and computer readable storage medium
WO2020024415A1 (en) Voiceprint recognition processing method and apparatus, electronic device and storage medium
JP7123871B2 (en) Identity authentication method, identity authentication device, electronic device and computer-readable storage medium
CN104980402B (en) Method and device for identifying malicious operation
CN110111798B (en) Method, terminal and computer readable storage medium for identifying speaker
WO2019228135A1 (en) Method and device for adjusting matching threshold, storage medium and electronic device
US11776543B2 (en) Authentication system, authentication method, and, non-transitory computer-readable information recording medium for recording program
CN116013324A (en) Robot voice control authority management method based on voiceprint recognition
CN111142834A (en) Service processing method and system
KR101181060B1 (en) Voice recognition system and method for speaker recognition using thereof
WO2019140851A1 (en) Telemarketing prompt method, electronic device, and readable storage medium
CN111090846B (en) Login authentication method, login authentication device, electronic equipment and computer readable storage medium
CN112417412A (en) Bank account balance inquiry method, device and system
CN109388695B (en) User intention recognition method, apparatus and computer-readable storage medium
JP2015055835A (en) Speaker recognition device, speaker recognition method, and speaker recognition program
CN108985035B (en) Control method and device for user operation authority, storage medium and electronic equipment

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18928328

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18928328

Country of ref document: EP

Kind code of ref document: A1