CN111261171A - Method and system for voiceprint verification of customizable text - Google Patents

Method and system for voiceprint verification of customizable text Download PDF

Info

Publication number
CN111261171A
CN111261171A CN202010055493.4A CN202010055493A CN111261171A CN 111261171 A CN111261171 A CN 111261171A CN 202010055493 A CN202010055493 A CN 202010055493A CN 111261171 A CN111261171 A CN 111261171A
Authority
CN
China
Prior art keywords
voiceprint
user
text
content
verification
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202010055493.4A
Other languages
Chinese (zh)
Inventor
吴毅鑫
李稀敏
肖龙源
蔡振华
刘晓葳
谭玉坤
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Xiamen Kuaishangtong Technology Co Ltd
Original Assignee
Xiamen Kuaishangtong Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Xiamen Kuaishangtong Technology Co Ltd filed Critical Xiamen Kuaishangtong Technology Co Ltd
Priority to CN202010055493.4A priority Critical patent/CN111261171A/en
Publication of CN111261171A publication Critical patent/CN111261171A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/02Preprocessing operations, e.g. segment selection; Pattern representation or modelling, e.g. based on linear discriminant analysis [LDA] or principal components; Feature selection or extraction
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/06Decision making techniques; Pattern matching strategies

Abstract

The invention discloses a voiceprint verification method of a customizable text, which comprises the following steps: s1, displaying the content of the customized text on the interactive interface to prompt the user to read aloud, wherein the customized text is the user-defined content; s2, comparing and analyzing the collected voiceprint in the reading audio with the voiceprint model in the database; and S3, judging that the verification is passed when the text content is correct and the collected voiceprint is consistent with the voiceprint model in the database. The invention not only improves the user experience, but also meets the requirement of the user for personalized customization; in addition, the product diversity is improved, and the product popularization is facilitated.

Description

Method and system for voiceprint verification of customizable text
Technical Field
The invention relates to the technical field of voiceprint recognition, in particular to a voiceprint verification method and a voiceprint verification system for customizable texts.
Background
With the advent of the artificial intelligence era, more and more fields are beginning to use artificial intelligence to assist offices to improve productivity. Voiceprint recognition is an indispensable part of the field of artificial intelligence. The application range of the voiceprint recognition in the bank is very wide, and the voiceprint recognition can be applied to a call center to be used as the identity verification of a user and to build a voiceprint blacklist. In fact, fixed text recognition is a key part in voiceprint recognition. At present, fixed text recognition can only fix a word or a sentence for voice recognition. Similar to the existing awakening words such as "love classmates" in the market, the wrong utterance will cause the verification failure. However, such techniques can result in a user experience that is too boring to provide personalized customization. When the users are identified with fixed text, all users must use keywords set by the service provider. Such as words or sentences like 'love classmates', 'opening the door with sesame', etc. When the voiceprint recognition system receives the user voice, the voiceprint and the content are judged at the same time. The voiceprint recognition system will pass the user if and only if both pass at the same time.
Therefore, the current fixed text recognition technology unintentionally kills the right of user selection, and cannot provide personalized customization requirements for users. Thereby reducing the user experience of using the product.
Disclosure of Invention
The invention aims to solve the technical problem of providing a method and a system for verifying the voiceprint of the customizable text aiming at the defects of the prior art, so that the personalized customization requirements of users can be better met, and the use experience of products is further improved.
To achieve the above object, the present invention provides a method for voiceprint verification of customizable text, the method comprising:
s1, displaying the content of the customized text on the interactive interface to prompt the user to read aloud, wherein the customized text is the user-defined content;
s2, comparing and analyzing the collected voiceprint in the reading audio with the voiceprint model in the database;
and S3, judging that the verification is passed when the text content is correct and the collected voiceprint is consistent with the voiceprint model in the database.
Preferably, the user-defined content is a word or a sentence designed by the user according to needs.
Preferably, the voiceprint model establishes a connection with corresponding user-defined content when stored in the database.
Preferably, the voiceprint model is a voiceprint model established for the user by extracting voice voiceprint features of the user.
To achieve the above object, the present invention further provides a system for voiceprint verification of customizable text, the system comprising:
the prompting unit is used for prompting the user to read the customized text by displaying the content of the customized text on the interactive interface, wherein the customized text is the user-defined content;
the comparison unit is used for comparing and analyzing the collected voiceprints in the reading audio with the voiceprint models in the database;
and the verification unit is used for judging that the verification is passed when the text content is correct and the collected voiceprint is consistent with the voiceprint model in the database.
Preferably, the user-defined content is a word or a sentence designed by the user according to needs.
Preferably, the voiceprint model establishes a connection with corresponding user-defined content when stored in the database.
Preferably, the voiceprint model is a voiceprint model established for the user by extracting voice voiceprint features of the user.
According to the scheme, the personalized requirements of the user can be met, and when the user enters the database for the first time, the user can input words or sentences which the user wants to customize on the user interaction interface. After the user inputs, the user recites the content according to the input content. After the voiceprint recognition system receives the user's speech, content recognition is performed. It is determined whether the user recited content is user input content. And if the two are consistent, the voiceprint is put in a storage, and the statement is recorded. During the use stage of the user, the system presents the recorded sentences on the user interaction interface to prompt the user. When the user uses the voiceprint to carry out verification, the voiceprint recognition system can carry out content recognition and voiceprint recognition at the same time, and when the two types of voiceprint recognition and voiceprint recognition pass, the verification is passed. Therefore, the invention has the following advantages:
1. the user experience is improved, and the requirement of the user for personalized customization is met;
2. promote the product diversity, do benefit to the product popularization.
Drawings
Fig. 1 is a flowchart of a method for voiceprint verification of customizable text according to an embodiment of the present invention;
fig. 2 is a block diagram illustrating a structure of a system for voiceprint verification of customizable text according to an embodiment of the present invention.
The implementation, functional features and advantages of the objects of the present invention will be further explained with reference to the accompanying drawings.
Detailed Description
It should be understood that the specific embodiments described herein are merely illustrative of the invention and are not intended to limit the invention.
The present invention will be described in detail with reference to the following examples.
Referring to fig. 1, a flowchart of a method for voiceprint verification of a customizable text is provided according to an embodiment of the present invention. The method comprises the following steps:
s1, displaying the content of the customized text on the interactive interface to prompt the user to read aloud, wherein the customized text is the user-defined content;
s2, comparing and analyzing the collected voiceprint in the reading audio with the voiceprint model in the database;
and S3, judging that the verification is passed when the text content is correct and the collected voiceprint is consistent with the voiceprint model in the database.
Preferably, the user-defined content is a word or a sentence designed by the user according to needs.
Preferably, the voiceprint model establishes a connection with corresponding user-defined content when stored in the database.
Preferably, the voiceprint model is a voiceprint model established for the user by extracting voice voiceprint features of the user.
The scheme solves the problem that the user cannot freely customize the text in the current fixed text recognition. The user personalized experience is ignored in the fixed text recognition as the most critical part for improving the user experience. The reason why the fixed text is required for the voiceprint recognition is that the recognition accuracy is not high in the state of the phrase-voice free text. The fixed text fixes the content of the voice, thereby improving the recognition rate. The improvement of the recognition rate mainly depends on that the content spoken by the user when the user is in storage in the voiceprint modeling is consistent with all the content used for verifying the identity of the user later. Therefore, the voiceprint recognition system does not need to remove semantic information in the audio, and voiceprint feature extraction is directly carried out, so that the recognition accuracy is improved. The scheme maintains the accuracy of phrase voice and voice print recognition and meets the personalized requirements of users.
In addition, the present invention further provides a system for voiceprint verification of customizable text, which is shown in fig. 2 and is a block diagram of a structure of the system for voiceprint verification of customizable text provided in an embodiment of the present invention.
The system comprises:
the prompting unit is used for prompting the user to read the customized text by displaying the content of the customized text on the interactive interface, wherein the customized text is the user-defined content;
the comparison unit is used for comparing and analyzing the collected voiceprints in the reading audio with the voiceprint models in the database;
and the verification unit is used for judging that the verification is passed when the text content is correct and the collected voiceprint is consistent with the voiceprint model in the database.
Preferably, the user-defined content is a word or a sentence designed by the user according to needs.
Preferably, the voiceprint model establishes a connection with corresponding user-defined content when stored in the database.
Preferably, the voiceprint model is a voiceprint model established for the user by extracting voice voiceprint features of the user.
In the stage of the user voiceprint modeling and warehousing, the user sets words or sentences according to the requirement of the user and recites the contents of the words or the sentences. After the voiceprint recognition system receives the user's audio, content recognition is performed first, and the recognized content corresponds to the text entered by the user. If the text is consistent with the recognized content, feature extraction (modeling) is carried out on the section of audio and the section of audio is put in storage, and the text is recorded and is connected with the corresponding voiceprint model.
When the user verifies the voiceprint using the voiceprint recognition system, the voiceprint recognition system will display the text entered when the user modeled the voiceprint on the user interaction interface as a prompt. The user can recite the personalized and customized text at the moment. When the voiceprint recognition system receives the user audio, content recognition and voiceprint recognition are performed simultaneously. When the two pass through at the same time, the voiceprint recognition system can judge that the user is the user himself.
The invention not only retains the advantage that the fixed text improves the recognition accuracy of the voiceprint recognition system, but also meets the requirement that the user customizes own text in a personalized way. The invention can obviously improve the product experience of the user and meet the individual requirements of the user. Meanwhile, the interestingness is generated, and a user can customize interesting sentences according to individual imagination, so that the propaganda effect is played for the popularization of the product on the premise.
The embodiments in the above embodiments can be further combined or replaced, and the embodiments are only used for describing the preferred embodiments of the present invention, and do not limit the concept and scope of the present invention, and various changes and modifications made to the technical solution of the present invention by those skilled in the art without departing from the design idea of the present invention belong to the protection scope of the present invention.

Claims (8)

1. A method for voiceprint verification of customizable text, said method comprising:
s1, displaying the content of the customized text on the interactive interface to prompt the user to read aloud, wherein the customized text is the user-defined content;
s2, comparing and analyzing the collected voiceprint in the reading audio with the voiceprint model in the database;
and S3, judging that the verification is passed when the text content is correct and the collected voiceprint is consistent with the voiceprint model in the database.
2. The method for voiceprint verification of customizable text according to claim 1, wherein the user-defined content is words or sentences designed by a user according to needs.
3. A method for voiceprint verification of customizable text according to any one of claims 1 or 2, characterized in that said voiceprint model is associated with corresponding user-defined content when stored in a database.
4. The method of claim 1, wherein the voiceprint model is a voiceprint model created for the user by extracting voice voiceprint features of the user.
5. A system for voiceprint verification of customizable text, said system comprising:
the prompting unit is used for prompting the user to read the customized text by displaying the content of the customized text on the interactive interface, wherein the customized text is the user-defined content;
the comparison unit is used for comparing and analyzing the collected voiceprints in the reading audio with the voiceprint models in the database;
and the verification unit is used for judging that the verification is passed when the text content is correct and the collected voiceprint is consistent with the voiceprint model in the database.
6. The system of claim 5, wherein the user-defined content is words or sentences designed by the user according to the needs.
7. A system for voiceprint verification of customizable text according to any one of claims 5 or 6, wherein said voiceprint model is associated with corresponding user-defined content when stored in a database.
8. A system for voiceprint verification of customizable text according to claim 5, wherein said voiceprint model is a voiceprint model built for a user by extracting voiceprint features of his voice.
CN202010055493.4A 2020-01-17 2020-01-17 Method and system for voiceprint verification of customizable text Pending CN111261171A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202010055493.4A CN111261171A (en) 2020-01-17 2020-01-17 Method and system for voiceprint verification of customizable text

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202010055493.4A CN111261171A (en) 2020-01-17 2020-01-17 Method and system for voiceprint verification of customizable text

Publications (1)

Publication Number Publication Date
CN111261171A true CN111261171A (en) 2020-06-09

Family

ID=70947134

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202010055493.4A Pending CN111261171A (en) 2020-01-17 2020-01-17 Method and system for voiceprint verification of customizable text

Country Status (1)

Country Link
CN (1) CN111261171A (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103646646A (en) * 2013-11-27 2014-03-19 联想(北京)有限公司 Voice control method and electronic device
CN103685185A (en) * 2012-09-14 2014-03-26 上海掌门科技有限公司 Mobile equipment voiceprint registration and authentication method and system
AU2013315343A1 (en) * 2012-09-11 2015-04-30 Auraya Pty Ltd Voice authentication system and method
CN105575395A (en) * 2014-10-14 2016-05-11 中兴通讯股份有限公司 Voice wake-up method and apparatus, terminal, and processing method thereof
CN108735209A (en) * 2018-04-28 2018-11-02 广东美的制冷设备有限公司 Wake up word binding method, smart machine and storage medium

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
AU2013315343A1 (en) * 2012-09-11 2015-04-30 Auraya Pty Ltd Voice authentication system and method
CN103685185A (en) * 2012-09-14 2014-03-26 上海掌门科技有限公司 Mobile equipment voiceprint registration and authentication method and system
CN103646646A (en) * 2013-11-27 2014-03-19 联想(北京)有限公司 Voice control method and electronic device
CN105575395A (en) * 2014-10-14 2016-05-11 中兴通讯股份有限公司 Voice wake-up method and apparatus, terminal, and processing method thereof
CN108735209A (en) * 2018-04-28 2018-11-02 广东美的制冷设备有限公司 Wake up word binding method, smart machine and storage medium

Similar Documents

Publication Publication Date Title
JP6394709B2 (en) SPEAKER IDENTIFYING DEVICE AND FEATURE REGISTRATION METHOD FOR REGISTERED SPEECH
Segal Narrative comprehension and the role of deictic shift theory
EP2109097B1 (en) A method for personalization of a service
Yankelovich How do users know what to say?
US9070363B2 (en) Speech translation with back-channeling cues
US7716050B2 (en) Multilingual speech recognition
CN110689877A (en) Voice end point detection method and device
US20070055520A1 (en) Incorporation of speech engine training into interactive user tutorial
CN110517668B (en) Chinese and English mixed speech recognition system and method
Karat et al. Conversational interface technologies
Alghamdi et al. Saudi accented Arabic voice bank
US20150254238A1 (en) System and Methods for Maintaining Speech-To-Speech Translation in the Field
CN106910499A (en) The control method and device of application program
Fellbaum et al. Principles of electronic speech processing with applications for people with disabilities
Shahin Studying and enhancing talking condition recognition in stressful and emotional talking environments based on HMMs, CHMM2s and SPHMMs
CN109102807A (en) Personalized speech database creation system, speech recognition control system and terminal
CN112309406A (en) Voiceprint registration method, voiceprint registration device and computer-readable storage medium
Deka et al. Speech corpora of under resourced languages of north-east india
CN111261171A (en) Method and system for voiceprint verification of customizable text
Minker et al. Spoken dialogue systems technology and design
Gilbert et al. Intelligent virtual agents for contact center automation
WO2004034355A2 (en) System and methods for comparing speech elements
CN109035896A (en) A kind of Oral Training method and facility for study
CN101304457A (en) Method and apparatus for implementing automatic spoken language training based on voice telephone
CN108831473A (en) A kind of audio-frequency processing method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20200609

RJ01 Rejection of invention patent application after publication