CN107331400A - A kind of Application on Voiceprint Recognition performance improvement method, device, terminal and storage medium - Google Patents
A kind of Application on Voiceprint Recognition performance improvement method, device, terminal and storage medium Download PDFInfo
- Publication number
- CN107331400A CN107331400A CN201710741564.4A CN201710741564A CN107331400A CN 107331400 A CN107331400 A CN 107331400A CN 201710741564 A CN201710741564 A CN 201710741564A CN 107331400 A CN107331400 A CN 107331400A
- Authority
- CN
- China
- Prior art keywords
- vocal print
- print feature
- sample
- user
- current
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 33
- 230000006872 improvement Effects 0.000 title claims abstract description 18
- 230000001755 vocal effect Effects 0.000 claims abstract description 173
- 238000000605 extraction Methods 0.000 claims abstract description 18
- 239000000284 extract Substances 0.000 claims abstract description 7
- 230000001960 triggered effect Effects 0.000 claims description 5
- 238000004590 computer program Methods 0.000 claims description 2
- 230000007246 mechanism Effects 0.000 abstract description 4
- 238000005516 engineering process Methods 0.000 description 9
- 230000006870 function Effects 0.000 description 9
- 238000012545 processing Methods 0.000 description 6
- 230000003287 optical effect Effects 0.000 description 5
- 230000005291 magnetic effect Effects 0.000 description 4
- 238000004891 communication Methods 0.000 description 2
- 210000000056 organ Anatomy 0.000 description 2
- 230000001133 acceleration Effects 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 238000000151 deposition Methods 0.000 description 1
- 238000011161 development Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000005611 electricity Effects 0.000 description 1
- 230000002452 interceptive effect Effects 0.000 description 1
- 238000013507 mapping Methods 0.000 description 1
- 239000013307 optical fiber Substances 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 230000035479 physiological effects, processes and functions Effects 0.000 description 1
- 230000008569 process Effects 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/06—Decision making techniques; Pattern matching strategies
- G10L17/14—Use of phonemic categorisation or speech recognition prior to speaker recognition or verification
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G10L15/1815—Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/22—Interactive procedures; Man-machine interfaces
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Computational Linguistics (AREA)
- Artificial Intelligence (AREA)
- Business, Economics & Management (AREA)
- Computer Vision & Pattern Recognition (AREA)
- Game Theory and Decision Science (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
The invention discloses a kind of Application on Voiceprint Recognition performance improvement method, device, terminal and storage medium, wherein, this method includes:Obtain the voice open command of user's input;Determine whether the voice open command matches with default guiding text;If matching, extracts the corresponding vocal print feature of the voice open command;The vocal print feature of extraction is matched with predetermined sample vocal print feature, if the match is successful, performs and opens operation, wherein the sample vocal print feature is extracted in advance from the semantic voice messaging for the guiding text.Personalized speech of the invention by obtaining user, the personalized speech information extraction according to user obtains the sample vocal print feature of user, and performing follow-up unlatching according to the matching result of the voice open command of user and sample vocal print feature operates.So as to no longer be limited by speech samples amount, fault tolerant mechanism is improved, the accuracy rate and Consumer's Experience of Application on Voiceprint Recognition is improved.
Description
Technical field
The present embodiments relate to sound groove recognition technology in e field, more particularly to a kind of Application on Voiceprint Recognition performance improvement method, dress
Put, terminal and storage medium.
Background technology
Sound groove recognition technology in e belongs to one kind of biological identification technology, is one and is spoken human physiology and row according to reaction in voice
The speech parameter being characterized recognizes the technology of voice words person's identity.Because everyone phonatory organ is in terms of size and form
It is not quite similar, therefore vocal print also just turns into a kind of means of identification for differentiating speaker's identity.
With the fast development of speech recognition technology, increasing intelligent electric appliance is increased using sound groove recognition technology in e
The Consumer's Experience of strong user, user can lock personal account according to sound groove recognition technology in e, and private category is carried out to personal account
Property definition, therefore user can use voice to rapidly enter device systems and obtain personal account information and function.Therefore, vocal print
The degree of accuracy of identification is very crucial.
The content of the invention
The embodiments of the invention provide a kind of Application on Voiceprint Recognition performance improvement method, device, terminal and storage medium, Neng Gouzeng
Plus speech samples amount, the accuracy of Application on Voiceprint Recognition is improved, strengthens Consumer's Experience.
In a first aspect, the embodiments of the invention provide a kind of Application on Voiceprint Recognition performance improvement method, including:
Obtain the voice open command of user's input;
Determine whether the voice open command matches with default guiding text;
If matching, extracts the corresponding vocal print feature of the voice open command;
The vocal print feature of extraction is matched with predetermined sample vocal print feature, if the match is successful, held
Row opens operation, wherein the sample vocal print feature is extracted in advance from the semantic voice messaging for the guiding text.
Second aspect, the embodiments of the invention provide a kind of Application on Voiceprint Recognition performance boost device, including:
Phonetic order acquisition module, the voice open command for obtaining user's input;
Sound identification module, for determining whether the voice open command matches with default guiding text;
Vocal print feature extraction module, for when the voice open command is with default guiding text matches, extracting institute
The corresponding vocal print feature of predicate sound open command;
Vocal print feature matching module, for the vocal print feature of extraction and predetermined sample vocal print feature to be carried out
Matching, if the match is successful, performs and opens operation, wherein the sample vocal print feature is from the semantic language for the guiding text
Extracted in advance in message breath.
The third aspect, the embodiments of the invention provide a kind of terminal, including:
One or more processors;
Memory, for storing one or more programs,
When one or more of programs are by one or more of computing devices so that one or more of processing
Device realizes the Application on Voiceprint Recognition performance improvement method described in any embodiment of the present invention.
Fourth aspect, the embodiments of the invention provide a kind of computer-readable recording medium, is stored thereon with computer journey
Sequence, realizes the Application on Voiceprint Recognition performance improvement method described in any embodiment of the present invention when the program is executed by processor.
A kind of Application on Voiceprint Recognition performance improvement method provided in an embodiment of the present invention, device, terminal and storage medium, by obtaining
Take the personalized of family input and guide voice, the personalized guiding voice according to user extracts the sample vocal print spy for obtaining user
Levy, and the corresponding vocal print feature of voice open command is matched according to sample vocal print feature.Due to guiding the content of text
Can be by user's sets itself, personalization guiding voice improves fault tolerant mechanism, the degree of accuracy of Application on Voiceprint Recognition is improved, so as to carry
The high degree of accuracy of sample vocal print feature, correspondingly, improves the degree of accuracy of follow-up vocal print feature matching, improves user's body
Test.
Brief description of the drawings
Fig. 1 is a kind of flow chart for Application on Voiceprint Recognition performance improvement method that the embodiment of the present invention one is provided;
Fig. 2 is a kind of flow chart for Application on Voiceprint Recognition performance improvement method that the embodiment of the present invention two is provided;
Fig. 3 is a kind of structural representation for Application on Voiceprint Recognition performance boost device that the embodiment of the present invention three is provided;
Fig. 4 is a kind of structural representation for terminal that the embodiment of the present invention four is provided.
Embodiment
The present invention is described in further detail with reference to the accompanying drawings and examples.It is understood that this place is retouched
The specific embodiment stated is used only for explaining the present invention, rather than limitation of the invention.It also should be noted that, in order to just
Part related to the present invention rather than entire infrastructure are illustrate only in description, accompanying drawing.
Embodiment one
Fig. 1 is a kind of flow chart for Application on Voiceprint Recognition performance improvement method that the embodiment of the present invention one is provided, and the present embodiment can
Situation suitable for controlling smart machine by phonetic order, this method can be performed by Application on Voiceprint Recognition performance boost device,
The Application on Voiceprint Recognition performance boost device can be realized by the way of software and/or hardware.With reference to Fig. 1, this method specifically can be with
Including as follows:
S110, the voice open command for obtaining user's input.
Intelligent terminal can be detected that intelligent terminal in a dormant state detects residing ring in real time to surrounding environment
When there is phonetic order in border, the voice open command of user's input is obtained.Intelligent terminal is support interactive voice with multimedia
The smart machine of function, such as has the function of supporting in terms of audio, video, data, can be intelligent robot, intelligent sound box
Deng.
S120, determine voice open command with it is default guiding text whether match;If matching, continues executing with S130;
Otherwise, execution S160 is redirected.
Wherein, guiding text refers to that the voice that validated user is pre-set wakes up the corresponding text of instruction, and voice, which wakes up, to be referred to
Make for controlling intelligent terminal to be in a dormant state switched to running status.For example, using intelligent terminal in validated user
During such as intelligent terminal first by use, the personalized voice of prompting user's input wakes up and instructed, voice is waken up and referred to
Order carries out the guiding text that semantic analysis obtains personalization.
If specifically, voice open command and guiding text matches success, active user is probably the conjunction of intelligent terminal
Method user, continues executing with subsequent operation;If voice open command and guiding text matches failure, active user will not be legal
User, can directly shield the voice open command.
S130, the corresponding vocal print feature of extraction voice open command.
S140, the vocal print feature of extraction matched with predetermined sample vocal print feature, wherein sample vocal print is special
Levy is extracted in advance from the semantic voice messaging for guiding text;If the match is successful, S150 is continued executing with;Otherwise, jump
Turn to perform S160.
Wherein, the determination of the sample vocal print feature can include:During voiceprint registration, provide a user in recording
Pass passage;Show personalized speech input prompting message;The personalized speech content that user inputs is analyzed, obtains described
The sample vocal print feature of user.
It should be noted that during voiceprint registration, the personalized speech content that user inputs is not especially limited,
Guiding content of text is not especially limited, it is allowed to which user uses personalized guiding text.Also, to personalized speech
Quantity and volume are also not construed as limiting, and user carries out repeatedly guiding voice typing with can not limiting number of times using multiple usual volumes.
In the range of certain amount, the personalized speech quantity that user inputs during voiceprint registration is more, and personalized speech is entered
The degree of accuracy for the sample vocal print feature that row analysis is determined is higher.Sample vocal print feature is not limited by speech samples amount in the present embodiment
System, improves fault tolerant mechanism, so as to improve the degree of accuracy of sample vocal print feature.
Operation is opened in S150, execution.
S160, without any operation.
It is further to note that intelligent terminal can have multiple validated users, different validated users are to that should have sample sound
Line feature and guiding text, then the incidence relation being also stored between guiding text and sample vocal print feature in intelligent terminal, or
Person is stored with validated user with guiding the mapping relations between text, and validated user and sample vocal print feature.
By taking the opening process of intelligent sound box as an example, the corresponding guiding texts of user A are " intelligent sound box that please start me ", and
Extract the sample vocal print feature for obtaining user A.Guiding text corresponding to user B is " intelligent sound box quickly starts ", and is extracted
Obtain user B sample vocal print feature.User C does not store any open command, guiding text and sample sound to the intelligent sound box
Line feature.During intelligent sound box use, if it is " intelligent sound box that please start me " that user A, which says content to intelligent sound box,
Voice open command, now voice open command and guiding text matches success, and " intelligent sound box that please start me " is corresponding
The match is successful for current vocal print feature and user A sample vocal print feature, and intelligent sound box starts.
However, when user A says the voice open command that content is " intelligent sound box quickly starts " to intelligent sound box, though
Right voice open command and user B guiding text matches success, but current vocal print feature and user B sample vocal print feature
It fails to match, and intelligent sound box starts failure.
The technical scheme of the present embodiment, by obtaining the personalized guiding voice that user inputs, the personalization according to user
Guiding voice extracts the sample vocal print feature for obtaining user, and according to sample vocal print feature to the corresponding vocal print of voice open command
Feature is matched.Due to guiding the content of text can be by user's sets itself, personalization guiding voice improves fault-tolerant machine
System, improves the degree of accuracy of Application on Voiceprint Recognition, so as to improve the degree of accuracy of sample vocal print feature, correspondingly, improves follow-up sound
The degree of accuracy of line characteristic matching, improves Consumer's Experience.
Embodiment two
There is provided a kind of update method of sample vocal print feature on the basis of above-described embodiment one for the present embodiment.Fig. 2 is
The flow chart for a kind of Application on Voiceprint Recognition performance improvement method that the embodiment of the present invention two is provided, as shown in Fig. 2 this method specifically can be with
Including following:
S210, when detecting vocal print update event, obtain user input current speech information
Wherein, it is triggered detecting default vocal print more new button, or when detecting the presence of sample vocal print feature
Between length be more than default time span threshold value when, generate vocal print update event.
S220, current speech information is identified, extraction obtains current vocal print feature.
S230, according to current vocal print feature and predetermined sample vocal print feature, obtain new sample vocal print feature.
Exemplary, S230 can include:Determine that the current vocal print feature and the predetermined sample vocal print are special
Whether identical owning user is levied, if identical, using predetermined coefficient to the current vocal print feature and predetermined
Sample vocal print feature is merged, and obtains the new sample vocal print feature.Wherein, coefficient can be experience set in advance
Value.
Exemplary, it is determined that whether current vocal print feature and predetermined sample vocal print feature owning user are identical, can
With including:It is determined that current similarity between vocal print feature and predetermined sample vocal print feature, presets if similarity is more than
Similarity threshold, it is determined that current vocal print feature is identical with predetermined sample vocal print feature owning user.
S240, the voice open command for obtaining user's input.
S250, determine voice open command with it is default guiding text whether match.
If S260, matching, the corresponding vocal print feature of voice open command is extracted.
S270, the vocal print feature of extraction matched with new sample vocal print feature, if the match is successful, performed
Open operation.
Exemplary, it is triggered detecting default vocal print more new button, or detect depositing for sample vocal print feature
When time span is more than default time span threshold value, the vocal print update event is generated.
Because everyone phonatory organ is not quite similar in terms of size and form, and vary, therefore work as constantly
, it is necessary to be carried out more to sample vocal print feature when the existence time length of sample vocal print feature is more than default time span threshold value
Newly, vocal print update event is generated, to ensure the accuracy rate of Application on Voiceprint Recognition.
The technical scheme of the present embodiment, when detecting vocal print update event, passes through voice messaging identification and vocal print feature
Matching, to judge whether the user profile of active user is consistent with equipment user now, when user is consistent, using advance
The coefficient of determination is merged to current vocal print feature and predetermined sample vocal print feature, obtains new sample vocal print special
Levy, complete the renewal of sample vocal print feature.Regularly updating for the sample vocal print feature in smart machine is ensured with this, sound is improved
The accuracy rate of line identification.
Embodiment three
Fig. 3 is a kind of structural representation for Application on Voiceprint Recognition performance boost device that the embodiment of the present invention three is provided, this implementation
Example is applicable to control the situation of smart machine by phonetic order, can perform the vocal print knowledge that any embodiment of the present invention is provided
The method of other performance boost.With reference to Fig. 3, the concrete structure of the device is as follows:
Phonetic order acquisition module 310, the voice open command for obtaining user's input;
Sound identification module 320, for determining whether voice open command matches with default guiding text;
Vocal print feature extraction module 330, for when voice open command is with default guiding text matches, extracting voice
The corresponding vocal print feature of open command;
Vocal print feature matching module 340, for the vocal print feature of extraction and predetermined sample vocal print feature to be carried out
Matching, if the match is successful, performs and opens operation, and wherein sample vocal print feature is from the semantic voice messaging for guiding text
Extract in advance.
Further, the device includes sample vocal print feature determining module 350, specifically for:
During voiceprint registration, recording uploading channel is provided a user;
Show personalized speech input prompting message;
The personalized speech content that user inputs is analyzed, the sample vocal print feature of user is obtained.
Further, the device also includes sample vocal print update module 360, specifically for:
When detecting vocal print update event, the current speech information of user's input is obtained;
Current speech information is identified, extraction obtains current vocal print feature;
According to current vocal print feature and predetermined sample vocal print feature, new sample vocal print feature is obtained.
On the basis of such scheme, sample vocal print update module 360, specifically for:
It is determined that whether current vocal print feature and predetermined sample vocal print feature owning user are identical, if identical, adopt
Current vocal print feature and predetermined sample vocal print feature are merged with predetermined coefficient, new sample sound is obtained
Line feature.
Preferably, it is determined that current similarity between vocal print feature and predetermined sample vocal print feature, if similar
Degree is more than default similarity threshold, it is determined that current vocal print feature and the predetermined sample vocal print feature owning user
It is identical.
Further, the device also includes vocal print update event generation module 370, specifically for:
It is triggered detecting default vocal print more new button, or detects the existence time length of sample vocal print feature
During more than default time span threshold value, vocal print update event is generated.
The technical scheme of the present embodiment, by the mutual cooperation between modules, realize speech recognition, voice print matching,
The functions such as user's identification, the determination of sample vocal print and the renewal of sample vocal print, have reached lifting fault tolerant mechanism, have improved Application on Voiceprint Recognition
The effect of accuracy rate and Consumer's Experience.
Example IV
Fig. 4 is a kind of structural representation for terminal that the embodiment of the present invention four is provided, and Fig. 4 is shown suitable for being used for realizing this
The block diagram of the exemplary terminal of invention embodiment.The terminal that Fig. 4 is shown/and it is only an example, should not be to present invention implementation
Example function and using range band any limitation.
The terminal 12 that Fig. 4 is shown is only an example, should not be come to the function of the embodiment of the present invention and using range band
Any limitation.
As shown in figure 4, terminal 12 is showed in the form of universal computing device.The component of terminal 12 can include but not limit
In:One or more processor or processing unit 16, system storage 28, connection different system component (including system is deposited
Reservoir 28 and processing unit 16) bus 18.
Bus 18 represents the one or more in a few class bus structures, including memory bus or Memory Controller,
Peripheral bus, graphics acceleration port, processor or the local bus using any bus structures in a variety of bus structures.Lift
For example, these architectures include but is not limited to industry standard architecture (ISA) bus, MCA (MAC)
Bus, enhanced isa bus, VESA's (VESA) local bus and periphery component interconnection (PCI) bus.
Terminal 12 typically comprises various computing systems computer-readable recording medium.These media can be it is any can be by terminal 12
The usable medium of access, including volatibility and non-volatile media, moveable and immovable medium.
System storage 28 can include the computer system readable media of form of volatile memory, such as arbitrary access
Memory (RAM) 30 and/or cache memory 32.Terminal 12 may further include it is other it is removable/nonremovable,
Volatile/non-volatile computer system storage medium.Only as an example, storage system 34 can be used for read-write it is irremovable
, non-volatile magnetic media (Fig. 4 do not show, commonly referred to as " hard disk drive ").Although not shown in Fig. 4, use can be provided
In the disc driver to may move non-volatile magnetic disk (such as " floppy disk ") read-write, and to may move anonvolatile optical disk
The CD drive of (such as CD-ROM, DVD-ROM or other optical mediums) read-write.In these cases, each driver can
To be connected by one or more data media interfaces with bus 18.Memory 28 can include at least one program product,
The program product has one group of (for example, at least one) program module, and these program modules are configured to perform each implementation of the invention
The function of example.
Program/utility 40 with one group of (at least one) program module 42, can be stored in such as memory 28
In, such program module 42 include but is not limited to operating system, one or more application program, other program modules and
The realization of network environment is potentially included in each or certain combination in routine data, these examples.Program module 42 is usual
Perform the function and/or method in embodiment described in the invention.
Terminal 12 can also communicate with one or more external equipments 14 (such as keyboard, sensing equipment, display 24),
Can also enable a user to the equipment communication interacted with the terminal 12 with one or more, and/or with enable the terminal 12 with
Any equipment (such as network interface card, modem etc.) communication that one or more of the other computing device is communicated.It is this logical
Letter can be carried out by input/output (I/O) interface 22.Also, terminal 12 can also by network adapter 20 and one or
The multiple networks of person (such as LAN (LAN), wide area network (WAN) and/or public network, such as internet) communicate.As illustrated,
Network adapter 20 is communicated by bus 18 with other modules of terminal 12.It should be understood that although not shown in the drawings, can combine
Terminal 12 uses other hardware and/or software module, includes but is not limited to:Microcode, device driver, redundant processing unit,
External disk drive array, RAID system, tape drive and data backup storage system etc..
Processing unit 16 is stored in program in system storage 28 by operation, thus perform various function application and
Data processing, for example, realize the Application on Voiceprint Recognition performance improvement method that the embodiment of the present invention is provided.
Embodiment five
The embodiment of the present invention five also provides a kind of computer-readable recording medium, be stored thereon with computer program (or
For computer executable instructions), it is used to perform a kind of Application on Voiceprint Recognition performance improvement method, the party when program is executed by processor
Method includes:
Obtain the voice open command of user's input;
Determine whether voice open command matches with default guiding text;
If matching, the corresponding vocal print feature of voice open command is extracted;
The vocal print feature of extraction is matched with predetermined sample vocal print feature, if the match is successful, execution is opened
Operation is opened, wherein sample vocal print feature is extracted in advance from the semantic voice messaging for guiding text.
The computer-readable storage medium of the embodiment of the present invention, can be using any of one or more computer-readable media
Combination.Computer-readable medium can be computer-readable signal media or computer-readable recording medium.It is computer-readable
Storage medium for example may be-but not limited to-the system of electricity, magnetic, optical, electromagnetic, infrared ray or semiconductor, device or
Device, or any combination above.The more specifically example (non exhaustive list) of computer-readable recording medium includes:Tool
There are the electrical connections of one or more wires, portable computer diskette, hard disk, random access memory (RAM), read-only storage
(ROM), erasable programmable read only memory (EPROM or flash memory), optical fiber, portable compact disc read-only storage (CD-
ROM), light storage device, magnetic memory device or above-mentioned any appropriate combination.In this document, computer-readable storage
Medium can be it is any include or storage program tangible medium, the program can be commanded execution system, device or device
Using or it is in connection.
Computer-readable signal media can be included in a base band or as the data-signal of carrier wave part propagation,
Wherein carry computer-readable program code.The data-signal of this propagation can take various forms, including but not limit
In electromagnetic signal, optical signal or above-mentioned any appropriate combination.Computer-readable signal media can also be that computer can
Any computer-readable medium beyond storage medium is read, the computer-readable medium, which can send, propagates or transmit, to be used for
Used by instruction execution system, device or device or program in connection.
The program code included on computer-readable medium can be transmitted with any appropriate medium, including --- but do not limit
In wireless, electric wire, optical cable, RF etc., or above-mentioned any appropriate combination.
It can be write with one or more programming languages or its combination for performing the computer that the present invention is operated
Program code, described program design language includes object oriented program language-such as Java, Smalltalk, C++,
Also include conventional procedural programming language-such as " C " language or similar programming language.Program code can be with
Fully perform, partly perform on the user computer on the user computer, as independent software kit execution, a portion
Divide part execution or the execution completely on remote computer or server on the remote computer on the user computer.
Be related in the situation of remote computer, remote computer can be by the network of any kind --- including LAN (LAN) or
Wide area network (WAN)-be connected to subscriber computer, or, it may be connected to outer computer (is for example carried using Internet service
Come for business by Internet connection).
Note, above are only presently preferred embodiments of the present invention and institute's application technology principle.It will be appreciated by those skilled in the art that
The invention is not restricted to specific embodiment described here, can carry out for a person skilled in the art it is various it is obvious change,
Readjust and substitute without departing from protection scope of the present invention.Therefore, although the present invention is carried out by above example
It is described in further detail, but the present invention is not limited only to above example, without departing from the inventive concept, also
Other more equivalent embodiments can be included, and the scope of the present invention is determined by scope of the appended claims.
Claims (14)
1. a kind of Application on Voiceprint Recognition performance improvement method, it is characterised in that including:
Obtain the voice open command of user's input;
Determine whether the voice open command matches with default guiding text;
If matching, extracts the corresponding vocal print feature of the voice open command;
The vocal print feature of extraction is matched with predetermined sample vocal print feature, if the match is successful, execution is opened
Operation is opened, wherein the sample vocal print feature is extracted in advance from the semantic voice messaging for the guiding text.
2. according to the method described in claim 1, it is characterised in that the determination of the sample vocal print feature includes:
During voiceprint registration, recording uploading channel is provided a user;
Show personalized speech input prompting message;
The personalized speech content that user inputs is analyzed, the sample vocal print feature of the user is obtained.
3. according to the method described in claim 1, it is characterised in that also include:
When detecting vocal print update event, the current speech information of user's input is obtained;
The current speech information is identified, extraction obtains current vocal print feature;
According to the current vocal print feature and the predetermined sample vocal print feature, new sample vocal print feature is obtained.
4. method according to claim 3, it is characterised in that according to the current vocal print feature and described predetermined
Sample vocal print feature, obtains the new sample vocal print feature, including:
Determine whether the current vocal print feature and the predetermined sample vocal print feature owning user are identical, if identical,
Then the current vocal print feature and predetermined sample vocal print feature are merged using predetermined coefficient, institute is obtained
State new sample vocal print feature.
5. method according to claim 4, it is characterised in that determine the current vocal print feature and described predetermined
Whether sample vocal print feature owning user is identical, including:
The similarity between the current vocal print feature and the predetermined sample vocal print feature is determined, if similarity is more than
Default similarity threshold, it is determined that the current vocal print feature and the predetermined sample vocal print feature owning user phase
Together.
6. method according to claim 3, it is characterised in that
It is triggered detecting default vocal print more new button, or detects the existence time length of sample vocal print feature and is more than
During default time span threshold value, the vocal print update event is generated.
7. a kind of Application on Voiceprint Recognition performance boost device, it is characterised in that including:
Phonetic order acquisition module, the voice open command for obtaining user's input;
Sound identification module, for determining whether the voice open command matches with default guiding text;
Vocal print feature extraction module, for when the voice open command is with default guiding text matches, extracting institute's predicate
The corresponding vocal print feature of sound open command;
A vocal print feature matching module, for the vocal print feature of extraction and predetermined sample vocal print feature to be carried out
Match somebody with somebody, if the match is successful, perform and open operation, wherein the sample vocal print feature is from the semantic voice for the guiding text
Extracted in advance in information.
8. device according to claim 7, it is characterised in that including sample vocal print feature determining module, the sample sound
Line characteristic determination module specifically for:
During voiceprint registration, recording uploading channel is provided a user;
Show personalized speech input prompting message;
The personalized speech content that user inputs is analyzed, the sample vocal print feature of the user is obtained.
9. device according to claim 7, it is characterised in that also include:Sample vocal print update module, the sample vocal print
Update module specifically for:
When detecting vocal print update event, the current speech information of user's input is obtained;
The current speech information is identified, extraction obtains current vocal print feature;
According to the current vocal print feature and the predetermined sample vocal print feature, new sample vocal print feature is obtained.
10. device according to claim 9, it is characterised in that the sample vocal print update module specifically for:
Determine whether the current vocal print feature and the predetermined sample vocal print feature owning user are identical, if identical,
Then the current vocal print feature and predetermined sample vocal print feature are merged using predetermined coefficient, institute is obtained
State new sample vocal print feature.
11. device according to claim 10, it is characterised in that the sample vocal print update module specifically for:
The similarity between the current vocal print feature and the predetermined sample vocal print feature is determined, if similarity is more than
Default similarity threshold, it is determined that the current vocal print feature and the predetermined sample vocal print feature owning user phase
Together.
12. device according to claim 9, it is characterised in that also including vocal print update event generation module, the vocal print
Update event generation module specifically for:
It is triggered detecting default vocal print more new button, or detects the existence time length of sample vocal print feature and is more than
During default time span threshold value, the vocal print update event is generated.
13. a kind of terminal, it is characterised in that including:
One or more processors;
Memory, for storing one or more programs,
When one or more of programs are by one or more of computing devices so that one or more of processors are real
The existing Application on Voiceprint Recognition performance improvement method as any one of claim 1 to 6.
14. a kind of computer-readable recording medium, is stored thereon with computer program, it is characterised in that the program is by processor
The Application on Voiceprint Recognition performance improvement method as any one of claim 1 to 6 is realized during execution.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710741564.4A CN107331400A (en) | 2017-08-25 | 2017-08-25 | A kind of Application on Voiceprint Recognition performance improvement method, device, terminal and storage medium |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710741564.4A CN107331400A (en) | 2017-08-25 | 2017-08-25 | A kind of Application on Voiceprint Recognition performance improvement method, device, terminal and storage medium |
Publications (1)
Publication Number | Publication Date |
---|---|
CN107331400A true CN107331400A (en) | 2017-11-07 |
Family
ID=60224958
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710741564.4A Pending CN107331400A (en) | 2017-08-25 | 2017-08-25 | A kind of Application on Voiceprint Recognition performance improvement method, device, terminal and storage medium |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107331400A (en) |
Cited By (17)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107863098A (en) * | 2017-12-07 | 2018-03-30 | 广州市艾涛普电子有限公司 | A kind of voice identification control method and device |
CN108428455A (en) * | 2018-02-13 | 2018-08-21 | 上海爱优威软件开发有限公司 | The acquisition method and system of vocal print feature |
CN108718357A (en) * | 2018-03-13 | 2018-10-30 | 上海与德科技有限公司 | Method and device, mobile terminal and the computer readable storage medium of interface locking |
CN109065056A (en) * | 2018-09-26 | 2018-12-21 | 珠海格力电器股份有限公司 | A kind of method and device of voice control air-conditioning |
CN109147797A (en) * | 2018-10-18 | 2019-01-04 | 平安科技(深圳)有限公司 | Client service method, device, computer equipment and storage medium based on Application on Voiceprint Recognition |
WO2019127897A1 (en) * | 2017-12-29 | 2019-07-04 | 广州势必可赢网络科技有限公司 | Updating method and device for self-learning voiceprint recognition |
WO2020019176A1 (en) * | 2018-07-24 | 2020-01-30 | 华为技术有限公司 | Method for updating wake-up voice of voice assistant by terminal, and terminal |
WO2020029367A1 (en) * | 2018-08-09 | 2020-02-13 | 平安科技(深圳)有限公司 | Case processing opinion input method and device, computer apparatus, and storage medium |
CN111161741A (en) * | 2019-12-19 | 2020-05-15 | 五八有限公司 | Personalized information identification method and device, electronic equipment and storage medium |
CN111261172A (en) * | 2020-01-21 | 2020-06-09 | 北京爱数智慧科技有限公司 | Voiceprint recognition method and device |
CN111369985A (en) * | 2018-12-26 | 2020-07-03 | 深圳市优必选科技有限公司 | Voice interaction method, device, equipment and medium |
CN111462760A (en) * | 2019-01-21 | 2020-07-28 | 阿里巴巴集团控股有限公司 | Voiceprint recognition system, method and device and electronic equipment |
CN111684444A (en) * | 2019-07-18 | 2020-09-18 | 深圳海付移通科技有限公司 | Identity authentication method, terminal equipment and storage medium |
CN111768789A (en) * | 2020-08-03 | 2020-10-13 | 上海依图信息技术有限公司 | Electronic equipment and method, device and medium for determining identity of voice sender thereof |
CN111816174A (en) * | 2020-06-24 | 2020-10-23 | 北京小米松果电子有限公司 | Speech recognition method, device and computer readable storage medium |
CN111833867A (en) * | 2020-06-08 | 2020-10-27 | 北京嘀嘀无限科技发展有限公司 | Voice instruction recognition method and device, readable storage medium and electronic equipment |
CN113407922A (en) * | 2021-07-14 | 2021-09-17 | 上海万向区块链股份公司 | Intelligent intention recognition and analysis system and method based on block chain technology |
Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN2763935Y (en) * | 2003-12-12 | 2006-03-08 | 北京大学 | Spenker certification identifying system by combined lexeme and sound groove information |
CN202841290U (en) * | 2012-06-04 | 2013-03-27 | 百度在线网络技术(北京)有限公司 | Unlocking device of mobile terminal and mobile terminal having unlocking device |
CN103546622A (en) * | 2012-07-12 | 2014-01-29 | 百度在线网络技术(北京)有限公司 | Control method, device and system for identifying login on basis of voiceprint |
CN104202486A (en) * | 2014-09-26 | 2014-12-10 | 上海华勤通讯技术有限公司 | Mobile terminal and screen unlocking method thereof |
US20150269946A1 (en) * | 2014-03-21 | 2015-09-24 | Wells Fargo Bank, N.A. | Fraud detection database |
CN205788350U (en) * | 2016-02-25 | 2016-12-07 | 上海大学 | A kind of intelligent sound electronic lock |
CN106506524A (en) * | 2016-11-30 | 2017-03-15 | 百度在线网络技术(北京)有限公司 | Method and apparatus for verifying user |
-
2017
- 2017-08-25 CN CN201710741564.4A patent/CN107331400A/en active Pending
Patent Citations (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN2763935Y (en) * | 2003-12-12 | 2006-03-08 | 北京大学 | Spenker certification identifying system by combined lexeme and sound groove information |
CN202841290U (en) * | 2012-06-04 | 2013-03-27 | 百度在线网络技术(北京)有限公司 | Unlocking device of mobile terminal and mobile terminal having unlocking device |
CN103546622A (en) * | 2012-07-12 | 2014-01-29 | 百度在线网络技术(北京)有限公司 | Control method, device and system for identifying login on basis of voiceprint |
US20150269946A1 (en) * | 2014-03-21 | 2015-09-24 | Wells Fargo Bank, N.A. | Fraud detection database |
CN104202486A (en) * | 2014-09-26 | 2014-12-10 | 上海华勤通讯技术有限公司 | Mobile terminal and screen unlocking method thereof |
CN205788350U (en) * | 2016-02-25 | 2016-12-07 | 上海大学 | A kind of intelligent sound electronic lock |
CN106506524A (en) * | 2016-11-30 | 2017-03-15 | 百度在线网络技术(北京)有限公司 | Method and apparatus for verifying user |
Non-Patent Citations (1)
Title |
---|
田景熙等: "《物联网概论 第2版》", 31 July 2017 * |
Cited By (26)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107863098A (en) * | 2017-12-07 | 2018-03-30 | 广州市艾涛普电子有限公司 | A kind of voice identification control method and device |
WO2019127897A1 (en) * | 2017-12-29 | 2019-07-04 | 广州势必可赢网络科技有限公司 | Updating method and device for self-learning voiceprint recognition |
CN108428455A (en) * | 2018-02-13 | 2018-08-21 | 上海爱优威软件开发有限公司 | The acquisition method and system of vocal print feature |
CN108718357A (en) * | 2018-03-13 | 2018-10-30 | 上海与德科技有限公司 | Method and device, mobile terminal and the computer readable storage medium of interface locking |
CN111742361B (en) * | 2018-07-24 | 2023-08-22 | 华为技术有限公司 | Method for updating wake-up voice of voice assistant by terminal and terminal |
WO2020019176A1 (en) * | 2018-07-24 | 2020-01-30 | 华为技术有限公司 | Method for updating wake-up voice of voice assistant by terminal, and terminal |
CN111742361A (en) * | 2018-07-24 | 2020-10-02 | 华为技术有限公司 | Method for updating voice wake-up of voice assistant by terminal and terminal |
WO2020029367A1 (en) * | 2018-08-09 | 2020-02-13 | 平安科技(深圳)有限公司 | Case processing opinion input method and device, computer apparatus, and storage medium |
CN109065056B (en) * | 2018-09-26 | 2021-05-11 | 珠海格力电器股份有限公司 | Method and device for controlling air conditioner through voice |
CN109065056A (en) * | 2018-09-26 | 2018-12-21 | 珠海格力电器股份有限公司 | A kind of method and device of voice control air-conditioning |
CN109147797A (en) * | 2018-10-18 | 2019-01-04 | 平安科技(深圳)有限公司 | Client service method, device, computer equipment and storage medium based on Application on Voiceprint Recognition |
CN111369985A (en) * | 2018-12-26 | 2020-07-03 | 深圳市优必选科技有限公司 | Voice interaction method, device, equipment and medium |
CN111462760B (en) * | 2019-01-21 | 2023-09-26 | 阿里巴巴集团控股有限公司 | Voiceprint recognition system, voiceprint recognition method, voiceprint recognition device and electronic equipment |
CN111462760A (en) * | 2019-01-21 | 2020-07-28 | 阿里巴巴集团控股有限公司 | Voiceprint recognition system, method and device and electronic equipment |
CN111684444A (en) * | 2019-07-18 | 2020-09-18 | 深圳海付移通科技有限公司 | Identity authentication method, terminal equipment and storage medium |
CN111161741B (en) * | 2019-12-19 | 2023-06-27 | 五八有限公司 | Personalized information identification method and device, electronic equipment and storage medium |
CN111161741A (en) * | 2019-12-19 | 2020-05-15 | 五八有限公司 | Personalized information identification method and device, electronic equipment and storage medium |
CN111261172B (en) * | 2020-01-21 | 2023-02-10 | 北京爱数智慧科技有限公司 | Voiceprint recognition method and device |
CN111261172A (en) * | 2020-01-21 | 2020-06-09 | 北京爱数智慧科技有限公司 | Voiceprint recognition method and device |
CN111833867A (en) * | 2020-06-08 | 2020-10-27 | 北京嘀嘀无限科技发展有限公司 | Voice instruction recognition method and device, readable storage medium and electronic equipment |
CN111833867B (en) * | 2020-06-08 | 2023-12-05 | 北京嘀嘀无限科技发展有限公司 | Voice instruction recognition method and device, readable storage medium and electronic equipment |
CN111816174A (en) * | 2020-06-24 | 2020-10-23 | 北京小米松果电子有限公司 | Speech recognition method, device and computer readable storage medium |
CN111768789A (en) * | 2020-08-03 | 2020-10-13 | 上海依图信息技术有限公司 | Electronic equipment and method, device and medium for determining identity of voice sender thereof |
CN111768789B (en) * | 2020-08-03 | 2024-02-23 | 上海依图信息技术有限公司 | Electronic equipment, and method, device and medium for determining identity of voice generator of electronic equipment |
CN113407922A (en) * | 2021-07-14 | 2021-09-17 | 上海万向区块链股份公司 | Intelligent intention recognition and analysis system and method based on block chain technology |
CN113407922B (en) * | 2021-07-14 | 2022-06-03 | 上海万向区块链股份公司 | Intelligent intention recognition and analysis system and method based on block chain technology |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107331400A (en) | A kind of Application on Voiceprint Recognition performance improvement method, device, terminal and storage medium | |
US11100934B2 (en) | Method and apparatus for voiceprint creation and registration | |
CN108133707B (en) | Content sharing method and system | |
CN107134279A (en) | A kind of voice awakening method, device, terminal and storage medium | |
US10811005B2 (en) | Adapting voice input processing based on voice input characteristics | |
CN107622770A (en) | voice awakening method and device | |
CN107886944B (en) | Voice recognition method, device, equipment and storage medium | |
WO2020029500A1 (en) | Voice command customization method, device, apparatus, and computer storage medium | |
CN107516526B (en) | Sound source tracking and positioning method, device, equipment and computer readable storage medium | |
JP2021533397A (en) | Speaker dialification using speaker embedding and a trained generative model | |
CN108376543A (en) | A kind of control method of electrical equipment, device, equipment and storage medium | |
CN104123939A (en) | Substation inspection robot based voice interaction control method | |
CN104282302A (en) | Apparatus and method for recognizing voice and text | |
US11721338B2 (en) | Context-based dynamic tolerance of virtual assistant | |
WO2020024620A1 (en) | Voice information processing method and device, apparatus, and storage medium | |
US10540973B2 (en) | Electronic device for performing operation corresponding to voice input | |
US11393490B2 (en) | Method, apparatus, device and computer-readable storage medium for voice interaction | |
CN109215646A (en) | Voice interaction processing method, device, computer equipment and storage medium | |
KR20190068021A (en) | User adaptive conversation apparatus based on monitoring emotion and ethic and method for thereof | |
KR20180121761A (en) | Electronic apparatus for processing user utterance | |
US10714087B2 (en) | Speech control for complex commands | |
CN108447478A (en) | A kind of sound control method of terminal device, terminal device and device | |
US20220068267A1 (en) | Method and apparatus for recognizing speech, electronic device and storage medium | |
CN111009240A (en) | Voice keyword screening method and device, travel terminal, equipment and medium | |
CN112087726B (en) | Method and system for identifying polyphonic ringtone, electronic equipment and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20171107 |
|
RJ01 | Rejection of invention patent application after publication |