CN109584860A - A kind of voice wakes up word and defines method and system - Google Patents

A kind of voice wakes up word and defines method and system Download PDF

Info

Publication number
CN109584860A
CN109584860A CN201710889765.9A CN201710889765A CN109584860A CN 109584860 A CN109584860 A CN 109584860A CN 201710889765 A CN201710889765 A CN 201710889765A CN 109584860 A CN109584860 A CN 109584860A
Authority
CN
China
Prior art keywords
word
voice
wake
user
module
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201710889765.9A
Other languages
Chinese (zh)
Other versions
CN109584860B (en
Inventor
朱泽春
时春平
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Joyoung Co Ltd
Original Assignee
Joyoung Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Joyoung Co Ltd filed Critical Joyoung Co Ltd
Priority to CN201710889765.9A priority Critical patent/CN109584860B/en
Publication of CN109584860A publication Critical patent/CN109584860A/en
Application granted granted Critical
Publication of CN109584860B publication Critical patent/CN109584860B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/063Training
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F21/00Security arrangements for protecting computers, components thereof, programs or data against unauthorised activity
    • G06F21/30Authentication, i.e. establishing the identity or authorisation of security principals
    • G06F21/31User authentication
    • G06F21/32User authentication using biometric data, e.g. fingerprints, iris scans or voiceprints
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • G10L15/1815Semantic context, e.g. disambiguation of the recognition hypotheses based on word meaning
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • G10L15/34Adaptation of a single recogniser for parallel processing, e.g. by use of multiple processors or cloud computing
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification techniques
    • G10L17/22Interactive procedures; Man-machine interfaces
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • G10L2015/223Execution procedure of a spoken command

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Health & Medical Sciences (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Security & Cryptography (AREA)
  • Artificial Intelligence (AREA)
  • General Physics & Mathematics (AREA)
  • General Engineering & Computer Science (AREA)
  • Computer Hardware Design (AREA)
  • Software Systems (AREA)
  • Computing Systems (AREA)
  • Mathematical Physics (AREA)
  • Electrically Operated Instructional Devices (AREA)

Abstract

The application proposes that a kind of voice wakes up word and defines method and system, is related to technical field of voice interaction, which comprises be trained by voice training model to customized wake-up word according to word information is waken up;Voice library file is returned to according to training result, the voice library file supports user to use customized wake-up word wake-up device for being matched with phonetic order;The voice library file includes audio file corresponding with the wake-up word information, or including voice match algorithm corresponding with the wake-up word information and voice match parameter.By voice training model, the customized wake-up word of training forms voice library file, realizes that voice system wake-up word user can be customized.

Description

A kind of voice wakes up word and defines method and system
Technical field
The present invention relates to technical field of voice interaction, and in particular to a kind of voice wakes up word and defines method and system.
Background technique
Smart home has become a kind of life style of young man, and voice wake-up is all used in online voice wake-up system Function, voice wake-up are a kind of forms of speech recognition technology, are not directly contacted with hardware device, can be by equipment by voice Operation is waken up, voice wakes up the word that word is typically all fixed noun such as " hello, robot " etc, because this is called out Word of waking up needs local identification, therefore wake-up word cannot be replaced arbitrarily.
Summary of the invention
The present invention, which provides a kind of voice and wakes up word, defines method and system, realizes that the customized voice system of user wakes up word.
In order to achieve the above-mentioned object of the invention, the technical solution adopted by the present invention is as follows:
It wakes up word in a first aspect, the present invention provides a kind of voice and defines method, comprising:
Customized wake-up word is trained by voice training model according to word information is waken up;
Voice library file is returned to according to training result, the voice library file with phonetic order for being matched to support User uses customized wake-up word wake-up device;
The voice library file includes audio file corresponding with the wake-up word information, or including with the wake-up The corresponding voice match algorithm of word information and voice match parameter.
Preferably, the method also includes:
The identity information of user is verified to determine whether user has and carry out customized wake-up word permission to wake-up word;
Determined whether to execute corresponding wake-up word training step according to verification result.
Preferably, according to the cycle of training for waking up word information determination and/or feedback wake-up word, the wake-up word information includes Close degree between the content for waking up word, the public use frequency for waking up word, the length for waking up word and the word for waking up word.
Preferably,
After cycle of training expires, customized wake-up word is prompted to be in available mode to the user for waking up word permission;With/ Or,
When wake-up device fails, word and/or customized wake-up are initially waken up to not having the user's prompt for waking up word permission Word.
Preferably, the method also includes:
It receives user's customized word that wakes up by voice input and instructs or receive what user was inputted by terminal device Customized wake-up word instruction.
Preferably, it when user inputs customized wake-up word instruction by voice, extracts and wakes up word Self-definition process In customized wake-up word audio-frequency information;Before cycle of training expires, by the wake up instruction of input and the audio-frequency information into Row similarity compares, when similarity is more than threshold value, wake-up device.
Preferably, the method also includes:
Interactive voice information of user during interactive voice is extracted according to the identity information of user,
The interactive voice information is trained by voice training model, is extracted in the interactive voice information The parameters,acoustic of least speech unit;
The voice library file is synthesized according to the parameters,acoustic splicing for waking up word information and least speech unit.
Preferably, after the identity information of verifying user, before being trained to customized wake-up word further include:
Speech recognition and semantic understanding are carried out to the wake-up word information.
Second aspect wakes up word the present invention also provides a kind of voice and defines system, comprising:
Voice wake-up module and voice training module are provided with voice training model in the voice training module, should Voice training module is used to be trained customized wake-up word by voice training model according to wake-up word information, and according to instruction Practice result and returns to voice library file;The voice wake-up module is used for voice library file and the phonetic order progress according to return It is equipped with and user is supported to use customized wake-up word wake-up device;The voice library file includes corresponding with the wake-up word information Audio file, or including voice match algorithm corresponding with the wake-up word information and voice match parameter.
Preferably, the system also includes identification modules;The voice wake-up module is set to local end equipment, institute Predicate sound training module is set to cloud platform;The local end equipment further includes voice acquisition module, noise processed module, language Sound transmission control module and the first voice transfer module;The cloud platform further includes the second voice transfer module, voice knowledge Other module and semantic understanding module;The voice transfer control module is according to the stream of the type adjustment voice signal of phonetic order To wake up local end equipment or voice signal be sent to cloud platform;The cloud platform passes through speech recognition module and language Adopted Understanding Module parses the voice signal, and returns to corresponding parsing result or voice according to the type of phonetic order Library file.
The present invention compared to the prior art, by being trained to customized wake-up word;It obtains to use customized call out The voice library file for word wake-up device of waking up;It has the following beneficial effects:
1, technical solution of the present invention forms voice library file by voice training model, the customized wake-up word of training, real Existing voice system wakes up word user can be customized.
2, the present invention can identify whether to be administrator's identity by Application on Voiceprint Recognition, and administrator can modify wake-up word.
3, the present invention can store multiple wake-up words, and user can pass through multiple wake-up word wake-up devices when in use.
4, voice acquisition module, noise processed module, voice transfer control module are in work always in the embodiment of the present invention Make state, speech recognition module, semantic understanding module and the voice training module of cloud server are in work shape after triggering State realizes the real-time identification of voice, and energy conservation.
5, the present invention can be inputted by voice or the customized wake-up word of terminal device input instructs, the selection of user Customized wake-up word is added or modified to mode appropriate.
Detailed description of the invention
Fig. 1 is that the voice of the embodiment of the present invention wakes up the flow chart that word defines method;
Fig. 2 is that the voice of the embodiment of the present invention wakes up the structural schematic diagram that word defines system.
Specific embodiment
To keep goal of the invention of the invention, technical scheme and beneficial effects more clear, with reference to the accompanying drawing to this The embodiment of invention is illustrated, it should be noted that in the absence of conflict, in the embodiment and embodiment in the application Feature can mutual any combination.
Embodiment one
The present embodiment, which is illustrated with reference to Fig. 1 a kind of voice and wakes up word, defines method, comprising:
S101, customized wake-up word is trained by voice training model according to wake-up word information;
S102, voice library file is returned to according to training result, the voice library file with phonetic order for being matched To support user to use customized wake-up word wake-up device;
The voice library file includes audio file corresponding with the wake-up word information, or including with the wake-up The corresponding voice match algorithm of word information and voice match parameter.
The embodiment of the present invention forms voice library file, realizes voice by voice training model, the customized wake-up word of training System wake-up word user can be customized.
Preferably, before the method further include:
The identity information of user is verified to determine whether user has and carry out customized wake-up word permission to wake-up word;
Determined whether to execute corresponding wake-up word training step according to verification result.
Specifically, be arranged by way of preparatory typing or default user identity information and corresponding permission, when with When family modification wakes up word, the identity information verifying of user is carried out, when user wake-up word customized with permission, execution is called out The step that word of waking up is trained;When user wake-up word customized without permission, the step for waking up word training is not executed.
Be arranged by way of preparatory typing in the embodiment of the present invention user identity information and corresponding permission, Ke Yili It being configured with voiceprint, the voiceprint of preparatory typing administrator, setting administrator has the customized wake-up word of permission, into When the identity information verifying of row user, user speech is inputted and is compared with administrator's voiceprint of preparatory typing, sound is worked as When line information matches, determine that user has the customized wake-up word of permission, when voiceprint mismatches, user does not have permission certainly Definition wakes up word.
The embodiment of the present invention identifies whether to be administrator's identity, administrator can modify wake-up word by Application on Voiceprint Recognition
Be arranged by way of default in the embodiment of the present invention user identity information and corresponding permission, can be set The user of one wake-up device is administrator, and setting administrator has the customized wake-up word of permission, carries out the identity information of user When verifying, user speech is inputted and is compared with administrator's voiceprint of preparatory typing, when voiceprint matching, determined User has the customized wake-up word of permission, and when voiceprint mismatches, user does not have the customized wake-up word of permission.
The mode that the mode receives customized wake-up word instruction includes: to receive that user is by voice input customized to be called out Word of waking up instructs or receives user and instructed by the customized wake-up word that terminal device inputs.The selection of user mode appropriate adds Add or modify customized wake-up word
Authority Verification information can store in the memory of intelligent terminal the machine in the embodiment of the present invention, in intelligent terminal sheet Authentication is carried out on machine determines that the user is match with the Authority Verification information subscriber identity information The no permission that there is modification to wake up word.The wake-up word pre-established can be equipped in the embodiment of the present invention in intelligent terminal the machine Sound training pattern, the intelligent terminal carry out voice training to the customized wake-up word according to the voice training model.
Authority Verification information can store on server beyond the clouds in the embodiment of the present invention, and smart machine takes to the cloud Business device sends the subscriber identity information;Authentication is carried out with by the subscriber identity information and the power by cloud server Limit verification information carries out matching and determines the permission whether user there is modification to wake up word.Voice training in the embodiment of the present invention Model can store on server beyond the clouds, and the cloud server is equipped with the wake-up word sound training pattern pre-established, The cloud server carries out voice instruction to the customized wake-up word by voice training model according to the wake-up word information Practice.
Embodiment two
The embodiment of the present invention illustrates that the voice before expiring cycle of training and after training at the expiration wakes up word and defines method flow:
According to the cycle of training for waking up word information determination and/or feedback wake-up word, the wake-up word information includes waking up word Content, wake up word the public use frequency, wake up word length and wake up word word between close degree.
Different customized wake-up words are different cycle of training, and the embodiment of the present invention can will return to user cycle of training, more Customized wake-up word comes into force after long-time, may remind the user that how long later the customized wake-up word can be used in this way.
Wherein, the public use frequency for waking up word indicates: using certain intelligent appliance or using the intelligent family of certain brand Electricity or using certain manufacturer intelligent appliance user using the customized wake-up word pre-seted in the embodiment of the present invention quantity, Described more using the customized quantity for waking up word, corresponding cycle of training is shorter, uses the customized wake-up word Quantity is fewer, and corresponding cycle of training is longer.For example, the intelligent appliance of nine positive brands, user A modify customized wake-up word 1 and are " small sun, small sun ", user B modify customized wake-ups word 2 be " small nine, small nine ", since numerous users use " small positive, small sun ", Therefore " the small sun, small sun " cycle of training of customized wake-up word 1 than customized wake-up word 2 " small nine, small nine " cycle of training it is short.
Close degree indicates between waking up the word of word: the close degree of the customized pronunciation for waking up each word of word or sound, Difference of pronouncing is smaller, and corresponding cycle of training is longer, and pronunciation difference is bigger, and corresponding cycle of training is smaller, for example, nine positive brands Intelligent appliance, it is " persimmon, persimmon " that user C, which modifies customized wake-ups word 3, each word of customized wake-up word 1 " small positive, small sun " Between pronounce and be weak in pronunciation greatly between difference word more each than customized wake-ups word 3 " persimmon, persimmon ", therefore customized wake-up word 1 is " small Positive, small sun " cycle of training is shorter cycle of training than customized wake-up word 3 " persimmon, persimmon ".
Preferably, it after expiring cycle of training, can be used to having the user for waking up word permission that customized wake-up word is prompted to be in State;And/or
When wake-up device fails, word and/or customized wake-up are initially waken up to not having the user's prompt for waking up word permission Word.
The embodiment of the present invention can store multiple wake-up words, and user can be waken up by multiple wake-up words set when in use It is standby.
It instructs to solve to receive customized wake-up word to waking up the wake-up word connection problem between expiring word cycle of training, The embodiment of the present invention may include:
When user inputs customized wake-up word instruction by voice, extracts and wake up making by oneself in word Self-definition process Justice wakes up the audio-frequency information of word;Before cycle of training expires, the wake up instruction of input and the audio-frequency information are subjected to similarity It compares, when similarity is more than threshold value, wake-up device.
Before cycle of training expires, after the customized customized wake-up word of user for waking up word of permission, it can be used certainly Definition wakes up word wake-up device, solves customized wake-up word and asks to the linking of the wake-up word between expiring word cycle of training is waken up Topic, other users by initially wake up word still can wake-up device, distinguish whether be have permission it is customized wake up word user when, It is compared using audio-frequency information, it is high with the customized audio-frequency information similarity for waking up word if it is administrator, if it is other User is low with the customized wake-up audio-frequency information similarity of word.In the embodiment of the present invention with it is customized wake up word audio-frequency information Similarity judged using threshold value, according to judging result, it is determined whether wake-up device.
The method further include:
Interactive voice information of user during interactive voice is extracted according to the identity information of user,
The interactive voice information is trained by voice training model, is extracted in the interactive voice information The parameters,acoustic of least speech unit;
The voice library file is synthesized according to the parameters,acoustic splicing for waking up word information and least speech unit.
In the embodiment of the present invention during daily interactive voice, according to the interactive voice information of extract management person, benefit Be trained with voice interactive information daily, advantageously reduce cycle of training, accelerate user using it is customized wake up word when Between.
It can use synthetic method in the embodiment of the present invention and generate voice library file by phonetic rules.Store the smallest language The parameters,acoustic of sound unit, and form word by phoneme group syllabication, by syllable, be composed of words sentence and control tone, weight The various rules of the rhythms such as sound.After providing voice data to be synthesized, automatically converted voice data to using rule continuous Speech sound waves.It is Pitch synchronous overlap add technology for waveform concatenation and prosodic control, more representational algorithm (PSOLA), this method was not only able to maintain the main segment5al feature to be pronounced, but can be adjusted flexibly in splicing its fundamental frequency, duration and The super-segmental features such as intensity.Its core concept is directly to splice to the voice of storage with PSOLA algorithm, thus whole Synthesize complete voice.It is different from traditional concept and only closes the waveform compilation that different voice units carries out simple concatenation At ruled synthesis first has in a large amount of sound banks, selects most suitable voice unit to be used to splice, and during selecting sound The technology for often using Various Complex will use such as PSOLA algorithm finally in splicing, and the rhythm that voice is synthesized to it is special Sign is modified, so that the voice of synthesis be enable to reach very high sound quality.
Embodiment three
It wakes up word as shown in Fig. 2, the embodiment of the present invention provides a kind of voice and defines system, comprising:
Voice wake-up module 111 and voice training module 221 are provided with voice instruction in the voice training module 221 Practice model, which is used to customized wake-up word is carried out by voice training model according to wake-up word information Training, and voice library file is returned to according to training result;The voice wake-up module is with 111 in the voice library file according to return It is matched with phonetic order to support user to use customized wake-up word wake-up device;The voice library file include with it is described The corresponding audio file of word information is waken up, or including voice match algorithm corresponding with the wake-up word information and voice With parameter.
The system also includes identification modules 222;The voice wake-up module 111 is set to local end equipment 11, The voice training module 221 is set to cloud server 22;It is described local end equipment 11 further include voice acquisition module 112, Noise processed module 113, voice transfer control module 114 and the first voice transfer module 115;The cloud server 22 is also Including the second voice transfer module 225, speech recognition module 223 and semantic understanding module 224;The voice transfer controls mould Block 114 is sent to according to the flow direction of the type adjustment voice signal of phonetic order with the local end equipment 11 of wake-up or by voice signal Cloud server 22;The cloud server 22 believes the voice by speech recognition module 223 and semantic understanding module 224 It number is parsed, and corresponding parsing result or voice library file is returned to according to the type of phonetic order.
The embodiment of the present invention verifying user identity information after, customized wake-up word is trained before include: Speech recognition and semantic understanding are carried out to the wake-up word information.
Voice acquisition module 112, noise processed module 113, voice transfer control 114 pieces of mould always in the embodiment of the present invention It is in running order, when voice transfer control module 114 is determined as the instruction of customized wake-up word according to the type of phonetic order, Voice signal is sent to cloud server 22 by the first voice transfer module 115, the voice of cloud server 22 is known at this time Other module 223, semantic understanding module 224 and voice training module 221 are in running order, start to execute wake-up word information Corresponding operation, when voice transfer control module 114 is determined as the instruction of non-custom wake-up word according to the type of phonetic order, Voice wake-up module 111 is in running order, wakes up local end equipment.
Example IV
Illustrate that the present invention realizes that voice wakes up the customized method of word in voice interactive system in conjunction with soil 2, comprising:
When system handles working condition, voice acquisition module 112, noise processed module 113, voice transfer control mould 114 are constantly in working condition, wait the input of user speech, when user wakes up word using modification, the first voice transfer mould Block 115, speech recognition module 223, semantic understanding module 223 are started to work, the final real-time identification for realizing voice;
In speech recognition process when user wakes up word using preset order modification, for example " I will modify and call out for use Awake word ", the meeting of cloud server 22 identifies the identity of the user by identification module 222, identifies whether the user manages Reason person, if having permission modification and wake up word;Then allow to modify if it is manager to wake up word, feeds back to use if not manager Family can not modify wake-up word.Meeting starts voice the wake-up word last time of the offer of user to cloud server 22 after being identified by Training, it is different that difference wakes up the word training time, therefore using that can return to user, how long afterwards the wake-up word comes into force, this How long sample can be used later if may remind the user that;It can be in the following way when the typing of manager's identity: using for the first time When equipment can prompt user's typing administrator information, according to prompt typing voice messaging, sound of the equipment the user after typing Line information is stored in cloud server 22.
After the completion of voice training, voice document library is returned to local end equipment 11, local end equipment by cloud server 22 After 11 receive voice document library, store into memory;
The memory can store multiple wake-up words simultaneously, and user can be waken up by multiple wake-up words set when in use It is standby.Addition and overlay strategy: whether original wake-up word is covered by phonetic order prompt, selects addition customized by user It wakes up word or covers original wake-up word, it is general to support to wake up word within storage 5.
A button can be arranged in the embodiment of the present invention in equipment, can be triggered by the button, restore voice and wake up Word function.After equipment receives the instruction, restore original voice data library file from storage, at the same user storage from Definition wakes up word and deletes, and realizes that voice wakes up the recovery function of word.
It checks that voice wakes up word by equipment, and is modified by copy editor and wake up word.When modification or addition wake up word, Equipment sends the text information of modification to cloud server, verifies identity by cloud server, authentication is by then allowing Modification wakes up word, and by the way that the wake-up word is passed to during training pattern is trained, cloud server will estimate training time and anti- Feed user.
Although disclosed embodiment is as above, its content is only to facilitate understand technical side of the invention Case and the embodiment used, are not intended to limit the present invention.Any those skilled in the art to which this invention pertains, not Under the premise of being detached from disclosed core technology scheme, any modification and change can be made in form and details in implementation Change, but protection scope defined by the present invention, the range that the appended claims that must still be subject to limits.

Claims (10)

1. a kind of voice wakes up word and defines method, it is characterised in that: include:
Customized wake-up word is trained by voice training model according to word information is waken up;
Voice library file is returned to according to training result, the voice library file with phonetic order for being matched to support user Use customized wake-up word wake-up device;
The voice library file includes audio file corresponding with the wake-up word information, or including believing with the wake-up word Cease corresponding voice match algorithm and voice match parameter.
2. the method as described in claim 1, which is characterized in that the method also includes:
The identity information of user is verified to determine whether user has and carry out customized wake-up word permission to wake-up word;
Determined whether to execute corresponding wake-up word training step according to verification result.
3. method according to claim 2, which is characterized in that according to the instruction for waking up word information determination and/or feedback wake-up word Practice the period, the word information that wakes up includes the content for waking up word, the public use frequency for waking up word, the length for waking up word and calls out Close degree between the word of awake word.
4. according to the method described in claim 3, it is characterized in that,
After cycle of training expires, customized wake-up word is prompted to be in available mode to the user for waking up word permission;And/or
When wake-up device fails, word and/or customized wake-up word are initially waken up to not having the user's prompt for waking up word permission.
5. such as method of any of claims 1-4, which is characterized in that the method also includes:
It receives user's customized wake-up word instruction by voice input or receives user and made by oneself by what terminal device inputted Justice wakes up word instruction.
6. method as claimed in claim 5, which is characterized in that instructed when user inputs the customized wake-up word by voice When, extract the audio-frequency information for waking up the customized wake-up word in word Self-definition process;Before cycle of training expires, by calling out for input Instruction of waking up carries out similarity with the audio-frequency information and compares, when similarity is more than threshold value, wake-up device.
7. method according to claim 2, which is characterized in that the method also includes:
Interactive voice information of user during interactive voice is extracted according to the identity information of user,
The interactive voice information is trained by voice training model, extracts the minimum in the interactive voice information The parameters,acoustic of phonetic unit;
The voice library file is synthesized according to the parameters,acoustic splicing for waking up word information and least speech unit.
8. method according to claim 2, which is characterized in that after the identity information of verifying user, to customized wake-up Before word is trained further include:
Speech recognition and semantic understanding are carried out to the wake-up word information.
9. a kind of voice wakes up word and defines system characterized by comprising
Voice wake-up module and voice training module are provided with voice training model in the voice training module, the voice Training module is used to be trained customized wake-up word by voice training model according to wake-up word information, and is tied according to training Fruit returns to voice library file;The voice wake-up module be used to be matched according to the voice library file of return with phonetic order with User is supported to use customized wake-up word wake-up device;The voice library file includes sound corresponding with the wake-up word information Frequency file, or including voice match algorithm corresponding with the wake-up word information and voice match parameter.
10. system as claimed in claim 9, which is characterized in that the system also includes identification modules;The voice is called out Awake module is set to local end equipment, and the voice training module is set to cloud platform;The local end equipment further includes language Sound acquisition module, noise processed module, voice transfer control module and the first voice transfer module;The cloud platform also wraps Include the second voice transfer module, speech recognition module and semantic understanding module;The voice transfer control module is according to voice The flow direction of the type adjustment voice signal of instruction, to wake up local end equipment or voice signal is sent to cloud platform;The cloud End platform parses the voice signal by speech recognition module and semantic understanding module, and according to the class of phonetic order Type returns to corresponding parsing result or voice library file.
CN201710889765.9A 2017-09-27 2017-09-27 Voice wake-up word definition method and system Active CN109584860B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710889765.9A CN109584860B (en) 2017-09-27 2017-09-27 Voice wake-up word definition method and system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710889765.9A CN109584860B (en) 2017-09-27 2017-09-27 Voice wake-up word definition method and system

Publications (2)

Publication Number Publication Date
CN109584860A true CN109584860A (en) 2019-04-05
CN109584860B CN109584860B (en) 2021-08-03

Family

ID=65912523

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710889765.9A Active CN109584860B (en) 2017-09-27 2017-09-27 Voice wake-up word definition method and system

Country Status (1)

Country Link
CN (1) CN109584860B (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110364147A (en) * 2019-08-29 2019-10-22 厦门市思芯微科技有限公司 A kind of wake-up training word acquisition system and method
CN110534107A (en) * 2019-09-11 2019-12-03 北京安云世纪科技有限公司 Sound control method, device, system and the electronic equipment of smart machine
CN110808030A (en) * 2019-11-22 2020-02-18 珠海格力电器股份有限公司 Voice awakening method, system, storage medium and electronic equipment
CN110827836A (en) * 2019-10-23 2020-02-21 珠海格力电器股份有限公司 Method and device for resetting awakening words, electronic equipment and storage medium
CN112009493A (en) * 2020-09-03 2020-12-01 三一专用汽车有限责任公司 Awakening method of vehicle-mounted control system, vehicle-mounted control system and vehicle
CN112153213A (en) * 2019-06-28 2020-12-29 青岛海信移动通信技术股份有限公司 Method and equipment for determining voice information
CN112164395A (en) * 2020-09-18 2021-01-01 北京百度网讯科技有限公司 Vehicle-mounted voice starting method and device, electronic equipment and storage medium
CN112201239A (en) * 2020-09-25 2021-01-08 海尔优家智能科技(北京)有限公司 Target device determination method and apparatus, storage medium, and electronic apparatus
CN113299275A (en) * 2021-05-21 2021-08-24 阿里巴巴新加坡控股有限公司 Method and system for realizing voice interaction, service end, client and intelligent sound box
CN113534780A (en) * 2021-06-21 2021-10-22 上汽通用五菱汽车股份有限公司 Remote control parking parameter and function definition method, automobile and readable storage medium
CN113590207A (en) * 2021-07-30 2021-11-02 思必驰科技股份有限公司 Method and device for improving awakening effect
CN113963695A (en) * 2021-10-13 2022-01-21 深圳市欧瑞博科技股份有限公司 Awakening method, awakening device, equipment and storage medium of intelligent equipment

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105206271A (en) * 2015-08-25 2015-12-30 北京宇音天下科技有限公司 Intelligent equipment voice wake-up method and system for realizing method
CN105895096A (en) * 2016-03-30 2016-08-24 乐视控股(北京)有限公司 Identity identification and voice interaction operating method and device
CN105913842A (en) * 2016-07-03 2016-08-31 朱小龙 Method for waking up mobile phone by custom voice
CN106157950A (en) * 2016-09-29 2016-11-23 合肥华凌股份有限公司 Speech control system and awakening method, Rouser and household electrical appliances, coprocessor
WO2017092189A1 (en) * 2015-11-30 2017-06-08 中兴通讯股份有限公司 Method realizing voice wake-up, device, terminal, and computer storage medium
CN106847283A (en) * 2017-02-28 2017-06-13 广东美的制冷设备有限公司 Intelligent electrical appliance control and device

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103414560A (en) * 2013-07-05 2013-11-27 北京车音网科技有限公司 Starting method of application, device thereof, system thereof and application server
CN105575395A (en) * 2014-10-14 2016-05-11 中兴通讯股份有限公司 Voice wake-up method and apparatus, terminal, and processing method thereof
CN104575504A (en) * 2014-12-24 2015-04-29 上海师范大学 Method for personalized television voice wake-up by voiceprint and voice identification
CN105989841B (en) * 2015-02-17 2019-12-27 上海汽车集团股份有限公司 Vehicle-mounted voice control method and device
CN105654943A (en) * 2015-10-26 2016-06-08 乐视致新电子科技(天津)有限公司 Voice wakeup method, apparatus and system thereof

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105206271A (en) * 2015-08-25 2015-12-30 北京宇音天下科技有限公司 Intelligent equipment voice wake-up method and system for realizing method
WO2017092189A1 (en) * 2015-11-30 2017-06-08 中兴通讯股份有限公司 Method realizing voice wake-up, device, terminal, and computer storage medium
CN105895096A (en) * 2016-03-30 2016-08-24 乐视控股(北京)有限公司 Identity identification and voice interaction operating method and device
CN105913842A (en) * 2016-07-03 2016-08-31 朱小龙 Method for waking up mobile phone by custom voice
CN106157950A (en) * 2016-09-29 2016-11-23 合肥华凌股份有限公司 Speech control system and awakening method, Rouser and household electrical appliances, coprocessor
CN106847283A (en) * 2017-02-28 2017-06-13 广东美的制冷设备有限公司 Intelligent electrical appliance control and device

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112153213A (en) * 2019-06-28 2020-12-29 青岛海信移动通信技术股份有限公司 Method and equipment for determining voice information
CN110364147B (en) * 2019-08-29 2021-08-20 厦门市思芯微科技有限公司 Awakening training word acquisition system and method
CN110364147A (en) * 2019-08-29 2019-10-22 厦门市思芯微科技有限公司 A kind of wake-up training word acquisition system and method
CN110534107A (en) * 2019-09-11 2019-12-03 北京安云世纪科技有限公司 Sound control method, device, system and the electronic equipment of smart machine
CN110827836A (en) * 2019-10-23 2020-02-21 珠海格力电器股份有限公司 Method and device for resetting awakening words, electronic equipment and storage medium
CN110827836B (en) * 2019-10-23 2022-05-03 珠海格力电器股份有限公司 Method and device for resetting awakening words, electronic equipment and storage medium
CN110808030A (en) * 2019-11-22 2020-02-18 珠海格力电器股份有限公司 Voice awakening method, system, storage medium and electronic equipment
CN112009493A (en) * 2020-09-03 2020-12-01 三一专用汽车有限责任公司 Awakening method of vehicle-mounted control system, vehicle-mounted control system and vehicle
CN112164395A (en) * 2020-09-18 2021-01-01 北京百度网讯科技有限公司 Vehicle-mounted voice starting method and device, electronic equipment and storage medium
CN112201239A (en) * 2020-09-25 2021-01-08 海尔优家智能科技(北京)有限公司 Target device determination method and apparatus, storage medium, and electronic apparatus
CN112201239B (en) * 2020-09-25 2024-05-24 海尔优家智能科技(北京)有限公司 Determination method and device of target equipment, storage medium and electronic device
CN113299275A (en) * 2021-05-21 2021-08-24 阿里巴巴新加坡控股有限公司 Method and system for realizing voice interaction, service end, client and intelligent sound box
CN113534780A (en) * 2021-06-21 2021-10-22 上汽通用五菱汽车股份有限公司 Remote control parking parameter and function definition method, automobile and readable storage medium
CN113590207A (en) * 2021-07-30 2021-11-02 思必驰科技股份有限公司 Method and device for improving awakening effect
CN113963695A (en) * 2021-10-13 2022-01-21 深圳市欧瑞博科技股份有限公司 Awakening method, awakening device, equipment and storage medium of intelligent equipment

Also Published As

Publication number Publication date
CN109584860B (en) 2021-08-03

Similar Documents

Publication Publication Date Title
CN109584860A (en) A kind of voice wakes up word and defines method and system
CN108766441B (en) Voice control method and device based on offline voiceprint recognition and voice recognition
CN106486121B (en) Voice optimization method and device applied to intelligent robot
CN106653021A (en) Voice wake-up control method and device and terminal
WO2020253509A1 (en) Situation- and emotion-oriented chinese speech synthesis method, device, and storage medium
CN103543979A (en) Voice outputting method, voice interaction method and electronic device
CN106652995A (en) Voice broadcasting method and system for text
CN105304080A (en) Speech synthesis device and speech synthesis method
JP2020034895A (en) Responding method and device
CN106504742B (en) Synthesize transmission method, cloud server and the terminal device of voice
JP2018146715A (en) Voice interactive device, processing method of the same and program
JP2000122687A (en) Language model updating method
US10593319B1 (en) Parallelization of instruction steps
CN110473556A (en) Audio recognition method, device and mobile terminal
CN111128175B (en) Spoken language dialogue management method and system
CN209328511U (en) A kind of portable AI interactive voice control system
CN114694651A (en) Intelligent terminal control method and device, electronic equipment and storage medium
CN106710587A (en) Speech recognition data pre-processing method
CN114283820A (en) Multi-character voice interaction method, electronic equipment and storage medium
CN105023574B (en) A kind of method and system for realizing synthesis speech enhan-cement
JP6299563B2 (en) Response generation method, response generation apparatus, and response generation program
CN104809923A (en) Self-complied and self-guided method and system for generating intelligent voice communication
CN112242134A (en) Speech synthesis method and device
CN112463108B (en) Voice interaction processing method and device, electronic equipment and storage medium
CN102104657A (en) Alarm clock reminding method, device and mobile terminal

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant