CN107180631A - Voice interaction method and device - Google Patents
Voice interaction method and device Download PDFInfo
- Publication number
- CN107180631A CN107180631A CN201710372523.2A CN201710372523A CN107180631A CN 107180631 A CN107180631 A CN 107180631A CN 201710372523 A CN201710372523 A CN 201710372523A CN 107180631 A CN107180631 A CN 107180631A
- Authority
- CN
- China
- Prior art keywords
- voice
- instruction
- data
- progress icon
- collecting
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 238000000034 method Methods 0.000 title claims abstract description 35
- 230000003993 interaction Effects 0.000 title claims abstract description 10
- 230000005059 dormancy Effects 0.000 claims description 46
- 230000002452 interceptive effect Effects 0.000 claims description 41
- 230000033001 locomotion Effects 0.000 claims description 32
- 230000006870 function Effects 0.000 claims description 22
- 230000002618 waking effect Effects 0.000 claims 1
- 230000000875 corresponding effect Effects 0.000 description 55
- 238000010586 diagram Methods 0.000 description 20
- 230000000284 resting effect Effects 0.000 description 9
- 230000008569 process Effects 0.000 description 8
- 238000004378 air conditioning Methods 0.000 description 7
- 238000004590 computer program Methods 0.000 description 7
- 238000005516 engineering process Methods 0.000 description 7
- 238000012545 processing Methods 0.000 description 6
- 238000003860 storage Methods 0.000 description 6
- 235000009967 Erodium cicutarium Nutrition 0.000 description 5
- 238000004458 analytical method Methods 0.000 description 5
- 230000005611 electricity Effects 0.000 description 5
- 230000008859 change Effects 0.000 description 3
- 241001269238 Data Species 0.000 description 2
- 241000238558 Eucarida Species 0.000 description 2
- 230000005540 biological transmission Effects 0.000 description 2
- 230000004087 circulation Effects 0.000 description 2
- 230000007613 environmental effect Effects 0.000 description 2
- 239000011521 glass Substances 0.000 description 2
- 238000012905 input function Methods 0.000 description 2
- 238000009434 installation Methods 0.000 description 2
- 239000000203 mixture Substances 0.000 description 2
- 238000004321 preservation Methods 0.000 description 2
- 230000004044 response Effects 0.000 description 2
- 238000012163 sequencing technique Methods 0.000 description 2
- 235000008694 Humulus lupulus Nutrition 0.000 description 1
- 230000006399 behavior Effects 0.000 description 1
- 238000004364 calculation method Methods 0.000 description 1
- 238000005520 cutting process Methods 0.000 description 1
- 230000007547 defect Effects 0.000 description 1
- 230000009191 jumping Effects 0.000 description 1
- 238000004519 manufacturing process Methods 0.000 description 1
- 238000012544 monitoring process Methods 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000007430 reference method Methods 0.000 description 1
- 230000035807 sensation Effects 0.000 description 1
- 230000001960 triggered effect Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/26—Speech to text systems
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- User Interface Of Digital Computer (AREA)
Abstract
The invention discloses a voice interaction method, which comprises the following steps: when a first control instruction is received, enabling a voice acquisition function, outputting a first voice prompt, and starting a voice acquisition progress icon to enable the voice acquisition progress icon to move along a set direction; when the voice acquisition progress icon moves along a set direction, acquiring voice signals in the current environment; when a voice signal is acquired before the voice acquisition progress icon moves to a limit position along a set direction, analyzing the voice signal to obtain voice data; matching the voice data with instruction data in a local instruction library; and when the voice data is successfully matched with the instruction data in the local instruction library, outputting a second voice prompt corresponding to the instruction data, and executing the voice instruction corresponding to the voice data. The invention also discloses a voice interaction device.
Description
Technical field
The present invention relates to interactive voice technology, and in particular to a kind of voice interactive method and device.
Background technology
Voice control technology is the advanced subject of world today's smart machine control field, it is therefore intended that allow equipment according to people
Password accurately perform predetermined behavior.Main information technology (IT, Information Technology) in the world at present
Company releases the speech recognition engine of oneself, SIRI, the Google of Google (Google) company of such as Apple Inc. one after another
The Now and Cortana of Microsoft.Domestic IT companies are also proposed the speech-recognition services of oneself, and such as Baidu's voice is helped
Hand etc..The release of these voice platforms presents the magical magic power of speech ciphering equipment control, and equipment starts if can understanding people
Language, and acted according to our wish.In the prior art, the mode of speech control system acquisition phonetic order generally includes following
Two kinds:
1) traditional voice interactive mode, such as SIRI, the working method of the voice assistant such as Cortana, user click on figure manually
The button of correspondence phonetic entry on shape interface, triggering system enters order reception pattern, and at this moment user begins to send out phonetic order.
If the system detects that phonetic entry, then system the phonetic order of phonetic entry is identified, the operation such as semantic analysis, and root
Corresponding actions are performed according to recognition result.If system is not detected by phonetic entry within the specified period, system thinks language
Sound recognition failures, this interactive voice terminates.User needs to click on the button of correspondence phonetic entry on graphical interfaces again, starts
Interactive voice next time.
Traditional voice interactive mode, is mostly near field voice interaction, quality of speech signal is of a relatively high, and has touch-screen
Auxiliary, so the processing of voice signal is relatively easy, the accuracy rate of identification is also higher.But, what traditional voice interaction was present lacks
It is that user sends phonetic order each time to fall into, and is required for the button of correspondence phonetic entry on triggering graphical interfaces manually.Can not be real
Existing complete Voice command.And the single phonetic entry time is longer, causes system response time long, recognition accuracy is by environment shadow
Ring big.
2) man machine language's interactive mode, the object of interactive voice is probably robot or smart machine.It is remote due to being related to
Field interactive voice, therefore environment is more complicated, and without screen interaction.Interactive voice object must continuously monitor voice letter
Number, according to acoustic energy, the change of frequency judges the beginning and end of each interactive voice.
Man machine language's interactive mode, closer to the talk between the mankind, therefore give people it is a kind of naturally, smooth sensation,
Even think that oneself talks with a true man.But the defect that man machine language's interactive mode is present is:When being interacted in far field, voice
Interactive quality is protected from environmental, and greatly environmental noise, accent, volume all directly affects the accuracy rate of speech recognition, application
Occasion is very limited.And system is after phonetic entry is received, link is confirmed without voice, user does not know that system identification goes out
Instruction whether be exactly instruction that user sends.
The content of the invention
To solve existing technical problem, the embodiment of the present invention is expected to provide a kind of voice interactive method and device,
The accuracy of speech recognition can be improved.
What the technical scheme of the embodiment of the present invention was realized in:
One side according to embodiments of the present invention includes there is provided a kind of voice interactive method, methods described:
When receiving the first control instruction, voice enabled acquisition function exports the first voice message, and start voice collecting
Progress icon, makes the voice collecting progress icon be moved along direction initialization;
When the voice collecting progress icon is moved along direction initialization, the voice signal in current environment is adopted
Collection;
The voice collecting progress icon moved to along direction initialization place restrictions on collect voice signal before position when, parsing
The voice signal, obtains speech data;
The speech data is matched with the director data in local instruction database;
The director data for determining in the speech data and local instruction database is when the match is successful, output and the director data
Corresponding second voice message;
Perform the corresponding phonetic order of the speech data.
In such scheme, the voice collecting progress icon at least includes setting in tempo instructions frame, the tempo instructions frame
It is equipped with the progress indicator strip of uniform motion;
The progress indicator strip reaches the tempo instructions frame from one end of the tempo instructions frame to another end motion
The other end when stop motion.
In such scheme, the voice enabled acquisition function, including:
When first control instruction received is the open command of speech recognition mode, show that the voice collecting enters
Icon is spent, and is started counting up;
Or, when first control instruction received is the wake-up instruction in the local instruction database, display is described
Voice collecting progress icon, and start counting up.
In such scheme, methods described also includes:
The voice signal is not collected before the voice collecting progress icon moves to along direction initialization and places restrictions on position
When, first voice message is exported again.
In such scheme, the second voice message corresponding with the director data is exported, including:
When determining that the speech data is matched with the dormancy instruction in local instruction database, export corresponding with the dormancy instruction
Dormancy prompt tone;
Or, when determining that the speech data is matched with the work order in local instruction database, output refers to the work
Make corresponding work prompt tone.
It is again defeated when determining that the speech data is mismatched with the director data in local instruction database in such scheme
Go out first voice message.
Another aspect according to embodiments of the present invention includes there is provided a kind of voice interaction device, described device:Output is single
Member, collecting unit, resolution unit, judging unit and execution unit;
Wherein, the output unit, for receiving during the first control instruction, voice enabled acquisition function, output first
Voice message, and start voice collecting progress icon, the voice collecting progress icon is moved along direction initialization;It is additionally operable to really
The director data for determining in speech data and local instruction database exports the second voice corresponding with the director data when the match is successful
Prompting;
The collecting unit, for when the voice collecting progress icon is moved along direction initialization, in current environment
Voice signal be acquired;
The resolution unit, for being gathered before the voice collecting progress icon moves to along direction initialization and places restrictions on position
During to voice signal, the voice signal is parsed, speech data is obtained;
The judging unit, for the speech data to be matched with the director data in local instruction database;
The execution unit, when the match is successful for determining the director data in the speech data and local instruction database,
Perform the corresponding phonetic order of the speech data.
In such scheme, the voice collecting progress icon at least includes setting in tempo instructions frame, the tempo instructions frame
It is equipped with the progress indicator strip of uniform motion;The progress indicator strip from one end of the tempo instructions frame to another end motion, and
Stop motion when reaching the other end of the tempo instructions frame.
In such scheme, described device also includes:
Display unit, when first control instruction for receiving is the open command of speech recognition mode, display
The voice collecting progress icon, and start counting up;Or, first control instruction received is the local instruction database
In wake-up instruction when, show the voice collecting progress icon, and start counting up.
In such scheme, the output unit is additionally operable to move to along direction initialization in the voice collecting progress icon
Place restrictions on when not collecting the voice signal before position, first voice message is exported again.
In such scheme, the output unit, specifically for determining the speech data and the dormancy in local instruction database
During instructions match, dormancy prompt tone corresponding with the dormancy instruction is exported;Or, determine the speech data and local instruction
When work order in storehouse is matched, work prompt tone corresponding with the work order is exported.
In such scheme, the output unit is additionally operable to determine the speech data and the instruction number in local instruction database
During according to mismatching, first voice message is exported again.
A kind of voice interactive method and device provided in an embodiment of the present invention, by before each phonetic entry all to user
Voice message is sent, to remind user to start phonetic entry, so, it is possible to make user extremely accurate send phonetic order, from
And improve the recognition accuracy of voice signal;In addition, the acquisition time of voice signal is limited by voice collecting progress icon,
Identifying system, which can be shortened, is used for the time of recognition of speech signals, so as to improve the speed of response;Furthermore, by default voice
The voice signal that instruction database is sent to user is inquired about, the voice signal that need not be not only received by cloud service interface differential technique
Corresponding phonetic order carries out semantic analysis, but also supports processed offline, and user only relies on can be achieved really by means of voice message
Voice command in meaning, completely without manually operated.
Brief description of the drawings
Fig. 1 is a kind of method flow schematic diagram of interactive voice of the embodiment of the present invention;
Fig. 2 is the implementation process schematic diagram of interactive voice of the embodiment of the present invention;
Fig. 3 is the view of voice APP installations on mobile terminals;
Fig. 4 is the view that voice APP is arranged on wearable device;
Fig. 5 is Fig. 3 and Fig. 4 workflow schematic diagram;
Fig. 6 is a kind of device composition schematic diagram of interactive voice of the embodiment of the present invention.
Embodiment
The embodiment to the present invention is described in detail below in conjunction with the accompanying drawings.It should be appreciated that this place is retouched
The embodiment stated is merely to illustrate and explain the present invention, and is not intended to limit the invention.
Fig. 1 is a kind of schematic flow sheet of voice interactive method of the embodiment of the present invention;As shown in figure 1, methods described includes:
Step 101, when receiving the first control instruction, voice enabled acquisition function exports the first voice message, and start
Voice collecting progress icon, makes the voice collecting progress icon be moved along direction initialization;
The embodiment of the present invention is mainly used in voice interaction device, and described device can specifically be provided with voice APP
Electronic equipment, the function that the voice interactive method is realized can by the processor caller code in electronic equipment come
Realize, certain program code can be stored in computer-readable storage medium, it is seen then that the electronic equipment at least includes processor and deposited
Storage media.
The electronic equipment includes:Mobile terminal, Wearable terminal, fixed terminal, car-mounted terminal, bank transaction are whole
The delivery terminal at end, supermarket's transaction terminal and express delivery mailbag.Wherein mobile terminal can at least include mobile phone, it is tablet personal computer, individual
Personal digital assistant (PDA, Personal Digital Assistant), navigator, game machine, intelligent toy etc., Wearable are whole
End can at least include intelligent watch, intelligent glasses, intelligent running shoes etc., and fixed terminal can at least include desktop computer, table
Intelligence in face computer, integral computer, television set, projecting apparatus, sound equipment etc., above intelligent toy, intelligent watch refers to equipment
Include processor and storage medium, so as to automatically or according to the setting of operator such as user perform some sequencing
Instruction.
In the embodiment of the present invention, first control instruction that the electronic equipment is received is opening for speech recognition mode
When opening instruction, the voice collecting progress icon is shown, and start counting up;Or, first control instruction received is
When wake-up in the local instruction database is instructed, the voice collecting progress icon is shown, and is started counting up, is adopted with voice enabled
Collect function, export the first voice message.First voice message is used to inform that custom system immediately enters speech recognition state,
User is reminded to start phonetic entry.And start voice collecting progress icon, make the voice collecting progress icon along direction initialization
Motion.Here, the voice collecting progress icon can be voice progress bar, progress circle or progress percentage.In addition, described
One voice message can be the prompt tone defined by user oneself, for example:The voice message such as " please say " or " please indicate ".The electricity
Sub- equipment immediately enters speech recognition state when first voice message output is finished.
Step 102, when the voice collecting progress icon is moved along direction initialization, to the voice signal in current environment
It is acquired;
In the embodiment of the present invention, the electronic equipment is after speech recognition state is entered, the voice collecting progress icon
Moved with uniform velocity along direction initialization to position is placed restrictions on, and timing of starting from scratch, now, the electronic equipment is in current environment
Voice signal be acquired.And during voice signal is gathered, voice collecting progress chart target described in real-time update
Progress, i.e., described voice collecting progress chart target progress gradually increases.For example, the voice collecting progress icon is voice progress
During bar, the voice collecting progress icon at least includes being provided with uniform motion in tempo instructions frame, the tempo instructions frame
Progress indicator strip;The progress indicator strip is referred to from one end of the tempo instructions frame to another end motion, and the arrival progress
Stop motion when showing the other end of frame.Here, the time that the voice collecting progress icon moves to the other end from one end is 1-
15 seconds.In order to prevent the voice collecting progress chart target movement velocity too fast, the electronic equipment does not collect user's input
Voice signal, or, the voice collecting progress chart target movement velocity is too slow, influences the acquisition time of the electronic equipment
And recognition accuracy, in the embodiment of the present invention, it may be preferable that be set to voice collecting progress chart target motion duration 5 seconds.
Here, it is within described 5 seconds the upper limit of the electronic equipment single acquisition Speech time, if the electronic equipment was completed in 3 seconds
Voice collecting, the voice collecting progress icon also can stop motion immediately, then the time of this interactive voice is exactly 3 seconds.
User is hearing that first voice message or display interface in the voice APP see that the voice collecting enters
When degree icon starts timing, phonetic entry is proceeded by, and position is placed restrictions on ensuring that the voice collecting progress icon is reached
Before, complete the phonetic entry.Here, it is the voice collecting progress icon that the voice collecting progress chart target, which places restrictions on position,
Maximum progress threshold value, that is to say, that the electronic equipment single allows the maximum time value of phonetic entry, i.e. time-out time.Such as
This, the time of the electronic equipment single acquisition voice signal is limited according to voice collecting progress icon, can shorten the electricity
Sub- equipment is to the recognition time of voice signal, while improving the signal identification efficiency of the electronic equipment.
Step 103, voice letter is collected before the voice collecting progress icon moves to along direction initialization and places restrictions on position
Number when, parse the voice signal, obtain speech data;
In the embodiment of the present invention, the electronic equipment determines to move to along direction initialization in the voice collecting progress icon
Place restrictions on before position, when collecting voice signal, show this phonetic entry success, then the voice signal is divided into length certain
Speech frame, is then asked for each frame speech data the average pitch cycle, obtains voice number corresponding with the voice signal
According to.
If on the contrary, the electronic equipment is moved in the voice collecting progress icon along direction initialization places restrictions on position
Before, when not collecting the voice signal, show that phonetic entry fails, then terminate this interactive voice, the first language is exported again
Sound is pointed out.
Step 104, the speech data is matched with the director data in local instruction database;
In the embodiment of the present invention, the electronic equipment is then searched after the speech data is obtained in preset instructions storehouse
Director data corresponding with the speech data, and obtain lookup result.Here, the preset instructions storehouse is according to certainly by user
The instruction database of own requirement definition.Specifically, user by the electronic equipment to APP pairs of the voice installed on the electronic equipment
The voice server answered sends instruction database request to create, and the voice server is responded after the request to create, controls the electricity
The establishment interface in the display interface idsplay order storehouse of sub- equipment, user is at the establishment interface of the instruction database according to the demand of oneself
Create the instruction database.For example, user can carry out phonetic entry by the speech voice input function in the establishment interface, with complete
It into the establishment of the instruction database, can also directly be inputted at the establishment interface by word, complete the establishment of the instruction database.
Wherein, the instruction database of establishment includes father's instruction and sub-instructions.For example, father's instruction is:Music, video, intelligent family
Occupy, then the sub-instructions of music can be:Next bent, upper one bent, Chinese song, English song;The sub-instructions of video can be:Tengxun regards
Frequently, QQ videos, youku.com's video;The sub-instructions of smart home can be:Curtain, bedroom air-conditioning, electric light etc..In this way, electronic equipment
The corresponding speech data of the voice signal of collection is identified by user-defined instruction database, can effectively ensure voice
The accuracy rate of identification.
Further, since the default instruction database is limited instruction set, alone word identification technology is used,
Therefore, the embodiment of the present invention, without carrying out semantic analysis by voice cloud service, can be provided the user in speech recognition process
Offline service.
In embodiments of the present invention, user can also be configured in preset instructions storehouse to signal acquisition periods.Specifically
Ground, user is asked by the voice APP settings for sending signal acquisition periods to the voice server, the voice service
Device is received after the setting request of the signal acquisition periods, controls the display interface of the voice APP to show that the signal is adopted
The setting interface in collection cycle, user is according to oneself demand at the setting interface of the signal acquisition periods to signal acquisition week
Phase is configured, and after the setup, is sent to the voice server and set successfully request, and the voice server exists
Receive after the successful request of the setting, preserve the setting of the signal acquisition periods, and in interactive voice next time,
Electronic equipment is acquired according to the signal acquisition periods of preservation to the voice signal of user.
In embodiments of the present invention, user can also be configured to the voice collecting progress chart target type, specifically
Ground, user sends voice collecting progress chart target type to the voice server by the voice APP and sets request, described
Voice server receives the voice collecting progress chart target and set after request, controls the display interface of the voice APP to show
Show that voice collecting progress chart target sets interface, user sets interface to select oneself needs according to oneself demand in progress chart target
Voice collecting progress icon, and after the setup, sent to the voice server and successfully request, the voice be set
Server preserves the voice collecting progress chart target and set after the successful request of the setting is received, and next time
Interactive voice in, show preserve voice collecting progress icon.
Step 105, the director data for determining in the speech data and local instruction database is when the match is successful, output with it is described
Corresponding second voice message of director data.
In the embodiment of the present invention, work instruction data, dormancy instruction data and wake-up are included in the preset instructions storehouse
Director data, the work instruction data that the electronic equipment determines in the speech data and local instruction database is when the match is successful,
Show that the speech data is recognized successfully, then the electronic equipment exports the second voice corresponding with the work instruction data and carried
Show, for example, the work instruction data is:" music ", then second voice message is " music ", for reporting to user's input
Speech data recognize successfully;Or, when the electronic equipment determines that the speech data refers to the work in preset instructions storehouse
Data are made to mismatch, and during with the dormancy instruction Data Matching, then the electronic equipment is exported and the dormancy instruction data
Corresponding dormancy prompt tone.For example, the dormancy instruction data are:" rest ", then the dormancy prompt tone be:" I rests
, it is busy to be me ".Afterwards, the speech recognition mode is changed to park mode;Conversely, when the electronic equipment determines institute's predicate
When sound data are mismatched with all director datas in preset instructions storehouse, show this speech data recognition failures, terminate this
Secondary interactive voice, and first voice message is exported again, interactive voice is realized in the way of continuously circulating.Here, it is described
Work instruction data refers to that the voice signal that the electronic equipment is inputted according to active user performs the data of command adapted thereto group.Example
Such as, the instruction group included in the electronic equipment has " bedroom ", " amusement ", if the voice signal corresponding instruction group being currently received
When " bedroom ", then the electronic equipment performs the instruction group " bedroom ", and shows that the son in the instruction group " bedroom " refers to
Order, for example, sub-instructions are:Lamp, bedroom air-conditioning, curtain;If during the voice signal corresponding instruction group " amusement " being currently received,
The electronic equipment performs the instruction group " amusement ", and shows the sub-instructions in the instruction group " amusement ", for example, described
Sub-instructions are:Music, game.
Step 106, the corresponding phonetic order of the speech data is performed.
In the embodiment of the present invention, the electronic equipment the second voice message output finish after, immediately hop to it is described
The corresponding sub-instructions storehouse of speech data, and in the sub-instructions storehouse, continue to gather the voice signal that user sends.For example, institute
Stating speech data is:Parlor, then the electronic equipment is when successfully identifying " parlor ", playing alert tones " parlor ", and redirects
To parlor sub-instructions storehouse corresponding with parlor.For example, the parlor sub-instructions storehouse includes:Curtain, lamp, bedroom air-conditioning, then it is described
Electronic equipment continues to gather the voice signal that user sends in the parlor sub-instructions storehouse, for example, collecting user's transmission
The corresponding instruction of voice signal is " lamp ", then the electronic equipment performs the control operation to " lamp ".
By the voice collecting progress icon in the embodiment of the present invention, user can be helped to understand when oneself should send
Phonetic order, also, according to voice collecting progress chart target state change and prompt tone, user can be made to understand oneself input
Whether phonetic order is successfully identified, so that user has at fingertips whole speech control process.
Fig. 2 is the implementation process schematic diagram of interactive voice of the embodiment of the present invention;As shown in Figure 2:Including:
Step 201, instruction database is created;
Here, the instruction database is the instruction database defined by user oneself by way of phonetic entry or word input.
For example, user-defined instruction database includes:Voice message data, work instruction data, dormancy instruction data, wake-up director data
With voice collecting progress icon.Wherein, the work instruction data includes at least one work sub-instructions data.For example, described
Work instruction data is amusement, then also includes in the work instruction data:The sons such as game, TV, film and camera refer to
Make data.In this way, the voice signal that the local instruction database that system is created according to user oneself is sent to user is identified, not only
Speech recognition accuracy can be improved, and without carrying out semantic analysis by cloud service, offline service can be provided the user.
Step 202, the first voice message is exported;
Here, first voice message can be the prompt tone of system default, for example, " please say " or by with
The prompt tone that family is defined, for example, the voice message sound such as " owner please tell ", first voice message is mainly used in reminding user
It is ready for phonetic entry.And when first voice message is finished, system immediately enters speech recognition state.
In the embodiment of the present invention, while the electronic equipment is playing first voice message, start voice and adopt
Collection progress icon, makes the voice collecting progress icon be moved along direction initialization, voice collecting progress icon edge setting side
Placed restrictions on to moving to before position, when not collecting voice signal, the voice collecting progress icon makees even from one end to the other side
Speed motion, and stop motion when reaching the other end.For example, the voice collecting progress icon include tempo instructions frame, it is described enter
Degree indicates to be provided with the progress indicator strip of uniform motion in frame;The progress indicator strip is from one end of the tempo instructions frame to another
One end motion, and stop motion when reaching the other end of the tempo instructions frame.
The electronic equipment does not collect voice before voice collecting progress icon moves to along direction initialization and places restrictions on position
During input, show that phonetic entry fails, first voice message is exported again, remind user to re-start phonetic entry.
In the embodiment of the present invention, the electronic equipment does not inquire what is matched with the phonetic order in preset instructions storehouse
During sub-instructions, show that this interactive voice fails, terminate this interactive voice, resend first voice message, now,
The voice collecting progress chart target progress zero, and after first voice message is finished, the voice collecting
Progress chart indicated weight is newly started from scratch timing, and is moved along direction initialization.
In the embodiment of the present invention, the electronic equipment exports first voice message when receiving wake-up instruction, this
When, the voice collecting progress icon zero.User is reminded to start phonetic entry.And played in first voice message
Bi Hou, the voice collecting progress chart indicated weight is newly started from scratch timing.
Step 203, when the voice collecting progress icon is moved along direction initialization, to the voice signal in current environment
It is acquired;
Here, user is after first voice message is finished, or is seeing the voice collecting progress icon
When being moved along direction initialization, phonetic entry is carried out, the electronic equipment is transported in the voice collecting progress icon along direction initialization
Move to placing restrictions on before position, gather voice signal.
Step 204, before the voice collecting progress icon moves to along direction initialization and places restrictions on position, voice letter is collected
Number when, perform step 205;When not collecting voice signal, return and perform step 202;
Here, the electronic equipment is adopted before the voice collecting progress icon moves to along direction initialization and places restrictions on position
When collecting voice signal, show the phonetic entry success of user;, whereas if in voice collecting progress icon edge setting side
Placed restrictions on to moving to before position, voice signal is not collected, then show that this phonetic entry fails.
Step 205, speech data corresponding with the voice signal is searched in preset instructions storehouse, the voice number is determined
According to whether being matched with work instruction data, when being matched with work instruction data, step 206 is performed, with work instruction data not
Timing, performs step 208;Here, the work instruction data includes father's director data and sub-instructions data, for example, father instructs
Data are:" amusement ", sub-instructions data are:" game ".The work instruction data can be described in detail below.
Step 206, prompt tone corresponding with the phonetic order data is played;
For example, the phonetic order data are " amusements ", then the corresponding prompt tone of the phonetic order data is " amusement ",
To remind the phonetic entry of user to be identified successfully.
Step 207, the corresponding instruction of the phonetic order data is performed;
Here, the electronic equipment finds work instruction data and the speech data collected in preset instructions storehouse
Timing, exports work prompt tone corresponding with the work instruction data, and the phonetic entry to report to user is recognized successfully, and
After the work prompt tone is finished, sub-instructions storehouse corresponding with the phonetic order data is immediately hopped to, this is represented
Interactive voice is completed, and re-executes step 203.Or, the electronic equipment is after the work prompt tone is finished, immediately
Corresponding function is performed, without jump instruction storehouse.For example, the work prompt tone is to play music, then the electronic equipment exists
After the work prompt tone is finished, music playback function is immediately performed.
Step 208, speech data corresponding with the voice signal is searched in preset instructions storehouse, the voice number is determined
According to whether with dormancy instruction Data Matching, during with dormancy instruction Data Matching, perform step 209, with dormancy instruction data not
Timing, performs step 202;
Here, dormancy instruction data refer to allow the voice APP in the electronic equipment to enter the instruction of resting state.Example
Such as, dormancy instruction data are " heronsbill rests ".
Step 209, dormancy prompt tone corresponding with dormancy instruction data is sent;
Here, the electronic equipment determines the speech data collected and the dormancy instruction data in preset instructions storehouse
During matching, dormancy prompt tone corresponding with dormancy instruction data is played, and after the dormancy prompt tone is finished, institute's predicate
Sound APP enters resting state, while control voice collection progress icon enters park mode, and performs step 210.
Step 210, wait and wake up instruction;
Here, the voice APP of the electronic equipment in a dormant state when, only receive wake up instruction, it is other instruction without exception
Do not receive.
Step 211, if receive wake-up instruction, when having been received by wake-up instruction, performs step 212, exits dormancy shape
State, re-executes step 202, when not receiving wake-up instruction, re-executes step 210.
Step 212, resting state is exited.
Fig. 3 is the view of voice APP installations on mobile terminals;As shown in figure 3, the mobile terminal is hand
Machine, and the entitled heronsbill voice assistant of the voice APP on mobile phone, including heronsbill voice assistant work shape
State schematic diagram 301a and heronsbill voice assistant resting state schematic diagram 301b.Wherein, Figure 30 1a in the operating condition, progress chart
Mode of operation is designated as, corresponding instruction group is shown.For example, parlor, bedroom, amusement, navigation, return.And it can receive any
Phonetic order;And Figure 30 1b are in the dormant state, progress chart is designated as resting state, and system then only receives and wakes up instruction, do not receive
Other any phonetic orders.
In the embodiment of the present invention, each node in multiway tree can regard the corresponding instruction of a work instruction data as
Group.As shown in Fig. 3-301a, " happy cabin ", " bedroom ", " parlor ", " navigation ", " amusement " are considered as an instruction group,
And each instruction group includes multiple sub-instructions again.For example, instruction group " happy cabin " include " bedroom ", " parlor ", " navigation ",
" amusement " four sub-instructions, instruction group " bedroom " includes:Curtain, lamp, three sub-instructions of bedroom air-conditioning;In instruction group " parlor "
Including:Parlor monitoring, parlor air-conditioning, robot, television set, five sub-instructions of video recorder;Instruction group " amusement " includes:Electricity
Shadow, music, three sub-instructions of camera;Then without sub-instructions in instruction group " navigation ".
Instruction in the embodiment of the present invention includes two types, is respectively:Jump instruction and execute instruction.Wherein, it is described
Jump instruction refers to the instruction that turn function is performed between each instruction.For example, the instruction group for being currently at working condition is " fast
Happy cabin ", then when the phonetic order received is " bedroom ", be currently at instruction group " happy cabin " switching of working condition
To instruction group " bedroom ".
The execute instruction refers to the instruction for performing specific function.For example, the instruction group for being currently at working condition is " sound
It is happy ", then when it is " music " to receive phonetic order, then music playback function is performed, not cutting between execute instruction group
Change operation.
In the embodiment of the present invention, only allow an instruction group in running order in the same time, such as, currently, instruction
When group " amusement " is in running order, system currently only supports " film ", " music " and " camera " corresponding phonetic order.
Shown in Fig. 3-301, in running order instruction group is " happy cabin ", and the rightmost side of current display interface is arranged
List all instructions of present instruction group " happy cabin " in table, including " parlor ", " bedroom ", " amusement ", " navigation ", " return
Return " five instructions.Wherein, in five instructions, " parlor ", " bedroom ", " amusement ", " navigation " are jump instructions, are respectively used to
Execute instruction turn function, for example, " parlor " instruction performs and jumps to instruction group " parlor " from present instruction group " happy cabin ".
And " return " instruction is then used to jump to upper level instruction from present instruction group.For example, being currently at the instruction group of working condition
It is " music ", then when performing " return " instruction, then jumps to father's instruction group " amusement " of " music " instruction.
Fig. 3 also realizes schematic diagram 302a including phonetic order (one);As shown in Figure 30 2a:
User inputs phonetic order " amusement ", electronic equipment identification when instruction group " happy cabin " is in running order
When to go out the phonetic order be " amusement ", output with after the corresponding voice message of " amusement " instruction, it is " happy small from instruction group immediately
Room " jumps to instruction group " amusement ", and the control voice APP sub-instructions information that includes of display screen idsplay order group " amusement ".
For example, the command information that instruction group " amusement " is included is:Music, film, camera, return.
Fig. 3 also realizes schematic diagram 302b including phonetic order (two);As shown in Figure 30 2b:
User's input speech signal " music ", electronic equipment identifies that the corresponding phonetic order of the voice signal is " sound
It is happy " when, export with after the corresponding voice message of " music " instruction, jumping to instruction group " sound from present instruction group " amusement " immediately
It is happy ", and the control voice APP command information that includes of display screen idsplay order group " music ".For example, the instruction group " music "
Including command information be:" next ", " pause ", " broadcasting ", " upper one is first ", " end ", " song of Little Bear ", " English song
Song ", " national language song ", " Music on Demand " and " return ".Wherein, " next ", " pause ", " broadcasting ", " upper one is first ", " knot
Beam ", " song of Little Bear ", " English songs ", " national language song " are execute instructions." Music on Demand " and " return " is jump instruction.
Fig. 3 also realizes schematic diagram 302c including phonetic order (three), as shown in Figure 30 2c:
User's input phonetic order " broadcastings ", electronic equipment identifies the phonetic order when being " broadcasting ", and output is with " broadcasting
Put " instruct after corresponding voice message, the music in current music storehouse is played immediately, and here, the music of broadcasting can be upper one
A song or the song of system shuffle for the last broadcasting of subsystem record, can also be and set according to user
The song for the music sequential selection put.
Fig. 3 also realizes schematic diagram 302d including phonetic order (four).As shown in Figure 30 2d:
User's input phonetic order " Aladdin rest ", electronic equipment identifies that the phonetic order is that " Aladdin is stopped
During breath ", after output voice message " I rests, busy to be me " corresponding with " Aladdin rest " instruction, system
Immediately enter resting state.
Fig. 4 is the view that voice APP is arranged on wearable device;As shown in figure 4, the wearable device is hand
Table, including working state schematic representation 401a and resting state schematic diagram 401b, when voice APP is in running order, such as scheme
Shown in 401a, progress icon is in mode of operation, and display multiple instruction group, and can receive any phonetic order;Work as language
Sound APP in a dormant state when, as shown in Figure 40 1b, progress icon be in park mode, except wake up instruction in addition to do not receive appoint
What phonetic order.Voice APP recognizes that the method for phonetic order is consistent with Fig. 3 in described Fig. 4, its voice APP's distinguished
Carrier is installed different, in the embodiment of the present invention, method reference picture 1, Fig. 2 of the voice APP identifications and execution phonetic order
With described by Fig. 3, it will not be repeated here.
Fig. 5 is Fig. 3 and Fig. 4 workflow schematic diagram;As shown in figure 5, including:
Step 501, voice APP loads default instruction database;
Here, when the voice APP in a dormant state when, user can send to electronic equipment and wake up instruction to start
The voice APP, and after the electronic equipment opens the voice APP, load user-defined local instruction database, example
Such as, the local instruction database includes:Voice message sound data " please say ", work instruction data " bedroom, parlor, navigation, amusement,
Return ", dormancy instruction data " Aladdin rest ", dormancy prompt tone data " I rests, busy to be me ", wake-up instruction
Data " calling Latin " and voice collecting progress icon " progress circle " (referring to shown in Figure 30 1a).
Step 502, voice message sound " please say " is played;
Here, user starts phonetic entry after the voice message sound is heard.
Step 503, voice collecting progress icon is moved along direction initialization, and timing of starting from scratch, and voice APP opens language
Sound identification function;
Step 504, user's input phonetic order " amusement " (referring to shown in Figure 30 2a);
Step 505, when electronic equipment collects voice signal, the voice signal is parsed into speech data and to described
Speech data is identified, if recognizing successfully, performs step 506, if recognition failures, re-executes step 502;
Step 506, whether the speech data matches with work instruction data.When being matched with work instruction data, perform
Step 507, when being mismatched with work instruction data, step 509 is performed;
Step 507, instruction group " amusement " matching corresponding with work instruction data, plays voice message sound " amusement ";
Here, the voice message sound is used to point out user to identify phonetic order " amusement ".
Step 508, phonetic order " amusement " is performed;
Here, the electronic equipment jumps to instruction group " amusement " node from current instruction group node, and described
Instruction group " amusement " node includes sub-instructions " film, music, camera, return ", afterwards, re-executes step 503.
In the embodiment of the present invention, the voice APP circulations perform step 502 to step 505, it is determined that collecting user's transmission
During voice signal, the voice signal collected is identified electronic equipment, for example, identifying that the voice signal correspondingly refers to
When making group " music ", instruction group " music " is jumped to from present instruction group.It can refer in the instruction group " music " including son
Make " next, pause, play, upper one, end, the song of Little Bear, English songs, national language song, Music on Demand, return " (join
As shown in Figure 30 2b), afterwards, re-execute step 503.
In the embodiment of the present invention, the voice APP circulations perform step 502 to step 505, collect user and send voice
During signal, the voice signal is identified.For example, (referring to figure when determining voice signal corresponding instruction " broadcasting "
Shown in 302c), music playback function is performed, afterwards, step 503 is re-executed.
Step 509, the speech data whether with dormancy instruction Data Matching, determine the speech data and dormancy instruction
During Data Matching, step 510 is performed, when determining that the speech data is mismatched with dormancy instruction data, step is re-executed
502;
Here, the voice APP that the electronic equipment is installed identifies the voice letter that user sends in default instruction database
It is number corresponding when being dormancy instruction " Aladdin rest ", perform step 510, it is unidentified go out dormancy instruction " Aladdin rest "
When, re-execute step 502.
Step 510, dormancy prompt tone " I rests, busy to be me " is sent;
Here, voice APP is sent after the dormancy prompt tone, and the voice APP enters resting state (referring to Figure 30 2d institutes
Show), perform step 511;
Step 511, wait and wake up instruction " calling Latin ", and perform step 512;
Step 512, if receive wake-up instruction " calling Latin ", when having been received by wake-up instruction " calling Latin ", holds
Row step 513, when not receiving wake-up instruction " calling Latin ", re-executes step 511;
Step 513, resting state is exited, step 502 is re-executed.
Fig. 6 is a kind of composition schematic diagram of voice interaction device of the embodiment of the present invention:As shown in fig. 6, described device includes:
Output unit 601, collecting unit 602, resolution unit 603, judging unit 604 and execution unit 605;
Wherein, the output unit 601, for receiving during the first control instruction, voice enabled acquisition function, output the
One voice message, and start voice collecting progress icon, the voice collecting progress icon is moved along direction initialization;It is additionally operable to
The director data for determining in speech data and local instruction database exports the second language corresponding with the director data when the match is successful
Sound is pointed out;
The collecting unit 602, for when the voice collecting progress icon is moved along direction initialization, to current environment
In voice signal be acquired;
The resolution unit 603, for before the voice collecting progress icon moves to along direction initialization and places restrictions on position
When collecting voice signal, the voice signal is parsed, speech data is obtained;
The judging unit 604, for the speech data to be matched with the director data in local instruction database;
The execution unit 605, for determining the speech data, the match is successful with the director data in local instruction database
When, perform the corresponding phonetic order of the speech data.
In the embodiment of the present invention, described device can be specifically the electronic equipment for being provided with voice APP.The electronic equipment
Including:Mobile terminal, Wearable terminal, fixed terminal, car-mounted terminal, bank transaction terminal, supermarket's transaction terminal and express delivery
The delivery terminal of mailbag.Wherein mobile terminal can at least include mobile phone, tablet personal computer, PDA, navigator, game machine, intelligence object for appreciation
Tool etc., Wearable terminal can at least include intelligent watch, intelligent glasses, intelligent running shoes etc., and fixed terminal can at least be wrapped
Include in desktop computer, desktop computer, integral computer, television set, projecting apparatus, sound equipment etc., above intelligent toy, intelligent watch
Intelligence refers to that equipment includes processor and storage medium, so as to automatically or according to the setting of operator such as user hold
The instruction of some sequencing of row.
In the embodiment of the present invention, described device also includes display unit 606, for referring in first control received
When order is the open command of speech recognition mode, the voice collecting progress icon is shown;Or, first control received
When system instruction is the wake-up instruction in the local instruction database, the voice collecting progress icon is shown, to enable voice collecting
Function, and the first voice message is exported from the output unit 601 to user, first voice message is used to inform that user is
System immediately enters speech recognition state, reminds user to start phonetic entry.And start voice collecting progress icon, make the voice
Collection progress icon is moved along direction initialization.Here, the voice collecting progress icon can be voice progress bar, progress circle or
Progress percentage.In addition, first voice message can be the prompt tone defined by user oneself, such as:" please say " or " please
The voice messages such as instruction ".The electronic equipment immediately enters speech recognition shape when first voice message output is finished
State.
In the embodiment of the present invention, voice APP in said device is installed after speech recognition state is entered, the voice
Collection progress icon moves with uniform velocity along direction initialization to position is placed restrictions on, and timing of starting from scratch.Trigger the collecting unit
Voice signal in 602 pairs of current environments is acquired.The collecting unit 602 is additionally operable to the gatherer process in voice signal
In, voice collecting progress chart target progress described in real-time update, i.e., described voice collecting progress chart target progress gradually increases.Example
Such as, when the voice collecting progress icon is voice progress bar, the voice collecting progress icon at least includes tempo instructions frame,
The progress indicator strip of uniform motion is provided with the tempo instructions frame;The progress indicator strip is by the one of the tempo instructions frame
Stop motion when holding to another end motion, and reaching the other end of the tempo instructions frame.Here, the voice collecting progress chart
It is 1-15 seconds to mark the time for moving to the other end from one end.In order to prevent the voice collecting progress chart target movement velocity too
It hurry up, described device does not collect the voice signal of user's input, or, the voice collecting progress chart target movement velocity is too
Slowly, influence in the acquisition time and recognition accuracy of described device, the embodiment of the present invention, it is preferable that enter the voice collecting
The motion duration of degree icon is set to 5 seconds.Specifically, it is within described 5 seconds the upper of the single acquisition Speech time of collecting unit 602
Limit, if the collecting unit 602 completed voice collecting in 3 seconds, the voice collecting progress icon can also stop immediately
Motion, then the time of this interactive voice is exactly 3 seconds.
In the embodiment of the present invention, user is hearing that first voice message or display interface in the voice APP see
To the voice collecting progress chart timestamp, phonetic entry is proceeded by, and ensuring the voice collecting progress icon arrival
Place restrictions on before position, complete the phonetic entry.Here, it is the voice collecting that the voice collecting progress chart target, which places restrictions on position,
Progress chart target maximum progress threshold value, that is to say, that described device single allows the maximum time value of phonetic entry, i.e., when overtime
Between.In this way, limiting the time of described device single acquisition voice signal according to voice collecting progress icon, the dress can be shortened
The recognition time to voice signal is put, while improving the signal identification efficiency of described device.
In the embodiment of the present invention, the collecting unit 602 is moved in the voice collecting progress icon along direction initialization
Place restrictions on before position, it is determined that when collecting voice signal, showing this phonetic entry success, the resolution unit 603 being triggered, by institute
State resolution unit 603 and the voice signal is divided into the certain speech frame of length, then each frame speech data is asked for average
Pitch period, obtains speech data corresponding with the voice signal.
If on the contrary, voice collecting progress icon described in the collecting unit 602 moves to along direction initialization and places restrictions on position
Before, when not collecting the voice signal, show that phonetic entry fails, then terminate this interactive voice, trigger the output single
Member 601 exports the first voice message again.
In the embodiment of the present invention, the resolution unit 603 triggers the judging unit after the speech data is obtained
604, director data corresponding with the speech data is searched in preset instructions storehouse by the judging unit 604, and looked into
Look for result.Here, the preset instructions storehouse is the instruction database according to oneself requirement definition by user.Specifically, user passes through electricity
Sub- equipment sends instruction database request to create, institute's predicate to the corresponding voice servers of the voice APP installed on the electronic equipment
Sound server is responded after the request to create, controls the establishment interface in the display interface idsplay order storehouse of the voice APP, user
The instruction database is created according to the demand of oneself at the interface that creates of the instruction database.For example, user can be created by described
Speech voice input function in interface carries out phonetic entry, to complete the establishment of the instruction database, directly can also be created described
Interface is inputted by word, completes the establishment of the instruction database.Wherein, father's instruction can be included in the instruction database of establishment and son refers to
Order, wherein, father's instruction can be:Music, video, smart home, the sub-instructions of music can be:A next bent, upper song,
Chinese song, English song;The sub-instructions of video can be:Tengxun's video, QQ videos, youku.com's video;The sub-instructions of smart home can
To be:Curtain, bedroom air-conditioning, electric light etc..In this way, electronic equipment by user-defined instruction database to the voice signal that identifies
Corresponding phonetic order is identified, and can effectively ensure the accuracy rate of speech recognition.
Further, since the default instruction database is limited instruction set, alone word identification technology is used,
Therefore, the embodiment of the present invention, without carrying out semantic analysis by voice cloud service, can be provided the user in speech recognition process
Offline service.
In embodiments of the present invention, user can be to be configured in preset instructions storehouse to signal acquisition periods.Specifically,
User sends signal acquisition periods by electronic equipment to the corresponding voice servers of the voice APP installed on the electronic equipment
Setting request, the voice server received after the setting request of the signal acquisition periods, controls the voice APP's
Display interface shows the setting interface of the signal acquisition periods, user's setting in the signal acquisition periods according to oneself demand
Put interface to be configured the signal acquisition periods, and after the setup, send and set successfully to the voice server
Request, the voice server receiving the setting successfully after request, preserving the setting of the signal acquisition periods,
And in interactive voice next time, the collecting unit 602 is according to the voice signal of the signal acquisition periods of preservation to user
It is acquired.
In embodiments of the present invention, user can also be configured to voice collecting progress icon, and specifically, user passes through
Electronic equipment sends voice collecting progress chart target to voice server and sets request, and the voice server receives institute's predicate
Sound collection progress chart target is set after request, controls the display interface of the voice APP to show that voice collecting progress chart target is set
Interface is put, user sets the voice collecting progress that interface selects oneself to need according to oneself demand in voice collecting progress chart target
Icon, and after the setup, sent to the voice server and successfully request is set, the voice server is being received
The setting successfully after request, preserves the voice collecting progress chart target and sets, and the display unit 606 is next time
In interactive voice, the voice collecting progress icon preserved is shown.
In the embodiment of the present invention, include in the preset instructions storehouse work instruction data, dormancy instruction data, wake up refer to
Data are made, the output unit 601 determines the work in the speech data and preset instructions storehouse that the collecting unit 602 is collected
When director data is matched, show that the speech data is recognized successfully, then export the second language corresponding with the work instruction data
Sound is pointed out, for example, the work instruction data is:" music ", then second voice message is " music ", for reporting to user
The phonetic order of input is recognized successfully;Or, when the output unit 601 is determined in the speech data and preset instructions storehouse
Work instruction data is mismatched, and during with the dormancy instruction Data Matching, is then exported corresponding with the dormancy instruction data
Dormancy prompt tone.For example, the dormancy instruction data are:" heronsbill rest ", then the dormancy prompt tone is:" I rests
, it is busy to be me ".Afterwards, the speech recognition mode is changed to park mode;Conversely, described in being determined when the output unit 601
When speech data is mismatched with all director datas in preset instructions storehouse, show this phonetic order recognition failures, terminate
This interactive voice, and first voice message is exported again.Interactive voice is realized in the way of continuously circulating.
In the embodiment of the present invention, the output unit 601 is after the second voice message output is finished, and triggering is described to perform list
Member 605, immediately hops to sub-instructions storehouse corresponding with the phonetic order data, and trigger described by the execution unit 605
Collecting unit 602 continues to gather the voice signal that user sends in the sub-instructions storehouse.For example, the phonetic order data are:
Parlor, then described device is when successfully identifying " parlor ", after the playing alert tones of output unit 601 " parlor " finish, institute
State execution unit 605 and immediately hop to parlor sub-instructions storehouse corresponding with parlor.For example, the parlor sub-instructions storehouse includes:Window
Curtain, lamp, bedroom air-conditioning.Fig. 2 descriptions in specific interactive voice implementation process reference method embodiment, will not be repeated here.
It should be understood by those skilled in the art that, embodiments of the invention can be provided as method, system or computer program
Product.Therefore, the shape of the embodiment in terms of the present invention can use hardware embodiment, software implementation or combine software and hardware
Formula.Moreover, the present invention can be used can use storage in one or more computers for wherein including computer usable program code
The form for the computer program product that medium is implemented on (including but is not limited to magnetic disk storage and optical memory etc.).
The present invention is the flow with reference to method according to embodiments of the present invention, equipment (system) and computer program product
Figure and/or block diagram are described.It should be understood that can be by every first-class in computer program instructions implementation process figure and/or block diagram
Journey and/or the flow in square frame and flow chart and/or block diagram and/or the combination of square frame.These computer programs can be provided
The processor of all-purpose computer, special-purpose computer, Embedded Processor or other programmable data processing devices is instructed to produce
A raw machine so that produced by the instruction of computer or the computing device of other programmable data processing devices for real
The device for the function of being specified in present one flow of flow chart or one square frame of multiple flows and/or block diagram or multiple square frames.
These computer program instructions, which may be alternatively stored in, can guide computer or other programmable data processing devices with spy
Determine in the computer-readable memory that mode works so that the instruction being stored in the computer-readable memory, which is produced, to be included referring to
The manufacture set by dress is made, the command device is realized in one flow of flow chart or multiple flows and/or one side of block diagram
The function of being specified in frame or multiple square frames.
These computer program instructions can be also loaded into computer or other programmable data processing devices so that in meter
Series of operation steps is performed on calculation machine or other programmable devices to produce computer implemented processing, thus in computer or
The instruction performed on other programmable devices is provided for realizing in one flow of flow chart or multiple flows and/or block diagram one
The step of function of being specified in individual square frame or multiple square frames.
The foregoing is only a preferred embodiment of the present invention, is not intended to limit the scope of the present invention.
Claims (12)
1. a kind of voice interactive method, it is characterised in that methods described includes:
When receiving the first control instruction, voice enabled acquisition function exports the first voice message, and start voice collecting progress
Icon, makes the voice collecting progress icon be moved along direction initialization;
When the voice collecting progress icon is moved along direction initialization, the voice signal in current environment is acquired;
The voice collecting progress icon moved to along direction initialization place restrictions on collect voice signal before position when, parsing is described
Voice signal, obtains speech data;
The speech data is matched with the director data in local instruction database;
The director data for determining in the speech data and local instruction database is exported corresponding with the director data when the match is successful
The second voice message;
Perform the corresponding phonetic order of the speech data.
2. according to the method described in claim 1, it is characterised in that the voice collecting progress icon at least includes tempo instructions
The progress indicator strip of uniform motion is provided with frame, the tempo instructions frame;
The progress indicator strip reaches the another of the tempo instructions frame from one end of the tempo instructions frame to another end motion
Stop motion during one end.
3. according to the method described in claim 1, it is characterised in that the voice enabled acquisition function, including:
When first control instruction received is the open command of speech recognition mode, the voice collecting progress chart is shown
Mark, and start counting up;
Or, when first control instruction received is the wake-up instruction in the local instruction database, show the voice
Collection progress icon, and start counting up.
4. according to the method described in claim 1, it is characterised in that methods described also includes:
The voice collecting progress icon moved to along direction initialization place restrictions on do not collect the voice signal before position when, weight
Newly export first voice message.
5. according to the method described in claim 1, it is characterised in that output the second voice corresponding with the director data is carried
Show, including:
When determining that the speech data is matched with the dormancy instruction in local instruction database, stop corresponding with the dormancy instruction is exported
Dormancy prompt tone;
Or, when determining that the speech data is matched with the work order in local instruction database, output and the work order pair
The work prompt tone answered.
6. according to the method described in claim 1, it is characterised in that determine the speech data and the instruction in local instruction database
When data are mismatched, first voice message is exported again.
7. a kind of voice interaction device, it is characterised in that described device includes:Output unit, collecting unit, resolution unit, sentence
Disconnected unit and execution unit;
Wherein, the output unit, for receiving during the first control instruction, voice enabled acquisition function exports the first voice
Prompting, and start voice collecting progress icon, the voice collecting progress icon is moved along direction initialization;It is additionally operable to determine language
Director data in sound data and local instruction database exports the second voice corresponding with the director data and carried when the match is successful
Show;
The collecting unit, for when the voice collecting progress icon is moved along direction initialization, to the language in current environment
Message number is acquired;
The resolution unit, for collecting language before the voice collecting progress icon moves to along direction initialization and places restrictions on position
During message, the voice signal is parsed, speech data is obtained;
The judging unit, for the speech data to be matched with the director data in local instruction database;
The execution unit, when the match is successful for determining the director data in the speech data and local instruction database, is performed
The corresponding phonetic order of the speech data.
8. device according to claim 7, it is characterised in that the voice collecting progress icon at least includes tempo instructions
The progress indicator strip of uniform motion is provided with frame, the tempo instructions frame;The progress indicator strip is by the tempo instructions frame
One end to another end motion, and stop motion when reaching the other end of the tempo instructions frame.
9. device according to claim 7, it is characterised in that described device also includes:
Display unit, when first control instruction for receiving is the open command of speech recognition mode, display is described
Voice collecting progress icon, and start counting up;Or, during first control instruction received is the local instruction database
When waking up instruction, the voice collecting progress icon is shown, and start counting up.
10. device according to claim 7, it is characterised in that the output unit, is additionally operable to enter in the voice collecting
Degree icon is moved to along direction initialization is placed restrictions on when not collecting the voice signal before position, and first voice is exported again and is carried
Show.
11. device according to claim 7, it is characterised in that the output unit, specifically for determining the voice number
During according to being matched with the dormancy instruction in local instruction database, dormancy prompt tone corresponding with the dormancy instruction is exported;Or, it is determined that
When the speech data is matched with the work order in local instruction database, work prompting corresponding with the work order is exported
Sound.
12. device according to claim 7, it is characterised in that the output unit, is additionally operable to determine the speech data
When being mismatched with the director data in local instruction database, first voice message is exported again.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710372523.2A CN107180631A (en) | 2017-05-24 | 2017-05-24 | Voice interaction method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201710372523.2A CN107180631A (en) | 2017-05-24 | 2017-05-24 | Voice interaction method and device |
Publications (1)
Publication Number | Publication Date |
---|---|
CN107180631A true CN107180631A (en) | 2017-09-19 |
Family
ID=59831498
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201710372523.2A Pending CN107180631A (en) | 2017-05-24 | 2017-05-24 | Voice interaction method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN107180631A (en) |
Cited By (15)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN108873713A (en) * | 2018-06-25 | 2018-11-23 | 广州市锐尚展柜制作有限公司 | A kind of man-machine interaction method and system applied in smart home |
CN108877791A (en) * | 2018-05-23 | 2018-11-23 | 百度在线网络技术(北京)有限公司 | Voice interactive method, device, server, terminal and medium based on view |
CN109176537A (en) * | 2018-08-09 | 2019-01-11 | 北京云迹科技有限公司 | content displaying method and device for robot |
CN109360570A (en) * | 2018-10-19 | 2019-02-19 | 歌尔科技有限公司 | Audio recognition method, speech ciphering equipment and the readable storage medium storing program for executing of speech ciphering equipment |
CN109903758A (en) * | 2017-12-08 | 2019-06-18 | 阿里巴巴集团控股有限公司 | Audio-frequency processing method, device and terminal device |
CN109960537A (en) * | 2019-03-29 | 2019-07-02 | 北京金山安全软件有限公司 | Interaction method and device and electronic equipment |
CN110767222A (en) * | 2019-06-19 | 2020-02-07 | 北京嘀嘀无限科技发展有限公司 | Order receiving method and device |
CN111583923A (en) * | 2020-04-28 | 2020-08-25 | 北京小米松果电子有限公司 | Information control method and device, and storage medium |
CN111833858A (en) * | 2019-04-17 | 2020-10-27 | 百度在线网络技术(北京)有限公司 | Voice interaction state display method and device based on loudspeaker box |
TWI739067B (en) * | 2019-02-13 | 2021-09-11 | 華南商業銀行股份有限公司 | Cross-channel artificial intelligence dialogue platform and operation method thereof |
CN113539252A (en) * | 2020-04-22 | 2021-10-22 | 庄连豪 | Barrier-free intelligent voice system and control method thereof |
CN113658601A (en) * | 2021-08-18 | 2021-11-16 | 开放智能机器(上海)有限公司 | Voice interaction method, device, terminal equipment, storage medium and program product |
TWI767499B (en) * | 2019-02-13 | 2022-06-11 | 華南商業銀行股份有限公司 | Cross-channel artificial intelligence dialogue platform integrating online custom service system and its operation method |
TWI767498B (en) * | 2019-02-13 | 2022-06-11 | 華南商業銀行股份有限公司 | Cross-channel artificial intelligence dialogue platform integrating machine learning and operation method thereof |
TWI769653B (en) * | 2019-02-13 | 2022-07-01 | 華南商業銀行股份有限公司 | Cross-channel artificial intelligence dialogue platform capable of reassembling voice segment and its operation method |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102262890A (en) * | 2010-05-31 | 2011-11-30 | 鸿富锦精密工业(深圳)有限公司 | Electronic device and marking method thereof |
US20120081530A1 (en) * | 2009-06-13 | 2012-04-05 | Rolestar, Inc. | System for Juxtaposition of Separately Recorded Scenes |
CN105244025A (en) * | 2015-10-29 | 2016-01-13 | 惠州Tcl移动通信有限公司 | Voice identification method and system based on intelligent wearable device |
CN106356059A (en) * | 2015-07-17 | 2017-01-25 | 中兴通讯股份有限公司 | Voice control method, device and projector |
-
2017
- 2017-05-24 CN CN201710372523.2A patent/CN107180631A/en active Pending
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US20120081530A1 (en) * | 2009-06-13 | 2012-04-05 | Rolestar, Inc. | System for Juxtaposition of Separately Recorded Scenes |
CN102262890A (en) * | 2010-05-31 | 2011-11-30 | 鸿富锦精密工业(深圳)有限公司 | Electronic device and marking method thereof |
CN106356059A (en) * | 2015-07-17 | 2017-01-25 | 中兴通讯股份有限公司 | Voice control method, device and projector |
CN105244025A (en) * | 2015-10-29 | 2016-01-13 | 惠州Tcl移动通信有限公司 | Voice identification method and system based on intelligent wearable device |
Cited By (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN109903758A (en) * | 2017-12-08 | 2019-06-18 | 阿里巴巴集团控股有限公司 | Audio-frequency processing method, device and terminal device |
CN109903758B (en) * | 2017-12-08 | 2023-06-23 | 阿里巴巴集团控股有限公司 | Audio processing method and device and terminal equipment |
CN108877791B (en) * | 2018-05-23 | 2021-10-08 | 百度在线网络技术(北京)有限公司 | Voice interaction method, device, server, terminal and medium based on view |
CN108877791A (en) * | 2018-05-23 | 2018-11-23 | 百度在线网络技术(北京)有限公司 | Voice interactive method, device, server, terminal and medium based on view |
US11727927B2 (en) | 2018-05-23 | 2023-08-15 | Baidu Online Network Technology (Beijing) Co., Ltd. | View-based voice interaction method, apparatus, server, terminal and medium |
CN108873713A (en) * | 2018-06-25 | 2018-11-23 | 广州市锐尚展柜制作有限公司 | A kind of man-machine interaction method and system applied in smart home |
CN109176537A (en) * | 2018-08-09 | 2019-01-11 | 北京云迹科技有限公司 | content displaying method and device for robot |
CN109176537B (en) * | 2018-08-09 | 2022-05-10 | 北京云迹科技股份有限公司 | Content display method and device for robot |
CN109360570A (en) * | 2018-10-19 | 2019-02-19 | 歌尔科技有限公司 | Audio recognition method, speech ciphering equipment and the readable storage medium storing program for executing of speech ciphering equipment |
TWI767499B (en) * | 2019-02-13 | 2022-06-11 | 華南商業銀行股份有限公司 | Cross-channel artificial intelligence dialogue platform integrating online custom service system and its operation method |
TWI739067B (en) * | 2019-02-13 | 2021-09-11 | 華南商業銀行股份有限公司 | Cross-channel artificial intelligence dialogue platform and operation method thereof |
TWI767498B (en) * | 2019-02-13 | 2022-06-11 | 華南商業銀行股份有限公司 | Cross-channel artificial intelligence dialogue platform integrating machine learning and operation method thereof |
TWI769653B (en) * | 2019-02-13 | 2022-07-01 | 華南商業銀行股份有限公司 | Cross-channel artificial intelligence dialogue platform capable of reassembling voice segment and its operation method |
CN109960537A (en) * | 2019-03-29 | 2019-07-02 | 北京金山安全软件有限公司 | Interaction method and device and electronic equipment |
CN111833858A (en) * | 2019-04-17 | 2020-10-27 | 百度在线网络技术(北京)有限公司 | Voice interaction state display method and device based on loudspeaker box |
CN110767222B (en) * | 2019-06-19 | 2021-03-09 | 北京嘀嘀无限科技发展有限公司 | Order receiving method and device |
CN110767222A (en) * | 2019-06-19 | 2020-02-07 | 北京嘀嘀无限科技发展有限公司 | Order receiving method and device |
CN113539252A (en) * | 2020-04-22 | 2021-10-22 | 庄连豪 | Barrier-free intelligent voice system and control method thereof |
CN111583923A (en) * | 2020-04-28 | 2020-08-25 | 北京小米松果电子有限公司 | Information control method and device, and storage medium |
CN111583923B (en) * | 2020-04-28 | 2023-11-14 | 北京小米松果电子有限公司 | Information control method and device and storage medium |
CN113658601A (en) * | 2021-08-18 | 2021-11-16 | 开放智能机器(上海)有限公司 | Voice interaction method, device, terminal equipment, storage medium and program product |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN107180631A (en) | Voice interaction method and device | |
JP6977169B2 (en) | Digital Voice Assistant Coordinating signal processing between computing devices | |
CN105657535B (en) | A kind of audio identification methods and device | |
CN104714981B (en) | Voice message searching method, device and system | |
CN109147779A (en) | Voice data processing method and device | |
US11457061B2 (en) | Creating a cinematic storytelling experience using network-addressable devices | |
CN108351872A (en) | Equipment selection for providing response | |
CN106356059A (en) | Voice control method, device and projector | |
CN107948672B (en) | Method and system for storing video data, server and wearable device | |
CN104866275B (en) | Method and device for acquiring image information | |
CN108449493A (en) | Voice communication data processing method, device, storage medium and mobile terminal | |
CN107680614B (en) | Audio signal processing method, apparatus and storage medium | |
CN106210266B (en) | A kind of acoustic signal processing method and audio signal processor | |
CN108874904A (en) | Speech message searching method, device, computer equipment and storage medium | |
KR20160106075A (en) | Method and device for identifying a piece of music in an audio stream | |
CN108694947A (en) | Sound control method, device, storage medium and electronic equipment | |
WO2019045816A1 (en) | Graphical data selection and presentation of digital content | |
CN109509472A (en) | Method, apparatus and system based on voice platform identification background music | |
CN104092809A (en) | Communication sound recording method and recorded communication sound playing method and device | |
CN112270918A (en) | Information processing method, device, system, electronic equipment and storage medium | |
CN106601242A (en) | Executing method and device of operation event and terminal | |
CN109686370A (en) | The method and device of fighting landlord game is carried out based on voice control | |
CN109686372B (en) | Resource playing control method and device | |
JP2023526285A (en) | Test method and apparatus for full-duplex voice interaction system | |
JP2022036953A (en) | Adjustment of signal processing between digital voice assistant computing devices |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
RJ01 | Rejection of invention patent application after publication |
Application publication date: 20170919 |
|
RJ01 | Rejection of invention patent application after publication |