CN107180631A - Voice interaction method and device - Google Patents

Voice interaction method and device Download PDF

Info

Publication number
CN107180631A
CN107180631A CN201710372523.2A CN201710372523A CN107180631A CN 107180631 A CN107180631 A CN 107180631A CN 201710372523 A CN201710372523 A CN 201710372523A CN 107180631 A CN107180631 A CN 107180631A
Authority
CN
China
Prior art keywords
voice
instruction
data
progress icon
collecting
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201710372523.2A
Other languages
Chinese (zh)
Inventor
刘平舟
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN201710372523.2A priority Critical patent/CN107180631A/en
Publication of CN107180631A publication Critical patent/CN107180631A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/26Speech to text systems

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • User Interface Of Digital Computer (AREA)

Abstract

The invention discloses a voice interaction method, which comprises the following steps: when a first control instruction is received, enabling a voice acquisition function, outputting a first voice prompt, and starting a voice acquisition progress icon to enable the voice acquisition progress icon to move along a set direction; when the voice acquisition progress icon moves along a set direction, acquiring voice signals in the current environment; when a voice signal is acquired before the voice acquisition progress icon moves to a limit position along a set direction, analyzing the voice signal to obtain voice data; matching the voice data with instruction data in a local instruction library; and when the voice data is successfully matched with the instruction data in the local instruction library, outputting a second voice prompt corresponding to the instruction data, and executing the voice instruction corresponding to the voice data. The invention also discloses a voice interaction device.

Description

A kind of voice interactive method and device
Technical field
The present invention relates to interactive voice technology, and in particular to a kind of voice interactive method and device.
Background technology
Voice control technology is the advanced subject of world today's smart machine control field, it is therefore intended that allow equipment according to people Password accurately perform predetermined behavior.Main information technology (IT, Information Technology) in the world at present Company releases the speech recognition engine of oneself, SIRI, the Google of Google (Google) company of such as Apple Inc. one after another The Now and Cortana of Microsoft.Domestic IT companies are also proposed the speech-recognition services of oneself, and such as Baidu's voice is helped Hand etc..The release of these voice platforms presents the magical magic power of speech ciphering equipment control, and equipment starts if can understanding people Language, and acted according to our wish.In the prior art, the mode of speech control system acquisition phonetic order generally includes following Two kinds:
1) traditional voice interactive mode, such as SIRI, the working method of the voice assistant such as Cortana, user click on figure manually The button of correspondence phonetic entry on shape interface, triggering system enters order reception pattern, and at this moment user begins to send out phonetic order. If the system detects that phonetic entry, then system the phonetic order of phonetic entry is identified, the operation such as semantic analysis, and root Corresponding actions are performed according to recognition result.If system is not detected by phonetic entry within the specified period, system thinks language Sound recognition failures, this interactive voice terminates.User needs to click on the button of correspondence phonetic entry on graphical interfaces again, starts Interactive voice next time.
Traditional voice interactive mode, is mostly near field voice interaction, quality of speech signal is of a relatively high, and has touch-screen Auxiliary, so the processing of voice signal is relatively easy, the accuracy rate of identification is also higher.But, what traditional voice interaction was present lacks It is that user sends phonetic order each time to fall into, and is required for the button of correspondence phonetic entry on triggering graphical interfaces manually.Can not be real Existing complete Voice command.And the single phonetic entry time is longer, causes system response time long, recognition accuracy is by environment shadow Ring big.
2) man machine language's interactive mode, the object of interactive voice is probably robot or smart machine.It is remote due to being related to Field interactive voice, therefore environment is more complicated, and without screen interaction.Interactive voice object must continuously monitor voice letter Number, according to acoustic energy, the change of frequency judges the beginning and end of each interactive voice.
Man machine language's interactive mode, closer to the talk between the mankind, therefore give people it is a kind of naturally, smooth sensation, Even think that oneself talks with a true man.But the defect that man machine language's interactive mode is present is:When being interacted in far field, voice Interactive quality is protected from environmental, and greatly environmental noise, accent, volume all directly affects the accuracy rate of speech recognition, application Occasion is very limited.And system is after phonetic entry is received, link is confirmed without voice, user does not know that system identification goes out Instruction whether be exactly instruction that user sends.
The content of the invention
To solve existing technical problem, the embodiment of the present invention is expected to provide a kind of voice interactive method and device, The accuracy of speech recognition can be improved.
What the technical scheme of the embodiment of the present invention was realized in:
One side according to embodiments of the present invention includes there is provided a kind of voice interactive method, methods described:
When receiving the first control instruction, voice enabled acquisition function exports the first voice message, and start voice collecting Progress icon, makes the voice collecting progress icon be moved along direction initialization;
When the voice collecting progress icon is moved along direction initialization, the voice signal in current environment is adopted Collection;
The voice collecting progress icon moved to along direction initialization place restrictions on collect voice signal before position when, parsing The voice signal, obtains speech data;
The speech data is matched with the director data in local instruction database;
The director data for determining in the speech data and local instruction database is when the match is successful, output and the director data Corresponding second voice message;
Perform the corresponding phonetic order of the speech data.
In such scheme, the voice collecting progress icon at least includes setting in tempo instructions frame, the tempo instructions frame It is equipped with the progress indicator strip of uniform motion;
The progress indicator strip reaches the tempo instructions frame from one end of the tempo instructions frame to another end motion The other end when stop motion.
In such scheme, the voice enabled acquisition function, including:
When first control instruction received is the open command of speech recognition mode, show that the voice collecting enters Icon is spent, and is started counting up;
Or, when first control instruction received is the wake-up instruction in the local instruction database, display is described Voice collecting progress icon, and start counting up.
In such scheme, methods described also includes:
The voice signal is not collected before the voice collecting progress icon moves to along direction initialization and places restrictions on position When, first voice message is exported again.
In such scheme, the second voice message corresponding with the director data is exported, including:
When determining that the speech data is matched with the dormancy instruction in local instruction database, export corresponding with the dormancy instruction Dormancy prompt tone;
Or, when determining that the speech data is matched with the work order in local instruction database, output refers to the work Make corresponding work prompt tone.
It is again defeated when determining that the speech data is mismatched with the director data in local instruction database in such scheme Go out first voice message.
Another aspect according to embodiments of the present invention includes there is provided a kind of voice interaction device, described device:Output is single Member, collecting unit, resolution unit, judging unit and execution unit;
Wherein, the output unit, for receiving during the first control instruction, voice enabled acquisition function, output first Voice message, and start voice collecting progress icon, the voice collecting progress icon is moved along direction initialization;It is additionally operable to really The director data for determining in speech data and local instruction database exports the second voice corresponding with the director data when the match is successful Prompting;
The collecting unit, for when the voice collecting progress icon is moved along direction initialization, in current environment Voice signal be acquired;
The resolution unit, for being gathered before the voice collecting progress icon moves to along direction initialization and places restrictions on position During to voice signal, the voice signal is parsed, speech data is obtained;
The judging unit, for the speech data to be matched with the director data in local instruction database;
The execution unit, when the match is successful for determining the director data in the speech data and local instruction database, Perform the corresponding phonetic order of the speech data.
In such scheme, the voice collecting progress icon at least includes setting in tempo instructions frame, the tempo instructions frame It is equipped with the progress indicator strip of uniform motion;The progress indicator strip from one end of the tempo instructions frame to another end motion, and Stop motion when reaching the other end of the tempo instructions frame.
In such scheme, described device also includes:
Display unit, when first control instruction for receiving is the open command of speech recognition mode, display The voice collecting progress icon, and start counting up;Or, first control instruction received is the local instruction database In wake-up instruction when, show the voice collecting progress icon, and start counting up.
In such scheme, the output unit is additionally operable to move to along direction initialization in the voice collecting progress icon Place restrictions on when not collecting the voice signal before position, first voice message is exported again.
In such scheme, the output unit, specifically for determining the speech data and the dormancy in local instruction database During instructions match, dormancy prompt tone corresponding with the dormancy instruction is exported;Or, determine the speech data and local instruction When work order in storehouse is matched, work prompt tone corresponding with the work order is exported.
In such scheme, the output unit is additionally operable to determine the speech data and the instruction number in local instruction database During according to mismatching, first voice message is exported again.
A kind of voice interactive method and device provided in an embodiment of the present invention, by before each phonetic entry all to user Voice message is sent, to remind user to start phonetic entry, so, it is possible to make user extremely accurate send phonetic order, from And improve the recognition accuracy of voice signal;In addition, the acquisition time of voice signal is limited by voice collecting progress icon, Identifying system, which can be shortened, is used for the time of recognition of speech signals, so as to improve the speed of response;Furthermore, by default voice The voice signal that instruction database is sent to user is inquired about, the voice signal that need not be not only received by cloud service interface differential technique Corresponding phonetic order carries out semantic analysis, but also supports processed offline, and user only relies on can be achieved really by means of voice message Voice command in meaning, completely without manually operated.
Brief description of the drawings
Fig. 1 is a kind of method flow schematic diagram of interactive voice of the embodiment of the present invention;
Fig. 2 is the implementation process schematic diagram of interactive voice of the embodiment of the present invention;
Fig. 3 is the view of voice APP installations on mobile terminals;
Fig. 4 is the view that voice APP is arranged on wearable device;
Fig. 5 is Fig. 3 and Fig. 4 workflow schematic diagram;
Fig. 6 is a kind of device composition schematic diagram of interactive voice of the embodiment of the present invention.
Embodiment
The embodiment to the present invention is described in detail below in conjunction with the accompanying drawings.It should be appreciated that this place is retouched The embodiment stated is merely to illustrate and explain the present invention, and is not intended to limit the invention.
Fig. 1 is a kind of schematic flow sheet of voice interactive method of the embodiment of the present invention;As shown in figure 1, methods described includes:
Step 101, when receiving the first control instruction, voice enabled acquisition function exports the first voice message, and start Voice collecting progress icon, makes the voice collecting progress icon be moved along direction initialization;
The embodiment of the present invention is mainly used in voice interaction device, and described device can specifically be provided with voice APP Electronic equipment, the function that the voice interactive method is realized can by the processor caller code in electronic equipment come Realize, certain program code can be stored in computer-readable storage medium, it is seen then that the electronic equipment at least includes processor and deposited Storage media.
The electronic equipment includes:Mobile terminal, Wearable terminal, fixed terminal, car-mounted terminal, bank transaction are whole The delivery terminal at end, supermarket's transaction terminal and express delivery mailbag.Wherein mobile terminal can at least include mobile phone, it is tablet personal computer, individual Personal digital assistant (PDA, Personal Digital Assistant), navigator, game machine, intelligent toy etc., Wearable are whole End can at least include intelligent watch, intelligent glasses, intelligent running shoes etc., and fixed terminal can at least include desktop computer, table Intelligence in face computer, integral computer, television set, projecting apparatus, sound equipment etc., above intelligent toy, intelligent watch refers to equipment Include processor and storage medium, so as to automatically or according to the setting of operator such as user perform some sequencing Instruction.
In the embodiment of the present invention, first control instruction that the electronic equipment is received is opening for speech recognition mode When opening instruction, the voice collecting progress icon is shown, and start counting up;Or, first control instruction received is When wake-up in the local instruction database is instructed, the voice collecting progress icon is shown, and is started counting up, is adopted with voice enabled Collect function, export the first voice message.First voice message is used to inform that custom system immediately enters speech recognition state, User is reminded to start phonetic entry.And start voice collecting progress icon, make the voice collecting progress icon along direction initialization Motion.Here, the voice collecting progress icon can be voice progress bar, progress circle or progress percentage.In addition, described One voice message can be the prompt tone defined by user oneself, for example:The voice message such as " please say " or " please indicate ".The electricity Sub- equipment immediately enters speech recognition state when first voice message output is finished.
Step 102, when the voice collecting progress icon is moved along direction initialization, to the voice signal in current environment It is acquired;
In the embodiment of the present invention, the electronic equipment is after speech recognition state is entered, the voice collecting progress icon Moved with uniform velocity along direction initialization to position is placed restrictions on, and timing of starting from scratch, now, the electronic equipment is in current environment Voice signal be acquired.And during voice signal is gathered, voice collecting progress chart target described in real-time update Progress, i.e., described voice collecting progress chart target progress gradually increases.For example, the voice collecting progress icon is voice progress During bar, the voice collecting progress icon at least includes being provided with uniform motion in tempo instructions frame, the tempo instructions frame Progress indicator strip;The progress indicator strip is referred to from one end of the tempo instructions frame to another end motion, and the arrival progress Stop motion when showing the other end of frame.Here, the time that the voice collecting progress icon moves to the other end from one end is 1- 15 seconds.In order to prevent the voice collecting progress chart target movement velocity too fast, the electronic equipment does not collect user's input Voice signal, or, the voice collecting progress chart target movement velocity is too slow, influences the acquisition time of the electronic equipment And recognition accuracy, in the embodiment of the present invention, it may be preferable that be set to voice collecting progress chart target motion duration 5 seconds. Here, it is within described 5 seconds the upper limit of the electronic equipment single acquisition Speech time, if the electronic equipment was completed in 3 seconds Voice collecting, the voice collecting progress icon also can stop motion immediately, then the time of this interactive voice is exactly 3 seconds.
User is hearing that first voice message or display interface in the voice APP see that the voice collecting enters When degree icon starts timing, phonetic entry is proceeded by, and position is placed restrictions on ensuring that the voice collecting progress icon is reached Before, complete the phonetic entry.Here, it is the voice collecting progress icon that the voice collecting progress chart target, which places restrictions on position, Maximum progress threshold value, that is to say, that the electronic equipment single allows the maximum time value of phonetic entry, i.e. time-out time.Such as This, the time of the electronic equipment single acquisition voice signal is limited according to voice collecting progress icon, can shorten the electricity Sub- equipment is to the recognition time of voice signal, while improving the signal identification efficiency of the electronic equipment.
Step 103, voice letter is collected before the voice collecting progress icon moves to along direction initialization and places restrictions on position Number when, parse the voice signal, obtain speech data;
In the embodiment of the present invention, the electronic equipment determines to move to along direction initialization in the voice collecting progress icon Place restrictions on before position, when collecting voice signal, show this phonetic entry success, then the voice signal is divided into length certain Speech frame, is then asked for each frame speech data the average pitch cycle, obtains voice number corresponding with the voice signal According to.
If on the contrary, the electronic equipment is moved in the voice collecting progress icon along direction initialization places restrictions on position Before, when not collecting the voice signal, show that phonetic entry fails, then terminate this interactive voice, the first language is exported again Sound is pointed out.
Step 104, the speech data is matched with the director data in local instruction database;
In the embodiment of the present invention, the electronic equipment is then searched after the speech data is obtained in preset instructions storehouse Director data corresponding with the speech data, and obtain lookup result.Here, the preset instructions storehouse is according to certainly by user The instruction database of own requirement definition.Specifically, user by the electronic equipment to APP pairs of the voice installed on the electronic equipment The voice server answered sends instruction database request to create, and the voice server is responded after the request to create, controls the electricity The establishment interface in the display interface idsplay order storehouse of sub- equipment, user is at the establishment interface of the instruction database according to the demand of oneself Create the instruction database.For example, user can carry out phonetic entry by the speech voice input function in the establishment interface, with complete It into the establishment of the instruction database, can also directly be inputted at the establishment interface by word, complete the establishment of the instruction database. Wherein, the instruction database of establishment includes father's instruction and sub-instructions.For example, father's instruction is:Music, video, intelligent family Occupy, then the sub-instructions of music can be:Next bent, upper one bent, Chinese song, English song;The sub-instructions of video can be:Tengxun regards Frequently, QQ videos, youku.com's video;The sub-instructions of smart home can be:Curtain, bedroom air-conditioning, electric light etc..In this way, electronic equipment The corresponding speech data of the voice signal of collection is identified by user-defined instruction database, can effectively ensure voice The accuracy rate of identification.
Further, since the default instruction database is limited instruction set, alone word identification technology is used, Therefore, the embodiment of the present invention, without carrying out semantic analysis by voice cloud service, can be provided the user in speech recognition process Offline service.
In embodiments of the present invention, user can also be configured in preset instructions storehouse to signal acquisition periods.Specifically Ground, user is asked by the voice APP settings for sending signal acquisition periods to the voice server, the voice service Device is received after the setting request of the signal acquisition periods, controls the display interface of the voice APP to show that the signal is adopted The setting interface in collection cycle, user is according to oneself demand at the setting interface of the signal acquisition periods to signal acquisition week Phase is configured, and after the setup, is sent to the voice server and set successfully request, and the voice server exists Receive after the successful request of the setting, preserve the setting of the signal acquisition periods, and in interactive voice next time, Electronic equipment is acquired according to the signal acquisition periods of preservation to the voice signal of user.
In embodiments of the present invention, user can also be configured to the voice collecting progress chart target type, specifically Ground, user sends voice collecting progress chart target type to the voice server by the voice APP and sets request, described Voice server receives the voice collecting progress chart target and set after request, controls the display interface of the voice APP to show Show that voice collecting progress chart target sets interface, user sets interface to select oneself needs according to oneself demand in progress chart target Voice collecting progress icon, and after the setup, sent to the voice server and successfully request, the voice be set Server preserves the voice collecting progress chart target and set after the successful request of the setting is received, and next time Interactive voice in, show preserve voice collecting progress icon.
Step 105, the director data for determining in the speech data and local instruction database is when the match is successful, output with it is described Corresponding second voice message of director data.
In the embodiment of the present invention, work instruction data, dormancy instruction data and wake-up are included in the preset instructions storehouse Director data, the work instruction data that the electronic equipment determines in the speech data and local instruction database is when the match is successful, Show that the speech data is recognized successfully, then the electronic equipment exports the second voice corresponding with the work instruction data and carried Show, for example, the work instruction data is:" music ", then second voice message is " music ", for reporting to user's input Speech data recognize successfully;Or, when the electronic equipment determines that the speech data refers to the work in preset instructions storehouse Data are made to mismatch, and during with the dormancy instruction Data Matching, then the electronic equipment is exported and the dormancy instruction data Corresponding dormancy prompt tone.For example, the dormancy instruction data are:" rest ", then the dormancy prompt tone be:" I rests , it is busy to be me ".Afterwards, the speech recognition mode is changed to park mode;Conversely, when the electronic equipment determines institute's predicate When sound data are mismatched with all director datas in preset instructions storehouse, show this speech data recognition failures, terminate this Secondary interactive voice, and first voice message is exported again, interactive voice is realized in the way of continuously circulating.Here, it is described Work instruction data refers to that the voice signal that the electronic equipment is inputted according to active user performs the data of command adapted thereto group.Example Such as, the instruction group included in the electronic equipment has " bedroom ", " amusement ", if the voice signal corresponding instruction group being currently received When " bedroom ", then the electronic equipment performs the instruction group " bedroom ", and shows that the son in the instruction group " bedroom " refers to Order, for example, sub-instructions are:Lamp, bedroom air-conditioning, curtain;If during the voice signal corresponding instruction group " amusement " being currently received, The electronic equipment performs the instruction group " amusement ", and shows the sub-instructions in the instruction group " amusement ", for example, described Sub-instructions are:Music, game.
Step 106, the corresponding phonetic order of the speech data is performed.
In the embodiment of the present invention, the electronic equipment the second voice message output finish after, immediately hop to it is described The corresponding sub-instructions storehouse of speech data, and in the sub-instructions storehouse, continue to gather the voice signal that user sends.For example, institute Stating speech data is:Parlor, then the electronic equipment is when successfully identifying " parlor ", playing alert tones " parlor ", and redirects To parlor sub-instructions storehouse corresponding with parlor.For example, the parlor sub-instructions storehouse includes:Curtain, lamp, bedroom air-conditioning, then it is described Electronic equipment continues to gather the voice signal that user sends in the parlor sub-instructions storehouse, for example, collecting user's transmission The corresponding instruction of voice signal is " lamp ", then the electronic equipment performs the control operation to " lamp ".
By the voice collecting progress icon in the embodiment of the present invention, user can be helped to understand when oneself should send Phonetic order, also, according to voice collecting progress chart target state change and prompt tone, user can be made to understand oneself input Whether phonetic order is successfully identified, so that user has at fingertips whole speech control process.
Fig. 2 is the implementation process schematic diagram of interactive voice of the embodiment of the present invention;As shown in Figure 2:Including:
Step 201, instruction database is created;
Here, the instruction database is the instruction database defined by user oneself by way of phonetic entry or word input. For example, user-defined instruction database includes:Voice message data, work instruction data, dormancy instruction data, wake-up director data With voice collecting progress icon.Wherein, the work instruction data includes at least one work sub-instructions data.For example, described Work instruction data is amusement, then also includes in the work instruction data:The sons such as game, TV, film and camera refer to Make data.In this way, the voice signal that the local instruction database that system is created according to user oneself is sent to user is identified, not only Speech recognition accuracy can be improved, and without carrying out semantic analysis by cloud service, offline service can be provided the user.
Step 202, the first voice message is exported;
Here, first voice message can be the prompt tone of system default, for example, " please say " or by with The prompt tone that family is defined, for example, the voice message sound such as " owner please tell ", first voice message is mainly used in reminding user It is ready for phonetic entry.And when first voice message is finished, system immediately enters speech recognition state.
In the embodiment of the present invention, while the electronic equipment is playing first voice message, start voice and adopt Collection progress icon, makes the voice collecting progress icon be moved along direction initialization, voice collecting progress icon edge setting side Placed restrictions on to moving to before position, when not collecting voice signal, the voice collecting progress icon makees even from one end to the other side Speed motion, and stop motion when reaching the other end.For example, the voice collecting progress icon include tempo instructions frame, it is described enter Degree indicates to be provided with the progress indicator strip of uniform motion in frame;The progress indicator strip is from one end of the tempo instructions frame to another One end motion, and stop motion when reaching the other end of the tempo instructions frame.
The electronic equipment does not collect voice before voice collecting progress icon moves to along direction initialization and places restrictions on position During input, show that phonetic entry fails, first voice message is exported again, remind user to re-start phonetic entry.
In the embodiment of the present invention, the electronic equipment does not inquire what is matched with the phonetic order in preset instructions storehouse During sub-instructions, show that this interactive voice fails, terminate this interactive voice, resend first voice message, now, The voice collecting progress chart target progress zero, and after first voice message is finished, the voice collecting Progress chart indicated weight is newly started from scratch timing, and is moved along direction initialization.
In the embodiment of the present invention, the electronic equipment exports first voice message when receiving wake-up instruction, this When, the voice collecting progress icon zero.User is reminded to start phonetic entry.And played in first voice message Bi Hou, the voice collecting progress chart indicated weight is newly started from scratch timing.
Step 203, when the voice collecting progress icon is moved along direction initialization, to the voice signal in current environment It is acquired;
Here, user is after first voice message is finished, or is seeing the voice collecting progress icon When being moved along direction initialization, phonetic entry is carried out, the electronic equipment is transported in the voice collecting progress icon along direction initialization Move to placing restrictions on before position, gather voice signal.
Step 204, before the voice collecting progress icon moves to along direction initialization and places restrictions on position, voice letter is collected Number when, perform step 205;When not collecting voice signal, return and perform step 202;
Here, the electronic equipment is adopted before the voice collecting progress icon moves to along direction initialization and places restrictions on position When collecting voice signal, show the phonetic entry success of user;, whereas if in voice collecting progress icon edge setting side Placed restrictions on to moving to before position, voice signal is not collected, then show that this phonetic entry fails.
Step 205, speech data corresponding with the voice signal is searched in preset instructions storehouse, the voice number is determined According to whether being matched with work instruction data, when being matched with work instruction data, step 206 is performed, with work instruction data not Timing, performs step 208;Here, the work instruction data includes father's director data and sub-instructions data, for example, father instructs Data are:" amusement ", sub-instructions data are:" game ".The work instruction data can be described in detail below.
Step 206, prompt tone corresponding with the phonetic order data is played;
For example, the phonetic order data are " amusements ", then the corresponding prompt tone of the phonetic order data is " amusement ", To remind the phonetic entry of user to be identified successfully.
Step 207, the corresponding instruction of the phonetic order data is performed;
Here, the electronic equipment finds work instruction data and the speech data collected in preset instructions storehouse Timing, exports work prompt tone corresponding with the work instruction data, and the phonetic entry to report to user is recognized successfully, and After the work prompt tone is finished, sub-instructions storehouse corresponding with the phonetic order data is immediately hopped to, this is represented Interactive voice is completed, and re-executes step 203.Or, the electronic equipment is after the work prompt tone is finished, immediately Corresponding function is performed, without jump instruction storehouse.For example, the work prompt tone is to play music, then the electronic equipment exists After the work prompt tone is finished, music playback function is immediately performed.
Step 208, speech data corresponding with the voice signal is searched in preset instructions storehouse, the voice number is determined According to whether with dormancy instruction Data Matching, during with dormancy instruction Data Matching, perform step 209, with dormancy instruction data not Timing, performs step 202;
Here, dormancy instruction data refer to allow the voice APP in the electronic equipment to enter the instruction of resting state.Example Such as, dormancy instruction data are " heronsbill rests ".
Step 209, dormancy prompt tone corresponding with dormancy instruction data is sent;
Here, the electronic equipment determines the speech data collected and the dormancy instruction data in preset instructions storehouse During matching, dormancy prompt tone corresponding with dormancy instruction data is played, and after the dormancy prompt tone is finished, institute's predicate Sound APP enters resting state, while control voice collection progress icon enters park mode, and performs step 210.
Step 210, wait and wake up instruction;
Here, the voice APP of the electronic equipment in a dormant state when, only receive wake up instruction, it is other instruction without exception Do not receive.
Step 211, if receive wake-up instruction, when having been received by wake-up instruction, performs step 212, exits dormancy shape State, re-executes step 202, when not receiving wake-up instruction, re-executes step 210.
Step 212, resting state is exited.
Fig. 3 is the view of voice APP installations on mobile terminals;As shown in figure 3, the mobile terminal is hand Machine, and the entitled heronsbill voice assistant of the voice APP on mobile phone, including heronsbill voice assistant work shape State schematic diagram 301a and heronsbill voice assistant resting state schematic diagram 301b.Wherein, Figure 30 1a in the operating condition, progress chart Mode of operation is designated as, corresponding instruction group is shown.For example, parlor, bedroom, amusement, navigation, return.And it can receive any Phonetic order;And Figure 30 1b are in the dormant state, progress chart is designated as resting state, and system then only receives and wakes up instruction, do not receive Other any phonetic orders.
In the embodiment of the present invention, each node in multiway tree can regard the corresponding instruction of a work instruction data as Group.As shown in Fig. 3-301a, " happy cabin ", " bedroom ", " parlor ", " navigation ", " amusement " are considered as an instruction group, And each instruction group includes multiple sub-instructions again.For example, instruction group " happy cabin " include " bedroom ", " parlor ", " navigation ", " amusement " four sub-instructions, instruction group " bedroom " includes:Curtain, lamp, three sub-instructions of bedroom air-conditioning;In instruction group " parlor " Including:Parlor monitoring, parlor air-conditioning, robot, television set, five sub-instructions of video recorder;Instruction group " amusement " includes:Electricity Shadow, music, three sub-instructions of camera;Then without sub-instructions in instruction group " navigation ".
Instruction in the embodiment of the present invention includes two types, is respectively:Jump instruction and execute instruction.Wherein, it is described Jump instruction refers to the instruction that turn function is performed between each instruction.For example, the instruction group for being currently at working condition is " fast Happy cabin ", then when the phonetic order received is " bedroom ", be currently at instruction group " happy cabin " switching of working condition To instruction group " bedroom ".
The execute instruction refers to the instruction for performing specific function.For example, the instruction group for being currently at working condition is " sound It is happy ", then when it is " music " to receive phonetic order, then music playback function is performed, not cutting between execute instruction group Change operation.
In the embodiment of the present invention, only allow an instruction group in running order in the same time, such as, currently, instruction When group " amusement " is in running order, system currently only supports " film ", " music " and " camera " corresponding phonetic order.
Shown in Fig. 3-301, in running order instruction group is " happy cabin ", and the rightmost side of current display interface is arranged List all instructions of present instruction group " happy cabin " in table, including " parlor ", " bedroom ", " amusement ", " navigation ", " return Return " five instructions.Wherein, in five instructions, " parlor ", " bedroom ", " amusement ", " navigation " are jump instructions, are respectively used to Execute instruction turn function, for example, " parlor " instruction performs and jumps to instruction group " parlor " from present instruction group " happy cabin ". And " return " instruction is then used to jump to upper level instruction from present instruction group.For example, being currently at the instruction group of working condition It is " music ", then when performing " return " instruction, then jumps to father's instruction group " amusement " of " music " instruction.
Fig. 3 also realizes schematic diagram 302a including phonetic order (one);As shown in Figure 30 2a:
User inputs phonetic order " amusement ", electronic equipment identification when instruction group " happy cabin " is in running order When to go out the phonetic order be " amusement ", output with after the corresponding voice message of " amusement " instruction, it is " happy small from instruction group immediately Room " jumps to instruction group " amusement ", and the control voice APP sub-instructions information that includes of display screen idsplay order group " amusement ". For example, the command information that instruction group " amusement " is included is:Music, film, camera, return.
Fig. 3 also realizes schematic diagram 302b including phonetic order (two);As shown in Figure 30 2b:
User's input speech signal " music ", electronic equipment identifies that the corresponding phonetic order of the voice signal is " sound It is happy " when, export with after the corresponding voice message of " music " instruction, jumping to instruction group " sound from present instruction group " amusement " immediately It is happy ", and the control voice APP command information that includes of display screen idsplay order group " music ".For example, the instruction group " music " Including command information be:" next ", " pause ", " broadcasting ", " upper one is first ", " end ", " song of Little Bear ", " English song Song ", " national language song ", " Music on Demand " and " return ".Wherein, " next ", " pause ", " broadcasting ", " upper one is first ", " knot Beam ", " song of Little Bear ", " English songs ", " national language song " are execute instructions." Music on Demand " and " return " is jump instruction.
Fig. 3 also realizes schematic diagram 302c including phonetic order (three), as shown in Figure 30 2c:
User's input phonetic order " broadcastings ", electronic equipment identifies the phonetic order when being " broadcasting ", and output is with " broadcasting Put " instruct after corresponding voice message, the music in current music storehouse is played immediately, and here, the music of broadcasting can be upper one A song or the song of system shuffle for the last broadcasting of subsystem record, can also be and set according to user The song for the music sequential selection put.
Fig. 3 also realizes schematic diagram 302d including phonetic order (four).As shown in Figure 30 2d:
User's input phonetic order " Aladdin rest ", electronic equipment identifies that the phonetic order is that " Aladdin is stopped During breath ", after output voice message " I rests, busy to be me " corresponding with " Aladdin rest " instruction, system Immediately enter resting state.
Fig. 4 is the view that voice APP is arranged on wearable device;As shown in figure 4, the wearable device is hand Table, including working state schematic representation 401a and resting state schematic diagram 401b, when voice APP is in running order, such as scheme Shown in 401a, progress icon is in mode of operation, and display multiple instruction group, and can receive any phonetic order;Work as language Sound APP in a dormant state when, as shown in Figure 40 1b, progress icon be in park mode, except wake up instruction in addition to do not receive appoint What phonetic order.Voice APP recognizes that the method for phonetic order is consistent with Fig. 3 in described Fig. 4, its voice APP's distinguished Carrier is installed different, in the embodiment of the present invention, method reference picture 1, Fig. 2 of the voice APP identifications and execution phonetic order With described by Fig. 3, it will not be repeated here.
Fig. 5 is Fig. 3 and Fig. 4 workflow schematic diagram;As shown in figure 5, including:
Step 501, voice APP loads default instruction database;
Here, when the voice APP in a dormant state when, user can send to electronic equipment and wake up instruction to start The voice APP, and after the electronic equipment opens the voice APP, load user-defined local instruction database, example Such as, the local instruction database includes:Voice message sound data " please say ", work instruction data " bedroom, parlor, navigation, amusement, Return ", dormancy instruction data " Aladdin rest ", dormancy prompt tone data " I rests, busy to be me ", wake-up instruction Data " calling Latin " and voice collecting progress icon " progress circle " (referring to shown in Figure 30 1a).
Step 502, voice message sound " please say " is played;
Here, user starts phonetic entry after the voice message sound is heard.
Step 503, voice collecting progress icon is moved along direction initialization, and timing of starting from scratch, and voice APP opens language Sound identification function;
Step 504, user's input phonetic order " amusement " (referring to shown in Figure 30 2a);
Step 505, when electronic equipment collects voice signal, the voice signal is parsed into speech data and to described Speech data is identified, if recognizing successfully, performs step 506, if recognition failures, re-executes step 502;
Step 506, whether the speech data matches with work instruction data.When being matched with work instruction data, perform Step 507, when being mismatched with work instruction data, step 509 is performed;
Step 507, instruction group " amusement " matching corresponding with work instruction data, plays voice message sound " amusement ";
Here, the voice message sound is used to point out user to identify phonetic order " amusement ".
Step 508, phonetic order " amusement " is performed;
Here, the electronic equipment jumps to instruction group " amusement " node from current instruction group node, and described Instruction group " amusement " node includes sub-instructions " film, music, camera, return ", afterwards, re-executes step 503.
In the embodiment of the present invention, the voice APP circulations perform step 502 to step 505, it is determined that collecting user's transmission During voice signal, the voice signal collected is identified electronic equipment, for example, identifying that the voice signal correspondingly refers to When making group " music ", instruction group " music " is jumped to from present instruction group.It can refer in the instruction group " music " including son Make " next, pause, play, upper one, end, the song of Little Bear, English songs, national language song, Music on Demand, return " (join As shown in Figure 30 2b), afterwards, re-execute step 503.
In the embodiment of the present invention, the voice APP circulations perform step 502 to step 505, collect user and send voice During signal, the voice signal is identified.For example, (referring to figure when determining voice signal corresponding instruction " broadcasting " Shown in 302c), music playback function is performed, afterwards, step 503 is re-executed.
Step 509, the speech data whether with dormancy instruction Data Matching, determine the speech data and dormancy instruction During Data Matching, step 510 is performed, when determining that the speech data is mismatched with dormancy instruction data, step is re-executed 502;
Here, the voice APP that the electronic equipment is installed identifies the voice letter that user sends in default instruction database It is number corresponding when being dormancy instruction " Aladdin rest ", perform step 510, it is unidentified go out dormancy instruction " Aladdin rest " When, re-execute step 502.
Step 510, dormancy prompt tone " I rests, busy to be me " is sent;
Here, voice APP is sent after the dormancy prompt tone, and the voice APP enters resting state (referring to Figure 30 2d institutes Show), perform step 511;
Step 511, wait and wake up instruction " calling Latin ", and perform step 512;
Step 512, if receive wake-up instruction " calling Latin ", when having been received by wake-up instruction " calling Latin ", holds Row step 513, when not receiving wake-up instruction " calling Latin ", re-executes step 511;
Step 513, resting state is exited, step 502 is re-executed.
Fig. 6 is a kind of composition schematic diagram of voice interaction device of the embodiment of the present invention:As shown in fig. 6, described device includes: Output unit 601, collecting unit 602, resolution unit 603, judging unit 604 and execution unit 605;
Wherein, the output unit 601, for receiving during the first control instruction, voice enabled acquisition function, output the One voice message, and start voice collecting progress icon, the voice collecting progress icon is moved along direction initialization;It is additionally operable to The director data for determining in speech data and local instruction database exports the second language corresponding with the director data when the match is successful Sound is pointed out;
The collecting unit 602, for when the voice collecting progress icon is moved along direction initialization, to current environment In voice signal be acquired;
The resolution unit 603, for before the voice collecting progress icon moves to along direction initialization and places restrictions on position When collecting voice signal, the voice signal is parsed, speech data is obtained;
The judging unit 604, for the speech data to be matched with the director data in local instruction database;
The execution unit 605, for determining the speech data, the match is successful with the director data in local instruction database When, perform the corresponding phonetic order of the speech data.
In the embodiment of the present invention, described device can be specifically the electronic equipment for being provided with voice APP.The electronic equipment Including:Mobile terminal, Wearable terminal, fixed terminal, car-mounted terminal, bank transaction terminal, supermarket's transaction terminal and express delivery The delivery terminal of mailbag.Wherein mobile terminal can at least include mobile phone, tablet personal computer, PDA, navigator, game machine, intelligence object for appreciation Tool etc., Wearable terminal can at least include intelligent watch, intelligent glasses, intelligent running shoes etc., and fixed terminal can at least be wrapped Include in desktop computer, desktop computer, integral computer, television set, projecting apparatus, sound equipment etc., above intelligent toy, intelligent watch Intelligence refers to that equipment includes processor and storage medium, so as to automatically or according to the setting of operator such as user hold The instruction of some sequencing of row.
In the embodiment of the present invention, described device also includes display unit 606, for referring in first control received When order is the open command of speech recognition mode, the voice collecting progress icon is shown;Or, first control received When system instruction is the wake-up instruction in the local instruction database, the voice collecting progress icon is shown, to enable voice collecting Function, and the first voice message is exported from the output unit 601 to user, first voice message is used to inform that user is System immediately enters speech recognition state, reminds user to start phonetic entry.And start voice collecting progress icon, make the voice Collection progress icon is moved along direction initialization.Here, the voice collecting progress icon can be voice progress bar, progress circle or Progress percentage.In addition, first voice message can be the prompt tone defined by user oneself, such as:" please say " or " please The voice messages such as instruction ".The electronic equipment immediately enters speech recognition shape when first voice message output is finished State.
In the embodiment of the present invention, voice APP in said device is installed after speech recognition state is entered, the voice Collection progress icon moves with uniform velocity along direction initialization to position is placed restrictions on, and timing of starting from scratch.Trigger the collecting unit Voice signal in 602 pairs of current environments is acquired.The collecting unit 602 is additionally operable to the gatherer process in voice signal In, voice collecting progress chart target progress described in real-time update, i.e., described voice collecting progress chart target progress gradually increases.Example Such as, when the voice collecting progress icon is voice progress bar, the voice collecting progress icon at least includes tempo instructions frame, The progress indicator strip of uniform motion is provided with the tempo instructions frame;The progress indicator strip is by the one of the tempo instructions frame Stop motion when holding to another end motion, and reaching the other end of the tempo instructions frame.Here, the voice collecting progress chart It is 1-15 seconds to mark the time for moving to the other end from one end.In order to prevent the voice collecting progress chart target movement velocity too It hurry up, described device does not collect the voice signal of user's input, or, the voice collecting progress chart target movement velocity is too Slowly, influence in the acquisition time and recognition accuracy of described device, the embodiment of the present invention, it is preferable that enter the voice collecting The motion duration of degree icon is set to 5 seconds.Specifically, it is within described 5 seconds the upper of the single acquisition Speech time of collecting unit 602 Limit, if the collecting unit 602 completed voice collecting in 3 seconds, the voice collecting progress icon can also stop immediately Motion, then the time of this interactive voice is exactly 3 seconds.
In the embodiment of the present invention, user is hearing that first voice message or display interface in the voice APP see To the voice collecting progress chart timestamp, phonetic entry is proceeded by, and ensuring the voice collecting progress icon arrival Place restrictions on before position, complete the phonetic entry.Here, it is the voice collecting that the voice collecting progress chart target, which places restrictions on position, Progress chart target maximum progress threshold value, that is to say, that described device single allows the maximum time value of phonetic entry, i.e., when overtime Between.In this way, limiting the time of described device single acquisition voice signal according to voice collecting progress icon, the dress can be shortened The recognition time to voice signal is put, while improving the signal identification efficiency of described device.
In the embodiment of the present invention, the collecting unit 602 is moved in the voice collecting progress icon along direction initialization Place restrictions on before position, it is determined that when collecting voice signal, showing this phonetic entry success, the resolution unit 603 being triggered, by institute State resolution unit 603 and the voice signal is divided into the certain speech frame of length, then each frame speech data is asked for average Pitch period, obtains speech data corresponding with the voice signal.
If on the contrary, voice collecting progress icon described in the collecting unit 602 moves to along direction initialization and places restrictions on position Before, when not collecting the voice signal, show that phonetic entry fails, then terminate this interactive voice, trigger the output single Member 601 exports the first voice message again.
In the embodiment of the present invention, the resolution unit 603 triggers the judging unit after the speech data is obtained 604, director data corresponding with the speech data is searched in preset instructions storehouse by the judging unit 604, and looked into Look for result.Here, the preset instructions storehouse is the instruction database according to oneself requirement definition by user.Specifically, user passes through electricity Sub- equipment sends instruction database request to create, institute's predicate to the corresponding voice servers of the voice APP installed on the electronic equipment Sound server is responded after the request to create, controls the establishment interface in the display interface idsplay order storehouse of the voice APP, user The instruction database is created according to the demand of oneself at the interface that creates of the instruction database.For example, user can be created by described Speech voice input function in interface carries out phonetic entry, to complete the establishment of the instruction database, directly can also be created described Interface is inputted by word, completes the establishment of the instruction database.Wherein, father's instruction can be included in the instruction database of establishment and son refers to Order, wherein, father's instruction can be:Music, video, smart home, the sub-instructions of music can be:A next bent, upper song, Chinese song, English song;The sub-instructions of video can be:Tengxun's video, QQ videos, youku.com's video;The sub-instructions of smart home can To be:Curtain, bedroom air-conditioning, electric light etc..In this way, electronic equipment by user-defined instruction database to the voice signal that identifies Corresponding phonetic order is identified, and can effectively ensure the accuracy rate of speech recognition.
Further, since the default instruction database is limited instruction set, alone word identification technology is used, Therefore, the embodiment of the present invention, without carrying out semantic analysis by voice cloud service, can be provided the user in speech recognition process Offline service.
In embodiments of the present invention, user can be to be configured in preset instructions storehouse to signal acquisition periods.Specifically, User sends signal acquisition periods by electronic equipment to the corresponding voice servers of the voice APP installed on the electronic equipment Setting request, the voice server received after the setting request of the signal acquisition periods, controls the voice APP's Display interface shows the setting interface of the signal acquisition periods, user's setting in the signal acquisition periods according to oneself demand Put interface to be configured the signal acquisition periods, and after the setup, send and set successfully to the voice server Request, the voice server receiving the setting successfully after request, preserving the setting of the signal acquisition periods, And in interactive voice next time, the collecting unit 602 is according to the voice signal of the signal acquisition periods of preservation to user It is acquired.
In embodiments of the present invention, user can also be configured to voice collecting progress icon, and specifically, user passes through Electronic equipment sends voice collecting progress chart target to voice server and sets request, and the voice server receives institute's predicate Sound collection progress chart target is set after request, controls the display interface of the voice APP to show that voice collecting progress chart target is set Interface is put, user sets the voice collecting progress that interface selects oneself to need according to oneself demand in voice collecting progress chart target Icon, and after the setup, sent to the voice server and successfully request is set, the voice server is being received The setting successfully after request, preserves the voice collecting progress chart target and sets, and the display unit 606 is next time In interactive voice, the voice collecting progress icon preserved is shown.
In the embodiment of the present invention, include in the preset instructions storehouse work instruction data, dormancy instruction data, wake up refer to Data are made, the output unit 601 determines the work in the speech data and preset instructions storehouse that the collecting unit 602 is collected When director data is matched, show that the speech data is recognized successfully, then export the second language corresponding with the work instruction data Sound is pointed out, for example, the work instruction data is:" music ", then second voice message is " music ", for reporting to user The phonetic order of input is recognized successfully;Or, when the output unit 601 is determined in the speech data and preset instructions storehouse Work instruction data is mismatched, and during with the dormancy instruction Data Matching, is then exported corresponding with the dormancy instruction data Dormancy prompt tone.For example, the dormancy instruction data are:" heronsbill rest ", then the dormancy prompt tone is:" I rests , it is busy to be me ".Afterwards, the speech recognition mode is changed to park mode;Conversely, described in being determined when the output unit 601 When speech data is mismatched with all director datas in preset instructions storehouse, show this phonetic order recognition failures, terminate This interactive voice, and first voice message is exported again.Interactive voice is realized in the way of continuously circulating.
In the embodiment of the present invention, the output unit 601 is after the second voice message output is finished, and triggering is described to perform list Member 605, immediately hops to sub-instructions storehouse corresponding with the phonetic order data, and trigger described by the execution unit 605 Collecting unit 602 continues to gather the voice signal that user sends in the sub-instructions storehouse.For example, the phonetic order data are: Parlor, then described device is when successfully identifying " parlor ", after the playing alert tones of output unit 601 " parlor " finish, institute State execution unit 605 and immediately hop to parlor sub-instructions storehouse corresponding with parlor.For example, the parlor sub-instructions storehouse includes:Window Curtain, lamp, bedroom air-conditioning.Fig. 2 descriptions in specific interactive voice implementation process reference method embodiment, will not be repeated here.
It should be understood by those skilled in the art that, embodiments of the invention can be provided as method, system or computer program Product.Therefore, the shape of the embodiment in terms of the present invention can use hardware embodiment, software implementation or combine software and hardware Formula.Moreover, the present invention can be used can use storage in one or more computers for wherein including computer usable program code The form for the computer program product that medium is implemented on (including but is not limited to magnetic disk storage and optical memory etc.).
The present invention is the flow with reference to method according to embodiments of the present invention, equipment (system) and computer program product Figure and/or block diagram are described.It should be understood that can be by every first-class in computer program instructions implementation process figure and/or block diagram Journey and/or the flow in square frame and flow chart and/or block diagram and/or the combination of square frame.These computer programs can be provided The processor of all-purpose computer, special-purpose computer, Embedded Processor or other programmable data processing devices is instructed to produce A raw machine so that produced by the instruction of computer or the computing device of other programmable data processing devices for real The device for the function of being specified in present one flow of flow chart or one square frame of multiple flows and/or block diagram or multiple square frames.
These computer program instructions, which may be alternatively stored in, can guide computer or other programmable data processing devices with spy Determine in the computer-readable memory that mode works so that the instruction being stored in the computer-readable memory, which is produced, to be included referring to The manufacture set by dress is made, the command device is realized in one flow of flow chart or multiple flows and/or one side of block diagram The function of being specified in frame or multiple square frames.
These computer program instructions can be also loaded into computer or other programmable data processing devices so that in meter Series of operation steps is performed on calculation machine or other programmable devices to produce computer implemented processing, thus in computer or The instruction performed on other programmable devices is provided for realizing in one flow of flow chart or multiple flows and/or block diagram one The step of function of being specified in individual square frame or multiple square frames.
The foregoing is only a preferred embodiment of the present invention, is not intended to limit the scope of the present invention.

Claims (12)

1. a kind of voice interactive method, it is characterised in that methods described includes:
When receiving the first control instruction, voice enabled acquisition function exports the first voice message, and start voice collecting progress Icon, makes the voice collecting progress icon be moved along direction initialization;
When the voice collecting progress icon is moved along direction initialization, the voice signal in current environment is acquired;
The voice collecting progress icon moved to along direction initialization place restrictions on collect voice signal before position when, parsing is described Voice signal, obtains speech data;
The speech data is matched with the director data in local instruction database;
The director data for determining in the speech data and local instruction database is exported corresponding with the director data when the match is successful The second voice message;
Perform the corresponding phonetic order of the speech data.
2. according to the method described in claim 1, it is characterised in that the voice collecting progress icon at least includes tempo instructions The progress indicator strip of uniform motion is provided with frame, the tempo instructions frame;
The progress indicator strip reaches the another of the tempo instructions frame from one end of the tempo instructions frame to another end motion Stop motion during one end.
3. according to the method described in claim 1, it is characterised in that the voice enabled acquisition function, including:
When first control instruction received is the open command of speech recognition mode, the voice collecting progress chart is shown Mark, and start counting up;
Or, when first control instruction received is the wake-up instruction in the local instruction database, show the voice Collection progress icon, and start counting up.
4. according to the method described in claim 1, it is characterised in that methods described also includes:
The voice collecting progress icon moved to along direction initialization place restrictions on do not collect the voice signal before position when, weight Newly export first voice message.
5. according to the method described in claim 1, it is characterised in that output the second voice corresponding with the director data is carried Show, including:
When determining that the speech data is matched with the dormancy instruction in local instruction database, stop corresponding with the dormancy instruction is exported Dormancy prompt tone;
Or, when determining that the speech data is matched with the work order in local instruction database, output and the work order pair The work prompt tone answered.
6. according to the method described in claim 1, it is characterised in that determine the speech data and the instruction in local instruction database When data are mismatched, first voice message is exported again.
7. a kind of voice interaction device, it is characterised in that described device includes:Output unit, collecting unit, resolution unit, sentence Disconnected unit and execution unit;
Wherein, the output unit, for receiving during the first control instruction, voice enabled acquisition function exports the first voice Prompting, and start voice collecting progress icon, the voice collecting progress icon is moved along direction initialization;It is additionally operable to determine language Director data in sound data and local instruction database exports the second voice corresponding with the director data and carried when the match is successful Show;
The collecting unit, for when the voice collecting progress icon is moved along direction initialization, to the language in current environment Message number is acquired;
The resolution unit, for collecting language before the voice collecting progress icon moves to along direction initialization and places restrictions on position During message, the voice signal is parsed, speech data is obtained;
The judging unit, for the speech data to be matched with the director data in local instruction database;
The execution unit, when the match is successful for determining the director data in the speech data and local instruction database, is performed The corresponding phonetic order of the speech data.
8. device according to claim 7, it is characterised in that the voice collecting progress icon at least includes tempo instructions The progress indicator strip of uniform motion is provided with frame, the tempo instructions frame;The progress indicator strip is by the tempo instructions frame One end to another end motion, and stop motion when reaching the other end of the tempo instructions frame.
9. device according to claim 7, it is characterised in that described device also includes:
Display unit, when first control instruction for receiving is the open command of speech recognition mode, display is described Voice collecting progress icon, and start counting up;Or, during first control instruction received is the local instruction database When waking up instruction, the voice collecting progress icon is shown, and start counting up.
10. device according to claim 7, it is characterised in that the output unit, is additionally operable to enter in the voice collecting Degree icon is moved to along direction initialization is placed restrictions on when not collecting the voice signal before position, and first voice is exported again and is carried Show.
11. device according to claim 7, it is characterised in that the output unit, specifically for determining the voice number During according to being matched with the dormancy instruction in local instruction database, dormancy prompt tone corresponding with the dormancy instruction is exported;Or, it is determined that When the speech data is matched with the work order in local instruction database, work prompting corresponding with the work order is exported Sound.
12. device according to claim 7, it is characterised in that the output unit, is additionally operable to determine the speech data When being mismatched with the director data in local instruction database, first voice message is exported again.
CN201710372523.2A 2017-05-24 2017-05-24 Voice interaction method and device Pending CN107180631A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201710372523.2A CN107180631A (en) 2017-05-24 2017-05-24 Voice interaction method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201710372523.2A CN107180631A (en) 2017-05-24 2017-05-24 Voice interaction method and device

Publications (1)

Publication Number Publication Date
CN107180631A true CN107180631A (en) 2017-09-19

Family

ID=59831498

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201710372523.2A Pending CN107180631A (en) 2017-05-24 2017-05-24 Voice interaction method and device

Country Status (1)

Country Link
CN (1) CN107180631A (en)

Cited By (15)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108873713A (en) * 2018-06-25 2018-11-23 广州市锐尚展柜制作有限公司 A kind of man-machine interaction method and system applied in smart home
CN108877791A (en) * 2018-05-23 2018-11-23 百度在线网络技术(北京)有限公司 Voice interactive method, device, server, terminal and medium based on view
CN109176537A (en) * 2018-08-09 2019-01-11 北京云迹科技有限公司 content displaying method and device for robot
CN109360570A (en) * 2018-10-19 2019-02-19 歌尔科技有限公司 Audio recognition method, speech ciphering equipment and the readable storage medium storing program for executing of speech ciphering equipment
CN109903758A (en) * 2017-12-08 2019-06-18 阿里巴巴集团控股有限公司 Audio-frequency processing method, device and terminal device
CN109960537A (en) * 2019-03-29 2019-07-02 北京金山安全软件有限公司 Interaction method and device and electronic equipment
CN110767222A (en) * 2019-06-19 2020-02-07 北京嘀嘀无限科技发展有限公司 Order receiving method and device
CN111583923A (en) * 2020-04-28 2020-08-25 北京小米松果电子有限公司 Information control method and device, and storage medium
CN111833858A (en) * 2019-04-17 2020-10-27 百度在线网络技术(北京)有限公司 Voice interaction state display method and device based on loudspeaker box
TWI739067B (en) * 2019-02-13 2021-09-11 華南商業銀行股份有限公司 Cross-channel artificial intelligence dialogue platform and operation method thereof
CN113539252A (en) * 2020-04-22 2021-10-22 庄连豪 Barrier-free intelligent voice system and control method thereof
CN113658601A (en) * 2021-08-18 2021-11-16 开放智能机器(上海)有限公司 Voice interaction method, device, terminal equipment, storage medium and program product
TWI767499B (en) * 2019-02-13 2022-06-11 華南商業銀行股份有限公司 Cross-channel artificial intelligence dialogue platform integrating online custom service system and its operation method
TWI767498B (en) * 2019-02-13 2022-06-11 華南商業銀行股份有限公司 Cross-channel artificial intelligence dialogue platform integrating machine learning and operation method thereof
TWI769653B (en) * 2019-02-13 2022-07-01 華南商業銀行股份有限公司 Cross-channel artificial intelligence dialogue platform capable of reassembling voice segment and its operation method

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102262890A (en) * 2010-05-31 2011-11-30 鸿富锦精密工业(深圳)有限公司 Electronic device and marking method thereof
US20120081530A1 (en) * 2009-06-13 2012-04-05 Rolestar, Inc. System for Juxtaposition of Separately Recorded Scenes
CN105244025A (en) * 2015-10-29 2016-01-13 惠州Tcl移动通信有限公司 Voice identification method and system based on intelligent wearable device
CN106356059A (en) * 2015-07-17 2017-01-25 中兴通讯股份有限公司 Voice control method, device and projector

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20120081530A1 (en) * 2009-06-13 2012-04-05 Rolestar, Inc. System for Juxtaposition of Separately Recorded Scenes
CN102262890A (en) * 2010-05-31 2011-11-30 鸿富锦精密工业(深圳)有限公司 Electronic device and marking method thereof
CN106356059A (en) * 2015-07-17 2017-01-25 中兴通讯股份有限公司 Voice control method, device and projector
CN105244025A (en) * 2015-10-29 2016-01-13 惠州Tcl移动通信有限公司 Voice identification method and system based on intelligent wearable device

Cited By (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109903758A (en) * 2017-12-08 2019-06-18 阿里巴巴集团控股有限公司 Audio-frequency processing method, device and terminal device
CN109903758B (en) * 2017-12-08 2023-06-23 阿里巴巴集团控股有限公司 Audio processing method and device and terminal equipment
CN108877791B (en) * 2018-05-23 2021-10-08 百度在线网络技术(北京)有限公司 Voice interaction method, device, server, terminal and medium based on view
CN108877791A (en) * 2018-05-23 2018-11-23 百度在线网络技术(北京)有限公司 Voice interactive method, device, server, terminal and medium based on view
US11727927B2 (en) 2018-05-23 2023-08-15 Baidu Online Network Technology (Beijing) Co., Ltd. View-based voice interaction method, apparatus, server, terminal and medium
CN108873713A (en) * 2018-06-25 2018-11-23 广州市锐尚展柜制作有限公司 A kind of man-machine interaction method and system applied in smart home
CN109176537A (en) * 2018-08-09 2019-01-11 北京云迹科技有限公司 content displaying method and device for robot
CN109176537B (en) * 2018-08-09 2022-05-10 北京云迹科技股份有限公司 Content display method and device for robot
CN109360570A (en) * 2018-10-19 2019-02-19 歌尔科技有限公司 Audio recognition method, speech ciphering equipment and the readable storage medium storing program for executing of speech ciphering equipment
TWI767499B (en) * 2019-02-13 2022-06-11 華南商業銀行股份有限公司 Cross-channel artificial intelligence dialogue platform integrating online custom service system and its operation method
TWI739067B (en) * 2019-02-13 2021-09-11 華南商業銀行股份有限公司 Cross-channel artificial intelligence dialogue platform and operation method thereof
TWI767498B (en) * 2019-02-13 2022-06-11 華南商業銀行股份有限公司 Cross-channel artificial intelligence dialogue platform integrating machine learning and operation method thereof
TWI769653B (en) * 2019-02-13 2022-07-01 華南商業銀行股份有限公司 Cross-channel artificial intelligence dialogue platform capable of reassembling voice segment and its operation method
CN109960537A (en) * 2019-03-29 2019-07-02 北京金山安全软件有限公司 Interaction method and device and electronic equipment
CN111833858A (en) * 2019-04-17 2020-10-27 百度在线网络技术(北京)有限公司 Voice interaction state display method and device based on loudspeaker box
CN110767222B (en) * 2019-06-19 2021-03-09 北京嘀嘀无限科技发展有限公司 Order receiving method and device
CN110767222A (en) * 2019-06-19 2020-02-07 北京嘀嘀无限科技发展有限公司 Order receiving method and device
CN113539252A (en) * 2020-04-22 2021-10-22 庄连豪 Barrier-free intelligent voice system and control method thereof
CN111583923A (en) * 2020-04-28 2020-08-25 北京小米松果电子有限公司 Information control method and device, and storage medium
CN111583923B (en) * 2020-04-28 2023-11-14 北京小米松果电子有限公司 Information control method and device and storage medium
CN113658601A (en) * 2021-08-18 2021-11-16 开放智能机器(上海)有限公司 Voice interaction method, device, terminal equipment, storage medium and program product

Similar Documents

Publication Publication Date Title
CN107180631A (en) Voice interaction method and device
JP6977169B2 (en) Digital Voice Assistant Coordinating signal processing between computing devices
CN105657535B (en) A kind of audio identification methods and device
CN104714981B (en) Voice message searching method, device and system
CN109147779A (en) Voice data processing method and device
US11457061B2 (en) Creating a cinematic storytelling experience using network-addressable devices
CN108351872A (en) Equipment selection for providing response
CN106356059A (en) Voice control method, device and projector
CN107948672B (en) Method and system for storing video data, server and wearable device
CN104866275B (en) Method and device for acquiring image information
CN108449493A (en) Voice communication data processing method, device, storage medium and mobile terminal
CN107680614B (en) Audio signal processing method, apparatus and storage medium
CN106210266B (en) A kind of acoustic signal processing method and audio signal processor
CN108874904A (en) Speech message searching method, device, computer equipment and storage medium
KR20160106075A (en) Method and device for identifying a piece of music in an audio stream
CN108694947A (en) Sound control method, device, storage medium and electronic equipment
WO2019045816A1 (en) Graphical data selection and presentation of digital content
CN109509472A (en) Method, apparatus and system based on voice platform identification background music
CN104092809A (en) Communication sound recording method and recorded communication sound playing method and device
CN112270918A (en) Information processing method, device, system, electronic equipment and storage medium
CN106601242A (en) Executing method and device of operation event and terminal
CN109686370A (en) The method and device of fighting landlord game is carried out based on voice control
CN109686372B (en) Resource playing control method and device
JP2023526285A (en) Test method and apparatus for full-duplex voice interaction system
JP2022036953A (en) Adjustment of signal processing between digital voice assistant computing devices

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20170919

RJ01 Rejection of invention patent application after publication