CN108962259A - Processing method and the first electronic equipment - Google Patents
Processing method and the first electronic equipment Download PDFInfo
- Publication number
- CN108962259A CN108962259A CN201810825087.4A CN201810825087A CN108962259A CN 108962259 A CN108962259 A CN 108962259A CN 201810825087 A CN201810825087 A CN 201810825087A CN 108962259 A CN108962259 A CN 108962259A
- Authority
- CN
- China
- Prior art keywords
- sound
- condition
- electronic equipment
- voice control
- voice
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000003672 processing method Methods 0.000 title claims abstract description 18
- 238000004458 analytical method Methods 0.000 claims abstract description 32
- 238000012544 monitoring process Methods 0.000 claims abstract description 12
- 230000006870 function Effects 0.000 claims description 69
- 238000012545 processing Methods 0.000 claims description 37
- 238000000034 method Methods 0.000 claims description 21
- 238000012360 testing method Methods 0.000 claims description 15
- 238000012512 characterization method Methods 0.000 claims description 13
- 230000008054 signal transmission Effects 0.000 claims description 3
- 238000004891 communication Methods 0.000 description 16
- 230000005540 biological transmission Effects 0.000 description 15
- 230000002618 waking effect Effects 0.000 description 8
- 230000008569 process Effects 0.000 description 6
- 230000009467 reduction Effects 0.000 description 6
- 238000011946 reduction process Methods 0.000 description 6
- 230000000630 rising effect Effects 0.000 description 4
- 238000010586 diagram Methods 0.000 description 3
- 238000004590 computer program Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 230000005611 electricity Effects 0.000 description 2
- 238000003032 molecular docking Methods 0.000 description 2
- 230000011664 signaling Effects 0.000 description 2
- 244000062793 Sorghum vulgare Species 0.000 description 1
- 235000013399 edible fruits Nutrition 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
- 235000019713 millet Nutrition 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 230000004044 response Effects 0.000 description 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/30—Distributed recognition, e.g. in client-server systems, for mobile phones or network applications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/28—Constructional details of speech recognition systems
- G10L15/34—Adaptation of a single recogniser for parallel processing, e.g. by use of multiple processors or cloud computing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/22—Procedures used during a speech recognition process, e.g. man-machine dialogue
- G10L2015/223—Execution procedure of a spoken command
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Computing Systems (AREA)
- Mathematical Physics (AREA)
- Theoretical Computer Science (AREA)
- Telephonic Communication Services (AREA)
Abstract
The embodiment of the present application discloses a kind of processing method and the first electronic equipment, and monitoring voice input starts voice control function if detecting that the first sound meets the first condition in multiple preset conditions first;Wherein, a preset condition is at least associated with the server for analyzing the voice of subsequent input, and different preset conditions are associated with different servers;Second sound input after acquiring first voice input;Obtain the analysis result that the associated first server of the first condition is directed to the second sound.It is directed to same first electronic equipment, different servers can be accessed according to the sound for meeting different preset conditions.To realize the function that first electronic equipment accesses multiple servers.
Description
Technical field
This application involves technical field of data transmission, and more specifically, it relates to processing method and the first electronic equipments.
Background technique
Voice control is widely used, for example, the electronic equipments such as intelligent sound box all have voice control function.
Currently, the electronic equipment with voice control function is all the single access for realizing a server, for example, sub- horse
When inferior Alexa intelligent sound box detects voice input, voice can be sent to the corresponding server of Amazon;The intelligence of Google
When speaker detects voice input, voice can be sent to the corresponding server of Google.
Summary of the invention
In view of this, this application provides a kind of processing method and the first electronic equipments.
To achieve the above object, the application provides the following technical solutions:
A kind of processing method is applied to the first electronic equipment, the treating method comprises:
Monitor voice input;
If detecting that the first sound meets the first condition in multiple preset conditions, start voice control function;
Wherein, a preset condition is at least associated with the server for analyzing the voice of subsequent input, and different preset conditions are closed
It is associated with different servers;
Second sound input after acquiring first voice input;
Obtain the analysis result that the associated first server of the first condition is directed to the second sound.
Wherein, if described detect that the first sound meets the first condition in multiple preset conditions, start voice control
Function includes:
It detects whether first sound meets multiple preset conditions respectively, is preset with obtaining first sound with multiple
The corresponding testing result of condition;
If the testing result includes that first sound meets the first condition, start voice control function.
Wherein, further includes:
From multiple voice control modes, the first voice control mode is determined;
Wherein, a voice control mode characterization can start the sound of the voice control function of first electronic equipment
The preset condition of satisfaction, different voice control modes characterize different preset conditions;
The first voice control mode characterization can start the sound of the voice control function of first electronic equipment
The preset condition of satisfaction is the first condition.
Wherein, if described detect that the first sound meets first condition, starting voice control function includes:
Detect whether first sound meets first condition;
If first sound meets the first condition, start voice control function.
Wherein, the starting voice control function includes:
Determine that the server for analyzing the voice of subsequent input is the associated first server of the first condition.
A kind of first electronic equipment, comprising:
Pronunciation receiver, for monitoring voice input;
Chip is handled, if detecting that the first sound meets the in multiple preset conditions for the pronunciation receiver
One condition starts voice control function;
Wherein, a preset condition is at least associated with the server for analyzing the voice of subsequent input, and different preset conditions are closed
It is associated with different servers;
The pronunciation receiver is also used to: the second sound input after acquisition first voice input;
The processor chips are also used to: obtaining the associated first server of the first condition for the second sound
Analysis result;
Output device, for exporting the analysis result.
Wherein, further includes:
Transmitting device, for carrying out signal transmission with the second electronic equipment;
If the processing chip detects that the first sound meets the first condition in multiple preset conditions in execution, start
When voice control function, it is specifically used for:
It detects whether first sound meets multiple preset conditions respectively, is preset with obtaining first sound with multiple
The corresponding testing result of condition;
If the testing result includes that first sound meets the first condition, start voice control function.
Wherein, further includes:
The processor chips are also used to: from multiple voice control modes, determining the first voice control mode;
Wherein, a voice control mode characterization can start the sound of the voice control function of first electronic equipment
The preset condition of satisfaction, different voice control modes characterize different preset conditions;
The first voice control mode characterization can start the sound of the voice control function of first electronic equipment
The preset condition of satisfaction is the first condition.
Wherein, if the processing chip detects that the first sound meets first in multiple preset conditions in execution
Part is specifically used for when starting voice control function:
Detect whether first sound meets first condition;
If first sound meets the first condition, start voice control function.
A kind of first electronic equipment, comprising:
Memory, for storing program;
Processor, for executing described program, described program is specifically used for:
Monitor voice input;
If detecting that the first sound meets the first condition in multiple preset conditions, start voice control function;
Wherein, a preset condition is at least associated with the server for analyzing the voice of subsequent input, and different preset conditions are closed
It is associated with different servers;
Second sound input after acquiring first voice input;
Obtain the analysis result that the associated first server of the first condition is directed to the second sound.
It can be seen via above technical scheme that compared with prior art, this application discloses a kind of processing methods, supervise first
Voice input is surveyed, if detecting that the first sound meets the first condition in multiple preset conditions, starts voice control function;Its
In, a preset condition is at least associated with the server for analyzing the voice of subsequent input, and different preset conditions are associated with different
Server;Second sound input after acquiring first voice input;Obtain the associated first server of the first condition
For the analysis result of the second sound.It is directed to same first electronic equipment, it can be according to meeting different preset conditions
Sound accesses different servers.To realize the function that first electronic equipment accesses multiple servers.
Detailed description of the invention
In order to illustrate the technical solutions in the embodiments of the present application or in the prior art more clearly, to embodiment or will show below
There is attached drawing needed in technical description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this
The embodiment of application for those of ordinary skill in the art without creative efforts, can also basis
The attached drawing of offer obtains other attached drawings.
Fig. 1 is a kind of structure chart of implementation of processing system provided by the embodiments of the present application;
Fig. 2 is the structure chart of another implementation of processing system provided by the embodiments of the present application;
Fig. 3 provides a kind of signaling diagram of implementation of processing method for the embodiment of the present application;
Fig. 4 provides the schematic diagram that voice control mode selects a kind of implementation of the page for the embodiment of the present application;
Fig. 5 is a kind of structure chart of implementation of the first electronic equipment provided by the embodiments of the present application;
Fig. 6 is the structure chart of another implementation of the first electronic equipment provided by the embodiments of the present application;
Fig. 7 is the structure chart of another implementation of the first electronic equipment provided by the embodiments of the present application;
Fig. 8 is the structure chart of another implementation of processing system provided by the embodiments of the present application;
Fig. 9 is the structure chart of another implementation of the first electronic equipment provided by the embodiments of the present application.
Specific embodiment
Below in conjunction with the attached drawing in the embodiment of the present application, technical solutions in the embodiments of the present application carries out clear, complete
Site preparation description, it is clear that described embodiments are only a part of embodiments of the present application, instead of all the embodiments.It is based on
Embodiment in the application, it is obtained by those of ordinary skill in the art without making creative efforts every other
Embodiment shall fall in the protection scope of this application.
Processing method provided by the embodiments of the present application can be applied to processing system, as shown in Figure 1, being the embodiment of the present application
A kind of structure chart of implementation of the processing system of offer.
Processing system includes: the first electronic equipment 11, and, multiple servers 12.
First electronic equipment 11 can be PAD or smart phone or laptop or desktop computer or intelligent sound box or intelligence
The equipment such as household.
Each server 12 can be a background server, or, the server cluster being made of several servers, or,
Cloud computing service center.
In an alternative embodiment, different servers corresponds to different suppliers.The corresponding server of different suppliers
Achieved service function may be different, for example, shopping service function may be implemented in the corresponding server of Amazon supplier,
For example the corresponding server of Amazon supplier can analyze the sound of characterization shopping;For another example the corresponding clothes of supplier of Google
Music service function may be implemented in business device, for example the corresponding server of supplier of Google can play the sound of music with response representation
Sound.
Optionally, service function achieved by the corresponding server of different suppliers may be identical.
Optionally, server can be divided according to supplier in the embodiment of the present application, different server is corresponding not
Same supplier.
Electronic equipment (by taking intelligent sound box as an example) can only realize the access of a server at present, for example, if intelligent sound box
The access of the server of Amazon supplier can only be realized, if intelligent sound box receives server corresponding for supplier of Google
Or the sound of the corresponding server of supplier of Baidu, since intelligent sound box cannot access the corresponding server of supplier of Google or hundred
The corresponding server of supplier is spent, therefore, intelligent sound box cannot respond to the sound.
The access of multiple servers may be implemented in the first electronic equipment 11 in the embodiment of the present application.It is directed to so as to respond
The phonetic control command of different type server.
As shown in Fig. 2, the structure chart of another implementation for processing system provided by the embodiments of the present application.
Processing system includes: the first electronic equipment 21, the second electronic equipment 22 and multiple servers 12.
First electronic equipment 21 can be equipment such as docking station (Docking Station).
Second electronic equipment 22 can be PAD or smart phone or the equipment such as laptop or desktop computer.
Wireless connection or wired connection can be carried out between first electronic equipment 21 and the second electronic equipment 22.
First electronic equipment 21 and the second electronic equipment 22 can have wireless data transmission device, pass through wireless data
Transmitting device is wirelessly connected.
Wireless data transmission device can be low speed long distance transmission device or high speed short haul device, and high speed short distance passes
Defeated device includes: that (Near Field Communication, near field are logical based on ultra-high frequency wireless signal transmitted data device, NFC
Letter) device;Low speed long distance transmission device includes: wifi (WIreless-FIdelity) device, blue-tooth device.
Message transmission rate based on ultra-high frequency wireless signal transmitted data device can be up to 6GB/S.
The frequency range of ultra-high frequency wireless signal is 3GHz to 30GHz.
In an alternative embodiment, the first electronic equipment 21 can be as shown in Fig. 2, the first electronic equipment 21 may include holding
It carries and sets, bogey can carry the second electronic equipment 22.In an alternative embodiment, the first electronic equipment 21 can not also
Including bogey, i.e. the first electronic equipment 21 does not carry the second electronic equipment.First electronic equipment 21 and the second electronic equipment
It can not be bonded between 22, can there is a certain distance.
Since the first electronic equipment 21 and the second electronic equipment 22 may have certain distance, so wireless data transmission fills
Setting can be wifi device or blue-tooth device.In practical applications, if the first electronic equipment 21 and the second electronic equipment 22 away from
From pre-determined distance, such as 10cm is less than, then wireless data transmission device can be high speed short haul device.
Each server 12 can be server, or, the server cluster being made of several servers, or, cloud computing takes
Business center.It specifically may refer to the explanation that server 12 is directed in Fig. 1, which is not described herein again.
The first electronic equipment 21 can control the second electronic equipment 22 and realize connecing for multiple servers in the embodiment of the present application
Enter.So as to respond the phonetic control command for being directed to different type server.
Integrally processing method provided by the embodiments of the present application is illustrated in conjunction with Fig. 1 or Fig. 2, as shown in figure 3, being this Shen
Please embodiment provide processing method a kind of implementation signaling diagram, this method comprises:
The S301: the first electronic equipment of step 31 monitors voice input.
First electronic equipment 31 can set for the first electronics described in the first electronic equipment 11 described in Fig. 1 or Fig. 2
Standby 21.
In an alternative embodiment, 31 speech monitoring function of the first electronic equipment can be constantly in open state.I.e.
Extraneous sound can be monitored in real time in one electronic equipment.
In an alternative embodiment, the first electronic equipment 31 can be handled the first sound, for example, to the first sound
Noise reduction process is carried out, and/or, coded treatment;Or, carrying out noise reduction process and/or decoding process to the first sound.
Step S302: if the first electronic equipment 31 detects that the first sound meets first in multiple preset conditions
Part starts voice control function.
Wherein, a preset condition is at least associated with the server for analyzing the voice of subsequent input, and different preset conditions are closed
It is associated with different servers.
In an alternative embodiment, predetermined condition can be with are as follows: the first sound includes waking up word, or, the first sound is
The control instruction of control starting voice control function.
First condition is either condition in multiple preset conditions.
The corresponding wake-up word of different preset conditions or control instruction are different;For example, multiple preset conditions be respectively as follows: it is default
Condition 1, preset condition 2 and preset condition 4;Preset condition 1 can be with are as follows: the first sound includes waking up word 1, or, the first sound is
First control instruction of control starting voice control function;Preset condition 2 can be with are as follows: the first sound includes waking up word 2, or, the
One sound is the second control instruction of control starting voice control function;Preset condition 3 can be with are as follows: the first sound includes waking up word
3, or, the first sound is the third control instruction of control starting voice control function.
Wherein, the first control instruction, the second control instruction, third control instruction are different control instructions;Wake-up word 1,
Wake up word 2, wake-up word 3 is different wake-up word.
If the first sound meets either condition in multiple predetermined conditions, the voice control function of the first electronic equipment is just
It is activated.In the embodiment of the present application, the first electronic equipment enters after monitoring to meet the first voice input of first condition
Subsequent sound can be sent to and first after receiving subsequent voice input by the state of the subsequent voice input of waiting
The corresponding server of part, so that server carries out speech recognition to subsequent sound.By the first electronic equipment in the embodiment of the present application
The function of monitoring that the first voice input for meeting first condition can be realized later is known as voice control function.
In an alternative embodiment, the first electronic equipment before monitoring to meet the first voice input of first condition,
May monitor multiple sound for being unsatisfactory for all predetermined conditions, it is assumed that meet any preset condition the first sound be comprising
Any the first sound for waking up word in multiple wake-up words, it is assumed that multiple wake-up words, which are respectively as follows:, to be waken up word 1, wakes up word 2, wake up word
3;First electronic equipment may be monitored also before the first sound for monitoring comprising wake-up word 1 or waking up word 2 or wake-up word 3
Sound, these sound such as " I eats up ", " very nice " cannot start the voice control function of the first electronic equipment.If the
The voice control function of one electronic equipment is not activated, then the first electronic equipment, which can be constantly in the current voice input of monitoring, is
The no state for meeting either condition in multiple predetermined conditions.It is constantly in searching and " meets either condition in multiple predetermined conditions
The first voice input " state.
The S303: the first electronic equipment of step 31 acquires the second sound input after first voice input.
Second sound is sent to the corresponding first server 12 of first condition by the S304: the first electronic equipment of step 31.
Different preset conditions characterize different servers, and the condition that the first sound meets is different, and first server 12 is just not
Together, optionally, the condition that the first sound meets is different, and the corresponding supplier of first server is just different.
Second sound can be sent to first server by the first electronic equipment, and first server divides second sound
Analysis processing, and analysis result is fed back into the first electronic equipment.
In an alternative embodiment, the first electronic equipment 31 can be handled second sound, for example, to second sound
Noise reduction process is carried out, and/or, coded treatment;Or, carrying out noise reduction process and/or decoding process to second sound, can will locate
Sound after reason is sent to first server 12.
In an alternative embodiment, the first electronic equipment 31 can not be handled second sound, directly by the rising tone
Sound is sent to first server 12.
Step S305: first server 12 is analyzed for second sound, is analyzed as a result, and feeding back to the first electricity
Sub- equipment 31.
In an alternative embodiment, each server can be constantly in speech recognition state;It is set when receiving the first electronics
When the second sound that preparation is sent, just the second sound is analyzed and processed, when not receiving the of the transmission of the first electronic equipment
When two sound, the sound status to be received such as it is at.
In an alternative embodiment, each server may be at non-speech recognition state, and it is more to detect that the first sound meets
In a preset condition when first condition, since first condition corresponds to first server, at this point, first server can just be in voice
Identification state;Since the first sound is unsatisfactory for other preset conditions in addition to first condition, so in addition to first server
Other servers be in non-speech recognition state.
The S306: the first electronic equipment of step 31 exports the analysis result.
In an alternative embodiment, the first electronic equipment 31 can be with the voice output analysis as a result, or, passing through display screen display
Show the analysis result.
This application discloses a kind of processing methods, first monitoring voice input, if it is multiple to detect that the first sound meets
First condition in preset condition starts voice control function;Wherein, a preset condition is at least associated with the subsequent input of analysis
Voice server, different preset conditions are associated with different servers;The rising tone after acquiring first voice input
Sound input;Obtain the analysis result that the associated first server of the first condition is directed to the second sound.I.e. for same
First electronic equipment can access different servers according to the sound for meeting different preset conditions.To realize one the
One electronic equipment accesses the function of multiple servers.
In an alternative embodiment, " starting voice control function " may include: to determine the voice for analyzing subsequent input
Server is the associated first server of the first condition.
In conjunction with Fig. 1, to " server for determining the voice of the subsequent input of analysis is the first condition associated described first
The implementation of server " is illustrated, and the embodiment of the present application provides but is not limited to following methods.
First way establishes the communication connection of first electronic equipment and the first server.
In an alternative embodiment, receive meet first condition in multiple preset conditions the first voice input it
Before, the server that multiple preset conditions are respectively associated is not connected with the first electronic equipment.Optionally, the first sound is being detected
When meeting first condition in multiple preset conditions, the communication connection with the associated first server of first condition is established.Optionally,
Still it is not connected with the first electronic equipment with other servers of first condition onrelevant.
Optionally, under the first technique, after the first electronic equipment and first server establish communication connection, first service
Device can be automatically into speech recognition state.
The second way, control first server enter speech recognition state.
In an alternative embodiment, receive meet first condition in multiple preset conditions the first voice input it
Before, the server that multiple preset conditions are respectively associated has been connected with the first electronic equipment.But multiple servers may be not
Into speech recognition state.Optionally, detect meet the first voice input of first condition in multiple preset conditions when, control
The associated first server of first condition processed enters speech recognition state.Optionally, it is serviced with other of first condition onrelevant
Device does not still enter speech recognition state.
In conjunction with Fig. 2, to " server for determining the voice of the subsequent input of analysis is the first condition associated described first
The implementation of server " is illustrated, and the embodiment of the present application provides but is not limited to following methods.
First way, generates the first instruction, and first instruction is used to indicate the second electronic equipment and first clothes
Business device establishes communication connection.
In an alternative embodiment, receive meet first condition in multiple preset conditions the first voice input it
Before, the server that multiple preset conditions are respectively associated is not connected with the second electronic equipment 22.Optionally, the first sound is being detected
When sound meets first condition in multiple preset conditions, the first instruction can be generated, the second electronic equipment 22 of instruction is established and first
The communication connection of the first server of conditions relevant.Optionally, with other servers of first condition onrelevant still not with
Two electronic equipments 22 are connected.
Optionally, under the first technique, after the first electronic equipment and first server establish communication connection, first service
Device can be automatically into speech recognition state.
The second way, generates the second instruction, and second instruction is used to indicate the second electronic equipment triggering first condition
Associated first server enters speech recognition state.
In an alternative embodiment, receive meet first condition in multiple preset conditions the first voice input it
Before, the server that multiple preset conditions are respectively associated has been connected with the second electronic equipment.But multiple servers may be not
Into speech recognition state.Optionally, detect meet the first voice input of first condition in multiple preset conditions when, it is raw
At the second instruction, instruction the second electronic equipment 22 triggering associated first server of first condition enters speech recognition state.It can
Choosing, speech recognition state is not still entered with other servers of first condition onrelevant.
The mode of " the first sound of detection meets the first condition in multiple preset conditions, starts voice control function " has more
Kind, the embodiment of the present application is provided but is not limited to following several:
The first: whether detection first sound meets multiple preset conditions respectively, with obtain first sound with
The corresponding testing result of multiple preset conditions;If the testing result includes that first sound meets described first
Part starts voice control function.
It can detect whether the first sound meets multiple preset conditions respectively.The power consumption of the first electronic equipment is increased, if
First electronic equipment does not connect to power supply in real time, then can also reduce the cruising ability of the first electronic equipment.
In order to reduce the power consumption of the first electronic equipment, the embodiment of the present application also provides second of implementations.
Second of implementation:
It is understood that user is when using the first electronic equipment, in a period (for example, one or more stars
Phase) in, the first sound for meeting first condition in multiple preset conditions may be only issued, for example, user is in one or more stars
Music only all is played using the first electronic equipment in phase, for example, passing through the service corresponding with supplier of Google of the first electronic equipment
Device provides music for user, at this point, the first condition can include this wake-up word of Google for the first sound.
According to the first implementation, user issues the first sound, and " after hi, Google ", the first electronic equipment still can
It detects whether the first sound meets multiple preset conditions respectively, causes processing speed slower, increase the power consumption of the first electronic equipment.
Second of implementation include:
From multiple voice control modes, the first voice control mode is determined;
Wherein, a voice control mode characterization can start the sound of the voice control function of first electronic equipment
The preset condition of satisfaction, different voice control modes characterize different preset conditions;
The first voice control mode characterization can start the sound of the voice control function of first electronic equipment
The preset condition of satisfaction is the first condition.
If detecting that the first sound meets first condition described in corresponding, starting voice control function includes:
Detect whether first sound meets first condition;
If first sound meets the first condition, start voice control function.
To sum up, after receiving the first sound, it is only necessary to detect whether the first sound meets first condition, without inspection
Surveying the other conditions whether the first sound meets in addition to first condition reduces the first electronics to improve processing speed
The power consumption of equipment.
There are many implementations of " from multiple voice control modes, determining the first voice control mode ", and the application is real
Example is applied to provide but be not limited to following several: the first, at least one key is provided on the first electronic equipment.User can pass through
At least one described key more becomes the voice control mode of the first electronic equipment.Second, the first electronic equipment can show language
Sound control model selects the page, and the voice control mode selection page presentation has multiple voice control modes;From multiple voices
In control model, the first voice control mode is determined.
In an alternative embodiment, voice control mode selects the page can be as shown in figure 4, optional, voice control mould
Formula can be the title of supplier, and the voice control mode selection page may include: Google, Baidu, Amazon, millet in Fig. 4
Etc..
Method is described in detail in above-mentioned disclosed embodiments, diversified forms can be used for the present processes
Device realize that therefore disclosed herein as well is a kind of devices, and specific embodiment is given below and is described in detail.
As shown in figure 5, a kind of structure chart of implementation for the first electronic equipment provided by the embodiments of the present application, this
One electronic equipment may include:
Pronunciation receiver 51, for monitoring voice input.
Optionally, pronunciation receiver 51 can be microphone.
Chip 52 is handled, if detecting that the first sound meets in multiple preset conditions for the pronunciation receiver
First condition starts voice control function.
Wherein, a preset condition is at least associated with the server for analyzing the voice of subsequent input, and different preset conditions are closed
It is associated with different servers.
Optionally, processing chip 52 can be CPU (central processing unit) or Bluetooth chip.
The pronunciation receiver 51 is also used to: the second sound input after acquisition first voice input.
The processor chips 52 are also used to: obtaining the associated first server of the first condition for the rising tone
The analysis result of sound.
Output device 53, for exporting the analysis result.
Optionally, output device can be voice playing device or display, and voice playing device can be loudspeaker.It can
Choosing, pronunciation receiver is in running order simultaneously with voice playing device.That is the first electronic equipment is playing the same of voice
When can monitor the input of sound, or, voice can be played while monitoring the input of sound.
Optionally, the first electronic equipment can also include: noise treatment device, for acquiring the pronunciation receiver
To sound handled, reduce the sound noise that includes.
Optionally, further includes: transmitting device, for carrying out signal transmission with the second electronic equipment;
If the processing chip detects that the first sound meets the first condition in multiple preset conditions in execution, start
When voice control function, it is specifically used for:
It detects whether first sound meets multiple preset conditions respectively, is preset with obtaining first sound with multiple
The corresponding testing result of condition;
If the testing result includes that first sound meets the first condition, start voice control function.
Transmitting device can be wireless data transmission device or wired transmission device.
Optionally, further includes:
The processor chips are also used to: from multiple voice control modes, determining the first voice control mode;
Wherein, a voice control mode characterization can start the sound of the voice control function of first electronic equipment
The preset condition of satisfaction, different voice control modes characterize different preset conditions;
The first voice control mode characterization can start the sound of the voice control function of first electronic equipment
The preset condition of satisfaction is the first condition.
Optionally, if the processing chip detects that the first sound meets first in multiple preset conditions in execution
Part is specifically used for when starting voice control function:
Detect whether first sound meets first condition;
If first sound meets the first condition, start voice control function.
Optionally, processing chip is specifically used for when executing starting voice control function:
Determine that the server for analyzing the voice of subsequent input is the associated first server of the first condition.
There are many specific implementations of first electronic equipment shown in fig. 5, the embodiment of the present application provide but be not limited to
Under it is several.
The first implementation is illustrated in conjunction with a kind of implementation of the Fig. 1 to the first electronic equipment;As shown in fig. 6,
For the structure chart of another implementation of the first electronic equipment provided by the embodiments of the present application.
First electronic equipment 11 includes at least one microphone 51, at least one loudspeaker 53, processing chip 52;Optionally
It can also include codec 61 and/or at least one amplifier 62.The connection relationship of above-mentioned component can be as shown in Figure 6.
Optionally, codec 61 can also include DSP (DigitalSignalProcessing, Digital Signal Processing)
63, the sound of at least one described microphone 51 acquisition is handled, the noise that sound includes is reduced.Optionally, 63 DSP
It can be independent from each other with codec 61, or, DSP is integrated in codec 61.
Optionally, processing chip 52 can also be integrated with storage unit, for example, DRAM (Dynamic RandomAccess
Memory, i.e. dynamic random access memory) and/or FLASH.
The corresponding wake-up word of multiple preset conditions or control instruction can be stored in the storage unit.
The working principle of the first electronic equipment 11 shown in fig. 6 is illustrated below.
Any microphone monitors voice input at least one described microphone 51, and the first sound that will test is sent to
Codec 61, optionally, the DSP in codec 61 handle the first sound, the first sound after obtaining noise reduction, compile
Decoder 61 encodes the first sound after noise reduction, and the first sound after coding is sent to processing chip 52;Handle core
Piece 52 detects whether the first sound meets either condition in multiple preset conditions, if detecting, the first sound meets multiple default items
First condition in part establishes the communication connection with first server 12, and/or, control first server 12 enters voice and knows
Other state.
Any microphone continues to acquire second sound at least one described microphone 51;The second sound hair that will test
It send to codec 61, optionally, the DSP in codec 61 handles second sound, the rising tone after obtaining noise reduction
Sound, codec 61 encode the second sound after noise reduction, and the second sound after coding is sent to processing chip 52, place
It manages chip 52 and second sound is sent to the associated first server 12 of first condition.
First server 12 carries out speech analysis processing to second sound, is analyzed as a result, analysis result is sent to
Handle chip 52;Processing chip 52 is sent to codec 61 for result is analyzed, and the DSP 63 in optional codec 61 is right
It analyzes result and carries out noise reduction process;Codec 61 is decoded the analysis result after noise reduction, and it is corresponding to obtain analysis result
Voice data;It sends voice data in loudspeaker 53 by amplifier 62, so that the first electronic equipment voice plays
Analyze result.
Optionally, the first electronic equipment 11 can have display screen, can show analysis result.
In an alternative embodiment, if the memory capacity for the storage unit that processing chip 52 integrates is smaller, it can handle
The external storage unit of chip 52 (for example, DRAM and/or FLASH), by the corresponding wake-up word of multiple preset conditions or control
System instruction is stored in external storage unit 71.As shown in fig. 7, again for the first electronic equipment provided by the embodiments of the present application
A kind of structure chart of implementation.
The difference of Fig. 7 and Fig. 6 is, corresponding word or the control instruction of waking up of multiple preset conditions is stored in Fig. 7
Storage unit be it is external, the corresponding storage unit for waking up word or control instruction of multiple preset conditions is stored in Fig. 6 is
It is integrated in processing chip 52.
Second of implementation is illustrated in conjunction with another implementation of the Fig. 2 to the first electronic equipment;Such as Fig. 8 institute
Show, is the structure chart of another implementation of processing system provided by the embodiments of the present application.
First electronic equipment 21 shown in Fig. 8 includes: microphone 51, processing chip 52;Optionally, further include DSP 81,
Amplifier 82 and loudspeaker 84.It is illustrated for handling chip 52 and being Bluetooth chip in Fig. 8.
Optionally, Bluetooth chip 52 includes: wireless data transmission device 83 and processing unit 84, optionally, no line number
It include: Wireless data receiving device 831 and wireless data sending device 832 according to transmitting device 83.
Optionally, Bluetooth chip 52 can integrate storage unit (for example, DRAM and/or FLASH), by multiple default items
The corresponding wake-up word of part or control instruction are stored in external storage unit.
Working principle shown in Fig. 8 is illustrated below.
Monitor voice input in microphone 51, the first sound that will test is sent to DSP 81, DSP to the first sound into
Row noise reduction process, first sound that obtains that treated will treated that the first sound is sent to Bluetooth chip 52;Bluetooth chip 52
In processing unit 84 detect the first sound whether meet either condition in multiple preset conditions, if detect the first sound meet
First condition in multiple preset conditions generates control instruction (can be the first instruction or the second instruction), control instruction is led to
The wireless data sending device 832 crossed in wireless data transmission device 83 is sent to the second electronic equipment 22.
Second electronic equipment 22 based on the control instruction establish with the communication connection of first server 12, and/or, control the
One server 12 enters speech recognition state.
Microphone 51 continues to acquire second sound, second sound is sent to DSP 81, DSP carries out noise reduction to second sound
Processing, the second sound that obtains that treated will treated that second sound is sent to Bluetooth chip 52;Place in Bluetooth chip 52
Second sound is sent to the second electronics by the wireless data sending device 832 that reason device 84 controls in wireless data transmission device 83
Equipment 22.
Second sound is sent to first server 12 by the second electronic equipment 22.
First server 12 carries out speech analysis processing to second sound, is analyzed as a result, analysis result is sent to
Second electronic equipment 22;Second electronic equipment 22 is sent to the first electronic equipment 21 for result is analyzed.
First electronic equipment 21 receives analysis knot by the Wireless data receiving device 831 in wireless data transmission device 83
Fruit;Processing unit 84 in first electronic equipment 21, which controls wireless data sending device 832, will analyze the corresponding voice number of result
It is sent to loudspeaker 84 according to by amplifier 82, to play the corresponding voice data of analysis result.
Optionally, the first electronic equipment 21 can have display screen, can show analysis result.
As shown in figure 9, the structure chart of another implementation for the first electronic equipment provided by the embodiments of the present application, it should
First electronic equipment includes:
Memory 91, for storing program;
Processor 92, for executing described program, described program is specifically used for:
Monitor voice input;
If detecting that the first sound meets the first condition in multiple preset conditions, start voice control function;
Wherein, a preset condition is at least associated with the server for analyzing the voice of subsequent input, and different preset conditions are closed
It is associated with different servers;
Second sound input after acquiring first voice input;
Obtain the analysis result that the associated first server of the first condition is directed to the second sound.
Memory 91 may include high speed RAM memory, it is also possible to further include nonvolatile memory (non-volatile
Memory), a for example, at least magnetic disk storage.
Processor 92 may be a central processor CPU or specific integrated circuit ASIC (Application
Specific Integrated Circuit), or be arranged to implement the integrated electricity of one or more of the embodiment of the present application
Road.
Optionally, electronic equipment can also include communication bus 93 and communication interface 94, wherein memory 91, processing
Device 92, completes mutual communication by communication bus 93 at communication interface 94;
Optionally, communication interface 94 can be the interface of communication module, such as the interface of gsm module.
The embodiment of the present application also provides a kind of readable storage medium storing program for executing, are stored thereon with computer program, which is characterized in that
When the computer program is executed by processor, each step that any of the above-described processing method includes is realized.
It should be noted that all the embodiments in this specification are described in a progressive manner, each embodiment weight
Point explanation is the difference from other embodiments, and the same or similar parts between the embodiments can be referred to each other.
For device or system class embodiment, since it is basically similar to the method embodiment, so be described relatively simple, it is related
Place illustrates referring to the part of embodiment of the method.
It should also be noted that, herein, relational terms such as first and second and the like are used merely to one
Entity or operation are distinguished with another entity or operation, without necessarily requiring or implying between these entities or operation
There are any actual relationship or orders.Moreover, the terms "include", "comprise" or its any other variant are intended to contain
Lid non-exclusive inclusion, so that the process, method, article or equipment including a series of elements is not only wanted including those
Element, but also including other elements that are not explicitly listed, or further include for this process, method, article or equipment
Intrinsic element.In the absence of more restrictions, the element limited by sentence "including a ...", it is not excluded that
There is also other identical elements in process, method, article or equipment including the element.
The step of method described in conjunction with the examples disclosed in this document or algorithm, can directly be held with hardware, processor
The combination of capable software module or the two is implemented.Software module can be placed in random access memory (RAM), memory, read-only deposit
Reservoir (ROM), electrically programmable ROM, electrically erasable ROM, register, hard disk, moveable magnetic disc, CD-ROM or technology
In any other form of storage medium well known in field.
The foregoing description of the disclosed embodiments makes professional and technical personnel in the field can be realized or use the application.
Various modifications to these embodiments will be readily apparent to those skilled in the art, as defined herein
General Principle can be realized in other embodiments without departing from the spirit or scope of the application.Therefore, the application
It is not intended to be limited to the embodiments shown herein, and is to fit to and the principles and novel features disclosed herein phase one
The widest scope of cause.
Claims (10)
1. a kind of processing method, which is characterized in that be applied to the first electronic equipment, the treating method comprises:
Monitor voice input;
If detecting that the first sound meets the first condition in multiple preset conditions, start voice control function;
Wherein, a preset condition is at least associated with the server for analyzing the voice of subsequent input, and different preset conditions are associated with
Different servers;
Second sound input after acquiring first voice input;
Obtain the analysis result that the associated first server of the first condition is directed to the second sound.
2. processing method according to claim 1, which is characterized in that if described detect that the first sound meets multiple preset
First condition in condition, starting voice control function include:
Detect whether first sound meets multiple preset conditions respectively, to obtain first sound and multiple preset conditions
Corresponding testing result;
If the testing result includes that first sound meets the first condition, start voice control function.
3. processing method according to claim 1, which is characterized in that further include:
From multiple voice control modes, the first voice control mode is determined;
Wherein, the sound for the voice control function that a voice control mode characterization can start first electronic equipment meets
Preset condition, different voice control modes characterizes different preset conditions;
The sound for the voice control function that the first voice control mode characterization can start first electronic equipment meets
Preset condition be the first condition.
4. processing method according to claim 3, which is characterized in that if described detect that the first sound meets first
Part, starting voice control function include:
Detect whether first sound meets first condition;
If first sound meets the first condition, start voice control function.
5. according to the processing method of claim 2 or 4, which is characterized in that the starting voice control function includes:
Determine that the server for analyzing the voice of subsequent input is the associated first server of the first condition.
6. a kind of first electronic equipment characterized by comprising
Pronunciation receiver, for monitoring voice input;
Chip is handled, if detecting that the first sound meets first in multiple preset conditions for the pronunciation receiver
Part starts voice control function;
Wherein, a preset condition is at least associated with the server for analyzing the voice of subsequent input, and different preset conditions are associated with
Different servers;
The pronunciation receiver is also used to: the second sound input after acquisition first voice input;
The processor chips are also used to: obtaining the associated first server of the first condition for point of the second sound
Analyse result;
Output device, for exporting the analysis result.
7. the first electronic equipment according to claim 6, which is characterized in that further include:
Transmitting device, for carrying out signal transmission with the second electronic equipment;
If the processing chip detects that the first sound meets the first condition in multiple preset conditions in execution, start voice
When control function, it is specifically used for:
Detect whether first sound meets multiple preset conditions respectively, to obtain first sound and multiple preset conditions
Corresponding testing result;
If the testing result includes that first sound meets the first condition, start voice control function.
8. the first electronic equipment according to claim 6, which is characterized in that further include:
The processor chips are also used to: from multiple voice control modes, determining the first voice control mode;
Wherein, the sound for the voice control function that a voice control mode characterization can start first electronic equipment meets
Preset condition, different voice control modes characterizes different preset conditions;
The sound for the voice control function that the first voice control mode characterization can start first electronic equipment meets
Preset condition be the first condition.
9. the first electronic equipment according to claim 8, which is characterized in that if the processing chip detects the executing
One sound meets the first condition in multiple preset conditions, when starting voice control function, is specifically used for:
Detect whether first sound meets first condition;
If first sound meets the first condition, start voice control function.
10. a kind of first electronic equipment characterized by comprising
Memory, for storing program;
Processor, for executing described program, described program is specifically used for:
Monitor voice input;
If detecting that the first sound meets the first condition in multiple preset conditions, start voice control function;
Wherein, a preset condition is at least associated with the server for analyzing the voice of subsequent input, and different preset conditions are associated with
Different servers;
Second sound input after acquiring first voice input;
Obtain the analysis result that the associated first server of the first condition is directed to the second sound.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810825087.4A CN108962259B (en) | 2018-07-25 | 2018-07-25 | Processing method and first electronic device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201810825087.4A CN108962259B (en) | 2018-07-25 | 2018-07-25 | Processing method and first electronic device |
Publications (2)
Publication Number | Publication Date |
---|---|
CN108962259A true CN108962259A (en) | 2018-12-07 |
CN108962259B CN108962259B (en) | 2021-06-15 |
Family
ID=64464137
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201810825087.4A Active CN108962259B (en) | 2018-07-25 | 2018-07-25 | Processing method and first electronic device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN108962259B (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111096680A (en) * | 2019-12-31 | 2020-05-05 | 广东美的厨房电器制造有限公司 | Cooking equipment, electronic equipment, voice server, voice control method and device |
CN112104949A (en) * | 2020-09-02 | 2020-12-18 | 北京字节跳动网络技术有限公司 | Method and device for detecting pickup assembly and electronic equipment |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN106960667A (en) * | 2017-03-08 | 2017-07-18 | 杭州联络互动信息科技股份有限公司 | Position reminding methods, devices and systems |
US20180040324A1 (en) * | 2016-08-05 | 2018-02-08 | Sonos, Inc. | Multiple Voice Services |
CN107704275A (en) * | 2017-09-04 | 2018-02-16 | 百度在线网络技术(北京)有限公司 | Smart machine awakening method, device, server and smart machine |
US9934777B1 (en) * | 2016-07-01 | 2018-04-03 | Amazon Technologies, Inc. | Customized speech processing language models |
-
2018
- 2018-07-25 CN CN201810825087.4A patent/CN108962259B/en active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US9934777B1 (en) * | 2016-07-01 | 2018-04-03 | Amazon Technologies, Inc. | Customized speech processing language models |
US20180040324A1 (en) * | 2016-08-05 | 2018-02-08 | Sonos, Inc. | Multiple Voice Services |
CN106960667A (en) * | 2017-03-08 | 2017-07-18 | 杭州联络互动信息科技股份有限公司 | Position reminding methods, devices and systems |
CN107704275A (en) * | 2017-09-04 | 2018-02-16 | 百度在线网络技术(北京)有限公司 | Smart machine awakening method, device, server and smart machine |
Non-Patent Citations (2)
Title |
---|
WENHUA XU等: "Cancelable Voiceprint Templates Based on Knowledge Signatures", 《INTERNATIONAL SYMPOSIUM ON ELECTRONIC COMMERCE AND SECURITY》 * |
梁烽: "应用自动语音识别技术实现通信增值业务", 《广西科学院学报》 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111096680A (en) * | 2019-12-31 | 2020-05-05 | 广东美的厨房电器制造有限公司 | Cooking equipment, electronic equipment, voice server, voice control method and device |
CN112104949A (en) * | 2020-09-02 | 2020-12-18 | 北京字节跳动网络技术有限公司 | Method and device for detecting pickup assembly and electronic equipment |
Also Published As
Publication number | Publication date |
---|---|
CN108962259B (en) | 2021-06-15 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109166593B (en) | Audio data processing method, device and storage medium | |
JP7160967B2 (en) | Keyphrase detection with audio watermark | |
EP3127116B1 (en) | Attention-based dynamic audio level adjustment | |
Rossi et al. | AmbientSense: A real-time ambient sound recognition system for smartphones | |
CN104254884B (en) | Low-power integrated-circuit for analyzing digitized audio stream | |
US9167520B2 (en) | Controlling applications in a mobile device based on environmental context | |
CN107147618A (en) | A kind of user registering method, device and electronic equipment | |
CN111083678B (en) | Playing control method and system of Bluetooth sound box and intelligent device | |
CN105408953A (en) | Voice recognition client device for local voice recognition | |
CN104247280A (en) | Voice-controlled communication connections | |
KR102580408B1 (en) | Portable Audio DEVICE with Voice Capabilities | |
US12014732B2 (en) | Energy efficient custom deep learning circuits for always-on embedded applications | |
KR20160106075A (en) | Method and device for identifying a piece of music in an audio stream | |
CN111433737A (en) | Electronic device and control method thereof | |
CN105975063B (en) | A kind of method and apparatus controlling intelligent terminal | |
CN110097895B (en) | Pure music detection method, pure music detection device and storage medium | |
CN108962259A (en) | Processing method and the first electronic equipment | |
CN113157240A (en) | Voice processing method, device, equipment, storage medium and computer program product | |
CN107016996B (en) | Audio data processing method and device | |
CN110933345A (en) | Method for reducing television standby power consumption, television and storage medium | |
CN112259076B (en) | Voice interaction method, voice interaction device, electronic equipment and computer readable storage medium | |
CN108231074A (en) | A kind of data processing method, voice assistant equipment and computer readable storage medium | |
CN110958348B (en) | Voice processing method and device, user equipment and intelligent sound box | |
CN116705033A (en) | System on chip for wireless intelligent audio equipment and wireless processing method | |
KR102071865B1 (en) | Device and method for recognizing wake-up word using server recognition result |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |