JP2019204112A5

JP2019204112A5 -

Info

Publication number: JP2019204112A5
Application number: JP2019137200A
Authority: JP
Filing date: 2019-07-25
Publication date: 2020-11-12
Anticipated expiration: 2035-04-10

Claims

Is applied to terminal including a voice waking device and speech recognition system,
The step of listening to the first voice information in the surrounding environment by the voice wake-up device, the first voice information includes the wake-up information and the first part of the command word, and the wake. The up information is used to enable the voice recognition device, the step and
The step of enabling the voice recognition device according to the wakeup information by the voice wakeup device, and
A step of listening to a second voice information by the voice recognition device, wherein the second voice information includes a second part of the command word .
In a step of acquiring voice instruction information according to the first voice information and the second voice information by the voice recognition device , the voice instruction information matches the command word, and the command word is wherein said first portion of the command word and a second portion of the command word, characterized in that it comprises a step, voice control method.

The step of enabling the voice recognition device according to the wakeup information by the voice wakeup device is
The method of claim 1 , comprising the step of generating a trigger signal to enable the voice recognition device when the voice wakeup device determines that the wakeup information matches the voice wakeup model. ..

Determining that the wakeup information matches the voice wakeup model can be determined.
Including that, when the wake-up information matches a predetermined wake-up voice information, it is determined that the wake-up information matches the voice wake-up model.
The method according to claim 2.

Determining that the wakeup information matches the voice wakeup model is
The wake-up information sound when the wake-up information was extracted voiceprint features within the wake-up information when matching the predetermined wake-up sound information, extracted the voiceprint feature matches a predetermined voiceprint feature The method of claim 2 , comprising determining that the wakeup model is consistent .

The voiceprint features the following characteristics, i.e., pitch curve, the linear prediction coefficients, the spectral envelope parameters, harmonic energy ratio, the resonance peak frequency and its bandwidth, cepstrum, or one or more of the mel-frequency cepstral coefficients The method according to claim 4 , which includes the above.

By the speech recognition device, according to the first audio information and the second audio information, the step of obtaining audio instruction information,
A step of acquiring a recognition result by the voice recognition device according to the first voice information and the second voice information, wherein the recognition result includes a command word information.
The voice recognition device includes a step of acquiring the voice instruction information that matches the recognition result by matching between the recognition result acquired and the voice instruction information stored in advance .
The method according to claim 1 .

The wake-up information is heard by the voice wake-up device within the first period, and the first portion of the command word is heard by the voice wake-up device within the second period.
The second voice information is heard by the voice recognition device within the third period.
The method according to any one of claims 1 to 6.

The step of listening to the first voice information in the surrounding environment by the voice wakeup device is
The step of listening to the first audio information in the surrounding environment in the standby state, or
The step of listening to the first audio information in the surrounding environment in the non-standby state, or
The step of listening to the first audio information in the surrounding environment in the screen locked state.
including,
The method according to any one of claims 1 to 6.

The voice wakeup device further comprises a step of transmitting the trigger signal to the voice recognition device to enable the voice recognition device.
The method according to claim 2.

The voice recognition device further includes a step of controlling the execution of the operation corresponding to the matched voice instruction information.
The method according to any one of claims 1 to 6.

Further including a step of automatically disabling the voice recognition device when it is determined that the voice information will not be received again within a preset period of time after the voice recognition device is enabled.
The method according to any one of claims 1 to 6.

The voice wakeup device is a digital signal processor DSP.
The method according to any one of claims 1 to 6.

The voice recognition device is an application processor AP.
The method according to any one of claims 1 to 6.

It ’s a terminal,
With one or more processors
A memory for storing an instruction, which causes the terminal to execute the method specified in any one of claims 1 to 13 when the instruction is executed by the one or more processors. Features memory and
A terminal equipped with.

The non-transitory computer-readable media having been a computer usable instructions stored thereon for execution by a processor, the instructions cause the processor according to any one of claims 1 to 13 The method is carried out.
Non-temporary computer-readable media.

Including voice wake-up device and voice recognition device
The voice wake-up device is to listen to the first voice information in the surrounding environment, and the first voice information includes the wake-up information and the first part of the command word, and the wake-up device. The up information is configured to do what is used to enable the voice recognition device.
The voice wakeup device is configured to enable the voice recognition device according to the wakeup information.
The voice recognition device is configured to listen to a second voice information, the second voice information including a second portion of the command word.
The voice recognition device acquires voice instruction information according to the first voice information and the second voice information, the voice instruction information matches the command word, and the command word is Containing said first part of the command word and said second part of the command word, configured to do.
A terminal characterized by that.

The voice wakeup device is configured to determine that the wakeup information matches a voice wakeup model when the wakeup information matches a predetermined wakeup voice information.
The terminal according to claim 16.

The voice wakeup device extracts the voiceprint feature in the wakeup information when the wakeup information matches the predetermined wakeup voice information, and when the extracted voiceprint feature matches the predetermined voiceprint feature. Is configured to determine that the wakeup information matches the voice wakeup model.
The terminal according to claim 16.

The voiceprint feature is one or more of the following features: pitch curve, linear prediction factor, spectral envelope parameter, harmonic energy ratio, resonance peak frequency and its bandwidth, cepstrum, or mel frequency cepstrum coefficient. Including
The terminal according to claim 18.

The voice recognition device is
Acquiring the recognition result according to the first voice information and the second voice information, and the recognition result includes the command word information.
By matching between the acquired recognition result and the voice instruction information stored in advance, the voice instruction information matching the recognition result is acquired.
The terminal according to claim 16.

The wake-up information is heard by the voice wake-up device within the first period, and the first portion of the command word is heard by the voice wake-up device within the second period.
The second voice information is heard by the voice recognition device within the third period.
The terminal according to any one of claims 16 to 20.

The voice wake-up device is
Listen to the first audio information in the surrounding environment in the standby state, or
Listening to the first audio information in the surrounding environment in the non-standby state, or
Listen to the first audio information in the surrounding environment in the screen locked state
Is configured as
The terminal according to any one of claims 16 to 20.

The voice recognition device is
It is configured to be automatically disabled when it determines that voice information will not be received again within a preset period of time after enabling the voice recognition device.
The terminal according to any one of claims 16 to 20.

The voice recognition device further includes an execution module.
The voice recognition device is also configured to send an execution instruction matching the voice instruction information to the execution module.
The execution module is configured to execute an operation corresponding to the execution instruction.
The terminal according to any one of claims 16 to 20.

The voice wakeup device is a digital signal processor DSP.
The voice recognition device is an application processor AP.
The terminal according to any one of claims 16 to 20.