WO2016067644A1 - Dispositif de réglage de système vocal - Google Patents

Dispositif de réglage de système vocal Download PDF

Info

Publication number
WO2016067644A1
WO2016067644A1 PCT/JP2015/055093 JP2015055093W WO2016067644A1 WO 2016067644 A1 WO2016067644 A1 WO 2016067644A1 JP 2015055093 W JP2015055093 W JP 2015055093W WO 2016067644 A1 WO2016067644 A1 WO 2016067644A1
Authority
WO
WIPO (PCT)
Prior art keywords
sound
signal
time
unit
voice
Prior art date
Application number
PCT/JP2015/055093
Other languages
English (en)
Japanese (ja)
Inventor
中村 圭介
Original Assignee
シャープ株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by シャープ株式会社 filed Critical シャープ株式会社
Publication of WO2016067644A1 publication Critical patent/WO2016067644A1/fr

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0324Details of processing therefor
    • G10L21/034Automatic adjustment
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/84Detection of presence or absence of voice signals for discriminating voice from noise

Definitions

  • the present invention relates to a sound adjustment device.
  • voice recognition if the microphone sensitivity is too low, the necessary voice signal cannot be obtained sufficiently and voice recognition cannot be performed correctly.
  • speech recognition even if the sensitivity of the microphone is too high, distortion of the speech signal and ambient noise are picked up and speech recognition cannot be performed correctly.
  • the speech recognition apparatus disclosed in Patent Document 1 detects an average speech speech included in an input speech signal in a speech speech interval after detecting a time interval in which the user speech speech is included in the input speech data as a speech speech interval.
  • the level and the level of noise included in the input voice signal in the time interval other than the utterance voice interval are calculated, the input voice signal level is estimated from the noise level and the average utterance voice level, and the input voice
  • the gain of the input amplifier is set so that the level obtained by amplifying the signal level with the input amplifier becomes a level suitable for speech recognition.
  • the voice recognition device of Patent Document 1 includes a talk switch, and determines that the period from when the user presses the talk switch down to the end of the utterance voice section is the section including the user's utterance voice. Depressing the talk switch for each input has the problem of complicating user operations.
  • voice input data includes a user's speech signal and a noise signal, and it is difficult to determine when only the user's speech signal has ended.
  • the present invention has been made in view of the above problems, and an object of the present invention is to appropriately adjust the input gain and output gain of a sound signal in accordance with the use environment and improve the sound recognition rate. Is to provide.
  • the voice adjustment device of the present invention includes a microphone signal input unit that converts an electrical signal input from a microphone into a voice signal, a voice signal strength measurement unit that measures the strength of the voice signal and compares it with a preset voice strength threshold. , A silent time measurement unit that measures a silent period in which a silent state where the voice signal is smaller than a voice intensity threshold continues, and a voiced time that measures a voiced time in which a voice signal whose voice signal is greater than the voice intensity threshold continues, A measurement unit and an audio adjustment unit that adjusts the audio signal by comparing the silent time or the sound time with a preset time threshold value are provided.
  • the audio adjustment unit includes a microphone sensitivity adjustment unit for adjusting the input gain of the microphone signal input unit, and the input gain is set when the silence time is longer than a preset silence time threshold. The input gain is lowered when the sound duration is longer than a preset sound duration threshold.
  • the audio adjustment unit includes a microphone sensitivity adjustment unit for adjusting the input gain of the microphone signal input unit, a speaker signal output unit that outputs an audio signal to the speaker, and a speaker signal output unit.
  • a speaker volume adjustment unit for adjusting the output gain of the sound, and when the silence time is longer than a preset silent time threshold, the input gain is increased and the output gain is lowered to set the voice time with a preset voice time. When it is longer than the threshold value, the input gain is decreased and the output gain is increased.
  • the input gain and output gain of the audio signal are appropriately adjusted according to the usage environment, and the voice recognition rate is increased.
  • An audio adjustment device that can be improved can be provided.
  • FIG. 1 is a basic configuration diagram of an audio adjustment device of the present invention. It is a block diagram of the audio
  • FIG. It is a principle figure which shows the adjustment method of the microphone sensitivity using a silence time. It is a principle figure which shows the adjustment method of the microphone sensitivity using sound time.
  • FIG. 6 is a flowchart illustrating a microphone sensitivity adjustment step according to the first embodiment. It is a block diagram of the audio
  • FIG. It is a flowchart which shows the adjustment step of the speaker volume of Example 2. It is a block diagram of the audio
  • FIG. 1 shows a basic configuration of the sound adjustment device 20 of the present invention.
  • the voice adjustment device 20 of the present invention includes a microphone signal input unit 31 that converts an electric signal input from the microphone 10 into a voice signal, and a voice signal strength measurement that measures the strength of the voice signal and compares it with a preset voice strength threshold.
  • the unit 32 is provided.
  • the sound adjusting device 20 includes a silent time measuring unit 33 that measures a silent time during which a silent state in which the voice signal is smaller than the voice strength threshold continues, and a voiced state in which the voice signal is greater than the voice strength threshold.
  • a voiced time measuring unit 34 for measuring the sound time and a voice adjusting unit 40 for adjusting the voice signal by comparing the silent time or the voiced time with a preset time threshold value are provided.
  • the time that humans can speak continuously by conversation is considered to be about 5 seconds at most because of the restriction of breathing, and it is almost impossible to continue speaking without breathing for about 15 seconds. Therefore, in the audio adjusting device 20 of the present invention, when the audio signal continues in a state (sounding state) larger than the predetermined audio intensity threshold for an arbitrary fixed time (5 to 10 seconds), the audio The signal is regarded not as human speech but ambient noise, and the speech signal is adjusted so as to reduce the influence of such noise on speech recognition.
  • FIG. 2 shows a configuration of the sound adjustment device 21 according to the first embodiment.
  • the voice adjustment device 21 according to the first embodiment is a voice adjustment device mounted on a robot or an information terminal having a voice output function such as a voice recognition function or voice synthesis, and a microphone according to the magnitude of ambient noise in the usage environment.
  • the voice recognition function is improved by adjusting the sensitivity (input gain).
  • the sound adjustment device 21 receives a microphone signal input unit 31 that inputs an electrical signal from the microphone 10 and converts it into a sound signal, and measures the strength of the sound signal and compares it with a preset sound intensity threshold. And an audio signal intensity measuring unit 32.
  • the sound adjustment device 21 includes a silence time measuring unit 33 that measures a silence period in which a silence state in which the sound signal is smaller than the sound intensity threshold continues, and a sound condition in which the sound signal is greater than the sound intensity threshold continues.
  • a sound duration measuring unit 34 for measuring the sound duration is provided.
  • a microphone 10 that converts external sound into an electrical signal is connected to the sound adjustment device 21, and a sound signal that has been digitized via the microphone signal input unit 31 is passed to the sound signal intensity measurement unit 32. .
  • the audio signal intensity measuring unit 32 has, for example, a preset audio intensity threshold value, and compares the audio intensity threshold value with the current audio signal value to determine the presence or absence of audio.
  • FIG. 3 is an explanatory diagram for adjusting the microphone sensitivity (input gain) when the audio signal intensity measurement unit 32 determines that the audio signal is silent.
  • the silent time measuring unit 33 measures the silent time (elapsed time in the silent state).
  • the microphone sensitivity of the microphone 10 is increased by the microphone sensitivity adjustment unit 41.
  • the silent time threshold is specifically about 10 to 60 seconds, and the microphone sensitivity increase rate is preferably about 1 to 5%.
  • the microphone sensitivity adjustment unit 41 adjusts the microphone sensitivity gradually in a quiet environment by setting the silent time threshold and the microphone sensitivity increase rate as described above.
  • the microphone sensitivity is increased by several percent. If the silent time further continues from that time by the silent time threshold (T12), the microphone sensitivity is increased by several percent again. This is repeated throughout the silence period.
  • silence time threshold (T11) and the silence time threshold (T12) are not necessarily equal. Further, it is not necessary to make the rate of increase of each microphone sensitivity equal.
  • the microphone sensitivity rises the silence time will not continue for a long time, and the microphone sensitivity will settle down to a certain level suitable for the usage environment.
  • FIG. 4 is an explanatory diagram for adjusting the microphone sensitivity (input gain) when the audio signal intensity measurement unit 32 determines that the audio signal is sound.
  • the sound time measuring unit 34 measures the sound time (elapsed time of the sound state).
  • the microphone sensitivity of the microphone 10 is lowered by the microphone sensitivity adjustment unit 41.
  • the sound duration threshold is specifically about 5 to 20 seconds, and the rate of decrease in sensitivity is preferably about 10 to 50%, and is usually set based on the time during which a person can speak with one breath.
  • the microphone sensitivity adjustment unit 41 adjusts the microphone sensitivity of the microphone to be quickly lowered under a noisy environment by setting the sound duration threshold and the microphone sensitivity increase rate as described above.
  • the microphone sensitivity is reduced by several percent to several tens of percent. Further, if the sound duration continues from that time for the sound duration threshold (T22), the microphone sensitivity is lowered again by several percent to several tens percent. This is repeated throughout the duration of the sound.
  • the sound time threshold (T21) and the sound time threshold (T22) are not necessarily equal. Further, it is not necessary to make the rate of decrease of the input gain equal in each.
  • the microphone sensitivity decreases, the sound duration will not continue as long as before, and the microphone sensitivity will settle to a certain level suitable for the usage environment.
  • FIG. 5 shows a processing flow for adjusting the microphone sensitivity (input gain) in the sound adjustment device 21 of the first embodiment.
  • the processing flow will be described with reference to FIG. S11.
  • S12. In the audio signal intensity measurement unit 32, the intensity of the audio signal is compared with an arbitrary audio intensity threshold (or based on whether the audio is being recognized), and if the intensity of the audio signal is greater than the audio intensity threshold, If the intensity of the audio signal is smaller than the audio intensity threshold, it is determined that there is no sound.
  • the soundless time measuring unit 33 measures the soundless time (the duration of the section without the sound signal).
  • the process returns to the start of the sound adjustment processing (S11) without changing the microphone sensitivity. If the silent time is longer than the silent time threshold, the process proceeds to the next process (S15). S15. When the silent time is longer than the silent time threshold, it is determined that there is a margin in microphone sensitivity in a quiet environment with no noise around the microphone, and the microphone sensitivity of the microphone signal input unit 31 is increased by an arbitrary ratio. The process returns to the start of the sound adjustment process (S11). S16. When the voice signal strength measuring unit 32 determines that there is sound (there is a voice signal), the voiced time measuring unit 34 measures the voiced time (the duration of a section where the voice signal is present). S17.
  • the process returns to the start of the sound adjustment processing (S11) without changing the microphone sensitivity. If the sound time is longer than the sound time threshold, the process proceeds to the next process (S18). S18. If the sound time is longer than the sound time threshold, it is determined that there is constant noise around the sound, and the microphone sensitivity of the microphone 10 is lowered by an arbitrary ratio by the microphone signal input unit 31, and then the sound adjustment processing Return to the start (S11).
  • the present invention is characterized in that the length of the silence state and the sound state is measured from the voice signal and adjusted to a voice signal suitable for voice recognition. That is, according to the sound adjustment device 21 of the first embodiment, the ambient noise and the human conversation are distinguished by the length of time during which the silent time or the sound time continues, and the optimum microphone sensitivity is obtained according to the use environment. Can be set.
  • FIG. 6 shows the configuration of the audio adjustment device 22 according to the second embodiment.
  • the audio adjusting device 22 according to the second embodiment includes a speaker volume adjusting unit 42 and a speaker signal output unit 43 as the audio adjusting unit 40, and the volume of the speaker 50 (depending on the usage environment such as a quiet environment or a noisy environment with a lot of noise). This is an audio adjustment device that adjusts the output gain easily.
  • an audio signal is input through the microphone 10 and the microphone signal input unit 31, the intensity of the audio signal is measured by the audio signal intensity measurement unit 32, and the silent time measurement unit 33 and the sound time measurement unit 34 are measured.
  • the process until the silent time and the voiced time are measured is the same as that of the sound adjustment device 21 of the first embodiment.
  • the speaker volume adjustment unit 42 controls the volume of the speaker 50.
  • the silent time threshold is preferably about 10 to 60 seconds
  • the volume decrease rate is preferably about 1 to 5%.
  • the speaker volume adjustment unit 42 adjusts the volume of the speaker 50 to gradually decrease in a quiet environment by setting the silent time threshold and the volume decrease rate.
  • the speaker volume adjustment unit 42 increases the volume (output gain) of the speaker 50 by the speaker volume adjustment unit 42 when the sound duration is longer than an arbitrary sound duration threshold set in advance.
  • the sound duration threshold is specifically about 5 to 20 seconds, and the rate of increase in volume is preferably about 10 to 50%, and is usually set based on the time during which a person can speak with one breath.
  • the speaker volume adjustment unit 42 adjusts the volume of the speaker 50 to be quickly increased in a noisy environment by setting the silent time threshold and the volume decrease rate.
  • FIG. 7 shows a processing flow for adjusting the speaker volume (output gain) in the sound adjustment device 22 of the second embodiment.
  • the processing flow will be described with reference to FIG. S11.
  • S12. In the audio signal intensity measurement unit 32, the intensity of the audio signal is compared with an arbitrary audio intensity threshold (or based on whether the audio is being recognized), and if the intensity of the audio signal is greater than the audio intensity threshold, If the intensity of the audio signal is smaller than the audio intensity threshold, it is determined that there is no sound.
  • the soundless time measuring unit 33 measures the soundless time (the duration of the section without the sound signal).
  • the process returns to the start of the sound adjustment processing (S11) without changing the volume of the speaker. If the silent time is longer than the silent time threshold, the process proceeds to the next process (S21). S21. If the silent time is longer than the silent time threshold, it is determined that the environment is quiet and there is no noise, the volume (output gain) of the speaker 50 is lowered at an arbitrary rate, and then the voice adjustment process is started. Return to (S11). S16. When the voice signal strength measuring unit 32 determines that there is sound (there is a voice signal), the voiced time measuring unit 34 measures the voiced time (the duration of a section in which the voice signal is present). S17.
  • the process proceeds to the next process (S22). S22. If the sound time is longer than the sound time threshold, it is determined that the sound is ambient noise, the sound volume (output gain) of the speaker 50 is increased by an arbitrary ratio, and then the sound adjustment process is started (S11). Return to.
  • the volume (output gain) of the speaker 50 is adjusted according to the intensity of ambient noise, the volume of human conversation is easily adjusted according to the usage environment. be able to.
  • FIG. 8 shows the configuration of the sound adjustment device 23 according to the third embodiment.
  • the sound adjustment device 23 according to the third embodiment is a combination of the sound adjustment device 21 according to the first embodiment and the sound adjustment device 22 according to the second embodiment.
  • the voice adjustment device 23 according to the third embodiment adjusts the sensitivity (input gain) of the microphone so that the voice recognition function is in a good state in a quiet environment or a noisy environment where there is a lot of noise.
  • This is an audio adjustment device that adjusts the volume (output gain) of a speaker so that it can be easily heard.
  • the voice adjustment device 23 of the third embodiment has the configuration of the voice adjustment device 21 of the first embodiment and the configuration of the voice adjustment device 22 of the second embodiment.
  • the details of each component of the sound adjustment device 23 are the same as those in the first and second embodiments, and thus the description thereof is omitted.
  • FIG. 9 shows a processing flow of sound adjustment of the microphone and the speaker by the sound adjustment device 23 of the third embodiment.
  • the processing flow will be described with reference to FIG. S11.
  • S12. In the audio signal intensity measurement unit 32, the intensity of the audio signal is compared with an arbitrary audio intensity threshold (or based on whether the audio is being recognized), and if the intensity of the audio signal is greater than the audio intensity threshold, If the intensity of the audio signal is smaller than the audio intensity threshold, it is determined that there is no sound. S13.
  • the soundless time measuring unit 33 measures the soundless time (the duration of the section without the sound signal).
  • the process returns to the start of the sound adjustment processing (S11) without changing the microphone sensitivity and the speaker volume. If the silent time is longer than the silent time threshold, the process proceeds to the next process (S15). S15. If the silent time is longer than the silent time threshold, it is determined that the microphone sensitivity is sufficient in a quiet environment with no surrounding noise, and the microphone sensitivity (input sensitivity) is increased by an arbitrary ratio. After lowering the speaker volume (output gain) at an arbitrary rate, the process returns to the start of the audio adjustment process (S11). S16.
  • the voiced time measuring unit 34 measures the voiced time (the duration of a section in which the voice signal is present). S17. If the sound duration is shorter than an arbitrary sound duration threshold, it is determined that the conversation is normal, and the process returns to the start of the sound adjustment processing (S11) without changing the microphone sensitivity and the speaker volume. If the sound time is longer than the sound time threshold, the process proceeds to the next process (S18). S18. If the sound time is longer than the sound time threshold, it is determined that there is constant noise around the sound, and the microphone sensitivity (input sensitivity) is reduced by an arbitrary ratio. ) At an arbitrary rate, and then the process returns to the start of the audio adjustment process (S11).
  • the ambient noise and the human conversation are distinguished based on the length of time during which the silent time or the voiced time lasts. Since the speaker volume is set, it is possible to easily adjust the volume of human conversation according to the usage environment. In addition, by providing the voice adjustment device 23 of the present invention, it is possible to improve the voice recognition rate in a device having a voice recognition function.
  • the present invention can be used in any device having a voice recognition function and a voice response function.

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Quality & Reliability (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Telephone Function (AREA)

Abstract

L'invention concerne un dispositif de réglage de système vocal, capable d'améliorer un taux de reconnaissance vocale par un réglage approprié du gain à l'entrée et du gain à la sortie d'un signal vocal en réponse à environnement d'utilisation. Ce dispositif de réglage de système vocal (20) est pourvu d'une unité de réglage de système vocal (40) pour régler un signal vocal en comparant une période silencieuse ou une période sonore à un seuil temporel prédéterminé, la période silencieuse étant la continuation d'un état silencieux où l'intensité du signal vocal est inférieure à un seuil d'intensité vocale, et la période sonore étant la continuation d'un état sonore où l'intensité du signal vocal est supérieure au seuil d'intensité vocale.
PCT/JP2015/055093 2014-10-29 2015-02-23 Dispositif de réglage de système vocal WO2016067644A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2014-219802 2014-10-29
JP2014219802A JP5863928B1 (ja) 2014-10-29 2014-10-29 音声調整装置

Publications (1)

Publication Number Publication Date
WO2016067644A1 true WO2016067644A1 (fr) 2016-05-06

Family

ID=55346919

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2015/055093 WO2016067644A1 (fr) 2014-10-29 2015-02-23 Dispositif de réglage de système vocal

Country Status (2)

Country Link
JP (1) JP5863928B1 (fr)
WO (1) WO2016067644A1 (fr)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108735207A (zh) * 2017-04-25 2018-11-02 丰田自动车株式会社 声音对话系统、声音对话方法及计算机可读存储介质
WO2021040834A1 (fr) * 2019-08-29 2021-03-04 Microsoft Technology Licensing, Llc Fonction de réglage automatique de la sensibilité au langage

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP6887315B2 (ja) * 2017-06-05 2021-06-16 キヤノン株式会社 音声処理装置およびその制御方法、プログラム並びに記憶媒体
JP7404664B2 (ja) 2019-06-07 2023-12-26 ヤマハ株式会社 音声処理装置及び音声処理方法

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS58190993A (ja) * 1982-05-01 1983-11-08 日産自動車株式会社 車両用音声検出装置
JPH1091184A (ja) * 1996-09-12 1998-04-10 Oki Electric Ind Co Ltd 音声検出装置
JP2006209069A (ja) * 2004-12-28 2006-08-10 Advanced Telecommunication Research Institute International 音声区間検出装置および音声区間検出プログラム
WO2008114448A1 (fr) * 2007-03-20 2008-09-25 Fujitsu Limited Système de reconnaissance vocale, programme de reconnaissance vocale, et procédé de reconnaissance vocale
JP2009175179A (ja) * 2008-01-21 2009-08-06 Denso Corp 音声認識装置、プログラム、及び発話信号抽出方法
JP2014075674A (ja) * 2012-10-03 2014-04-24 Oki Electric Ind Co Ltd 音声信号処理装置、方法及びプログラム

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS58190993A (ja) * 1982-05-01 1983-11-08 日産自動車株式会社 車両用音声検出装置
JPH1091184A (ja) * 1996-09-12 1998-04-10 Oki Electric Ind Co Ltd 音声検出装置
JP2006209069A (ja) * 2004-12-28 2006-08-10 Advanced Telecommunication Research Institute International 音声区間検出装置および音声区間検出プログラム
WO2008114448A1 (fr) * 2007-03-20 2008-09-25 Fujitsu Limited Système de reconnaissance vocale, programme de reconnaissance vocale, et procédé de reconnaissance vocale
JP2009175179A (ja) * 2008-01-21 2009-08-06 Denso Corp 音声認識装置、プログラム、及び発話信号抽出方法
JP2014075674A (ja) * 2012-10-03 2014-04-24 Oki Electric Ind Co Ltd 音声信号処理装置、方法及びプログラム

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108735207A (zh) * 2017-04-25 2018-11-02 丰田自动车株式会社 声音对话系统、声音对话方法及计算机可读存储介质
CN108735207B (zh) * 2017-04-25 2023-05-02 丰田自动车株式会社 声音对话系统、声音对话方法及计算机可读存储介质
WO2021040834A1 (fr) * 2019-08-29 2021-03-04 Microsoft Technology Licensing, Llc Fonction de réglage automatique de la sensibilité au langage

Also Published As

Publication number Publication date
JP2016085420A (ja) 2016-05-19
JP5863928B1 (ja) 2016-02-17

Similar Documents

Publication Publication Date Title
US10631087B2 (en) Method and device for voice operated control
US10579327B2 (en) Speech recognition device, speech recognition method and storage medium using recognition results to adjust volume level threshold
EP2860730B1 (fr) Traitement de la parole
US7171357B2 (en) Voice-activity detection using energy ratios and periodicity
JP5863928B1 (ja) 音声調整装置
US10320967B2 (en) Signal processing device, non-transitory computer-readable storage medium, signal processing method, and telephone apparatus
JP2009178783A (ja) コミュニケーションロボット及びその制御方法
EP2743923B1 (fr) Dispositif et procédé de traitement vocal
US9749741B1 (en) Systems and methods for reducing intermodulation distortion
JP4876245B2 (ja) 子音加工装置、音声情報伝達装置及び子音加工方法
KR20200026896A (ko) 음성 신호 레벨링
WO2016017229A1 (fr) Dispositif de détection de segments de parole, système de traitement de la voix, procédé de détection de segments de parole, et programme associé
US20200152185A1 (en) Method and Device for Voice Operated Control
JP2012168296A (ja) 音声による抑圧状態検出装置およびプログラム
JPS6257040B2 (fr)
KR101602298B1 (ko) 음량측정기를 이용한 오디오시스템
JP3284968B2 (ja) 話速変換機能を有する補聴器
WO2020217605A1 (fr) Dispositif de traitement audio
US7664635B2 (en) Adaptive voice detection method and system
Kyriakides et al. Isolated word endpoint detection using time-frequency variance kernels
JP2005010652A (ja) 音声検出装置
JP2020161884A (ja) 音声処理装置、音声処理方法、および音声処理システム
JP2870421B2 (ja) 話速変換機能を有する補聴器
JP3257379B2 (ja) 話速変換機能付補聴器
KR20160149736A (ko) 음성 인식 장치 및 그 동작 방법

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 15853653

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 15853653

Country of ref document: EP

Kind code of ref document: A1