JP7012917B2 - 情報処理装置、検出方法、及び検出プログラム - Google Patents

情報処理装置、検出方法、及び検出プログラム Download PDF

Info

Publication number
JP7012917B2
JP7012917B2 JP2021559189A JP2021559189A JP7012917B2 JP 7012917 B2 JP7012917 B2 JP 7012917B2 JP 2021559189 A JP2021559189 A JP 2021559189A JP 2021559189 A JP2021559189 A JP 2021559189A JP 7012917 B2 JP7012917 B2 JP 7012917B2
Authority
JP
Japan
Prior art keywords
sound signal
section
sections
value
power
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2021559189A
Other languages
English (en)
Japanese (ja)
Other versions
JPWO2021117219A1 (https=
Inventor
利行 花澤
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Mitsubishi Electric Corp
Original Assignee
Mitsubishi Electric Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Mitsubishi Electric Corp filed Critical Mitsubishi Electric Corp
Publication of JPWO2021117219A1 publication Critical patent/JPWO2021117219A1/ja
Application granted granted Critical
Publication of JP7012917B2 publication Critical patent/JP7012917B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/04Segmentation; Word boundary detection
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/21Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L2025/783Detection of presence or absence of voice signals based on threshold decision
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L2025/783Detection of presence or absence of voice signals based on threshold decision
    • G10L2025/786Adaptive threshold
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/84Detection of presence or absence of voice signals for discriminating voice from noise

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Telephone Function (AREA)
  • Forklifts And Lifting Vehicles (AREA)
JP2021559189A 2019-12-13 2019-12-13 情報処理装置、検出方法、及び検出プログラム Active JP7012917B2 (ja)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2019/048921 WO2021117219A1 (ja) 2019-12-13 2019-12-13 情報処理装置、検出方法、及び検出プログラム

Publications (2)

Publication Number Publication Date
JPWO2021117219A1 JPWO2021117219A1 (https=) 2021-06-17
JP7012917B2 true JP7012917B2 (ja) 2022-01-28

Family

ID=76330100

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2021559189A Active JP7012917B2 (ja) 2019-12-13 2019-12-13 情報処理装置、検出方法、及び検出プログラム

Country Status (5)

Country Link
US (1) US20220262392A1 (https=)
EP (1) EP4060662B1 (https=)
JP (1) JP7012917B2 (https=)
CN (1) CN114746939B (https=)
WO (1) WO2021117219A1 (https=)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP7653311B2 (ja) * 2021-06-21 2025-03-28 アルインコ株式会社 無線通信装置及び無線通信システム
KR102516391B1 (ko) * 2022-09-02 2023-04-03 주식회사 액션파워 음성 구간 길이를 고려하여 오디오에서 음성 구간을 검출하는 방법
CN120677526A (zh) * 2023-02-07 2025-09-19 杜比实验室特许公司 用于语音分类器的鲁棒处理的方法和系统

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3812887B2 (ja) 2001-12-21 2006-08-23 富士通株式会社 信号処理システムおよび方法
WO2009078093A1 (ja) 2007-12-18 2009-06-25 Fujitsu Limited 非音声区間検出方法及び非音声区間検出装置
WO2012036305A1 (ja) 2010-09-17 2012-03-22 日本電気株式会社 音声認識装置、音声認識方法、及びプログラム

Family Cites Families (20)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA1090019A (en) * 1976-11-23 1980-11-18 Federico Vagliani Method and apparatus for detecting the presence of a speech signal on a voice channel signal
JPS62265699A (ja) * 1986-05-14 1987-11-18 富士通株式会社 単語音声認識装置
US5442712A (en) * 1992-11-25 1995-08-15 Matsushita Electric Industrial Co., Ltd. Sound amplifying apparatus with automatic howl-suppressing function
BE1007355A3 (nl) * 1993-07-26 1995-05-23 Philips Electronics Nv Spraaksignaaldiscriminatieschakeling alsmede een audio-inrichting voorzien van een dergelijke schakeling.
US6175634B1 (en) * 1995-08-28 2001-01-16 Intel Corporation Adaptive noise reduction technique for multi-point communication system
JP3607775B2 (ja) * 1996-04-15 2005-01-05 オリンパス株式会社 音声状態判別装置
JP3888727B2 (ja) 1997-04-15 2007-03-07 三菱電機株式会社 音声区間検出方法、音声認識方法、音声区間検出装置及び音声認識装置
JPH1124692A (ja) * 1997-07-01 1999-01-29 Nippon Telegr & Teleph Corp <Ntt> 音声波の有音/休止区間判定方法およびその装置
JP2000250568A (ja) * 1999-02-26 2000-09-14 Kobe Steel Ltd 音声区間検出装置
JP2001067092A (ja) * 1999-08-26 2001-03-16 Matsushita Electric Ind Co Ltd 音声検出装置
DE602005006536D1 (de) * 2004-03-01 2008-06-19 Gn Resound As Hörgerät mit automatischer umschaltung zwischen betriebsarten
JP4791857B2 (ja) * 2006-03-02 2011-10-12 日本放送協会 発話区間検出装置及び発話区間検出プログラム
WO2011111091A1 (ja) * 2010-03-09 2011-09-15 三菱電機株式会社 雑音抑圧装置
CN103380457B (zh) * 2011-12-02 2016-05-18 松下电器(美国)知识产权公司 声音处理装置、方法及集成电路
JP5971047B2 (ja) * 2012-09-12 2016-08-17 沖電気工業株式会社 音声信号処理装置、方法及びプログラム
FR3014237B1 (fr) * 2013-12-02 2016-01-08 Adeunis R F Procede de detection de la voix
WO2016116961A1 (ja) * 2015-01-21 2016-07-28 三菱電機株式会社 情報処理装置および情報処理方法
CN106571146B (zh) * 2015-10-13 2019-10-15 阿里巴巴集团控股有限公司 噪音信号确定方法、语音去噪方法及装置
WO2018217059A1 (en) * 2017-05-25 2018-11-29 Samsung Electronics Co., Ltd. Method and electronic device for managing loudness of audio signal
JP2021113835A (ja) * 2018-04-19 2021-08-05 ソニーグループ株式会社 音声処理装置および音声処理方法

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3812887B2 (ja) 2001-12-21 2006-08-23 富士通株式会社 信号処理システムおよび方法
WO2009078093A1 (ja) 2007-12-18 2009-06-25 Fujitsu Limited 非音声区間検出方法及び非音声区間検出装置
WO2012036305A1 (ja) 2010-09-17 2012-03-22 日本電気株式会社 音声認識装置、音声認識方法、及びプログラム

Also Published As

Publication number Publication date
EP4060662A4 (en) 2023-03-08
US20220262392A1 (en) 2022-08-18
CN114746939A (zh) 2022-07-12
CN114746939B (zh) 2025-09-30
JPWO2021117219A1 (https=) 2021-06-17
WO2021117219A1 (ja) 2021-06-17
EP4060662A1 (en) 2022-09-21
EP4060662B1 (en) 2025-12-03

Similar Documents

Publication Publication Date Title
JP7012917B2 (ja) 情報処理装置、検出方法、及び検出プログラム
CN106531172B (zh) 基于环境噪声变化检测的说话人语音回放鉴别方法及系统
RU2417456C2 (ru) Системы, способы и устройства для обнаружения изменения сигналов
US9484036B2 (en) Method and apparatus for detecting synthesized speech
US8005675B2 (en) Apparatus and method for audio analysis
CA3031819C (en) Systems and methods for cluster-based voice verification
US20190279298A1 (en) Information auditing method, apparatus, electronic device and computer readable storage medium
JP2020525817A (ja) 声紋認識方法、装置、端末機器および記憶媒体
US20230401338A1 (en) Method for detecting an audio adversarial attack with respect to a voice input processed by an automatic speech recognition system, corresponding device, computer program product and computer-readable carrier medium
CN112992153B (zh) 音频处理方法、声纹识别方法、装置、计算机设备
US11545159B1 (en) Computerized monitoring of digital audio signals
US20200082830A1 (en) Speaker recognition
Lee et al. Dual attention in time and frequency domain for voice activity detection
HUE034664T2 (hu) Eljárás és berendezés pitch periódus helyességének detektálására
JP4102745B2 (ja) 音声区間検出装置および方法
EP2328143B1 (en) Human voice distinguishing method and device
US8831763B1 (en) Intelligent interest point pruning for audio matching
US20130297311A1 (en) Information processing apparatus, information processing method and information processing program
JP7380188B2 (ja) 更新プログラム、更新方法および情報処理装置
TW202526912A (zh) 藉助於偵測自定義詞的語音特徵對聲控裝置進行喚醒控制之方法及處理電路
KR101804787B1 (ko) 음질특징을 이용한 화자인식장치 및 방법
US20260052342A1 (en) Method and system for managing speaker damage in electronic device
Barguil et al. Anomaly Detection Algorithm for Acoustics Phenomena
Xia Deep Neural Network Based Representation Learning and Modeling for Robust Speaker Recognition
KR20250028053A (ko) 분절모델 기반 소리 이벤트 검출장치 및 그 방법

Legal Events

Date Code Title Description
A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20211004

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20211004

A871 Explanation of circumstances concerning accelerated examination

Free format text: JAPANESE INTERMEDIATE CODE: A871

Effective date: 20211004

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20211221

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20220118

R150 Certificate of patent or registration of utility model

Ref document number: 7012917

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R150

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250

R250 Receipt of annual fees

Free format text: JAPANESE INTERMEDIATE CODE: R250