CN114746939B - 信息处理装置、检测方法和记录介质 - Google Patents

信息处理装置、检测方法和记录介质

Info

Publication number
CN114746939B
CN114746939B CN201980102693.6A CN201980102693A CN114746939B CN 114746939 B CN114746939 B CN 114746939B CN 201980102693 A CN201980102693 A CN 201980102693A CN 114746939 B CN114746939 B CN 114746939B
Authority
CN
China
Prior art keywords
sound signal
intervals
power
value
interval
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN201980102693.6A
Other languages
English (en)
Chinese (zh)
Other versions
CN114746939A (zh
Inventor
花泽利行
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Mitsubishi Electric Corp
Original Assignee
Mitsubishi Electric Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Mitsubishi Electric Corp filed Critical Mitsubishi Electric Corp
Publication of CN114746939A publication Critical patent/CN114746939A/zh
Application granted granted Critical
Publication of CN114746939B publication Critical patent/CN114746939B/zh
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/04Segmentation; Word boundary detection
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/22Procedures used during a speech recognition process, e.g. man-machine dialogue
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/21Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L2025/783Detection of presence or absence of voice signals based on threshold decision
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L2025/783Detection of presence or absence of voice signals based on threshold decision
    • G10L2025/786Adaptive threshold
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L25/84Detection of presence or absence of voice signals for discriminating voice from noise

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Telephone Function (AREA)
  • Forklifts And Lifting Vehicles (AREA)
CN201980102693.6A 2019-12-13 2019-12-13 信息处理装置、检测方法和记录介质 Active CN114746939B (zh)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/JP2019/048921 WO2021117219A1 (ja) 2019-12-13 2019-12-13 情報処理装置、検出方法、及び検出プログラム

Publications (2)

Publication Number Publication Date
CN114746939A CN114746939A (zh) 2022-07-12
CN114746939B true CN114746939B (zh) 2025-09-30

Family

ID=76330100

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201980102693.6A Active CN114746939B (zh) 2019-12-13 2019-12-13 信息处理装置、检测方法和记录介质

Country Status (5)

Country Link
US (1) US20220262392A1 (https=)
EP (1) EP4060662B1 (https=)
JP (1) JP7012917B2 (https=)
CN (1) CN114746939B (https=)
WO (1) WO2021117219A1 (https=)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP7653311B2 (ja) * 2021-06-21 2025-03-28 アルインコ株式会社 無線通信装置及び無線通信システム
KR102516391B1 (ko) * 2022-09-02 2023-04-03 주식회사 액션파워 음성 구간 길이를 고려하여 오디오에서 음성 구간을 검출하는 방법
CN120677526A (zh) * 2023-02-07 2025-09-19 杜比实验室特许公司 用于语音分类器的鲁棒处理的方法和系统

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001067092A (ja) * 1999-08-26 2001-03-16 Matsushita Electric Ind Co Ltd 音声検出装置
CN103380457A (zh) * 2011-12-02 2013-10-30 松下电器产业株式会社 声音处理装置、方法、程序及集成电路

Family Cites Families (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CA1090019A (en) * 1976-11-23 1980-11-18 Federico Vagliani Method and apparatus for detecting the presence of a speech signal on a voice channel signal
JPS62265699A (ja) * 1986-05-14 1987-11-18 富士通株式会社 単語音声認識装置
US5442712A (en) * 1992-11-25 1995-08-15 Matsushita Electric Industrial Co., Ltd. Sound amplifying apparatus with automatic howl-suppressing function
BE1007355A3 (nl) * 1993-07-26 1995-05-23 Philips Electronics Nv Spraaksignaaldiscriminatieschakeling alsmede een audio-inrichting voorzien van een dergelijke schakeling.
US6175634B1 (en) * 1995-08-28 2001-01-16 Intel Corporation Adaptive noise reduction technique for multi-point communication system
JP3607775B2 (ja) * 1996-04-15 2005-01-05 オリンパス株式会社 音声状態判別装置
JP3888727B2 (ja) 1997-04-15 2007-03-07 三菱電機株式会社 音声区間検出方法、音声認識方法、音声区間検出装置及び音声認識装置
JPH1124692A (ja) * 1997-07-01 1999-01-29 Nippon Telegr & Teleph Corp <Ntt> 音声波の有音/休止区間判定方法およびその装置
JP2000250568A (ja) * 1999-02-26 2000-09-14 Kobe Steel Ltd 音声区間検出装置
JP3812887B2 (ja) 2001-12-21 2006-08-23 富士通株式会社 信号処理システムおよび方法
DE602005006536D1 (de) * 2004-03-01 2008-06-19 Gn Resound As Hörgerät mit automatischer umschaltung zwischen betriebsarten
JP4791857B2 (ja) * 2006-03-02 2011-10-12 日本放送協会 発話区間検出装置及び発話区間検出プログラム
JP5229234B2 (ja) 2007-12-18 2013-07-03 富士通株式会社 非音声区間検出方法及び非音声区間検出装置
WO2011111091A1 (ja) * 2010-03-09 2011-09-15 三菱電機株式会社 雑音抑圧装置
WO2012036305A1 (ja) 2010-09-17 2012-03-22 日本電気株式会社 音声認識装置、音声認識方法、及びプログラム
JP5971047B2 (ja) * 2012-09-12 2016-08-17 沖電気工業株式会社 音声信号処理装置、方法及びプログラム
FR3014237B1 (fr) * 2013-12-02 2016-01-08 Adeunis R F Procede de detection de la voix
WO2016116961A1 (ja) * 2015-01-21 2016-07-28 三菱電機株式会社 情報処理装置および情報処理方法
CN106571146B (zh) * 2015-10-13 2019-10-15 阿里巴巴集团控股有限公司 噪音信号确定方法、语音去噪方法及装置
WO2018217059A1 (en) * 2017-05-25 2018-11-29 Samsung Electronics Co., Ltd. Method and electronic device for managing loudness of audio signal
JP2021113835A (ja) * 2018-04-19 2021-08-05 ソニーグループ株式会社 音声処理装置および音声処理方法

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2001067092A (ja) * 1999-08-26 2001-03-16 Matsushita Electric Ind Co Ltd 音声検出装置
CN103380457A (zh) * 2011-12-02 2013-10-30 松下电器产业株式会社 声音处理装置、方法、程序及集成电路

Also Published As

Publication number Publication date
JP7012917B2 (ja) 2022-01-28
EP4060662A4 (en) 2023-03-08
US20220262392A1 (en) 2022-08-18
CN114746939A (zh) 2022-07-12
JPWO2021117219A1 (https=) 2021-06-17
WO2021117219A1 (ja) 2021-06-17
EP4060662A1 (en) 2022-09-21
EP4060662B1 (en) 2025-12-03

Similar Documents

Publication Publication Date Title
CN114746939B (zh) 信息处理装置、检测方法和记录介质
Zhou et al. Deep Speaker Embedding Extraction with Channel-Wise Feature Responses and Additive Supervision Softmax Loss Function.
CN106531172A (zh) 基于环境噪声变化检测的说话人语音回放鉴别方法及系统
Chang et al. Temporal modeling using dilated convolution and gating for voice-activity-detection
RU2417456C2 (ru) Системы, способы и устройства для обнаружения изменения сигналов
KR101158291B1 (ko) 음성 활동 검출 디바이스 및 방법
US7120576B2 (en) Low-complexity music detection algorithm and system
US20200395042A1 (en) Learning device, voice activity detector, and method for detecting voice activity
US20230401338A1 (en) Method for detecting an audio adversarial attack with respect to a voice input processed by an automatic speech recognition system, corresponding device, computer program product and computer-readable carrier medium
CN112992153B (zh) 音频处理方法、声纹识别方法、装置、计算机设备
JP2022519391A (ja) 話者認識システムおよびその使用方法
KR100800873B1 (ko) 음성 신호 검출 시스템 및 방법
EP3254282A1 (en) Determining features of harmonic signals
JP7471139B2 (ja) 話者ダイアライゼーション装置、及び話者ダイアライゼーション方法
GB2576960A (en) Speaker recognition
CN113628248B (zh) 行人驻留时长确定方法、装置以及计算机可读存储介质
CN112955954A (zh) 用于音频场景分类的音频处理装置及其方法
US11087746B2 (en) Information processing device, information processing method, and program
EP2328143B1 (en) Human voice distinguishing method and device
US20090150164A1 (en) Tri-model audio segmentation
US8831763B1 (en) Intelligent interest point pruning for audio matching
JP7273078B2 (ja) 話者埋め込みに基づく音声活動検出を利用した話者ダイアライゼーション方法、システム、およびコンピュータプログラム
US20130297311A1 (en) Information processing apparatus, information processing method and information processing program
JP2021103202A (ja) 更新プログラム、更新方法および情報処理装置
US8837263B1 (en) Automatic on-drive sync-mark search and threshold adjustment

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant