CN104221018A - 声音检测装置、声音检测方法、声音特征值检测装置、声音特征值检测方法、声音区间检测装置、声音区间检测方法及程序 - Google Patents

声音检测装置、声音检测方法、声音特征值检测装置、声音特征值检测方法、声音区间检测装置、声音区间检测方法及程序 Download PDF

Info

Publication number
CN104221018A
CN104221018A CN201380019489.0A CN201380019489A CN104221018A CN 104221018 A CN104221018 A CN 104221018A CN 201380019489 A CN201380019489 A CN 201380019489A CN 104221018 A CN104221018 A CN 104221018A
Authority
CN
China
Prior art keywords
time
unit
sound
frequency
feature value
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201380019489.0A
Other languages
English (en)
Chinese (zh)
Inventor
安部素嗣
西口正之
仓田宜典
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sony Corp
Original Assignee
Sony Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sony Corp filed Critical Sony Corp
Publication of CN104221018A publication Critical patent/CN104221018A/zh
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/18Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being spectral information of each sub-band
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F16/00Information retrieval; Database structures therefor; File system structures therefor
    • G06F16/60Information retrieval; Database structures therefor; File system structures therefor of audio data
    • G06F16/68Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually
    • G06F16/686Retrieval characterised by using metadata, e.g. metadata not derived from the content or metadata generated manually using information manually generated, e.g. tags, keywords, comments, title or artist information, time, location or usage information, user ratings
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Theoretical Computer Science (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Library & Information Science (AREA)
  • Databases & Information Systems (AREA)
  • General Engineering & Computer Science (AREA)
  • General Physics & Mathematics (AREA)
  • Data Mining & Analysis (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
  • Electrophonic Musical Instruments (AREA)
CN201380019489.0A 2012-04-18 2013-04-16 声音检测装置、声音检测方法、声音特征值检测装置、声音特征值检测方法、声音区间检测装置、声音区间检测方法及程序 Pending CN104221018A (zh)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
JP2012094395A JP5998603B2 (ja) 2012-04-18 2012-04-18 音検出装置、音検出方法、音特徴量検出装置、音特徴量検出方法、音区間検出装置、音区間検出方法およびプログラム
JP2012-094395 2012-04-18
PCT/JP2013/002581 WO2013157254A1 (en) 2012-04-18 2013-04-16 Sound detecting apparatus, sound detecting method, sound feature value detecting apparatus, sound feature value detecting method, sound section detecting apparatus, sound section detecting method, and program

Publications (1)

Publication Number Publication Date
CN104221018A true CN104221018A (zh) 2014-12-17

Family

ID=48652284

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201380019489.0A Pending CN104221018A (zh) 2012-04-18 2013-04-16 声音检测装置、声音检测方法、声音特征值检测装置、声音特征值检测方法、声音区间检测装置、声音区间检测方法及程序

Country Status (5)

Country Link
US (1) US20150043737A1 (https=)
JP (1) JP5998603B2 (https=)
CN (1) CN104221018A (https=)
IN (1) IN2014DN08472A (https=)
WO (1) WO2013157254A1 (https=)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106251860A (zh) * 2016-08-09 2016-12-21 张爱英 面向安防领域的无监督的新颖性音频事件检测方法及系统
CN108291837A (zh) * 2015-12-09 2018-07-17 三菱电机株式会社 劣化部位估计装置、劣化部位估计方法以及移动体的诊断系统
CN111199749A (zh) * 2018-11-20 2020-05-26 松下电器(美国)知识产权公司 行为识别方法、装置,机器学习方法、装置以及记录介质
CN112071333A (zh) * 2019-06-11 2020-12-11 纳宝株式会社 用于动态音符匹配的电子装置及其操作方法
CN112230113A (zh) * 2019-06-28 2021-01-15 瑞萨电子株式会社 异常检测系统和异常检测程序
CN115931358A (zh) * 2023-02-24 2023-04-07 沈阳工业大学 一种低信噪比的轴承故障声发射信号诊断方法

Families Citing this family (22)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20150179167A1 (en) * 2013-12-19 2015-06-25 Kirill Chekhter Phoneme signature candidates for speech recognition
CN103793190A (zh) * 2014-02-07 2014-05-14 北京京东方视讯科技有限公司 一种信息显示方法、信息显示装置及显示设备
JP6362358B2 (ja) * 2014-03-05 2018-07-25 大阪瓦斯株式会社 作業完了報知装置
CN104217722B (zh) * 2014-08-22 2017-07-11 哈尔滨工程大学 一种海豚哨声信号时频谱轮廓提取方法
CN104810025B (zh) * 2015-03-31 2018-04-20 天翼爱音乐文化科技有限公司 音频相似度检测方法及装置
US10178474B2 (en) * 2015-04-21 2019-01-08 Google Llc Sound signature database for initialization of noise reduction in recordings
US10079012B2 (en) 2015-04-21 2018-09-18 Google Llc Customizing speech-recognition dictionaries in a smart-home environment
JP6524814B2 (ja) * 2015-06-18 2019-06-05 Tdk株式会社 会話検出装置及び会話検出方法
JP6448477B2 (ja) * 2015-06-19 2019-01-09 株式会社東芝 行動判定装置及び行動判定方法
CN105391501B (zh) * 2015-10-13 2017-11-21 哈尔滨工程大学 一种基于时频谱平移的仿海豚哨声水声通信方法
CN105871475B (zh) * 2016-05-25 2018-05-18 哈尔滨工程大学 一种基于自适应干扰抵消的仿鲸鱼叫声隐蔽水声通信方法
JP6640702B2 (ja) * 2016-12-08 2020-02-05 日本電信電話株式会社 時系列信号特徴推定装置、プログラム
US9870719B1 (en) * 2017-04-17 2018-01-16 Hz Innovations Inc. Apparatus and method for wireless sound recognition to notify users of detected sounds
JP7017488B2 (ja) * 2018-09-14 2022-02-08 株式会社日立製作所 音点検システムおよび音点検方法
JP6759479B1 (ja) * 2020-03-24 2020-09-23 株式会社 日立産業制御ソリューションズ 音響分析支援システム、及び音響分析支援方法
JP7417732B2 (ja) * 2020-06-15 2024-01-18 株式会社日立製作所 自動点検システム及び無線子機
KR102260466B1 (ko) * 2020-06-19 2021-06-03 주식회사 코클리어닷에이아이 오디오 인식을 활용한 라이프로그 장치 및 그 방법
US11410676B2 (en) * 2020-11-18 2022-08-09 Haier Us Appliance Solutions, Inc. Sound monitoring and user assistance methods for a microwave oven
CN112885374A (zh) * 2021-01-27 2021-06-01 吴怡然 一种基于频谱分析的声音音准判断方法及系统
CN113724734B (zh) * 2021-08-31 2023-07-25 上海师范大学 声音事件的检测方法、装置、存储介质及电子装置
CN115854269B (zh) * 2021-09-24 2025-04-04 中国石油化工股份有限公司 泄漏孔喷流噪声识别方法、装置、电子设备及存储介质
JP7748058B1 (ja) * 2025-05-23 2025-10-02 Foonz株式会社 プログラム、情報処理装置、方法、及びシステム

Family Cites Families (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5765127A (en) * 1992-03-18 1998-06-09 Sony Corp High efficiency encoding method
JPH0926354A (ja) * 1995-07-13 1997-01-28 Sharp Corp 音響・映像装置
US5956674A (en) * 1995-12-01 1999-09-21 Digital Theater Systems, Inc. Multi-channel predictive subband audio coder using psychoacoustic adaptive bit allocation in frequency, time and over the multiple channels
BRPI0608270A2 (pt) * 2005-04-01 2009-10-06 Qualcomm Inc sistemas, métodos e equipamento para filtragem anti-dispersão
CN101199003B (zh) * 2005-04-22 2012-01-11 高通股份有限公司 用于增益因数衰减的系统、方法和设备
WO2007087824A1 (de) * 2006-01-31 2007-08-09 Siemens Enterprise Communications Gmbh & Co. Kg Verfahren und anordnungen zur audiosignalkodierung
US20100332222A1 (en) * 2006-09-29 2010-12-30 National Chiao Tung University Intelligent classification method of vocal signal
US20080300702A1 (en) * 2007-05-29 2008-12-04 Universitat Pompeu Fabra Music similarity systems and methods using descriptors
JP2009008823A (ja) * 2007-06-27 2009-01-15 Fujitsu Ltd 音響認識装置、音響認識方法、及び、音響認識プログラム
US20090198500A1 (en) * 2007-08-24 2009-08-06 Qualcomm Incorporated Temporal masking in audio coding based on spectral dynamics in frequency sub-bands
JP4788810B2 (ja) 2009-08-17 2011-10-05 ソニー株式会社 楽曲同定装置及び方法、楽曲同定配信装置及び方法

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108291837A (zh) * 2015-12-09 2018-07-17 三菱电机株式会社 劣化部位估计装置、劣化部位估计方法以及移动体的诊断系统
CN106251860A (zh) * 2016-08-09 2016-12-21 张爱英 面向安防领域的无监督的新颖性音频事件检测方法及系统
CN106251860B (zh) * 2016-08-09 2020-02-11 张爱英 面向安防领域的无监督的新颖性音频事件检测方法及系统
CN111199749A (zh) * 2018-11-20 2020-05-26 松下电器(美国)知识产权公司 行为识别方法、装置,机器学习方法、装置以及记录介质
CN111199749B (zh) * 2018-11-20 2024-05-24 松下电器(美国)知识产权公司 行为识别方法、装置,机器学习方法、装置以及记录介质
CN112071333A (zh) * 2019-06-11 2020-12-11 纳宝株式会社 用于动态音符匹配的电子装置及其操作方法
CN112230113A (zh) * 2019-06-28 2021-01-15 瑞萨电子株式会社 异常检测系统和异常检测程序
CN115931358A (zh) * 2023-02-24 2023-04-07 沈阳工业大学 一种低信噪比的轴承故障声发射信号诊断方法
CN115931358B (zh) * 2023-02-24 2023-09-12 沈阳工业大学 一种低信噪比的轴承故障声发射信号诊断方法

Also Published As

Publication number Publication date
IN2014DN08472A (https=) 2015-05-08
US20150043737A1 (en) 2015-02-12
JP5998603B2 (ja) 2016-09-28
WO2013157254A1 (en) 2013-10-24
JP2013222113A (ja) 2013-10-28

Similar Documents

Publication Publication Date Title
CN104221018A (zh) 声音检测装置、声音检测方法、声音特征值检测装置、声音特征值检测方法、声音区间检测装置、声音区间检测方法及程序
US10127922B2 (en) Sound source identification apparatus and sound source identification method
RU2373584C2 (ru) Способ и устройство для повышения разборчивости речи с использованием нескольких датчиков
US9536547B2 (en) Speaker change detection device and speaker change detection method
JP6454916B2 (ja) 音声処理装置、音声処理方法及びプログラム
US10748544B2 (en) Voice processing device, voice processing method, and program
KR100905586B1 (ko) 로봇에서의 원거리 음성 인식을 위한 마이크의 성능 평가시스템 및 방법
US20060031066A1 (en) Isolating speech signals utilizing neural networks
CA2847689A1 (en) System and method of processing a sound signal including transforming the sound signal into a frequency-chirp domain
JP6371516B2 (ja) 音響信号処理装置および方法
JP2022547525A (ja) 音声信号を生成するためのシステム及び方法
JPWO2020013296A1 (ja) 精神・神経系疾患を推定する装置
US20230230599A1 (en) Data augmentation system and method for multi-microphone systems
JP5803125B2 (ja) 音声による抑圧状態検出装置およびプログラム
Poorjam et al. A parametric approach for classification of distortions in pathological voices
KR20190140780A (ko) 음악 장르 분류 장치 및 방법
JP6724290B2 (ja) 音響処理装置、音響処理方法、及び、プログラム
US12387747B2 (en) Voice activity detection apparatus, learning apparatus, and storage medium
JP6891736B2 (ja) 音声処理プログラム、音声処理方法および音声処理装置
US12456456B2 (en) Data augmentation system and method for multi-microphone systems
JP7176325B2 (ja) 音声処理プログラム、音声処理方法および音声処理装置
Kumar et al. Machine learning for audio processing: From feature extraction to model selection
US12243514B2 (en) Data augmentation system and method for multi-microphone systems
US20230230581A1 (en) Data augmentation system and method for multi-microphone systems
Kumar et al. Machine learning for audio processing: from feature

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C02 Deemed withdrawal of patent application after publication (patent law 2001)
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20141217