JP7079160B2 - 集音装置、集音装置の制御方法 - Google Patents

集音装置、集音装置の制御方法 Download PDF

Info

Publication number
JP7079160B2
JP7079160B2 JP2018125290A JP2018125290A JP7079160B2 JP 7079160 B2 JP7079160 B2 JP 7079160B2 JP 2018125290 A JP2018125290 A JP 2018125290A JP 2018125290 A JP2018125290 A JP 2018125290A JP 7079160 B2 JP7079160 B2 JP 7079160B2
Authority
JP
Japan
Prior art keywords
sound
noise
captured image
sound collecting
human body
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
JP2018125290A
Other languages
English (en)
Japanese (ja)
Other versions
JP2020003724A5 (enExample
JP2020003724A (ja
Inventor
智彦 黒木
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Canon Inc
Original Assignee
Canon Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Canon Inc filed Critical Canon Inc
Priority to JP2018125290A priority Critical patent/JP7079160B2/ja
Priority to US16/447,104 priority patent/US10812898B2/en
Publication of JP2020003724A publication Critical patent/JP2020003724A/ja
Publication of JP2020003724A5 publication Critical patent/JP2020003724A5/ja
Application granted granted Critical
Publication of JP7079160B2 publication Critical patent/JP7079160B2/ja
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/20Arrangements for obtaining desired frequency or directional characteristics
    • H04R1/32Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/04Segmentation; Word boundary detection
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/20Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/28Constructional details of speech recognition systems
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2218/00Aspects of pattern recognition specially adapted for signal processing
    • G06F2218/22Source localisation; Inverse modelling
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0272Voice signal separating
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/20Arrangements for obtaining desired frequency or directional characteristics
    • H04R1/32Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
    • H04R1/40Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers
    • H04R1/406Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers microphones
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2201/00Details of transducers, loudspeakers or microphones covered by H04R1/00 but not provided for in any of its subgroups
    • H04R2201/40Details of arrangements for obtaining desired directional characteristic by combining a number of identical transducers covered by H04R1/40 but not provided for in any of its subgroups
    • H04R2201/4012D or 3D arrays of transducers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R2499/00Aspects covered by H04R or H04S not otherwise provided for in their subgroups
    • H04R2499/10General applications
    • H04R2499/13Acoustic transducers and sound field adaptation in vehicles

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Acoustics & Sound (AREA)
  • Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Otolaryngology (AREA)
  • Quality & Reliability (AREA)
  • Image Analysis (AREA)
  • Circuit For Audible Band Transducer (AREA)
  • Studio Devices (AREA)
JP2018125290A 2018-06-29 2018-06-29 集音装置、集音装置の制御方法 Active JP7079160B2 (ja)

Priority Applications (2)

Application Number Priority Date Filing Date Title
JP2018125290A JP7079160B2 (ja) 2018-06-29 2018-06-29 集音装置、集音装置の制御方法
US16/447,104 US10812898B2 (en) 2018-06-29 2019-06-20 Sound collection apparatus, method of controlling sound collection apparatus, and non-transitory computer-readable storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
JP2018125290A JP7079160B2 (ja) 2018-06-29 2018-06-29 集音装置、集音装置の制御方法

Publications (3)

Publication Number Publication Date
JP2020003724A JP2020003724A (ja) 2020-01-09
JP2020003724A5 JP2020003724A5 (enExample) 2021-08-05
JP7079160B2 true JP7079160B2 (ja) 2022-06-01

Family

ID=69054836

Family Applications (1)

Application Number Title Priority Date Filing Date
JP2018125290A Active JP7079160B2 (ja) 2018-06-29 2018-06-29 集音装置、集音装置の制御方法

Country Status (2)

Country Link
US (1) US10812898B2 (enExample)
JP (1) JP7079160B2 (enExample)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2620960A (en) * 2022-07-27 2024-01-31 Nokia Technologies Oy Pair direction selection based on dominant audio direction
WO2025220283A1 (ja) * 2024-04-16 2025-10-23 ソニーグループ株式会社 撮像システム、表示方法及びプログラム

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005250397A (ja) 2004-03-08 2005-09-15 Nec Corp ロボット
US20060104454A1 (en) 2004-11-17 2006-05-18 Siemens Aktiengesellschaft Method for selectively picking up a sound signal
JP2009296232A (ja) 2008-06-04 2009-12-17 Casio Hitachi Mobile Communications Co Ltd 音入力装置、音入力方法およびプログラム
JP2017153065A (ja) 2016-02-25 2017-08-31 パナソニック株式会社 音声認識方法、音声認識装置及びプログラム

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7778425B2 (en) * 2003-12-24 2010-08-17 Nokia Corporation Method for generating noise references for generalized sidelobe canceling
US9197974B1 (en) * 2012-01-06 2015-11-24 Audience, Inc. Directional audio capture adaptation based on alternative sensory input
WO2015162645A1 (ja) * 2014-04-25 2015-10-29 パナソニックIpマネジメント株式会社 音声処理装置、音声処理システム、及び音声処理方法
JP2016046769A (ja) 2014-08-26 2016-04-04 パナソニックIpマネジメント株式会社 集音装置

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005250397A (ja) 2004-03-08 2005-09-15 Nec Corp ロボット
US20060104454A1 (en) 2004-11-17 2006-05-18 Siemens Aktiengesellschaft Method for selectively picking up a sound signal
JP2009296232A (ja) 2008-06-04 2009-12-17 Casio Hitachi Mobile Communications Co Ltd 音入力装置、音入力方法およびプログラム
JP2017153065A (ja) 2016-02-25 2017-08-31 パナソニック株式会社 音声認識方法、音声認識装置及びプログラム

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
中臺一博,"世界に飛び出す日本のソフトウェア ロボット聴覚用オープンソースソフトウェアHARKの展開",情報処理学会デジタルプラクティス,Vol.2, No.2,2011年04月15日,pp.133-140

Also Published As

Publication number Publication date
US20200007979A1 (en) 2020-01-02
US10812898B2 (en) 2020-10-20
JP2020003724A (ja) 2020-01-09

Similar Documents

Publication Publication Date Title
US11043231B2 (en) Speech enhancement method and apparatus for same
CN107534725B (zh) 一种语音信号处理方法及装置
JP5456832B2 (ja) 入力された発話の関連性を判定するための装置および方法
US9500739B2 (en) Estimating and tracking multiple attributes of multiple objects from multi-sensor data
CN109151442B (zh) 一种图像拍摄方法及终端
JP2012040655A (ja) ロボット制御方法、プログラム、及びロボット
JP6705656B2 (ja) 視覚補助装置及びオブジェクトの分類の検出方法
CN108989672B (zh) 一种拍摄方法及移动终端
CN107623778B (zh) 来电接听方法及移动终端
CN111091845A (zh) 音频处理方法、装置、终端设备及计算机存储介质
CN107592459A (zh) 一种拍照方法及移动终端
WO2017113937A1 (zh) 移动终端和降噪方法
KR20210017229A (ko) 오디오 줌 기능을 갖는 전자 장치 및 이의 동작 방법
EP4135314A1 (en) Camera-view acoustic fence
US20200365168A1 (en) Method for acquiring noise-refined voice signal, and electronic device for performing same
CN107749046A (zh) 一种图像处理方法及移动终端
JP2021105808A (ja) 発話者認識システム、発話者認識方法、及び発話者認識プログラム
CN112543295A (zh) 基于声源定位的车载视频通话方法、系统及设备
JP7079160B2 (ja) 集音装置、集音装置の制御方法
CN113506582A (zh) 声音信号识别方法、装置及系统
US10665243B1 (en) Subvocalized speech recognition
CN113014844A (zh) 一种音频处理方法、装置、存储介质及电子设备
CN109671034B (zh) 一种图像处理方法及终端设备
CN110942064A (zh) 图像处理方法、装置和电子设备
US12033654B2 (en) Sound pickup device and sound pickup method

Legal Events

Date Code Title Description
RD01 Notification of change of attorney

Free format text: JAPANESE INTERMEDIATE CODE: A7421

Effective date: 20210103

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20210113

A521 Request for written amendment filed

Free format text: JAPANESE INTERMEDIATE CODE: A523

Effective date: 20210625

A621 Written request for application examination

Free format text: JAPANESE INTERMEDIATE CODE: A621

Effective date: 20210625

A977 Report on retrieval

Free format text: JAPANESE INTERMEDIATE CODE: A971007

Effective date: 20220323

TRDD Decision of grant or rejection written
A01 Written decision to grant a patent or to grant a registration (utility model)

Free format text: JAPANESE INTERMEDIATE CODE: A01

Effective date: 20220422

A61 First payment of annual fees (during grant procedure)

Free format text: JAPANESE INTERMEDIATE CODE: A61

Effective date: 20220520

R151 Written notification of patent or utility model registration

Ref document number: 7079160

Country of ref document: JP

Free format text: JAPANESE INTERMEDIATE CODE: R151