CN110933254B - 一种基于图像分析的声音过滤系统及其声音过滤方法 - Google Patents
一种基于图像分析的声音过滤系统及其声音过滤方法 Download PDFInfo
- Publication number
- CN110933254B CN110933254B CN201911264104.2A CN201911264104A CN110933254B CN 110933254 B CN110933254 B CN 110933254B CN 201911264104 A CN201911264104 A CN 201911264104A CN 110933254 B CN110933254 B CN 110933254B
- Authority
- CN
- China
- Prior art keywords
- module
- sound
- image
- data
- person
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000001914 filtration Methods 0.000 title claims abstract description 35
- 238000000034 method Methods 0.000 title claims abstract description 23
- 238000010191 image analysis Methods 0.000 title claims abstract description 17
- 238000012544 monitoring process Methods 0.000 claims abstract description 51
- 238000004364 calculation method Methods 0.000 claims abstract description 34
- 238000001514 detection method Methods 0.000 claims abstract description 34
- 238000006243 chemical reaction Methods 0.000 claims description 6
- 230000001360 synchronised effect Effects 0.000 claims description 6
- 230000000694 effects Effects 0.000 claims description 4
- 230000003321 amplification Effects 0.000 claims description 3
- 238000005516 engineering process Methods 0.000 claims description 3
- 239000000284 extract Substances 0.000 claims description 3
- 238000007689 inspection Methods 0.000 claims description 3
- 238000003199 nucleic acid amplification method Methods 0.000 claims description 3
- 238000000926 separation method Methods 0.000 claims description 3
- 230000000007 visual effect Effects 0.000 claims description 3
- 230000002093 peripheral effect Effects 0.000 description 3
- 238000009434 installation Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000006978 adaptation Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000005034 decoration Methods 0.000 description 1
- 238000010586 diagram Methods 0.000 description 1
- 230000014509 gene expression Effects 0.000 description 1
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/04—Synchronising
-
- G—PHYSICS
- G01—MEASURING; TESTING
- G01S—RADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
- G01S5/00—Position-fixing by co-ordinating two or more direction or position line determinations; Position-fixing by co-ordinating two or more distance determinations
- G01S5/18—Position-fixing by co-ordinating two or more direction or position line determinations; Position-fixing by co-ordinating two or more distance determinations using ultrasonic, sonic, or infrasonic waves
- G01S5/20—Position of source determined by a plurality of spaced direction-finders
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V10/00—Arrangements for image or video recognition or understanding
- G06V10/40—Extraction of image or video features
- G06V10/56—Extraction of image or video features relating to colour
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
- G10L25/51—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N23/00—Cameras or camera modules comprising electronic image sensors; Control thereof
- H04N23/60—Control of cameras or camera modules
- H04N23/698—Control of cameras or camera modules for achieving an enlarged field of view, e.g. panoramic image capture
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N5/00—Details of television systems
- H04N5/76—Television signal recording
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/18—Closed-circuit television [CCTV] systems, i.e. systems in which the video signal is not broadcast
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
- G10L21/0216—Noise filtering characterised by the method used for estimating noise
- G10L2021/02161—Number of inputs available containing the signal or the noise to be suppressed
- G10L2021/02166—Microphone arrays; Beamforming
Abstract
Description
Claims (4)
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911264104.2A CN110933254B (zh) | 2019-12-11 | 2019-12-11 | 一种基于图像分析的声音过滤系统及其声音过滤方法 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201911264104.2A CN110933254B (zh) | 2019-12-11 | 2019-12-11 | 一种基于图像分析的声音过滤系统及其声音过滤方法 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN110933254A CN110933254A (zh) | 2020-03-27 |
CN110933254B true CN110933254B (zh) | 2021-09-07 |
Family
ID=69858877
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201911264104.2A Active CN110933254B (zh) | 2019-12-11 | 2019-12-11 | 一种基于图像分析的声音过滤系统及其声音过滤方法 |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN110933254B (zh) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112423191B (zh) * | 2020-11-18 | 2022-12-27 | 青岛海信商用显示股份有限公司 | 一种视频通话设备和音频增益方法 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN105049807A (zh) * | 2015-07-31 | 2015-11-11 | 小米科技有限责任公司 | 监控画面声音采集方法及装置 |
CN107534725A (zh) * | 2015-05-19 | 2018-01-02 | 华为技术有限公司 | 一种语音信号处理方法及装置 |
CN109474797A (zh) * | 2019-01-04 | 2019-03-15 | 北京快鱼电子股份公司 | 基于全景摄像头和麦克风阵列的会议转录系统 |
CN109506568A (zh) * | 2018-12-29 | 2019-03-22 | 苏州思必驰信息科技有限公司 | 一种基于图像识别和语音识别的声源定位方法及装置 |
Family Cites Families (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US8073157B2 (en) * | 2003-08-27 | 2011-12-06 | Sony Computer Entertainment Inc. | Methods and apparatus for targeted sound detection and characterization |
-
2019
- 2019-12-11 CN CN201911264104.2A patent/CN110933254B/zh active Active
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107534725A (zh) * | 2015-05-19 | 2018-01-02 | 华为技术有限公司 | 一种语音信号处理方法及装置 |
CN105049807A (zh) * | 2015-07-31 | 2015-11-11 | 小米科技有限责任公司 | 监控画面声音采集方法及装置 |
CN109506568A (zh) * | 2018-12-29 | 2019-03-22 | 苏州思必驰信息科技有限公司 | 一种基于图像识别和语音识别的声源定位方法及装置 |
CN109474797A (zh) * | 2019-01-04 | 2019-03-15 | 北京快鱼电子股份公司 | 基于全景摄像头和麦克风阵列的会议转录系统 |
Also Published As
Publication number | Publication date |
---|---|
CN110933254A (zh) | 2020-03-27 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US9595259B2 (en) | Sound source-separating device and sound source-separating method | |
US9749738B1 (en) | Synthesizing audio corresponding to a virtual microphone location | |
CN100546367C (zh) | 信号处理装置,信号处理方法 | |
JP5060565B2 (ja) | 信号の信頼できる識別をするためのビデオ及びオーディオ信号内容の特徴の抽出 | |
CN105554443B (zh) | 视频图像中异响来源的定位方法及装置 | |
Donley et al. | Easycom: An augmented reality dataset to support algorithms for easy communication in noisy environments | |
CN107820037B (zh) | 音频信号、图像处理的方法、装置和系统 | |
US20090154896A1 (en) | Video-Audio Recording Apparatus and Video-Audio Reproducing Apparatus | |
JP5618043B2 (ja) | 映像音響処理システム、映像音響処理方法及びプログラム | |
CN107888973B (zh) | 一种脑电控制的视频输入听觉显示导盲装置及方法 | |
US20160360150A1 (en) | Method an apparatus for isolating an active participant in a group of participants | |
CN104378635B (zh) | 基于麦克风阵列辅助的视频感兴趣区域的编码方法 | |
US20140086551A1 (en) | Information processing apparatus and information processing method | |
JPH09275533A (ja) | 信号処理装置 | |
CN110933254B (zh) | 一种基于图像分析的声音过滤系统及其声音过滤方法 | |
CN111551921A (zh) | 一种声像联动的声源定向系统及方法 | |
CN112015364A (zh) | 拾音灵敏度的调整方法、装置 | |
JP2004171490A (ja) | 画像検出装置及び画像検出方法 | |
CA2756165A1 (en) | System and method for time series filtering and data reduction | |
JP6818445B2 (ja) | 音データ処理装置および音データ処理方法 | |
CN110708600A (zh) | 识别电视的有效观看者的方法和设备 | |
JP2000152109A (ja) | テレビ受像機 | |
US20230186654A1 (en) | Systems and methods for detection and display of whiteboard text and/or an active speaker | |
EP3101839A1 (en) | Method and apparatus for isolating an active participant in a group of participants using light field information | |
JPH05227531A (ja) | カメラ監視システム |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: A sound filtering system based on image analysis and its sound filtering method Effective date of registration: 20211202 Granted publication date: 20210907 Pledgee: Hangzhou High-tech Financing Guarantee Co.,Ltd. Pledgor: HANGZHOU XUJIAN SCIENCE AND TECHNOLOGY Co.,Ltd. Registration number: Y2021980013922 |
|
PC01 | Cancellation of the registration of the contract for pledge of patent right |
Date of cancellation: 20220322 Granted publication date: 20210907 Pledgee: Hangzhou High-tech Financing Guarantee Co.,Ltd. Pledgor: HANGZHOU XUJIAN SCIENCE AND TECHNOLOGY Co.,Ltd. Registration number: Y2021980013922 |
|
PC01 | Cancellation of the registration of the contract for pledge of patent right | ||
PE01 | Entry into force of the registration of the contract for pledge of patent right |
Denomination of invention: A sound filtering system based on image analysis and its sound filtering method Effective date of registration: 20220322 Granted publication date: 20210907 Pledgee: Shanghai Guotai Junan Securities Asset Management Co.,Ltd. Pledgor: HANGZHOU XUJIAN SCIENCE AND TECHNOLOGY Co.,Ltd. Registration number: Y2022990000162 |
|
PE01 | Entry into force of the registration of the contract for pledge of patent right | ||
PC01 | Cancellation of the registration of the contract for pledge of patent right |
Date of cancellation: 20230131 Granted publication date: 20210907 Pledgee: Shanghai Guotai Junan Securities Asset Management Co.,Ltd. Pledgor: HANGZHOU XUJIAN SCIENCE AND TECHNOLOGY Co.,Ltd. Registration number: Y2022990000162 |
|
PC01 | Cancellation of the registration of the contract for pledge of patent right |