CN103680513A - 语音信号处理方法、装置及服务器 - Google Patents
语音信号处理方法、装置及服务器 Download PDFInfo
- Publication number
- CN103680513A CN103680513A CN201310681217.9A CN201310681217A CN103680513A CN 103680513 A CN103680513 A CN 103680513A CN 201310681217 A CN201310681217 A CN 201310681217A CN 103680513 A CN103680513 A CN 103680513A
- Authority
- CN
- China
- Prior art keywords
- signal
- voice signal
- weight
- cross
- talk
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000012545 processing Methods 0.000 title claims abstract description 71
- 238000000034 method Methods 0.000 title claims abstract description 44
- 238000001914 filtration Methods 0.000 claims description 21
- 238000009499 grossing Methods 0.000 claims description 13
- 238000003672 processing method Methods 0.000 claims description 10
- 230000005236 sound signal Effects 0.000 claims description 10
- 238000013507 mapping Methods 0.000 claims description 8
- 238000004891 communication Methods 0.000 abstract description 9
- 230000008569 process Effects 0.000 description 26
- 238000010586 diagram Methods 0.000 description 6
- 239000003795 chemical substances by application Substances 0.000 description 3
- 230000006870 function Effects 0.000 description 3
- 238000006243 chemical reaction Methods 0.000 description 2
- 238000005516 engineering process Methods 0.000 description 2
- 238000005259 measurement Methods 0.000 description 2
- 239000000654 additive Substances 0.000 description 1
- 230000000996 additive effect Effects 0.000 description 1
- 230000009286 beneficial effect Effects 0.000 description 1
- 238000013461 design Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000006386 neutralization reaction Methods 0.000 description 1
- 238000005070 sampling Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/20—Speech recognition techniques specially adapted for robustness in adverse environments, e.g. in noise, of stress induced speech
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/03—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
- G10L25/21—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being power information
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04M—TELEPHONIC COMMUNICATION
- H04M3/00—Automatic or semi-automatic exchanges
- H04M3/42—Systems providing special services or facilities to subscribers
- H04M3/56—Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities
Landscapes
- Engineering & Computer Science (AREA)
- Computational Linguistics (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Signal Processing (AREA)
- Telephone Function (AREA)
- Circuit For Audible Band Transducer (AREA)
- Telephonic Communication Services (AREA)
Abstract
Description
Claims (11)
Priority Applications (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310681217.9A CN103680513B (zh) | 2013-12-13 | 2013-12-13 | 语音信号处理方法、装置及服务器 |
PCT/CN2014/093656 WO2015085946A1 (zh) | 2013-12-13 | 2014-12-12 | 语音信号处理方法、装置及服务器 |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201310681217.9A CN103680513B (zh) | 2013-12-13 | 2013-12-13 | 语音信号处理方法、装置及服务器 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN103680513A true CN103680513A (zh) | 2014-03-26 |
CN103680513B CN103680513B (zh) | 2016-11-02 |
Family
ID=50317866
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201310681217.9A Active CN103680513B (zh) | 2013-12-13 | 2013-12-13 | 语音信号处理方法、装置及服务器 |
Country Status (2)
Country | Link |
---|---|
CN (1) | CN103680513B (zh) |
WO (1) | WO2015085946A1 (zh) |
Cited By (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104409079A (zh) * | 2014-11-03 | 2015-03-11 | 北京有恒斯康通信技术有限公司 | 一种音频叠加的方法和装置 |
WO2015085946A1 (zh) * | 2013-12-13 | 2015-06-18 | 广州华多网络科技有限公司 | 语音信号处理方法、装置及服务器 |
CN105469806A (zh) * | 2014-09-12 | 2016-04-06 | 联想(北京)有限公司 | 一种声音处理方法、装置及系统 |
CN108417208A (zh) * | 2018-03-26 | 2018-08-17 | 宇龙计算机通信科技(深圳)有限公司 | 一种语音输入方法和装置 |
WO2020073564A1 (zh) * | 2018-10-12 | 2020-04-16 | 北京字节跳动网络技术有限公司 | 用于检测音频信号的响度的方法和装置 |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN113596771B (zh) * | 2021-08-23 | 2023-11-17 | 国能包神铁路集团有限责任公司 | 一种机车无线通信设备及其控制方法、装置 |
Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7039203B2 (en) * | 1995-09-06 | 2006-05-02 | Apple Computer, Inc. | Reduced complexity audio mixing apparatus |
CN1946029A (zh) * | 2006-10-30 | 2007-04-11 | 北京中星微电子有限公司 | 一种处理音频信号的方法及其系统 |
CN1953488A (zh) * | 2006-11-01 | 2007-04-25 | 华为技术有限公司 | 一种多路语音信号的混音方法及装置 |
US7379961B2 (en) * | 1997-04-30 | 2008-05-27 | Computer Associates Think, Inc. | Spatialized audio in a three-dimensional computer-based scene |
US20080304673A1 (en) * | 2007-06-11 | 2008-12-11 | Fujitsu Limited | Multipoint communication apparatus |
CN101356571A (zh) * | 2005-10-12 | 2009-01-28 | 弗劳恩霍夫应用研究促进协会 | 多声道音频信号的时间与空间成形 |
CN101674450A (zh) * | 2008-09-10 | 2010-03-17 | 深圳市邦彦信息技术有限公司 | 视频指挥调度系统中的混音方法 |
CN103188595A (zh) * | 2011-12-31 | 2013-07-03 | 展讯通信(上海)有限公司 | 处理多声道音频信号的方法和系统 |
Family Cites Families (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
GB2284968A (en) * | 1993-12-18 | 1995-06-21 | Ibm | Audio conferencing system |
JPH1013556A (ja) * | 1996-06-21 | 1998-01-16 | Oki Electric Ind Co Ltd | テレビ会議システム |
CN1322488C (zh) * | 2004-04-14 | 2007-06-20 | 华为技术有限公司 | 一种语音增强的方法 |
CN103680513B (zh) * | 2013-12-13 | 2016-11-02 | 广州华多网络科技有限公司 | 语音信号处理方法、装置及服务器 |
-
2013
- 2013-12-13 CN CN201310681217.9A patent/CN103680513B/zh active Active
-
2014
- 2014-12-12 WO PCT/CN2014/093656 patent/WO2015085946A1/zh active Application Filing
Patent Citations (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7039203B2 (en) * | 1995-09-06 | 2006-05-02 | Apple Computer, Inc. | Reduced complexity audio mixing apparatus |
US7379961B2 (en) * | 1997-04-30 | 2008-05-27 | Computer Associates Think, Inc. | Spatialized audio in a three-dimensional computer-based scene |
CN101356571A (zh) * | 2005-10-12 | 2009-01-28 | 弗劳恩霍夫应用研究促进协会 | 多声道音频信号的时间与空间成形 |
CN1946029A (zh) * | 2006-10-30 | 2007-04-11 | 北京中星微电子有限公司 | 一种处理音频信号的方法及其系统 |
CN1953488A (zh) * | 2006-11-01 | 2007-04-25 | 华为技术有限公司 | 一种多路语音信号的混音方法及装置 |
US20080304673A1 (en) * | 2007-06-11 | 2008-12-11 | Fujitsu Limited | Multipoint communication apparatus |
CN101674450A (zh) * | 2008-09-10 | 2010-03-17 | 深圳市邦彦信息技术有限公司 | 视频指挥调度系统中的混音方法 |
CN103188595A (zh) * | 2011-12-31 | 2013-07-03 | 展讯通信(上海)有限公司 | 处理多声道音频信号的方法和系统 |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2015085946A1 (zh) * | 2013-12-13 | 2015-06-18 | 广州华多网络科技有限公司 | 语音信号处理方法、装置及服务器 |
CN105469806A (zh) * | 2014-09-12 | 2016-04-06 | 联想(北京)有限公司 | 一种声音处理方法、装置及系统 |
CN105469806B (zh) * | 2014-09-12 | 2020-02-21 | 联想(北京)有限公司 | 一种声音处理方法、装置及系统 |
CN104409079A (zh) * | 2014-11-03 | 2015-03-11 | 北京有恒斯康通信技术有限公司 | 一种音频叠加的方法和装置 |
CN108417208A (zh) * | 2018-03-26 | 2018-08-17 | 宇龙计算机通信科技(深圳)有限公司 | 一种语音输入方法和装置 |
CN108417208B (zh) * | 2018-03-26 | 2020-09-11 | 宇龙计算机通信科技(深圳)有限公司 | 一种语音输入方法和装置 |
WO2020073564A1 (zh) * | 2018-10-12 | 2020-04-16 | 北京字节跳动网络技术有限公司 | 用于检测音频信号的响度的方法和装置 |
CN111045633A (zh) * | 2018-10-12 | 2020-04-21 | 北京微播视界科技有限公司 | 用于检测音频信号的响度的方法和装置 |
Also Published As
Publication number | Publication date |
---|---|
CN103680513B (zh) | 2016-11-02 |
WO2015085946A1 (zh) | 2015-06-18 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN103680513A (zh) | 语音信号处理方法、装置及服务器 | |
CN103871421B (zh) | 一种基于子带噪声分析的自适应降噪方法与系统 | |
EP0919096B1 (fr) | Procede d'annulation d'echo acoustique multi-voies et annuleur d'echo acoustique multi-voies | |
CN102800323B (zh) | 移动终端语音降噪的方法及装置 | |
KR101552750B1 (ko) | 파라미트릭 스테레오 변환 시스템 및 방법 | |
CN108028049A (zh) | 麦克风信号融合 | |
CN101370322A (zh) | 麦克风增益调节的方法及通信设备 | |
CN104980337A (zh) | 一种音频处理的性能提升方法及装置 | |
CN102811267B (zh) | 近端语音干扰消除系统及移动通信终端 | |
CN105228056B (zh) | 一种消除麦克风啸叫的方法及系统 | |
CN104796836B (zh) | 双耳声源增强 | |
CN109817238A (zh) | 音频信号采集装置、音频信号处理方法和装置 | |
CN107426651B (zh) | 多通道的混音方法及装置 | |
US20200365174A1 (en) | Method and system for generating mixed voice data | |
US10602275B2 (en) | Audio enhancement via beamforming and multichannel filtering of an input audio signal | |
CN112309414A (zh) | 基于音频编解码的主动降噪方法、耳机及电子设备 | |
CN108494952A (zh) | 语音通话处理方法及相关设备 | |
CN103812462A (zh) | 响度控制方法及装置 | |
CN103077725B (zh) | 语音处理的方法及装置 | |
EP3414889B1 (en) | Bi-magnitude processing framework for nonlinear echo cancellation in mobile devices | |
CN101867853B (zh) | 基于传声器阵列的语音信号处理方法及装置 | |
CN106796782A (zh) | 信息处理装置、信息处理方法以及计算机程序 | |
CN101859567B (zh) | 一种语音背景噪声的消除方法和装置 | |
CN101699837A (zh) | 一种电话语音输出增益调节的方法、装置和通信终端 | |
CN113870871A (zh) | 音频处理方法、装置、存储介质、电子设备 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20140326 Assignee: All kinds of fruits garden, Guangzhou network technology company limited Assignor: Guangzhou Huaduo Network Technology Co., Ltd. Contract record no.: 2015990000265 Denomination of invention: Method and device for processing voice signal with noise, and server License type: Exclusive License Record date: 20150504 |
|
LICC | Enforcement, change and cancellation of record of contracts on the licence for exploitation of a patent or utility model | ||
CB02 | Change of applicant information |
Address after: 511446 Guangzhou City, Guangdong Province, Panyu District, South Village, Huambo Business District Wanda Plaza, block B1, floor 28 Applicant after: Guangzhou Huaduo Network Technology Co., Ltd. Address before: 510655, Guangzhou, Whampoa Avenue, No. 2, creative industrial park, building 3-08, Applicant before: Guangzhou Huaduo Network Technology Co., Ltd. |
|
COR | Change of bibliographic data | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
EE01 | Entry into force of recordation of patent licensing contract | ||
EE01 | Entry into force of recordation of patent licensing contract |
Application publication date: 20140326 Assignee: GUANGZHOU CUBESILI INFORMATION TECHNOLOGY Co.,Ltd. Assignor: GUANGZHOU HUADUO NETWORK TECHNOLOGY Co.,Ltd. Contract record no.: X2021980000101 Denomination of invention: Voice signal processing method, device and server Granted publication date: 20161102 License type: Common License Record date: 20210106 |