CN101405717B - 使用频道间振幅谱的音频频道提取的方法和设备 - Google Patents
使用频道间振幅谱的音频频道提取的方法和设备 Download PDFInfo
- Publication number
- CN101405717B CN101405717B CN2006800459938A CN200680045993A CN101405717B CN 101405717 B CN101405717 B CN 101405717B CN 2006800459938 A CN2006800459938 A CN 2006800459938A CN 200680045993 A CN200680045993 A CN 200680045993A CN 101405717 B CN101405717 B CN 101405717B
- Authority
- CN
- China
- Prior art keywords
- channel
- audio frequency
- output
- inputting
- inter
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S5/00—Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation
- H04S5/005—Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation of the pseudo five- or more-channel type, e.g. virtual surround
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/03—Application of parametric coding in stereophonic audio systems
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Quality & Reliability (AREA)
- Mathematical Physics (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Stereophonic System (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US11/296,730 US20070135952A1 (en) | 2005-12-06 | 2005-12-06 | Audio channel extraction using inter-channel amplitude spectra |
| US11/296,730 | 2005-12-06 | ||
| PCT/US2006/046017 WO2007067429A2 (en) | 2005-12-06 | 2006-12-01 | Audio channel extraction using inter-channel amplitude spectra |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN101405717A CN101405717A (zh) | 2009-04-08 |
| CN101405717B true CN101405717B (zh) | 2010-12-15 |
Family
ID=38123391
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN2006800459938A Expired - Fee Related CN101405717B (zh) | 2005-12-06 | 2006-12-01 | 使用频道间振幅谱的音频频道提取的方法和设备 |
Country Status (14)
| Country | Link |
|---|---|
| US (1) | US20070135952A1 (https=) |
| EP (1) | EP1958086A4 (https=) |
| JP (1) | JP2009518684A (https=) |
| KR (1) | KR20080091099A (https=) |
| CN (1) | CN101405717B (https=) |
| AU (1) | AU2006322079A1 (https=) |
| BR (1) | BRPI0619468A2 (https=) |
| CA (1) | CA2632496A1 (https=) |
| IL (1) | IL191701A0 (https=) |
| MX (1) | MX2008007226A (https=) |
| NZ (1) | NZ568402A (https=) |
| RU (1) | RU2432607C2 (https=) |
| TW (1) | TW200739366A (https=) |
| WO (1) | WO2007067429A2 (https=) |
Families Citing this family (14)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP5082327B2 (ja) * | 2006-08-09 | 2012-11-28 | ソニー株式会社 | 音声信号処理装置、音声信号処理方法および音声信号処理プログラム |
| CN102138176B (zh) * | 2008-07-11 | 2013-11-06 | 日本电气株式会社 | 信号分析装置、信号控制装置及其方法 |
| US9064499B2 (en) * | 2009-02-13 | 2015-06-23 | Nec Corporation | Method for processing multichannel acoustic signal, system therefor, and program |
| US8954323B2 (en) * | 2009-02-13 | 2015-02-10 | Nec Corporation | Method for processing multichannel acoustic signal, system thereof, and program |
| KR20120132342A (ko) * | 2011-05-25 | 2012-12-05 | 삼성전자주식회사 | 보컬 신호 제거 장치 및 방법 |
| US20150036827A1 (en) * | 2012-02-13 | 2015-02-05 | Franck Rosset | Transaural Synthesis Method for Sound Spatialization |
| US10321252B2 (en) | 2012-02-13 | 2019-06-11 | Axd Technologies, Llc | Transaural synthesis method for sound spatialization |
| FR2996043B1 (fr) * | 2012-09-27 | 2014-10-24 | Univ Bordeaux 1 | Procede et dispositif pour separer des signaux par filtrage spatial a variance minimum sous contrainte lineaire |
| KR101620173B1 (ko) | 2013-07-10 | 2016-05-13 | 주식회사 엘지화학 | 적층 형태 안정성이 우수한 단차를 갖는 전극 조립체 및 그 제조방법 |
| US10037750B2 (en) * | 2016-02-17 | 2018-07-31 | RMXHTZ, Inc. | Systems and methods for analyzing components of audio tracks |
| EP3246923A1 (en) | 2016-05-20 | 2017-11-22 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for processing a multichannel audio signal |
| US9820073B1 (en) | 2017-05-10 | 2017-11-14 | Tls Corp. | Extracting a common signal from multiple audio signals |
| CN113611323B (zh) * | 2021-05-07 | 2024-02-20 | 北京至芯开源科技有限责任公司 | 一种基于双通道卷积注意力网络的语音增强方法及系统 |
| CN117198313B (zh) * | 2023-08-17 | 2024-07-02 | 珠海全视通信息技术有限公司 | 侧音消除方法、装置、电子设备、存储介质 |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6526148B1 (en) * | 1999-05-18 | 2003-02-25 | Siemens Corporate Research, Inc. | Device and method for demixing signal mixtures using fast blind source separation technique based on delay and attenuation compensation, and for selecting channels for the demixed signals |
| US20040062401A1 (en) * | 2002-02-07 | 2004-04-01 | Davis Mark Franklin | Audio channel translation |
| US20050180579A1 (en) * | 2004-02-12 | 2005-08-18 | Frank Baumgarte | Late reverberation-based synthesis of auditory scenes |
Family Cites Families (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| DE4217276C1 (https=) * | 1992-05-25 | 1993-04-08 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung Ev, 8000 Muenchen, De | |
| US6321200B1 (en) * | 1999-07-02 | 2001-11-20 | Mitsubish Electric Research Laboratories, Inc | Method for extracting features from a mixture of signals |
| US6430528B1 (en) * | 1999-08-20 | 2002-08-06 | Siemens Corporate Research, Inc. | Method and apparatus for demixing of degenerate mixtures |
| US7660424B2 (en) * | 2001-02-07 | 2010-02-09 | Dolby Laboratories Licensing Corporation | Audio channel spatial translation |
| US7292901B2 (en) * | 2002-06-24 | 2007-11-06 | Agere Systems Inc. | Hybrid multi-channel/cue coding/decoding of audio signals |
| JP3950930B2 (ja) * | 2002-05-10 | 2007-08-01 | 財団法人北九州産業学術推進機構 | 音源の位置情報を利用した分割スペクトルに基づく目的音声の復元方法 |
| US7039204B2 (en) * | 2002-06-24 | 2006-05-02 | Agere Systems Inc. | Equalization for audio mixing |
| JP2006163178A (ja) * | 2004-12-09 | 2006-06-22 | Mitsubishi Electric Corp | 符号化装置及び復号装置 |
-
2005
- 2005-12-06 US US11/296,730 patent/US20070135952A1/en not_active Abandoned
-
2006
- 2006-10-05 TW TW095137143A patent/TW200739366A/zh unknown
- 2006-12-01 KR KR1020087014637A patent/KR20080091099A/ko not_active Withdrawn
- 2006-12-01 WO PCT/US2006/046017 patent/WO2007067429A2/en not_active Ceased
- 2006-12-01 RU RU2008127329/09A patent/RU2432607C2/ru not_active IP Right Cessation
- 2006-12-01 CN CN2006800459938A patent/CN101405717B/zh not_active Expired - Fee Related
- 2006-12-01 CA CA002632496A patent/CA2632496A1/en not_active Abandoned
- 2006-12-01 AU AU2006322079A patent/AU2006322079A1/en not_active Abandoned
- 2006-12-01 EP EP06838794.3A patent/EP1958086A4/en not_active Withdrawn
- 2006-12-01 JP JP2008544391A patent/JP2009518684A/ja active Pending
- 2006-12-01 MX MX2008007226A patent/MX2008007226A/es not_active Application Discontinuation
- 2006-12-01 BR BRPI0619468-0A patent/BRPI0619468A2/pt not_active Application Discontinuation
- 2006-12-01 NZ NZ568402A patent/NZ568402A/en not_active IP Right Cessation
-
2008
- 2008-05-26 IL IL191701A patent/IL191701A0/en unknown
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6526148B1 (en) * | 1999-05-18 | 2003-02-25 | Siemens Corporate Research, Inc. | Device and method for demixing signal mixtures using fast blind source separation technique based on delay and attenuation compensation, and for selecting channels for the demixed signals |
| US20040062401A1 (en) * | 2002-02-07 | 2004-04-01 | Davis Mark Franklin | Audio channel translation |
| US20050180579A1 (en) * | 2004-02-12 | 2005-08-18 | Frank Baumgarte | Late reverberation-based synthesis of auditory scenes |
Also Published As
| Publication number | Publication date |
|---|---|
| US20070135952A1 (en) | 2007-06-14 |
| WO2007067429B1 (en) | 2008-10-30 |
| JP2009518684A (ja) | 2009-05-07 |
| CN101405717A (zh) | 2009-04-08 |
| IL191701A0 (en) | 2008-12-29 |
| RU2008127329A (ru) | 2010-01-20 |
| EP1958086A4 (en) | 2013-07-17 |
| AU2006322079A1 (en) | 2007-06-14 |
| MX2008007226A (es) | 2008-11-19 |
| RU2432607C2 (ru) | 2011-10-27 |
| WO2007067429A3 (en) | 2008-09-12 |
| NZ568402A (en) | 2011-05-27 |
| HK1128786A1 (en) | 2009-11-06 |
| EP1958086A2 (en) | 2008-08-20 |
| CA2632496A1 (en) | 2007-06-14 |
| BRPI0619468A2 (pt) | 2011-10-04 |
| WO2007067429A2 (en) | 2007-06-14 |
| KR20080091099A (ko) | 2008-10-09 |
| TW200739366A (en) | 2007-10-16 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN101405717B (zh) | 使用频道间振幅谱的音频频道提取的方法和设备 | |
| CN101960516B (zh) | 语音增强 | |
| Grais et al. | Raw multi-channel audio source separation using multi-resolution convolutional auto-encoders | |
| Smaragdis | Non-negative matrix factor deconvolution; extraction of multiple sound sources from monophonic inputs | |
| Liutkus et al. | Informed source separation through spectrogram coding and data embedding | |
| Necciari et al. | The ERBlet transform: An auditory-based time-frequency representation with perfect reconstruction | |
| US20070076902A1 (en) | Method and Apparatus for Removing or Isolating Voice or Instruments on Stereo Recordings | |
| KR101280253B1 (ko) | 음원 분리 방법 및 그 장치 | |
| CN113921022B (zh) | 音频信号分离方法、装置、存储介质和电子设备 | |
| CN115699171B (zh) | 使用最少的训练分离一般化立体声背景与平移源 | |
| DE102007048973A1 (de) | Vorrichtung und Verfahren zum Erzeugen eines Multikanalsignals mit einer Sprachsignalverarbeitung | |
| Venkataramani et al. | Adaptive front-ends for end-to-end source separation | |
| WO2009046225A2 (en) | Correlation-based method for ambience extraction from two-channel audio signals | |
| CN102222508A (zh) | 一种基于矩阵变换的欠定盲分离方法 | |
| KR20140074918A (ko) | 직접-산란 분해 | |
| WO2023172852A1 (en) | Target mid-side signals for audio applications | |
| Oh et al. | Spectrogram-channels u-net: a source separation model viewing each channel as the spectrogram of each source | |
| Perez‐Gonzalez et al. | Automatic mixing | |
| Feng et al. | Hybrid model and structured sparsity for under-determined convolutive audio source separation | |
| HK1128786B (en) | Method and equipment for audio channel extraction using inter-channel amplitude spectra | |
| CN116631428A (zh) | 一种语音增强方法、装置、设备及介质 | |
| RU2805124C1 (ru) | Отделение панорамированных источников от обобщенных стереофонов с использованием минимального обучения | |
| Nesbit et al. | Audio source separation with a signal-adaptive local cosine transform | |
| Jiang et al. | A Complex Neural Network Adaptive Beamforming for Multi-channel Speech Enhancement in Time Domain | |
| Xie et al. | Equalizer Network-based Adaptive Time-Frequency Source Separation in Highly Reverberant Environments |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 1128786 Country of ref document: HK |
|
| C14 | Grant of patent or utility model | ||
| GR01 | Patent grant | ||
| REG | Reference to a national code |
Ref country code: HK Ref legal event code: GR Ref document number: 1128786 Country of ref document: HK |
|
| CF01 | Termination of patent right due to non-payment of annual fee | ||
| CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20101215 Termination date: 20181201 |