CN101405717B - 使用频道间振幅谱的音频频道提取的方法和设备 - Google Patents
使用频道间振幅谱的音频频道提取的方法和设备 Download PDFInfo
- Publication number
- CN101405717B CN101405717B CN2006800459938A CN200680045993A CN101405717B CN 101405717 B CN101405717 B CN 101405717B CN 2006800459938 A CN2006800459938 A CN 2006800459938A CN 200680045993 A CN200680045993 A CN 200680045993A CN 101405717 B CN101405717 B CN 101405717B
- Authority
- CN
- China
- Prior art keywords
- channel
- audio frequency
- output
- inputting
- inter
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 238000001228 spectrum Methods 0.000 title claims abstract description 78
- 238000000605 extraction Methods 0.000 title description 13
- 238000000926 separation method Methods 0.000 claims abstract description 7
- 238000000034 method Methods 0.000 claims description 29
- 230000003595 spectral effect Effects 0.000 claims description 29
- 238000013507 mapping Methods 0.000 claims description 27
- 230000009466 transformation Effects 0.000 claims description 8
- 230000015572 biosynthetic process Effects 0.000 claims description 3
- 238000006243 chemical reaction Methods 0.000 claims description 2
- 238000012880 independent component analysis Methods 0.000 description 8
- 239000000203 mixture Substances 0.000 description 7
- 238000010586 diagram Methods 0.000 description 4
- 239000000284 extract Substances 0.000 description 3
- 238000013528 artificial neural network Methods 0.000 description 1
- 230000001419 dependent effect Effects 0.000 description 1
- 238000001914 filtration Methods 0.000 description 1
- 238000009432 framing Methods 0.000 description 1
- 238000009499 grossing Methods 0.000 description 1
- 230000000750 progressive effect Effects 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding using interchannel correlation to reduce redundancy, e.g. joint-stereo, intensity-coding or matrixing
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S5/00—Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation
- H04S5/005—Pseudo-stereo systems, e.g. in which additional channel signals are derived from monophonic signals by means of phase shifting, time delay or reverberation of the pseudo five- or more-channel type, e.g. virtual surround
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S2420/00—Techniques used stereophonic systems covered by H04S but not provided for in its groups
- H04S2420/03—Application of parametric coding in stereophonic audio systems
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Acoustics & Sound (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Human Computer Interaction (AREA)
- Multimedia (AREA)
- Mathematical Physics (AREA)
- Quality & Reliability (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Stereophonic System (AREA)
Applications Claiming Priority (3)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US11/296,730 US20070135952A1 (en) | 2005-12-06 | 2005-12-06 | Audio channel extraction using inter-channel amplitude spectra |
| US11/296,730 | 2005-12-06 | ||
| PCT/US2006/046017 WO2007067429A2 (en) | 2005-12-06 | 2006-12-01 | Audio channel extraction using inter-channel amplitude spectra |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| CN101405717A CN101405717A (zh) | 2009-04-08 |
| CN101405717B true CN101405717B (zh) | 2010-12-15 |
Family
ID=38123391
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN2006800459938A Expired - Fee Related CN101405717B (zh) | 2005-12-06 | 2006-12-01 | 使用频道间振幅谱的音频频道提取的方法和设备 |
Country Status (14)
| Country | Link |
|---|---|
| US (1) | US20070135952A1 (enExample) |
| EP (1) | EP1958086A4 (enExample) |
| JP (1) | JP2009518684A (enExample) |
| KR (1) | KR20080091099A (enExample) |
| CN (1) | CN101405717B (enExample) |
| AU (1) | AU2006322079A1 (enExample) |
| BR (1) | BRPI0619468A2 (enExample) |
| CA (1) | CA2632496A1 (enExample) |
| IL (1) | IL191701A0 (enExample) |
| MX (1) | MX2008007226A (enExample) |
| NZ (1) | NZ568402A (enExample) |
| RU (1) | RU2432607C2 (enExample) |
| TW (1) | TW200739366A (enExample) |
| WO (1) | WO2007067429A2 (enExample) |
Families Citing this family (14)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP5082327B2 (ja) * | 2006-08-09 | 2012-11-28 | ソニー株式会社 | 音声信号処理装置、音声信号処理方法および音声信号処理プログラム |
| JPWO2010005050A1 (ja) * | 2008-07-11 | 2012-01-05 | 日本電気株式会社 | 信号分析装置、信号制御装置及びその方法と、プログラム |
| US8954323B2 (en) * | 2009-02-13 | 2015-02-10 | Nec Corporation | Method for processing multichannel acoustic signal, system thereof, and program |
| JP5605575B2 (ja) * | 2009-02-13 | 2014-10-15 | 日本電気株式会社 | 多チャンネル音響信号処理方法、そのシステム及びプログラム |
| KR20120132342A (ko) * | 2011-05-25 | 2012-12-05 | 삼성전자주식회사 | 보컬 신호 제거 장치 및 방법 |
| US20150036827A1 (en) * | 2012-02-13 | 2015-02-05 | Franck Rosset | Transaural Synthesis Method for Sound Spatialization |
| US10321252B2 (en) | 2012-02-13 | 2019-06-11 | Axd Technologies, Llc | Transaural synthesis method for sound spatialization |
| FR2996043B1 (fr) * | 2012-09-27 | 2014-10-24 | Univ Bordeaux 1 | Procede et dispositif pour separer des signaux par filtrage spatial a variance minimum sous contrainte lineaire |
| KR101620173B1 (ko) | 2013-07-10 | 2016-05-13 | 주식회사 엘지화학 | 적층 형태 안정성이 우수한 단차를 갖는 전극 조립체 및 그 제조방법 |
| US10037750B2 (en) * | 2016-02-17 | 2018-07-31 | RMXHTZ, Inc. | Systems and methods for analyzing components of audio tracks |
| EP3246923A1 (en) | 2016-05-20 | 2017-11-22 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Apparatus and method for processing a multichannel audio signal |
| US9820073B1 (en) | 2017-05-10 | 2017-11-14 | Tls Corp. | Extracting a common signal from multiple audio signals |
| CN113611323B (zh) * | 2021-05-07 | 2024-02-20 | 北京至芯开源科技有限责任公司 | 一种基于双通道卷积注意力网络的语音增强方法及系统 |
| CN117198313B (zh) * | 2023-08-17 | 2024-07-02 | 珠海全视通信息技术有限公司 | 侧音消除方法、装置、电子设备、存储介质 |
Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6526148B1 (en) * | 1999-05-18 | 2003-02-25 | Siemens Corporate Research, Inc. | Device and method for demixing signal mixtures using fast blind source separation technique based on delay and attenuation compensation, and for selecting channels for the demixed signals |
| US20040062401A1 (en) * | 2002-02-07 | 2004-04-01 | Davis Mark Franklin | Audio channel translation |
| US20050180579A1 (en) * | 2004-02-12 | 2005-08-18 | Frank Baumgarte | Late reverberation-based synthesis of auditory scenes |
Family Cites Families (8)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| DE4217276C1 (enExample) * | 1992-05-25 | 1993-04-08 | Fraunhofer-Gesellschaft Zur Foerderung Der Angewandten Forschung Ev, 8000 Muenchen, De | |
| US6321200B1 (en) * | 1999-07-02 | 2001-11-20 | Mitsubish Electric Research Laboratories, Inc | Method for extracting features from a mixture of signals |
| US6430528B1 (en) * | 1999-08-20 | 2002-08-06 | Siemens Corporate Research, Inc. | Method and apparatus for demixing of degenerate mixtures |
| US7660424B2 (en) * | 2001-02-07 | 2010-02-09 | Dolby Laboratories Licensing Corporation | Audio channel spatial translation |
| US7292901B2 (en) * | 2002-06-24 | 2007-11-06 | Agere Systems Inc. | Hybrid multi-channel/cue coding/decoding of audio signals |
| JP3950930B2 (ja) * | 2002-05-10 | 2007-08-01 | 財団法人北九州産業学術推進機構 | 音源の位置情報を利用した分割スペクトルに基づく目的音声の復元方法 |
| US7039204B2 (en) * | 2002-06-24 | 2006-05-02 | Agere Systems Inc. | Equalization for audio mixing |
| JP2006163178A (ja) * | 2004-12-09 | 2006-06-22 | Mitsubishi Electric Corp | 符号化装置及び復号装置 |
-
2005
- 2005-12-06 US US11/296,730 patent/US20070135952A1/en not_active Abandoned
-
2006
- 2006-10-05 TW TW095137143A patent/TW200739366A/zh unknown
- 2006-12-01 NZ NZ568402A patent/NZ568402A/en not_active IP Right Cessation
- 2006-12-01 BR BRPI0619468-0A patent/BRPI0619468A2/pt not_active Application Discontinuation
- 2006-12-01 CA CA002632496A patent/CA2632496A1/en not_active Abandoned
- 2006-12-01 CN CN2006800459938A patent/CN101405717B/zh not_active Expired - Fee Related
- 2006-12-01 AU AU2006322079A patent/AU2006322079A1/en not_active Abandoned
- 2006-12-01 KR KR1020087014637A patent/KR20080091099A/ko not_active Withdrawn
- 2006-12-01 JP JP2008544391A patent/JP2009518684A/ja active Pending
- 2006-12-01 EP EP06838794.3A patent/EP1958086A4/en not_active Withdrawn
- 2006-12-01 RU RU2008127329/09A patent/RU2432607C2/ru not_active IP Right Cessation
- 2006-12-01 MX MX2008007226A patent/MX2008007226A/es not_active Application Discontinuation
- 2006-12-01 WO PCT/US2006/046017 patent/WO2007067429A2/en not_active Ceased
-
2008
- 2008-05-26 IL IL191701A patent/IL191701A0/en unknown
Patent Citations (3)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6526148B1 (en) * | 1999-05-18 | 2003-02-25 | Siemens Corporate Research, Inc. | Device and method for demixing signal mixtures using fast blind source separation technique based on delay and attenuation compensation, and for selecting channels for the demixed signals |
| US20040062401A1 (en) * | 2002-02-07 | 2004-04-01 | Davis Mark Franklin | Audio channel translation |
| US20050180579A1 (en) * | 2004-02-12 | 2005-08-18 | Frank Baumgarte | Late reverberation-based synthesis of auditory scenes |
Also Published As
| Publication number | Publication date |
|---|---|
| WO2007067429A3 (en) | 2008-09-12 |
| EP1958086A4 (en) | 2013-07-17 |
| KR20080091099A (ko) | 2008-10-09 |
| CA2632496A1 (en) | 2007-06-14 |
| IL191701A0 (en) | 2008-12-29 |
| US20070135952A1 (en) | 2007-06-14 |
| WO2007067429A2 (en) | 2007-06-14 |
| MX2008007226A (es) | 2008-11-19 |
| RU2008127329A (ru) | 2010-01-20 |
| RU2432607C2 (ru) | 2011-10-27 |
| AU2006322079A1 (en) | 2007-06-14 |
| EP1958086A2 (en) | 2008-08-20 |
| CN101405717A (zh) | 2009-04-08 |
| HK1128786A1 (en) | 2009-11-06 |
| NZ568402A (en) | 2011-05-27 |
| BRPI0619468A2 (pt) | 2011-10-04 |
| JP2009518684A (ja) | 2009-05-07 |
| WO2007067429B1 (en) | 2008-10-30 |
| TW200739366A (en) | 2007-10-16 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| CN101405717B (zh) | 使用频道间振幅谱的音频频道提取的方法和设备 | |
| Makino | Audio source separation | |
| KR101280253B1 (ko) | 음원 분리 방법 및 그 장치 | |
| US11610593B2 (en) | Methods and systems for processing and mixing signals using signal decomposition | |
| EP2731359B1 (en) | Audio processing device, method and program | |
| JP2002510930A (ja) | 多重非相関化法を用いた未知の混在ソースの分離 | |
| CN101253809B (zh) | 用于编码和解码音频信号的装置及其方法 | |
| KR20230008815A (ko) | 최소한의 트레이닝을 사용하여 일반화된 스테레오 배경들로부터 패닝된 소스들의 분리 | |
| US20120300941A1 (en) | Apparatus and method for removing vocal signal | |
| Oh et al. | Spectrogram-channels u-net: a source separation model viewing each channel as the spectrogram of each source | |
| Tachibana et al. | Comparative evaluations of various harmonic/percussive sound separation algorithms based on anisotropic continuity of spectrogram | |
| Grais et al. | Single channel speech music separation using nonnegative matrix factorization with sliding windows and spectral masks | |
| Yang et al. | Stereophonic music source separation with spatially-informed bridging band-split network | |
| HK1128786B (en) | Method and equipment for audio channel extraction using inter-channel amplitude spectra | |
| US11087733B1 (en) | Method and system for designing a modal filter for a desired reverberation | |
| Cappell et al. | A simple construction of Atiyah-Singer classes and piecewise linear transformation groups | |
| Leveau et al. | Convolutive common audio signal extraction | |
| GB2560391A (en) | Extracting audio characteristics from audio signals | |
| Härmä | Estimation of the energy ratio between primary and ambience components in stereo audio data | |
| Jiang et al. | A Complex Neural Network Adaptive Beamforming for Multi-channel Speech Enhancement in Time Domain | |
| Soliman | A New Non-Maximally Decimated UEPS for Blind Source Separation T | |
| Grigis et al. | Improved recognition performance for orthogonal sources | |
| Zantalis et al. | Semi-Blind Audio Source Separation of Linearly Mixed Two-Channel Recordings via Guided Matching Pursuit. | |
| CN117409771A (zh) | 基于语义分析的审讯音频检测方法、装置、设备及介质 | |
| Takada et al. | A WAVELET APPROACH TO CONVOLUTIVE BLIND SEPARATION OF NON-STATIONARY SOUND SOURCES |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 1128786 Country of ref document: HK |
|
| C14 | Grant of patent or utility model | ||
| GR01 | Patent grant | ||
| REG | Reference to a national code |
Ref country code: HK Ref legal event code: GR Ref document number: 1128786 Country of ref document: HK |
|
| CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20101215 Termination date: 20181201 |
|
| CF01 | Termination of patent right due to non-payment of annual fee |