CN101366078A - 从单音音频信号分离音频信源的神经网络分类器 - Google Patents
从单音音频信号分离音频信源的神经网络分类器 Download PDFInfo
- Publication number
- CN101366078A CN101366078A CNA2006800414053A CN200680041405A CN101366078A CN 101366078 A CN101366078 A CN 101366078A CN A2006800414053 A CNA2006800414053 A CN A2006800414053A CN 200680041405 A CN200680041405 A CN 200680041405A CN 101366078 A CN101366078 A CN 101366078A
- Authority
- CN
- China
- Prior art keywords
- audio
- frame
- sources
- signal
- audio sources
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
- 230000005236 sound signal Effects 0.000 title claims abstract description 43
- 238000013528 artificial neural network Methods 0.000 title claims abstract description 37
- 238000004422 calculation algorithm Methods 0.000 claims abstract description 31
- 238000000926 separation method Methods 0.000 claims abstract description 13
- 238000002156 mixing Methods 0.000 claims abstract description 8
- 238000000034 method Methods 0.000 claims description 45
- 239000000284 extract Substances 0.000 claims description 28
- 230000009466 transformation Effects 0.000 claims description 15
- 238000009527 percussion Methods 0.000 claims description 13
- 210000004205 output neuron Anatomy 0.000 claims description 10
- 230000003595 spectral effect Effects 0.000 claims description 10
- 238000006243 chemical reaction Methods 0.000 claims description 9
- 238000001914 filtration Methods 0.000 claims description 8
- 239000000758 substrate Substances 0.000 claims description 3
- 238000009825 accumulation Methods 0.000 claims description 2
- 210000005036 nerve Anatomy 0.000 claims 3
- 230000001537 neural effect Effects 0.000 claims 1
- 238000012805 post-processing Methods 0.000 abstract description 6
- 238000004891 communication Methods 0.000 abstract description 5
- 230000008569 process Effects 0.000 description 16
- 238000000605 extraction Methods 0.000 description 14
- 210000002569 neuron Anatomy 0.000 description 12
- 238000012549 training Methods 0.000 description 11
- 238000012880 independent component analysis Methods 0.000 description 8
- 238000010586 diagram Methods 0.000 description 7
- 239000000203 mixture Substances 0.000 description 7
- 230000006870 function Effects 0.000 description 6
- 239000012634 fragment Substances 0.000 description 5
- 238000012544 monitoring process Methods 0.000 description 5
- 238000001228 spectrum Methods 0.000 description 5
- 238000005516 engineering process Methods 0.000 description 4
- 238000009432 framing Methods 0.000 description 4
- 238000004458 analytical method Methods 0.000 description 3
- 230000005540 biological transmission Effects 0.000 description 3
- 230000008859 change Effects 0.000 description 3
- 230000004048 modification Effects 0.000 description 3
- 238000012986 modification Methods 0.000 description 3
- 238000004364 calculation method Methods 0.000 description 2
- 230000004927 fusion Effects 0.000 description 2
- 230000001976 improved effect Effects 0.000 description 2
- 238000012545 processing Methods 0.000 description 2
- 238000005070 sampling Methods 0.000 description 2
- 230000011218 segmentation Effects 0.000 description 2
- 239000007787 solid Substances 0.000 description 2
- 238000012897 Levenberg–Marquardt algorithm Methods 0.000 description 1
- 230000008901 benefit Effects 0.000 description 1
- 238000012512 characterization method Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 238000001514 detection method Methods 0.000 description 1
- 230000000694 effects Effects 0.000 description 1
- 230000008676 import Effects 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 230000001939 inductive effect Effects 0.000 description 1
- 230000001483 mobilizing effect Effects 0.000 description 1
- 238000010606 normalization Methods 0.000 description 1
- 238000011160 research Methods 0.000 description 1
- 230000033764 rhythmic process Effects 0.000 description 1
- 230000006641 stabilisation Effects 0.000 description 1
- 238000011105 stabilization Methods 0.000 description 1
- 238000013179 statistical model Methods 0.000 description 1
- 230000008093 supporting effect Effects 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
- 238000012360 testing method Methods 0.000 description 1
- 230000000007 visual effect Effects 0.000 description 1
- 230000001755 vocal effect Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0272—Voice signal separating
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
- G10L25/30—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks
Landscapes
- Engineering & Computer Science (AREA)
- Acoustics & Sound (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Computational Linguistics (AREA)
- Human Computer Interaction (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Quality & Reliability (AREA)
- Evolutionary Computation (AREA)
- Artificial Intelligence (AREA)
- Auxiliary Devices For Music (AREA)
- Stereophonic System (AREA)
- Cable Transmission Systems, Equalization Of Radio And Reduction Of Echo (AREA)
- Burglar Alarm Systems (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
Abstract
Description
Claims (27)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US11/244,554 US20070083365A1 (en) | 2005-10-06 | 2005-10-06 | Neural network classifier for separating audio sources from a monophonic audio signal |
US11/244,554 | 2005-10-06 |
Publications (1)
Publication Number | Publication Date |
---|---|
CN101366078A true CN101366078A (zh) | 2009-02-11 |
Family
ID=37911912
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNA2006800414053A Pending CN101366078A (zh) | 2005-10-06 | 2006-10-03 | 从单音音频信号分离音频信源的神经网络分类器 |
Country Status (13)
Country | Link |
---|---|
US (1) | US20070083365A1 (zh) |
EP (1) | EP1941494A4 (zh) |
JP (1) | JP2009511954A (zh) |
KR (1) | KR101269296B1 (zh) |
CN (1) | CN101366078A (zh) |
AU (1) | AU2006302549A1 (zh) |
BR (1) | BRPI0616903A2 (zh) |
CA (1) | CA2625378A1 (zh) |
IL (1) | IL190445A0 (zh) |
NZ (1) | NZ566782A (zh) |
RU (1) | RU2418321C2 (zh) |
TW (1) | TWI317932B (zh) |
WO (1) | WO2007044377A2 (zh) |
Cited By (21)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN102446504A (zh) * | 2010-10-08 | 2012-05-09 | 华为技术有限公司 | 语音/音乐识别方法及装置 |
CN103038823A (zh) * | 2010-01-29 | 2013-04-10 | 马里兰大学派克分院 | 用于语音提取的系统和方法 |
CN103456311A (zh) * | 2012-05-29 | 2013-12-18 | 三星电子株式会社 | 用于处理音频信号的方法和设备 |
CN103839551A (zh) * | 2012-11-22 | 2014-06-04 | 鸿富锦精密工业(深圳)有限公司 | 音频处理系统与音频处理方法 |
CN103854644A (zh) * | 2012-12-05 | 2014-06-11 | 中国传媒大学 | 单声道多音音乐信号的自动转录方法及装置 |
CN104318929A (zh) * | 2010-01-19 | 2015-01-28 | 杜比国际公司 | 子带处理单元以及生成合成子带信号的方法 |
CN104464727A (zh) * | 2014-12-11 | 2015-03-25 | 福州大学 | 一种基于深度信念网络的单通道音乐的歌声分离方法 |
CN104575507A (zh) * | 2013-10-23 | 2015-04-29 | 中国移动通信集团公司 | 语音通信方法及装置 |
CN105070301A (zh) * | 2015-07-14 | 2015-11-18 | 福州大学 | 单通道音乐人声分离中的多种特定乐器强化分离方法 |
CN106847302A (zh) * | 2017-02-17 | 2017-06-13 | 大连理工大学 | 基于卷积神经网络的单通道混合语音时域分离方法 |
CN107507621A (zh) * | 2017-07-28 | 2017-12-22 | 维沃移动通信有限公司 | 一种噪声抑制方法及移动终端 |
CN108229659A (zh) * | 2017-12-29 | 2018-06-29 | 陕西科技大学 | 基于深度学习的钢琴单键音识别方法 |
CN108922556A (zh) * | 2018-07-16 | 2018-11-30 | 百度在线网络技术(北京)有限公司 | 声音处理方法、装置及设备 |
CN108922517A (zh) * | 2018-07-03 | 2018-11-30 | 百度在线网络技术(北京)有限公司 | 训练盲源分离模型的方法、装置及存储介质 |
CN109166593A (zh) * | 2018-08-17 | 2019-01-08 | 腾讯音乐娱乐科技(深圳)有限公司 | 音频数据处理方法、装置及存储介质 |
CN111566732A (zh) * | 2018-01-15 | 2020-08-21 | 三菱电机株式会社 | 声音信号分离装置和声音信号分离方法 |
CN112115821A (zh) * | 2020-09-04 | 2020-12-22 | 西北工业大学 | 一种基于小波近似系数熵的多信号智能调制模式识别方法 |
CN113366861A (zh) * | 2019-01-25 | 2021-09-07 | 索诺瓦有限公司 | 用于处理音频信号的信号处理装置、系统和方法 |
CN113647119A (zh) * | 2019-01-25 | 2021-11-12 | 索诺瓦有限公司 | 用于处理音频信号的信号处理装置、系统和方法 |
CN113674756A (zh) * | 2021-10-22 | 2021-11-19 | 青岛科技大学 | 基于短时傅里叶变换和bp神经网络的频域盲源分离方法 |
CN116828385A (zh) * | 2023-08-31 | 2023-09-29 | 深圳市广和通无线通信软件有限公司 | 一种基于人工智能分析的音频数据处理方法及相关装置 |
Families Citing this family (71)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
EP1605439B1 (en) * | 2004-06-04 | 2007-06-27 | Honda Research Institute Europe GmbH | Unified treatment of resolved and unresolved harmonics |
EP1605437B1 (en) * | 2004-06-04 | 2007-08-29 | Honda Research Institute Europe GmbH | Determination of the common origin of two harmonic components |
EP1686561B1 (en) | 2005-01-28 | 2012-01-04 | Honda Research Institute Europe GmbH | Determination of a common fundamental frequency of harmonic signals |
EP1853092B1 (en) * | 2006-05-04 | 2011-10-05 | LG Electronics, Inc. | Enhancing stereo audio with remix capability |
CN101652810B (zh) * | 2006-09-29 | 2012-04-11 | Lg电子株式会社 | 用于处理混合信号的装置及其方法 |
EP2084901B1 (en) | 2006-10-12 | 2015-12-09 | LG Electronics Inc. | Apparatus for processing a mix signal and method thereof |
KR100891665B1 (ko) | 2006-10-13 | 2009-04-02 | 엘지전자 주식회사 | 믹스 신호의 처리 방법 및 장치 |
WO2008060111A1 (en) * | 2006-11-15 | 2008-05-22 | Lg Electronics Inc. | A method and an apparatus for decoding an audio signal |
KR101062353B1 (ko) | 2006-12-07 | 2011-09-05 | 엘지전자 주식회사 | 오디오 신호의 디코딩 방법 및 그 장치 |
JP5450085B2 (ja) * | 2006-12-07 | 2014-03-26 | エルジー エレクトロニクス インコーポレイティド | オーディオ処理方法及び装置 |
CN101627425A (zh) * | 2007-02-13 | 2010-01-13 | Lg电子株式会社 | 用于处理音频信号的装置和方法 |
US20100121470A1 (en) * | 2007-02-13 | 2010-05-13 | Lg Electronics Inc. | Method and an apparatus for processing an audio signal |
TWI356399B (en) * | 2007-12-14 | 2012-01-11 | Ind Tech Res Inst | Speech recognition system and method with cepstral |
JP5277887B2 (ja) * | 2008-11-14 | 2013-08-28 | ヤマハ株式会社 | 信号処理装置およびプログラム |
US8200489B1 (en) * | 2009-01-29 | 2012-06-12 | The United States Of America As Represented By The Secretary Of The Navy | Multi-resolution hidden markov model using class specific features |
BRPI1008915A2 (pt) * | 2009-02-27 | 2018-01-16 | Panasonic Corp | dispositivo de determinação de tom e método de determinação de tom |
JP5375400B2 (ja) * | 2009-07-22 | 2013-12-25 | ソニー株式会社 | 音声処理装置、音声処理方法およびプログラム |
US8682669B2 (en) * | 2009-08-21 | 2014-03-25 | Synchronoss Technologies, Inc. | System and method for building optimal state-dependent statistical utterance classifiers in spoken dialog systems |
US8762154B1 (en) * | 2011-08-15 | 2014-06-24 | West Corporation | Method and apparatus of estimating optimum dialog state timeout settings in a spoken dialog system |
US9210506B1 (en) * | 2011-09-12 | 2015-12-08 | Audyssey Laboratories, Inc. | FFT bin based signal limiting |
US20140046670A1 (en) * | 2012-06-04 | 2014-02-13 | Samsung Electronics Co., Ltd. | Audio encoding method and apparatus, audio decoding method and apparatus, and multimedia device employing the same |
US9147157B2 (en) | 2012-11-06 | 2015-09-29 | Qualcomm Incorporated | Methods and apparatus for identifying spectral peaks in neuronal spiking representation of a signal |
US10203839B2 (en) | 2012-12-27 | 2019-02-12 | Avaya Inc. | Three-dimensional generalized space |
US9892743B2 (en) * | 2012-12-27 | 2018-02-13 | Avaya Inc. | Security surveillance via three-dimensional audio space presentation |
CN104078050A (zh) * | 2013-03-26 | 2014-10-01 | 杜比实验室特许公司 | 用于音频分类和音频处理的设备和方法 |
CN104347067B (zh) | 2013-08-06 | 2017-04-12 | 华为技术有限公司 | 一种音频信号分类方法和装置 |
US10564923B2 (en) | 2014-03-31 | 2020-02-18 | Sony Corporation | Method, system and artificial neural network |
US9620105B2 (en) | 2014-05-15 | 2017-04-11 | Apple Inc. | Analyzing audio input for efficient speech and music recognition |
RU2718999C2 (ru) * | 2014-07-23 | 2020-04-15 | Шлюмбергер Текнолоджи Б.В. | Кепстральный анализ исправности нефтепромыслового насосного оборудования |
RU2666631C2 (ru) * | 2014-09-12 | 2018-09-11 | МАЙКРОСОФТ ТЕКНОЛОДЖИ ЛАЙСЕНСИНГ, ЭлЭлСи | Обучение dnn-студента посредством распределения вывода |
US20160162473A1 (en) * | 2014-12-08 | 2016-06-09 | Microsoft Technology Licensing, Llc | Localization complexity of arbitrary language assets and resources |
US9407989B1 (en) | 2015-06-30 | 2016-08-02 | Arthur Woodrow | Closed audio circuit |
US11062228B2 (en) | 2015-07-06 | 2021-07-13 | Microsoft Technoiogy Licensing, LLC | Transfer learning techniques for disparate label sets |
US10902043B2 (en) | 2016-01-03 | 2021-01-26 | Gracenote, Inc. | Responding to remote media classification queries using classifier models and context parameters |
KR102151682B1 (ko) | 2016-03-23 | 2020-09-04 | 구글 엘엘씨 | 다중채널 음성 인식을 위한 적응성 오디오 강화 |
US10249305B2 (en) | 2016-05-19 | 2019-04-02 | Microsoft Technology Licensing, Llc | Permutation invariant training for talker-independent multi-talker speech separation |
US11373672B2 (en) | 2016-06-14 | 2022-06-28 | The Trustees Of Columbia University In The City Of New York | Systems and methods for speech separation and neural decoding of attentional selection in multi-speaker environments |
WO2017218492A1 (en) * | 2016-06-14 | 2017-12-21 | The Trustees Of Columbia University In The City Of New York | Neural decoding of attentional selection in multi-speaker environments |
US10614827B1 (en) * | 2017-02-21 | 2020-04-07 | Oben, Inc. | System and method for speech enhancement using dynamic noise profile estimation |
US10825445B2 (en) | 2017-03-23 | 2020-11-03 | Samsung Electronics Co., Ltd. | Method and apparatus for training acoustic model |
KR20180111271A (ko) * | 2017-03-31 | 2018-10-11 | 삼성전자주식회사 | 신경망 모델을 이용하여 노이즈를 제거하는 방법 및 장치 |
KR102395472B1 (ko) * | 2017-06-08 | 2022-05-10 | 한국전자통신연구원 | 가변 윈도우 사이즈 기반의 음원 분리 방법 및 장치 |
US10878144B2 (en) | 2017-08-10 | 2020-12-29 | Allstate Insurance Company | Multi-platform model processing and execution management engine |
US11755949B2 (en) | 2017-08-10 | 2023-09-12 | Allstate Insurance Company | Multi-platform machine learning systems |
US10885900B2 (en) | 2017-08-11 | 2021-01-05 | Microsoft Technology Licensing, Llc | Domain adaptation in speech recognition via teacher-student learning |
CN107680611B (zh) * | 2017-09-13 | 2020-06-16 | 电子科技大学 | 基于卷积神经网络的单通道声音分离方法 |
CN107749299B (zh) * | 2017-09-28 | 2021-07-09 | 瑞芯微电子股份有限公司 | 一种多音频输出方法和装置 |
KR102128153B1 (ko) * | 2017-12-28 | 2020-06-29 | 한양대학교 산학협력단 | 기계 학습을 이용한 음악 소스 검색 장치 및 그 방법 |
WO2019133765A1 (en) * | 2017-12-28 | 2019-07-04 | Knowles Electronics, Llc | Direction of arrival estimation for multiple audio content streams |
WO2019133732A1 (en) * | 2017-12-28 | 2019-07-04 | Knowles Electronics, Llc | Content-based audio stream separation |
US10283140B1 (en) | 2018-01-12 | 2019-05-07 | Alibaba Group Holding Limited | Enhancing audio signals using sub-band deep neural networks |
FR3079706B1 (fr) * | 2018-03-29 | 2021-06-04 | Inst Mines Telecom | Procede et systeme de diffusion d'un flux audio multicanal a des terminaux de spectateurs assistant a un evenement sportif |
US10957337B2 (en) | 2018-04-11 | 2021-03-23 | Microsoft Technology Licensing, Llc | Multi-microphone speech separation |
EP3576088A1 (en) | 2018-05-30 | 2019-12-04 | Fraunhofer Gesellschaft zur Förderung der Angewand | Audio similarity evaluator, audio encoder, methods and computer program |
US11756564B2 (en) | 2018-06-14 | 2023-09-12 | Pindrop Security, Inc. | Deep neural network based speech enhancement |
CN109272987A (zh) * | 2018-09-25 | 2019-01-25 | 河南理工大学 | 一种分选煤和矸石的声音识别方法 |
KR102691543B1 (ko) | 2018-11-16 | 2024-08-02 | 삼성전자주식회사 | 오디오 장면을 인식하는 전자 장치 및 그 방법 |
US11017774B2 (en) | 2019-02-04 | 2021-05-25 | International Business Machines Corporation | Cognitive audio classifier |
RU2720359C1 (ru) * | 2019-04-16 | 2020-04-29 | Хуавэй Текнолоджиз Ко., Лтд. | Способ и оборудование распознавания эмоций в речи |
US11315585B2 (en) | 2019-05-22 | 2022-04-26 | Spotify Ab | Determining musical style using a variational autoencoder |
US11355137B2 (en) | 2019-10-08 | 2022-06-07 | Spotify Ab | Systems and methods for jointly estimating sound sources and frequencies from audio |
CN110782915A (zh) * | 2019-10-31 | 2020-02-11 | 广州艾颂智能科技有限公司 | 一种基于深度学习的波形音乐成分分离方法 |
US11366851B2 (en) | 2019-12-18 | 2022-06-21 | Spotify Ab | Karaoke query processing system |
WO2021148342A1 (en) * | 2020-01-21 | 2021-07-29 | Dolby International Ab | Noise floor estimation and noise reduction |
CN111370023A (zh) * | 2020-02-17 | 2020-07-03 | 厦门快商通科技股份有限公司 | 一种基于gru的乐器识别方法及系统 |
CN111370019B (zh) * | 2020-03-02 | 2023-08-29 | 字节跳动有限公司 | 声源分离方法及装置、神经网络的模型训练方法及装置 |
US11558699B2 (en) | 2020-03-11 | 2023-01-17 | Sonova Ag | Hearing device component, hearing device, computer-readable medium and method for processing an audio-signal for a hearing device |
CN111787462B (zh) * | 2020-09-04 | 2021-01-26 | 蘑菇车联信息科技有限公司 | 音频流处理方法及系统、设备、介质 |
US11839815B2 (en) | 2020-12-23 | 2023-12-12 | Advanced Micro Devices, Inc. | Adaptive audio mixing |
CN112488092B (zh) * | 2021-02-05 | 2021-08-24 | 中国人民解放军国防科技大学 | 基于深度神经网络的导航频段信号类型识别方法及系统 |
CN114792529B (zh) * | 2022-02-24 | 2024-09-27 | 中国电子科技集团公司第五十四研究所 | 一种基于hog+svm的短波通信话音检测方法 |
Family Cites Families (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
JP2807457B2 (ja) * | 1987-07-17 | 1998-10-08 | 株式会社リコー | 音声区間検出方式 |
JP3521844B2 (ja) | 1992-03-30 | 2004-04-26 | セイコーエプソン株式会社 | ニューラルネットワークを用いた認識装置 |
US5960391A (en) * | 1995-12-13 | 1999-09-28 | Denso Corporation | Signal extraction system, system and method for speech restoration, learning method for neural network model, constructing method of neural network model, and signal processing system |
US6542866B1 (en) * | 1999-09-22 | 2003-04-01 | Microsoft Corporation | Speech recognition method and apparatus utilizing multiple feature streams |
US7295977B2 (en) * | 2001-08-27 | 2007-11-13 | Nec Laboratories America, Inc. | Extracting classifying data in music from an audio bitstream |
US7243060B2 (en) * | 2002-04-02 | 2007-07-10 | University Of Washington | Single channel sound separation |
FR2842014B1 (fr) * | 2002-07-08 | 2006-05-05 | Lyon Ecole Centrale | Procede et appareil pour affecter une classe sonore a un signal sonore |
JP4104626B2 (ja) * | 2003-02-07 | 2008-06-18 | 日本電信電話株式会社 | 収音方法及び収音装置 |
US7091409B2 (en) * | 2003-02-14 | 2006-08-15 | University Of Rochester | Music feature extraction using wavelet coefficient histograms |
DE10313875B3 (de) * | 2003-03-21 | 2004-10-28 | Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. | Vorrichtung und Verfahren zum Analysieren eines Informationssignals |
KR100486736B1 (ko) * | 2003-03-31 | 2005-05-03 | 삼성전자주식회사 | 두개의 센서를 이용한 목적원별 신호 분리방법 및 장치 |
US20040260550A1 (en) * | 2003-06-20 | 2004-12-23 | Burges Chris J.C. | Audio processing system and method for classifying speakers in audio data |
US7232948B2 (en) * | 2003-07-24 | 2007-06-19 | Hewlett-Packard Development Company, L.P. | System and method for automatic classification of music |
US7340398B2 (en) * | 2003-08-21 | 2008-03-04 | Hewlett-Packard Development Company, L.P. | Selective sampling for sound signal classification |
JP3949150B2 (ja) * | 2003-09-02 | 2007-07-25 | 日本電信電話株式会社 | 信号分離方法、信号分離装置、信号分離プログラム及び記録媒体 |
US7295607B2 (en) * | 2004-05-07 | 2007-11-13 | Broadcom Corporation | Method and system for receiving pulse width keyed signals |
-
2005
- 2005-10-06 US US11/244,554 patent/US20070083365A1/en not_active Abandoned
-
2006
- 2006-10-03 CN CNA2006800414053A patent/CN101366078A/zh active Pending
- 2006-10-03 JP JP2008534637A patent/JP2009511954A/ja active Pending
- 2006-10-03 CA CA002625378A patent/CA2625378A1/en not_active Abandoned
- 2006-10-03 NZ NZ566782A patent/NZ566782A/en not_active IP Right Cessation
- 2006-10-03 AU AU2006302549A patent/AU2006302549A1/en not_active Abandoned
- 2006-10-03 RU RU2008118004/09A patent/RU2418321C2/ru not_active IP Right Cessation
- 2006-10-03 WO PCT/US2006/038742 patent/WO2007044377A2/en active Search and Examination
- 2006-10-03 EP EP06816186A patent/EP1941494A4/en not_active Withdrawn
- 2006-10-03 BR BRPI0616903-1A patent/BRPI0616903A2/pt not_active Application Discontinuation
- 2006-10-05 TW TW095137147A patent/TWI317932B/zh not_active IP Right Cessation
-
2008
- 2008-03-26 IL IL190445A patent/IL190445A0/en unknown
- 2008-04-23 KR KR1020087009683A patent/KR101269296B1/ko not_active IP Right Cessation
Cited By (38)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN104318929B (zh) * | 2010-01-19 | 2017-05-31 | 杜比国际公司 | 子带处理单元以及生成合成子带信号的方法 |
US10109296B2 (en) | 2010-01-19 | 2018-10-23 | Dolby International Ab | Subband block based harmonic transposition |
US11935555B2 (en) | 2010-01-19 | 2024-03-19 | Dolby International Ab | Subband block based harmonic transposition |
US11646047B2 (en) | 2010-01-19 | 2023-05-09 | Dolby International Ab | Subband block based harmonic transposition |
US10699728B2 (en) | 2010-01-19 | 2020-06-30 | Dolby International Ab | Subband block based harmonic transposition |
CN104318929A (zh) * | 2010-01-19 | 2015-01-28 | 杜比国际公司 | 子带处理单元以及生成合成子带信号的方法 |
US9741362B2 (en) | 2010-01-19 | 2017-08-22 | Dolby International Ab | Subband block based harmonic transposition |
US9858945B2 (en) | 2010-01-19 | 2018-01-02 | Dolby International Ab | Subband block based harmonic transposition |
US11341984B2 (en) | 2010-01-19 | 2022-05-24 | Dolby International Ab | Subband block based harmonic transposition |
CN103038823B (zh) * | 2010-01-29 | 2017-09-12 | 马里兰大学派克分院 | 用于语音提取的系统和方法 |
CN103038823A (zh) * | 2010-01-29 | 2013-04-10 | 马里兰大学派克分院 | 用于语音提取的系统和方法 |
CN102446504B (zh) * | 2010-10-08 | 2013-10-09 | 华为技术有限公司 | 语音/音乐识别方法及装置 |
CN102446504A (zh) * | 2010-10-08 | 2012-05-09 | 华为技术有限公司 | 语音/音乐识别方法及装置 |
CN103456311A (zh) * | 2012-05-29 | 2013-12-18 | 三星电子株式会社 | 用于处理音频信号的方法和设备 |
TWI478151B (zh) * | 2012-11-22 | 2015-03-21 | Hon Hai Prec Ind Co Ltd | 音頻處理系統與音頻處理方法 |
CN103839551A (zh) * | 2012-11-22 | 2014-06-04 | 鸿富锦精密工业(深圳)有限公司 | 音频处理系统与音频处理方法 |
CN103854644B (zh) * | 2012-12-05 | 2016-09-28 | 中国传媒大学 | 单声道多音音乐信号的自动转录方法及装置 |
CN103854644A (zh) * | 2012-12-05 | 2014-06-11 | 中国传媒大学 | 单声道多音音乐信号的自动转录方法及装置 |
CN104575507A (zh) * | 2013-10-23 | 2015-04-29 | 中国移动通信集团公司 | 语音通信方法及装置 |
CN104575507B (zh) * | 2013-10-23 | 2018-06-01 | 中国移动通信集团公司 | 语音通信方法及装置 |
CN104464727A (zh) * | 2014-12-11 | 2015-03-25 | 福州大学 | 一种基于深度信念网络的单通道音乐的歌声分离方法 |
CN105070301B (zh) * | 2015-07-14 | 2018-11-27 | 福州大学 | 单通道音乐人声分离中的多种特定乐器强化分离方法 |
CN105070301A (zh) * | 2015-07-14 | 2015-11-18 | 福州大学 | 单通道音乐人声分离中的多种特定乐器强化分离方法 |
CN106847302A (zh) * | 2017-02-17 | 2017-06-13 | 大连理工大学 | 基于卷积神经网络的单通道混合语音时域分离方法 |
CN106847302B (zh) * | 2017-02-17 | 2020-04-14 | 大连理工大学 | 基于卷积神经网络的单通道混合语音时域分离方法 |
CN107507621A (zh) * | 2017-07-28 | 2017-12-22 | 维沃移动通信有限公司 | 一种噪声抑制方法及移动终端 |
CN108229659A (zh) * | 2017-12-29 | 2018-06-29 | 陕西科技大学 | 基于深度学习的钢琴单键音识别方法 |
CN111566732B (zh) * | 2018-01-15 | 2023-04-04 | 三菱电机株式会社 | 声音信号分离装置和声音信号分离方法 |
CN111566732A (zh) * | 2018-01-15 | 2020-08-21 | 三菱电机株式会社 | 声音信号分离装置和声音信号分离方法 |
CN108922517A (zh) * | 2018-07-03 | 2018-11-30 | 百度在线网络技术(北京)有限公司 | 训练盲源分离模型的方法、装置及存储介质 |
CN108922556B (zh) * | 2018-07-16 | 2019-08-27 | 百度在线网络技术(北京)有限公司 | 声音处理方法、装置及设备 |
CN108922556A (zh) * | 2018-07-16 | 2018-11-30 | 百度在线网络技术(北京)有限公司 | 声音处理方法、装置及设备 |
CN109166593A (zh) * | 2018-08-17 | 2019-01-08 | 腾讯音乐娱乐科技(深圳)有限公司 | 音频数据处理方法、装置及存储介质 |
CN113366861A (zh) * | 2019-01-25 | 2021-09-07 | 索诺瓦有限公司 | 用于处理音频信号的信号处理装置、系统和方法 |
CN113647119A (zh) * | 2019-01-25 | 2021-11-12 | 索诺瓦有限公司 | 用于处理音频信号的信号处理装置、系统和方法 |
CN112115821A (zh) * | 2020-09-04 | 2020-12-22 | 西北工业大学 | 一种基于小波近似系数熵的多信号智能调制模式识别方法 |
CN113674756A (zh) * | 2021-10-22 | 2021-11-19 | 青岛科技大学 | 基于短时傅里叶变换和bp神经网络的频域盲源分离方法 |
CN116828385A (zh) * | 2023-08-31 | 2023-09-29 | 深圳市广和通无线通信软件有限公司 | 一种基于人工智能分析的音频数据处理方法及相关装置 |
Also Published As
Publication number | Publication date |
---|---|
JP2009511954A (ja) | 2009-03-19 |
WO2007044377B1 (en) | 2008-11-27 |
RU2418321C2 (ru) | 2011-05-10 |
KR20080059246A (ko) | 2008-06-26 |
KR101269296B1 (ko) | 2013-05-29 |
EP1941494A2 (en) | 2008-07-09 |
TWI317932B (en) | 2009-12-01 |
WO2007044377A3 (en) | 2008-10-02 |
EP1941494A4 (en) | 2011-08-10 |
TW200739517A (en) | 2007-10-16 |
IL190445A0 (en) | 2008-11-03 |
RU2008118004A (ru) | 2009-11-20 |
US20070083365A1 (en) | 2007-04-12 |
WO2007044377A2 (en) | 2007-04-19 |
BRPI0616903A2 (pt) | 2011-07-05 |
CA2625378A1 (en) | 2007-04-19 |
AU2006302549A1 (en) | 2007-04-19 |
NZ566782A (en) | 2010-07-30 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN101366078A (zh) | 从单音音频信号分离音频信源的神经网络分类器 | |
Cano et al. | Musical source separation: An introduction | |
Reddy et al. | A scalable noisy speech dataset and online subjective test framework | |
Marchi et al. | Multi-resolution linear prediction based features for audio onset detection with bidirectional LSTM neural networks | |
Harb et al. | Gender identification using a general audio classifier | |
CN108447495B (zh) | 一种基于综合特征集的深度学习语音增强方法 | |
CN108417228A (zh) | 乐器音色迁移下的人声音色相似性度量方法 | |
CN1192309A (zh) | 信号质量的评估 | |
CN106997765B (zh) | 人声音色的定量表征方法 | |
Dubey et al. | Non-intrusive speech quality assessment using several combinations of auditory features | |
CN102723079A (zh) | 基于稀疏表示的音乐和弦自动识别方法 | |
CN103258537A (zh) | 利用特征结合对语音情感进行识别的方法及其装置 | |
Chu et al. | A noise-robust FFT-based auditory spectrum with application in audio classification | |
Shifas et al. | A non-causal FFTNet architecture for speech enhancement | |
Ravindran et al. | Improving the noise-robustness of mel-frequency cepstral coefficients for speech processing | |
CN115620731A (zh) | 一种语音特征提取与检测方法 | |
Uhle et al. | Speech enhancement of movie sound | |
Barbedo et al. | A robust and computationally efficient speech/music discriminator | |
CN114678039A (zh) | 一种基于深度学习的歌唱评价方法 | |
Chen et al. | Impairment Representation Learning for Speech Quality Assessment. | |
Gemello et al. | Multi-source neural networks for speech recognition: a review of recent results | |
CN113506583B (zh) | 利用残差网络的伪装语音检测方法 | |
MX2008004572A (en) | Neural network classifier for seperating audio sources from a monophonic audio signal | |
Bharti et al. | Speech Enhancement And Noise Reduction In Forensic Applications | |
CN116682445A (zh) | 基于特征识别的智能语音降噪系统及方法 |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
ASS | Succession or assignment of patent right |
Owner name: DTS(BVI) CO., LTD. Free format text: FORMER OWNER: DTS CO.,LTD. Effective date: 20090403 |
|
C41 | Transfer of patent application or patent right or utility model | ||
TA01 | Transfer of patent application right |
Effective date of registration: 20090403 Address after: Virgin Islands (British) Applicant after: DTS, Inc. Address before: American California Applicant before: DTS, Inc. |
|
REG | Reference to a national code |
Ref country code: HK Ref legal event code: DE Ref document number: 1125216 Country of ref document: HK |
|
C02 | Deemed withdrawal of patent application after publication (patent law 2001) | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20090211 |
|
REG | Reference to a national code |
Ref country code: HK Ref legal event code: WD Ref document number: 1125216 Country of ref document: HK |