DE60126811D1 - Kodierung von audiosignalen - Google Patents
Kodierung von audiosignalenInfo
- Publication number
- DE60126811D1 DE60126811D1 DE60126811T DE60126811T DE60126811D1 DE 60126811 D1 DE60126811 D1 DE 60126811D1 DE 60126811 T DE60126811 T DE 60126811T DE 60126811 T DE60126811 T DE 60126811T DE 60126811 D1 DE60126811 D1 DE 60126811D1
- Authority
- DE
- Germany
- Prior art keywords
- input signal
- psychoacoustic
- norm
- modeled
- frame
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
- 230000000873 masking effect Effects 0.000 abstract 1
- 238000000034 method Methods 0.000 abstract 1
- 238000003786 synthesis reaction Methods 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0316—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
- G10L21/0364—Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L2019/0001—Codebooks
- G10L2019/0013—Codebook search algorithms
- G10L2019/0014—Selection criteria for distances
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Quality & Reliability (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
EP00203856 | 2000-11-03 | ||
EP00203856 | 2000-11-03 | ||
EP01201685 | 2001-05-08 | ||
EP01201685 | 2001-05-08 | ||
PCT/EP2001/012721 WO2002037476A1 (en) | 2000-11-03 | 2001-10-31 | Sinusoidal model based coding of audio signals |
Publications (2)
Publication Number | Publication Date |
---|---|
DE60126811D1 true DE60126811D1 (de) | 2007-04-05 |
DE60126811T2 DE60126811T2 (de) | 2007-12-06 |
Family
ID=26072835
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
DE60126811T Expired - Fee Related DE60126811T2 (de) | 2000-11-03 | 2001-10-31 | Kodierung von audiosignalen |
Country Status (8)
Country | Link |
---|---|
US (1) | US7120587B2 (de) |
EP (1) | EP1338001B1 (de) |
JP (1) | JP2004513392A (de) |
KR (1) | KR20020070373A (de) |
CN (1) | CN1216366C (de) |
AT (1) | ATE354850T1 (de) |
DE (1) | DE60126811T2 (de) |
WO (1) | WO2002037476A1 (de) |
Families Citing this family (16)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US7079986B2 (en) * | 2003-12-31 | 2006-07-18 | Sieracki Jeffrey M | Greedy adaptive signature discrimination system and method |
US8271200B2 (en) * | 2003-12-31 | 2012-09-18 | Sieracki Jeffrey M | System and method for acoustic signature extraction, detection, discrimination, and localization |
US8478539B2 (en) | 2003-12-31 | 2013-07-02 | Jeffrey M. Sieracki | System and method for neurological activity signature determination, discrimination, and detection |
WO2005091275A1 (en) * | 2004-03-17 | 2005-09-29 | Koninklijke Philips Electronics N.V. | Audio coding |
US7751572B2 (en) | 2005-04-15 | 2010-07-06 | Dolby International Ab | Adaptive residual audio coding |
KR100788706B1 (ko) * | 2006-11-28 | 2007-12-26 | 삼성전자주식회사 | 광대역 음성 신호의 부호화/복호화 방법 |
KR101299155B1 (ko) * | 2006-12-29 | 2013-08-22 | 삼성전자주식회사 | 오디오 부호화 및 복호화 장치와 그 방법 |
KR101149448B1 (ko) * | 2007-02-12 | 2012-05-25 | 삼성전자주식회사 | 오디오 부호화 및 복호화 장치와 그 방법 |
KR101346771B1 (ko) * | 2007-08-16 | 2013-12-31 | 삼성전자주식회사 | 심리 음향 모델에 따른 마스킹 값보다 작은 정현파 신호를효율적으로 인코딩하는 방법 및 장치, 그리고 인코딩된오디오 신호를 디코딩하는 방법 및 장치 |
KR101441898B1 (ko) * | 2008-02-01 | 2014-09-23 | 삼성전자주식회사 | 주파수 부호화 방법 및 장치와 주파수 복호화 방법 및 장치 |
US8805083B1 (en) | 2010-03-21 | 2014-08-12 | Jeffrey M. Sieracki | System and method for discriminating constituents of image by complex spectral signature extraction |
US9691395B1 (en) | 2011-12-31 | 2017-06-27 | Reality Analytics, Inc. | System and method for taxonomically distinguishing unconstrained signal data segments |
US9558762B1 (en) | 2011-07-03 | 2017-01-31 | Reality Analytics, Inc. | System and method for distinguishing source from unconstrained acoustic signals emitted thereby in context agnostic manner |
US9886945B1 (en) | 2011-07-03 | 2018-02-06 | Reality Analytics, Inc. | System and method for taxonomically distinguishing sample data captured from biota sources |
JP5799707B2 (ja) * | 2011-09-26 | 2015-10-28 | ソニー株式会社 | オーディオ符号化装置およびオーディオ符号化方法、オーディオ復号装置およびオーディオ復号方法、並びにプログラム |
EP3617904A4 (de) * | 2017-04-28 | 2020-04-29 | Sony Corporation | Informationsverarbeitungsvorrichtung und informationsverarbeitungsverfahren |
Family Cites Families (5)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1062963C (zh) * | 1990-04-12 | 2001-03-07 | 多尔拜实验特许公司 | 用于产生高质量声音信号的解码器和编码器 |
JP3446216B2 (ja) * | 1992-03-06 | 2003-09-16 | ソニー株式会社 | 音声信号処理方法 |
US5651090A (en) * | 1994-05-06 | 1997-07-22 | Nippon Telegraph And Telephone Corporation | Coding method and coder for coding input signals of plural channels using vector quantization, and decoding method and decoder therefor |
JP3707153B2 (ja) * | 1996-09-24 | 2005-10-19 | ソニー株式会社 | ベクトル量子化方法、音声符号化方法及び装置 |
FI973873A (fi) * | 1997-10-02 | 1999-04-03 | Nokia Mobile Phones Ltd | Puhekoodaus |
-
2001
- 2001-10-31 AT AT01980541T patent/ATE354850T1/de not_active IP Right Cessation
- 2001-10-31 US US10/169,345 patent/US7120587B2/en not_active Expired - Fee Related
- 2001-10-31 EP EP01980541A patent/EP1338001B1/de not_active Expired - Lifetime
- 2001-10-31 KR KR1020027008652A patent/KR20020070373A/ko not_active Application Discontinuation
- 2001-10-31 CN CN018059643A patent/CN1216366C/zh not_active Expired - Fee Related
- 2001-10-31 WO PCT/EP2001/012721 patent/WO2002037476A1/en active IP Right Grant
- 2001-10-31 JP JP2002540143A patent/JP2004513392A/ja not_active Withdrawn
- 2001-10-31 DE DE60126811T patent/DE60126811T2/de not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
US7120587B2 (en) | 2006-10-10 |
ATE354850T1 (de) | 2007-03-15 |
DE60126811T2 (de) | 2007-12-06 |
EP1338001A1 (de) | 2003-08-27 |
CN1216366C (zh) | 2005-08-24 |
CN1408110A (zh) | 2003-04-02 |
EP1338001B1 (de) | 2007-02-21 |
WO2002037476A1 (en) | 2002-05-10 |
KR20020070373A (ko) | 2002-09-06 |
JP2004513392A (ja) | 2004-04-30 |
US20030009332A1 (en) | 2003-01-09 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE60126811D1 (de) | Kodierung von audiosignalen | |
CN107220235A (zh) | 基于人工智能的语音识别纠错方法、装置及存储介质 | |
TW357313B (en) | Methods and apparatus for handwriting recognition | |
SG128406A1 (en) | Character recognizing and translating system and voice recognizing and translating system | |
CN107403619A (zh) | 一种应用于自行车环境的语音控制方法及系统 | |
HK1080556A1 (en) | Systems and methods of building and using custom word lists | |
TW200612392A (en) | Multi-channel encoder | |
DE50001467D1 (de) | Verfahren und vorrichtung zum einbringen von informationen in einen datenstrom sowie verfahren und vorrichtung zum codieren eines audiosignals | |
CN111696580B (zh) | 一种语音检测方法、装置、电子设备及存储介质 | |
CN111354343B (zh) | 语音唤醒模型的生成方法、装置和电子设备 | |
EP1569199A4 (de) | Datenerzeugungseinrichtung und verfahren für musikkompositionen | |
CN106098078A (zh) | 一种可过滤扬声器噪音的语音识别方法及其系统 | |
CN104091592A (zh) | 一种基于隐高斯随机场的语音转换系统 | |
WO2004092919A3 (en) | System facilitating communications and financial contributions involving facilities and residents thereof | |
CN115394287A (zh) | 混合语种语音识别方法、装置、系统及存储介质 | |
CN104952446A (zh) | 基于语音交互的数字楼盘展示系统 | |
WO2003014961A3 (en) | Methods for efficient filtering of data | |
CN107403620A (zh) | 一种语音识别方法及装置 | |
CN105161096A (zh) | 基于垃圾模型的语音识别处理方法及装置 | |
CN106228976A (zh) | 语音识别方法和装置 | |
CN117524259A (zh) | 音频处理方法及系统 | |
CN112242134B (zh) | 语音合成方法及装置 | |
DE3570784D1 (en) | Improved phonemic classification in speech recognition system | |
Ouisaadane et al. | English Spoken Digits Database under noise conditions for research: SDDN | |
Srinivas et al. | Detection of vowel-like speech: an efficient hardware architecture and it's FPGA prototype |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
8364 | No opposition during term of opposition | ||
8339 | Ceased/non-payment of the annual fee |