ATE354850T1 - Kodierung von audiosignalen - Google Patents

Kodierung von audiosignalen

Info

Publication number
ATE354850T1
ATE354850T1 AT01980541T AT01980541T ATE354850T1 AT E354850 T1 ATE354850 T1 AT E354850T1 AT 01980541 T AT01980541 T AT 01980541T AT 01980541 T AT01980541 T AT 01980541T AT E354850 T1 ATE354850 T1 AT E354850T1
Authority
AT
Austria
Prior art keywords
input signal
psychoacoustic
norm
modeled
frame
Prior art date
Application number
AT01980541T
Other languages
English (en)
Inventor
Richard Heusdens
Renat Vafin
Willem B Kleijn
Original Assignee
Koninkl Philips Electronics Nv
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Koninkl Philips Electronics Nv filed Critical Koninkl Philips Electronics Nv
Application granted granted Critical
Publication of ATE354850T1 publication Critical patent/ATE354850T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0364Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0013Codebook search algorithms
    • G10L2019/0014Selection criteria for distances

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
AT01980541T 2000-11-03 2001-10-31 Kodierung von audiosignalen ATE354850T1 (de)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
EP00203856 2000-11-03
EP01201685 2001-05-08

Publications (1)

Publication Number Publication Date
ATE354850T1 true ATE354850T1 (de) 2007-03-15

Family

ID=26072835

Family Applications (1)

Application Number Title Priority Date Filing Date
AT01980541T ATE354850T1 (de) 2000-11-03 2001-10-31 Kodierung von audiosignalen

Country Status (8)

Country Link
US (1) US7120587B2 (de)
EP (1) EP1338001B1 (de)
JP (1) JP2004513392A (de)
KR (1) KR20020070373A (de)
CN (1) CN1216366C (de)
AT (1) ATE354850T1 (de)
DE (1) DE60126811T2 (de)
WO (1) WO2002037476A1 (de)

Families Citing this family (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8478539B2 (en) 2003-12-31 2013-07-02 Jeffrey M. Sieracki System and method for neurological activity signature determination, discrimination, and detection
US7079986B2 (en) * 2003-12-31 2006-07-18 Sieracki Jeffrey M Greedy adaptive signature discrimination system and method
US8271200B2 (en) * 2003-12-31 2012-09-18 Sieracki Jeffrey M System and method for acoustic signature extraction, detection, discrimination, and localization
WO2005091275A1 (en) * 2004-03-17 2005-09-29 Koninklijke Philips Electronics N.V. Audio coding
US7751572B2 (en) 2005-04-15 2010-07-06 Dolby International Ab Adaptive residual audio coding
KR100788706B1 (ko) * 2006-11-28 2007-12-26 삼성전자주식회사 광대역 음성 신호의 부호화/복호화 방법
KR101299155B1 (ko) * 2006-12-29 2013-08-22 삼성전자주식회사 오디오 부호화 및 복호화 장치와 그 방법
KR101149448B1 (ko) * 2007-02-12 2012-05-25 삼성전자주식회사 오디오 부호화 및 복호화 장치와 그 방법
KR101346771B1 (ko) * 2007-08-16 2013-12-31 삼성전자주식회사 심리 음향 모델에 따른 마스킹 값보다 작은 정현파 신호를효율적으로 인코딩하는 방법 및 장치, 그리고 인코딩된오디오 신호를 디코딩하는 방법 및 장치
KR101441898B1 (ko) * 2008-02-01 2014-09-23 삼성전자주식회사 주파수 부호화 방법 및 장치와 주파수 복호화 방법 및 장치
US8805083B1 (en) 2010-03-21 2014-08-12 Jeffrey M. Sieracki System and method for discriminating constituents of image by complex spectral signature extraction
US9886945B1 (en) 2011-07-03 2018-02-06 Reality Analytics, Inc. System and method for taxonomically distinguishing sample data captured from biota sources
US9558762B1 (en) 2011-07-03 2017-01-31 Reality Analytics, Inc. System and method for distinguishing source from unconstrained acoustic signals emitted thereby in context agnostic manner
US9691395B1 (en) 2011-12-31 2017-06-27 Reality Analytics, Inc. System and method for taxonomically distinguishing unconstrained signal data segments
JP5799707B2 (ja) * 2011-09-26 2015-10-28 ソニー株式会社 オーディオ符号化装置およびオーディオ符号化方法、オーディオ復号装置およびオーディオ復号方法、並びにプログラム
WO2018198454A1 (ja) * 2017-04-28 2018-11-01 ソニー株式会社 情報処理装置、および情報処理方法

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1062963C (zh) * 1990-04-12 2001-03-07 多尔拜实验特许公司 用于产生高质量声音信号的解码器和编码器
JP3446216B2 (ja) * 1992-03-06 2003-09-16 ソニー株式会社 音声信号処理方法
US5651090A (en) * 1994-05-06 1997-07-22 Nippon Telegraph And Telephone Corporation Coding method and coder for coding input signals of plural channels using vector quantization, and decoding method and decoder therefor
JP3707153B2 (ja) * 1996-09-24 2005-10-19 ソニー株式会社 ベクトル量子化方法、音声符号化方法及び装置
FI973873A (fi) * 1997-10-02 1999-04-03 Nokia Mobile Phones Ltd Puhekoodaus

Also Published As

Publication number Publication date
JP2004513392A (ja) 2004-04-30
DE60126811D1 (de) 2007-04-05
US20030009332A1 (en) 2003-01-09
US7120587B2 (en) 2006-10-10
EP1338001A1 (de) 2003-08-27
CN1216366C (zh) 2005-08-24
EP1338001B1 (de) 2007-02-21
WO2002037476A1 (en) 2002-05-10
CN1408110A (zh) 2003-04-02
DE60126811T2 (de) 2007-12-06
KR20020070373A (ko) 2002-09-06

Similar Documents

Publication Publication Date Title
DE60126811D1 (de) Kodierung von audiosignalen
CN110085251B (zh) 人声提取方法、人声提取装置及相关产品
GB2159997A (en) Speech recognition
CN107220235A (zh) 基于人工智能的语音识别纠错方法、装置及存储介质
ATE449382T1 (de) Systeme und verfahren zur erstellung und verwendung von angepassten wörterlisten
SG128406A1 (en) Character recognizing and translating system and voice recognizing and translating system
DE69908360D1 (de) Rechnersystem und verfahren zur erklärung des verhaltens eines modelles das eingangsdaten auf ausgangdaten abbildet
CN110910283A (zh) 生成法律文书的方法、装置、设备和存储介质
TW200612392A (en) Multi-channel encoder
DE50001467D1 (de) Verfahren und vorrichtung zum einbringen von informationen in einen datenstrom sowie verfahren und vorrichtung zum codieren eines audiosignals
CN111696580B (zh) 一种语音检测方法、装置、电子设备及存储介质
Lee et al. Improved tone concatenation rules in a formant-based Chinese text-to-speech system
CN111354343B (zh) 语音唤醒模型的生成方法、装置和电子设备
EP1569199A4 (de) Datenerzeugungseinrichtung und verfahren für musikkompositionen
CN106098078A (zh) 一种可过滤扬声器噪音的语音识别方法及其系统
CN104091592A (zh) 一种基于隐高斯随机场的语音转换系统
CN115394287A (zh) 混合语种语音识别方法、装置、系统及存储介质
WO2003014961A3 (en) Methods for efficient filtering of data
CN106228976A (zh) 语音识别方法和装置
CN105161096A (zh) 基于垃圾模型的语音识别处理方法及装置
CN112242134A (zh) 语音合成方法及装置
Ouisaadane et al. English Spoken Digits Database under noise conditions for research: SDDN
Srinivas et al. Detection of vowel-like speech: an efficient hardware architecture and it's FPGA prototype
Lashkari et al. NMF-based cepstral features for speech emotion recognition
CN102201232A (zh) 一种用于嵌入式语音合成系统的音库结构压缩及使用方法

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties