NO20082403L - Fremgangsmate og anordning for taledata - Google Patents

Fremgangsmate og anordning for taledata

Info

Publication number
NO20082403L
NO20082403L NO20082403A NO20082403A NO20082403L NO 20082403 L NO20082403 L NO 20082403L NO 20082403 A NO20082403 A NO 20082403A NO 20082403 A NO20082403 A NO 20082403A NO 20082403 L NO20082403 L NO 20082403L
Authority
NO
Norway
Prior art keywords
speech
class
prediction
target
sound
Prior art date
Application number
NO20082403A
Other languages
English (en)
Inventor
Tetsujiro Kondo
Tsutomu Watanabe
Hiroto Kimura
Masaaki Hattori
Yasuhiro Fujimori
Original Assignee
Sony Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from JP2000251969A external-priority patent/JP2002062899A/ja
Priority claimed from JP2000346675A external-priority patent/JP4517262B2/ja
Publication of NO20082403L publication Critical patent/NO20082403L/no
Application filed by Sony Corp filed Critical Sony Corp

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/12Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a code excitation, e.g. in code excited linear prediction [CELP] vocoders
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/26Pre-filtering or post-filtering

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)

Abstract

Det er beskrevet en talebehandlingsanordning, der forutsigelsesutgang for å finne forutsigelsesverdier for talen som har høy lydkvalitet, blir trukket ut fra den syntetiserte lyd som er fremkommet ved å føre lineære forutsigelseskoeffisienter og restsignaler, frembragt fra en forhåndsstilt kode, til et talesyntesefilter der talen med høy lydkvalitet har høyere lydkvalitet enn den syntetiserte lyd, og der forutsigelsesuttakene blir benyttet sammen med forhåndsstilte uttakskoeffisienter for å utføre forhåndsstilte forutsigelsesberegninger for å finne forutsigelsesverdiene for talen som har høy lydkvalitet. Lyden som har høy lydkvalitet har høyere lydkvalitet enn den syntetiserte lyd. Anordningen omfatter en enhet (45) til uttrekning av forutsigelsesuttak fra den syntetiserte lyd, der forutsigelsesuttakene benyttes til forutsigelse av talen som har høy kvalitet, som måltale, for hvilken forutsigelsesverdi og en enhet (46) for uttrekning av klasseuttak, benyttet til klassifisering av måltalen i en av et flertall klasser fra den ovenstående kode. Anordningen omfatter også en klassifiseringsenhet (47) for å finne klassen for måltalen basert på klasseuttakene, uthentningsenhet og uthentning av uttakskoeffisienter som er knyttet til klassen for måltalen fra blant uttakskoeffisientene som er funnet ved opplæring fra klasse til klasse, og en forutsigelsesenhet (49) for å finne forutsigelsesverdiene for måltalen ved bruk av forutsigelsesuttak og uttakskoeffisientene som er knyttet til klassen for måltalen.
NO20082403A 2000-08-09 2008-05-26 Fremgangsmate og anordning for taledata NO20082403L (no)

Applications Claiming Priority (4)

Application Number Priority Date Filing Date Title
JP2000241062 2000-08-09
JP2000251969A JP2002062899A (ja) 2000-08-23 2000-08-23 データ処理装置およびデータ処理方法、学習装置および学習方法、並びに記録媒体
JP2000346675A JP4517262B2 (ja) 2000-11-14 2000-11-14 音声処理装置および音声処理方法、学習装置および学習方法、並びに記録媒体
PCT/JP2001/006708 WO2002013183A1 (fr) 2000-08-09 2001-08-03 Procede et dispositif de traitement de donnees vocales

Publications (1)

Publication Number Publication Date
NO20082403L true NO20082403L (no) 2002-06-07

Family

ID=27344301

Family Applications (3)

Application Number Title Priority Date Filing Date
NO20021631A NO326880B1 (no) 2000-08-09 2002-04-05 Fremgangsmate og anordning for taledata
NO20082403A NO20082403L (no) 2000-08-09 2008-05-26 Fremgangsmate og anordning for taledata
NO20082401A NO20082401L (no) 2000-08-09 2008-05-26 Fremgangsmate og anordning for taledata

Family Applications Before (1)

Application Number Title Priority Date Filing Date
NO20021631A NO326880B1 (no) 2000-08-09 2002-04-05 Fremgangsmate og anordning for taledata

Family Applications After (1)

Application Number Title Priority Date Filing Date
NO20082401A NO20082401L (no) 2000-08-09 2008-05-26 Fremgangsmate og anordning for taledata

Country Status (7)

Country Link
US (1) US7912711B2 (no)
EP (3) EP1944760B1 (no)
KR (1) KR100819623B1 (no)
DE (3) DE60140020D1 (no)
NO (3) NO326880B1 (no)
TW (1) TW564398B (no)
WO (1) WO2002013183A1 (no)

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP4857468B2 (ja) * 2001-01-25 2012-01-18 ソニー株式会社 データ処理装置およびデータ処理方法、並びにプログラムおよび記録媒体
JP4857467B2 (ja) 2001-01-25 2012-01-18 ソニー株式会社 データ処理装置およびデータ処理方法、並びにプログラムおよび記録媒体
JP4711099B2 (ja) 2001-06-26 2011-06-29 ソニー株式会社 送信装置および送信方法、送受信装置および送受信方法、並びにプログラムおよび記録媒体
DE102006022346B4 (de) 2006-05-12 2008-02-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Informationssignalcodierung
US8504090B2 (en) * 2010-03-29 2013-08-06 Motorola Solutions, Inc. Enhanced public safety communication system
US9363068B2 (en) 2010-08-03 2016-06-07 Intel Corporation Vector processor having instruction set with sliding window non-linear convolutional function
WO2013063440A1 (en) 2011-10-27 2013-05-02 Lsi Corporation Vector processor having instruction set with vector convolution funciton for fir filtering
RU2012102842A (ru) 2012-01-27 2013-08-10 ЭлЭсАй Корпорейшн Инкрементное обнаружение преамбулы
EP2704142B1 (en) * 2012-08-27 2015-09-02 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for reproducing an audio signal, apparatus and method for generating a coded audio signal, computer program and coded audio signal
US9923595B2 (en) 2013-04-17 2018-03-20 Intel Corporation Digital predistortion for dual-band power amplifiers

Family Cites Families (45)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPS6011360B2 (ja) * 1981-12-15 1985-03-25 ケイディディ株式会社 音声符号化方式
JP2797348B2 (ja) 1988-11-28 1998-09-17 松下電器産業株式会社 音声符号化・復号化装置
US5293448A (en) * 1989-10-02 1994-03-08 Nippon Telegraph And Telephone Corporation Speech analysis-synthesis method and apparatus therefor
US5261027A (en) * 1989-06-28 1993-11-09 Fujitsu Limited Code excited linear prediction speech coding system
CA2031965A1 (en) 1990-01-02 1991-07-03 Paul A. Rosenstrach Sound synthesizer
JP2736157B2 (ja) 1990-07-17 1998-04-02 シャープ株式会社 符号化装置
JPH05158495A (ja) 1991-05-07 1993-06-25 Fujitsu Ltd 音声符号化伝送装置
ES2166355T3 (es) * 1991-06-11 2002-04-16 Qualcomm Inc Vocodificador de velocidad variable.
JP3076086B2 (ja) * 1991-06-28 2000-08-14 シャープ株式会社 音声合成装置用ポストフィルタ
US5233660A (en) * 1991-09-10 1993-08-03 At&T Bell Laboratories Method and apparatus for low-delay celp speech coding and decoding
US5371853A (en) * 1991-10-28 1994-12-06 University Of Maryland At College Park Method and system for CELP speech coding and codebook for use therewith
US5327520A (en) * 1992-06-04 1994-07-05 At&T Bell Laboratories Method of use of voice message coder/decoder
JP2779886B2 (ja) * 1992-10-05 1998-07-23 日本電信電話株式会社 広帯域音声信号復元方法
US5455888A (en) * 1992-12-04 1995-10-03 Northern Telecom Limited Speech bandwidth extension method and apparatus
US5491771A (en) * 1993-03-26 1996-02-13 Hughes Aircraft Company Real-time implementation of a 8Kbps CELP coder on a DSP pair
JP3043920B2 (ja) * 1993-06-14 2000-05-22 富士写真フイルム株式会社 ネガクリップ
US5717823A (en) * 1994-04-14 1998-02-10 Lucent Technologies Inc. Speech-rate modification for linear-prediction based analysis-by-synthesis speech coders
JPH08202399A (ja) 1995-01-27 1996-08-09 Kyocera Corp 復号音声の後処理方法
SE504010C2 (sv) * 1995-02-08 1996-10-14 Ericsson Telefon Ab L M Förfarande och anordning för prediktiv kodning av tal- och datasignaler
JP3235703B2 (ja) * 1995-03-10 2001-12-04 日本電信電話株式会社 ディジタルフィルタのフィルタ係数決定方法
EP0732687B2 (en) * 1995-03-13 2005-10-12 Matsushita Electric Industrial Co., Ltd. Apparatus for expanding speech bandwidth
JP2993396B2 (ja) * 1995-05-12 1999-12-20 三菱電機株式会社 音声加工フィルタ及び音声合成装置
FR2734389B1 (fr) * 1995-05-17 1997-07-18 Proust Stephane Procede d'adaptation du niveau de masquage du bruit dans un codeur de parole a analyse par synthese utilisant un filtre de ponderation perceptuelle a court terme
GB9512284D0 (en) * 1995-06-16 1995-08-16 Nokia Mobile Phones Ltd Speech Synthesiser
JPH0990997A (ja) * 1995-09-26 1997-04-04 Mitsubishi Electric Corp 音声符号化装置、音声復号化装置、音声符号化復号化方法および複合ディジタルフィルタ
JP3248668B2 (ja) * 1996-03-25 2002-01-21 日本電信電話株式会社 ディジタルフィルタおよび音響符号化/復号化装置
US6014622A (en) * 1996-09-26 2000-01-11 Rockwell Semiconductor Systems, Inc. Low bit rate speech coder using adaptive open-loop subframe pitch lag estimation and vector quantization
JP3095133B2 (ja) * 1997-02-25 2000-10-03 日本電信電話株式会社 音響信号符号化方法
JP3946812B2 (ja) * 1997-05-12 2007-07-18 ソニー株式会社 オーディオ信号変換装置及びオーディオ信号変換方法
US5995923A (en) 1997-06-26 1999-11-30 Nortel Networks Corporation Method and apparatus for improving the voice quality of tandemed vocoders
JP4132154B2 (ja) * 1997-10-23 2008-08-13 ソニー株式会社 音声合成方法及び装置、並びに帯域幅拡張方法及び装置
US6014618A (en) * 1998-08-06 2000-01-11 Dsp Software Engineering, Inc. LPAS speech coder using vector quantized, multi-codebook, multi-tap pitch predictor and optimized ternary source excitation codebook derivation
JP2000066700A (ja) * 1998-08-17 2000-03-03 Oki Electric Ind Co Ltd 音声信号符号器、音声信号復号器
JP4099879B2 (ja) 1998-10-26 2008-06-11 ソニー株式会社 帯域幅拡張方法及び装置
US6539355B1 (en) * 1998-10-15 2003-03-25 Sony Corporation Signal band expanding method and apparatus and signal synthesis method and apparatus
US6260009B1 (en) 1999-02-12 2001-07-10 Qualcomm Incorporated CELP-based to CELP-based vocoder packet translation
US6434519B1 (en) * 1999-07-19 2002-08-13 Qualcomm Incorporated Method and apparatus for identifying frequency bands to compute linear phase shifts between frame prototypes in a speech coder
JP4752088B2 (ja) 2000-05-09 2011-08-17 ソニー株式会社 データ処理装置およびデータ処理方法、並びに記録媒体
EP1282236B1 (en) * 2000-05-09 2012-10-03 Sony Corporation Data processing device and data processing method, and recorded medium
JP4517448B2 (ja) 2000-05-09 2010-08-04 ソニー株式会社 データ処理装置およびデータ処理方法、並びに記録媒体
US7283961B2 (en) * 2000-08-09 2007-10-16 Sony Corporation High-quality speech synthesis device and method by classification and prediction processing of synthesized sound
JP4857468B2 (ja) * 2001-01-25 2012-01-18 ソニー株式会社 データ処理装置およびデータ処理方法、並びにプログラムおよび記録媒体
JP4857467B2 (ja) * 2001-01-25 2012-01-18 ソニー株式会社 データ処理装置およびデータ処理方法、並びにプログラムおよび記録媒体
JP3876781B2 (ja) * 2002-07-16 2007-02-07 ソニー株式会社 受信装置および受信方法、記録媒体、並びにプログラム
JP4554561B2 (ja) * 2006-06-20 2010-09-29 株式会社シマノ 釣り用グローブ

Also Published As

Publication number Publication date
DE60143327D1 (de) 2010-12-02
NO20021631D0 (no) 2002-04-05
KR100819623B1 (ko) 2008-04-04
NO20082401L (no) 2002-06-07
DE60134861D1 (de) 2008-08-28
EP1308927B1 (en) 2008-07-16
NO20021631L (no) 2002-06-07
EP1944760A2 (en) 2008-07-16
EP1944759A2 (en) 2008-07-16
EP1308927A1 (en) 2003-05-07
KR20020040846A (ko) 2002-05-30
DE60140020D1 (de) 2009-11-05
NO326880B1 (no) 2009-03-09
EP1308927A4 (en) 2005-09-28
EP1944760A3 (en) 2008-07-30
US20080027720A1 (en) 2008-01-31
US7912711B2 (en) 2011-03-22
WO2002013183A1 (fr) 2002-02-14
EP1308927B9 (en) 2009-02-25
EP1944759A3 (en) 2008-07-30
TW564398B (en) 2003-12-01
EP1944759B1 (en) 2010-10-20
EP1944760B1 (en) 2009-09-23

Similar Documents

Publication Publication Date Title
NO20082403L (no) Fremgangsmate og anordning for taledata
DE60126149T2 (de) Verfahren, einrichtung und programm zum codieren und decodieren eines akustischen parameters und verfahren, einrichtung und programm zum codieren und decodieren von klängen
KR100795727B1 (ko) Celp기반의 음성 코더에서 고정 코드북 검색 장치 및방법
US5241649A (en) Voice recognition method
JPH04270398A (ja) 音声符号化方式
JP3628268B2 (ja) 音響信号符号化方法、復号化方法及び装置並びにプログラム及び記録媒体
CN101133442B (zh) 生成音频信号的印迹的方法
CA2483607A1 (en) Syllabic nuclei extracting apparatus and program product thereof
KR100766170B1 (ko) 다중 레벨 양자화를 이용한 음악 요약 장치 및 방법
JPH0764600A (ja) 音声のピッチ符号化装置
JP4357852B2 (ja) 時系列信号の圧縮解析装置および変換装置
JP3010654B2 (ja) 圧縮符号化装置及び方法
JPH04261591A (ja) 自動採譜装置
JP3095758B2 (ja) ベクトル量子化のコードベクトル検索方法
JP3346200B2 (ja) 音声認識装置
JP3010655B2 (ja) 圧縮符号化装置及び方法、並びに復号装置及び方法
JP3024467B2 (ja) 音声符号化装置
JPS61128300A (ja) ピツチ抽出装置
JPH0736119B2 (ja) 区分的最適関数近似方法
JPH07271397A (ja) 音声符号化装置
CN113312456A (zh) 短视频文本生成方法、装置、设备及存储介质
JPH0667696A (ja) 音声符号化方法
JPH07306699A (ja) ベクトル量子化装置
Jean et al. Optimal transform coding for speech line spectrum pair parameters based on spectral-weighted error criterion
JPH0754438B2 (ja) 音声処理装置

Legal Events

Date Code Title Description
FC2A Withdrawal, rejection or dismissal of laid open patent application