DE60213653D1 - Verfahren und system zur echtzeit-sprachsynthese - Google Patents

Verfahren und system zur echtzeit-sprachsynthese

Info

Publication number
DE60213653D1
DE60213653D1 DE60213653T DE60213653T DE60213653D1 DE 60213653 D1 DE60213653 D1 DE 60213653D1 DE 60213653 T DE60213653 T DE 60213653T DE 60213653 T DE60213653 T DE 60213653T DE 60213653 D1 DE60213653 D1 DE 60213653D1
Authority
DE
Germany
Prior art keywords
synthesis engine
real
time language
dsp
language synthesis
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
DE60213653T
Other languages
English (en)
Other versions
DE60213653T2 (de
Inventor
Hamid Sheikhzadeh-Nadjar
Etienne Cornu
L Brennan
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Emma Mixed Signal CV
Original Assignee
Emma Mixed Signal CV
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Emma Mixed Signal CV filed Critical Emma Mixed Signal CV
Application granted granted Critical
Publication of DE60213653D1 publication Critical patent/DE60213653D1/de
Publication of DE60213653T2 publication Critical patent/DE60213653T2/de
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L13/00Speech synthesis; Text to speech systems
    • G10L13/08Text analysis or generation of parameters for speech synthesis out of text, e.g. grapheme to phoneme translation, prosody generation or stress or intonation determination
    • G10L13/10Prosody rules derived from text; Stress or intonation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/038Speech enhancement, e.g. noise reduction or echo cancellation using band spreading techniques

Landscapes

  • Engineering & Computer Science (AREA)
  • Acoustics & Sound (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Signal Processing (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Telephonic Communication Services (AREA)
  • Machine Translation (AREA)
DE60213653T 2001-10-22 2002-10-22 Verfahren und system zur echtzeit-sprachsynthese Expired - Lifetime DE60213653T2 (de)

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
CA2359771 2001-10-22
CA002359771A CA2359771A1 (en) 2001-10-22 2001-10-22 Low-resource real-time audio synthesis system and method
PCT/CA2002/001579 WO2003036616A1 (en) 2001-10-22 2002-10-22 Method and system for real time speech synthesis

Publications (2)

Publication Number Publication Date
DE60213653D1 true DE60213653D1 (de) 2006-09-14
DE60213653T2 DE60213653T2 (de) 2007-09-27

Family

ID=4170332

Family Applications (1)

Application Number Title Priority Date Filing Date
DE60213653T Expired - Lifetime DE60213653T2 (de) 2001-10-22 2002-10-22 Verfahren und system zur echtzeit-sprachsynthese

Country Status (7)

Country Link
US (1) US7120584B2 (de)
EP (1) EP1454312B1 (de)
AT (1) ATE335271T1 (de)
CA (1) CA2359771A1 (de)
DE (1) DE60213653T2 (de)
DK (1) DK1454312T3 (de)
WO (1) WO2003036616A1 (de)

Families Citing this family (35)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7928310B2 (en) * 2002-11-12 2011-04-19 MediaLab Solutions Inc. Systems and methods for portable audio synthesis
JP4256189B2 (ja) * 2003-03-28 2009-04-22 株式会社ケンウッド 音声信号圧縮装置、音声信号圧縮方法及びプログラム
JP2004304536A (ja) * 2003-03-31 2004-10-28 Ricoh Co Ltd 半導体装置及びその半導体装置を使用した携帯電話装置
JP4264030B2 (ja) * 2003-06-04 2009-05-13 株式会社ケンウッド 音声データ選択装置、音声データ選択方法及びプログラム
US8666746B2 (en) * 2004-05-13 2014-03-04 At&T Intellectual Property Ii, L.P. System and method for generating customized text-to-speech voices
KR100608062B1 (ko) * 2004-08-04 2006-08-02 삼성전자주식회사 오디오 데이터의 고주파수 복원 방법 및 그 장치
US7869999B2 (en) * 2004-08-11 2011-01-11 Nuance Communications, Inc. Systems and methods for selecting from multiple phonectic transcriptions for text-to-speech synthesis
US7587441B2 (en) * 2005-06-29 2009-09-08 L-3 Communications Integrated Systems L.P. Systems and methods for weighted overlap and add processing
US20070106513A1 (en) * 2005-11-10 2007-05-10 Boillot Marc A Method for facilitating text to speech synthesis using a differential vocoder
GB2433150B (en) * 2005-12-08 2009-10-07 Toshiba Res Europ Ltd Method and apparatus for labelling speech
US7645929B2 (en) * 2006-09-11 2010-01-12 Hewlett-Packard Development Company, L.P. Computational music-tempo estimation
CN101542593B (zh) * 2007-03-12 2013-04-17 富士通株式会社 语音波形内插装置及方法
US8471743B2 (en) * 2010-11-04 2013-06-25 Mediatek Inc. Quantization circuit having VCO-based quantizer compensated in phase domain and related quantization method and continuous-time delta-sigma analog-to-digital converter
US8649523B2 (en) 2011-03-25 2014-02-11 Nintendo Co., Ltd. Methods and systems using a compensation signal to reduce audio decoding errors at block boundaries
CN104349260B (zh) * 2011-08-30 2017-06-30 中国科学院微电子研究所 低功耗wola滤波器组及其综合阶段电路
EP2757558A1 (de) 2013-01-18 2014-07-23 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Niveaueinstellung der Zeitbereichsebene zur Audiosignaldekodierung oder -kodierung
JP6305694B2 (ja) * 2013-05-31 2018-04-04 クラリオン株式会社 信号処理装置及び信号処理方法
US9554207B2 (en) 2015-04-30 2017-01-24 Shure Acquisition Holdings, Inc. Offset cartridge microphones
US9565493B2 (en) 2015-04-30 2017-02-07 Shure Acquisition Holdings, Inc. Array microphone system and method of assembling the same
WO2019232235A1 (en) 2018-05-31 2019-12-05 Shure Acquisition Holdings, Inc. Systems and methods for intelligent voice activation for auto-mixing
CN112335261B (zh) 2018-06-01 2023-07-18 舒尔获得控股公司 图案形成麦克风阵列
US11297423B2 (en) 2018-06-15 2022-04-05 Shure Acquisition Holdings, Inc. Endfire linear array microphone
WO2020061353A1 (en) 2018-09-20 2020-03-26 Shure Acquisition Holdings, Inc. Adjustable lobe shape for array microphones
WO2020191380A1 (en) 2019-03-21 2020-09-24 Shure Acquisition Holdings,Inc. Auto focus, auto focus within regions, and auto placement of beamformed microphone lobes with inhibition functionality
CN113841419A (zh) 2019-03-21 2021-12-24 舒尔获得控股公司 天花板阵列麦克风的外壳及相关联设计特征
US11558693B2 (en) 2019-03-21 2023-01-17 Shure Acquisition Holdings, Inc. Auto focus, auto focus within regions, and auto placement of beamformed microphone lobes with inhibition and voice activity detection functionality
CN114051738B (zh) 2019-05-23 2024-10-01 舒尔获得控股公司 可操纵扬声器阵列、系统及其方法
US11302347B2 (en) 2019-05-31 2022-04-12 Shure Acquisition Holdings, Inc. Low latency automixer integrated with voice and noise activity detection
WO2021041275A1 (en) 2019-08-23 2021-03-04 Shore Acquisition Holdings, Inc. Two-dimensional microphone array with improved directivity
US12028678B2 (en) 2019-11-01 2024-07-02 Shure Acquisition Holdings, Inc. Proximity microphone
US11552611B2 (en) 2020-02-07 2023-01-10 Shure Acquisition Holdings, Inc. System and method for automatic adjustment of reference gain
CN113452464B (zh) * 2020-03-24 2022-11-15 中移(成都)信息通信科技有限公司 时间校准方法、装置、设备及介质
WO2021243368A2 (en) 2020-05-29 2021-12-02 Shure Acquisition Holdings, Inc. Transducer steering and configuration systems and methods using a local positioning system
EP4285605A1 (de) 2021-01-28 2023-12-06 Shure Acquisition Holdings, Inc. Hybrides audiostrahlformungssystem
CN113840328B (zh) * 2021-09-09 2023-10-20 锐捷网络股份有限公司 一种数据压缩方法、装置、电子设备及存储介质

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
BE1010336A3 (fr) * 1996-06-10 1998-06-02 Faculte Polytechnique De Mons Procede de synthese de son.
GB2317537B (en) * 1996-09-19 2000-05-17 Matra Marconi Space Digital signal processing apparatus for frequency demultiplexing or multiplexing
US5991787A (en) * 1997-12-31 1999-11-23 Intel Corporation Reducing peak spectral error in inverse Fast Fourier Transform using MMX™ technology
US6081780A (en) * 1998-04-28 2000-06-27 International Business Machines Corporation TTS and prosody based authoring system
US6173263B1 (en) * 1998-08-31 2001-01-09 At&T Corp. Method and system for performing concatenative speech synthesis using half-phonemes
JP4792613B2 (ja) 1999-09-29 2011-10-12 ソニー株式会社 情報処理装置および方法、並びに記録媒体

Also Published As

Publication number Publication date
WO2003036616A1 (en) 2003-05-01
CA2359771A1 (en) 2003-04-22
DK1454312T3 (da) 2006-11-27
DE60213653T2 (de) 2007-09-27
ATE335271T1 (de) 2006-08-15
US7120584B2 (en) 2006-10-10
US20030130848A1 (en) 2003-07-10
EP1454312A1 (de) 2004-09-08
EP1454312B1 (de) 2006-08-02

Similar Documents

Publication Publication Date Title
DE60213653D1 (de) Verfahren und system zur echtzeit-sprachsynthese
US9799323B2 (en) System and method for low-latency web-based text-to-speech without plugins
BR9911315B1 (pt) sÍntese inteligente texto-para-voz.
ATE343267T1 (de) Elektronischer wandler eines akustischen signals in ein pseudo-digitales signal und bidirektionelles kommunikationsverfahren durch schallwellen
TW347619B (en) A communication system and method using a speaker dependent time-scaling technique a method for time-scale modification of speech using a modified version of the Waveform Similarity based Overlap-Add technique (WSOLA).
SG135951A1 (en) Presentation of data based on user input
ATE496496T1 (de) Direktionale audiosignalverarbeitung unter verwendung einer überabgetasteten filterbank
ATE220473T1 (de) System, verfahren und programmdatenträger zur darstellung komplexer informationen als klang
DE602004006641D1 (de) Audio-dialogsystem und sprachgesteuertes browsing-verfahren
ATE348455T1 (de) Fifo als übergang von taktregionen
FR2847376B1 (fr) Procede de traitement de donnees sonores et dispositif d'acquisition sonore mettant en oeuvre ce procede
DE60101148D1 (de) Vorrichtung und verfahren zur sprachsignalmodifizierung
DE602006019099D1 (de) Sprachanalysesystem
ATE288615T1 (de) Verfahren und prozessorsystem zur audiosignalverarbeitung
WO2004012183A3 (en) Concatenative text-to-speech conversion
DE59902143D1 (de) Verfahren und vorrichtung zur ausgabe von informationen und/oder meldungen per sprache
US4459674A (en) Voice input/output apparatus
ATE291772T1 (de) Taktiles kommunikationssystem
JP2003015681A (ja) 信号結合装置、信号結合方法及びプログラム
US7249020B2 (en) Voice synthesizing method using independent sampling frequencies and apparatus therefor
SE9303902D0 (sv) Anordning och förfarande vid talsyntes
Schnell et al. Text-to-speech for low-resource systems
CN117079659B (zh) 音频处理方法及相关装置
DE69637326D1 (de) System und verfahren zur sprecherunabhängigen echtzeitspracherkennung
CA2397080A1 (en) Sub-band adaptive signal processing in an oversampled filterbank

Legal Events

Date Code Title Description
8364 No opposition during term of opposition