ATE329345T1 - Verfahren und vorrichtung zur grundfrequenzermittlung - Google Patents

Verfahren und vorrichtung zur grundfrequenzermittlung

Info

Publication number
ATE329345T1
ATE329345T1 AT99959072T AT99959072T ATE329345T1 AT E329345 T1 ATE329345 T1 AT E329345T1 AT 99959072 T AT99959072 T AT 99959072T AT 99959072 T AT99959072 T AT 99959072T AT E329345 T1 ATE329345 T1 AT E329345T1
Authority
AT
Austria
Prior art keywords
window
pitch
speech signal
basic frequency
determining basic
Prior art date
Application number
AT99959072T
Other languages
English (en)
Inventor
Alejandro Acero
James G Droppo Iii
Original Assignee
Microsoft Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Microsoft Corp filed Critical Microsoft Corp
Application granted granted Critical
Publication of ATE329345T1 publication Critical patent/ATE329345T1/de

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/90Pitch determination of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/03Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters
    • G10L25/06Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the type of extracted parameters the extracted parameters being correlation coefficients

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Acoustics & Sound (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Computational Linguistics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Electrically Operated Instructional Devices (AREA)
  • Electrical Discharge Machining, Electrochemical Machining, And Combined Machining (AREA)
  • Measuring Frequencies, Analyzing Spectra (AREA)
  • Color Television Systems (AREA)
  • Stabilization Of Oscillater, Synchronisation, Frequency Synthesizers (AREA)
  • Measurement Of Mechanical Vibrations Or Ultrasonic Waves (AREA)
AT99959072T 1998-11-24 1999-11-22 Verfahren und vorrichtung zur grundfrequenzermittlung ATE329345T1 (de)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
US09/198,476 US6226606B1 (en) 1998-11-24 1998-11-24 Method and apparatus for pitch tracking

Publications (1)

Publication Number Publication Date
ATE329345T1 true ATE329345T1 (de) 2006-06-15

Family

ID=22733544

Family Applications (1)

Application Number Title Priority Date Filing Date
AT99959072T ATE329345T1 (de) 1998-11-24 1999-11-22 Verfahren und vorrichtung zur grundfrequenzermittlung

Country Status (8)

Country Link
US (1) US6226606B1 (de)
EP (1) EP1145224B1 (de)
JP (1) JP4354653B2 (de)
CN (1) CN1152365C (de)
AT (1) ATE329345T1 (de)
AU (1) AU1632100A (de)
DE (1) DE69931813T2 (de)
WO (1) WO2000031721A1 (de)

Families Citing this family (48)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7315815B1 (en) * 1999-09-22 2008-01-01 Microsoft Corporation LPC-harmonic vocoder with superframe structure
US6418407B1 (en) * 1999-09-30 2002-07-09 Motorola, Inc. Method and apparatus for pitch determination of a low bit rate digital voice message
US6510413B1 (en) * 2000-06-29 2003-01-21 Intel Corporation Distributed synthetic speech generation
US6535852B2 (en) * 2001-03-29 2003-03-18 International Business Machines Corporation Training of text-to-speech systems
US6917912B2 (en) * 2001-04-24 2005-07-12 Microsoft Corporation Method and apparatus for tracking pitch in audio analysis
US7366712B2 (en) * 2001-05-31 2008-04-29 Intel Corporation Information retrieval center gateway
US6907367B2 (en) * 2001-08-31 2005-06-14 The United States Of America As Represented By The Secretary Of The Navy Time-series segmentation
JP3823804B2 (ja) * 2001-10-22 2006-09-20 ソニー株式会社 信号処理方法及び装置、信号処理プログラム、並びに記録媒体
JP3997749B2 (ja) * 2001-10-22 2007-10-24 ソニー株式会社 信号処理方法及び装置、信号処理プログラム、並びに記録媒体
JP3750583B2 (ja) * 2001-10-22 2006-03-01 ソニー株式会社 信号処理方法及び装置、並びに信号処理プログラム
US7124075B2 (en) * 2001-10-26 2006-10-17 Dmitry Edward Terez Methods and apparatus for pitch determination
US6721699B2 (en) 2001-11-12 2004-04-13 Intel Corporation Method and system of Chinese speech pitch extraction
TW589618B (en) * 2001-12-14 2004-06-01 Ind Tech Res Inst Method for determining the pitch mark of speech
US20030139929A1 (en) * 2002-01-24 2003-07-24 Liang He Data transmission system and method for DSR application over GPRS
US7062444B2 (en) * 2002-01-24 2006-06-13 Intel Corporation Architecture for DSR client and server development platform
US7219059B2 (en) * 2002-07-03 2007-05-15 Lucent Technologies Inc. Automatic pronunciation scoring for language learning
US20040049391A1 (en) * 2002-09-09 2004-03-11 Fuji Xerox Co., Ltd. Systems and methods for dynamic reading fluency proficiency assessment
KR100552693B1 (ko) * 2003-10-25 2006-02-20 삼성전자주식회사 피치검출방법 및 장치
US7668712B2 (en) * 2004-03-31 2010-02-23 Microsoft Corporation Audio encoding and decoding with intra frames and adaptive forward error correction
KR100590561B1 (ko) * 2004-10-12 2006-06-19 삼성전자주식회사 신호의 피치를 평가하는 방법 및 장치
US7707034B2 (en) * 2005-05-31 2010-04-27 Microsoft Corporation Audio codec post-filter
US7831421B2 (en) * 2005-05-31 2010-11-09 Microsoft Corporation Robust decoder
US7177804B2 (en) * 2005-05-31 2007-02-13 Microsoft Corporation Sub-band voice codec with multi-stage codebooks and redundant coding
WO2007046267A1 (ja) * 2005-10-20 2007-04-26 Nec Corporation 音声判別システム、音声判別方法及び音声判別用プログラム
EP1958341B1 (de) * 2005-12-05 2015-01-21 Telefonaktiebolaget L M Ericsson (PUBL) Echoerkennung
SE0600243L (sv) * 2006-02-06 2007-02-27 Mats Hillborg Melodigenerator
US8364492B2 (en) * 2006-07-13 2013-01-29 Nec Corporation Apparatus, method and program for giving warning in connection with inputting of unvoiced speech
JP5093108B2 (ja) * 2006-07-21 2012-12-05 日本電気株式会社 音声合成装置、方法、およびプログラム
CN101009096B (zh) * 2006-12-15 2011-01-26 清华大学 子带清浊音模糊判决的方法
US7925502B2 (en) * 2007-03-01 2011-04-12 Microsoft Corporation Pitch model for noise estimation
ATE504010T1 (de) * 2007-06-01 2011-04-15 Univ Graz Tech Gemeinsame positions-tonhöhenschätzung akustischer quellen zu ihrer verfolgung und trennung
DE102007030209A1 (de) * 2007-06-27 2009-01-08 Siemens Audiologische Technik Gmbh Glättungsverfahren
JP2009047831A (ja) * 2007-08-17 2009-03-05 Toshiba Corp 特徴量抽出装置、プログラムおよび特徴量抽出方法
JP4599420B2 (ja) * 2008-02-29 2010-12-15 株式会社東芝 特徴量抽出装置
JP5593608B2 (ja) * 2008-12-05 2014-09-24 ソニー株式会社 情報処理装置、メロディーライン抽出方法、ベースライン抽出方法、及びプログラム
US9947340B2 (en) * 2008-12-10 2018-04-17 Skype Regeneration of wideband speech
GB2466201B (en) * 2008-12-10 2012-07-11 Skype Ltd Regeneration of wideband speech
GB0822537D0 (en) 2008-12-10 2009-01-14 Skype Ltd Regeneration of wideband speech
US8626497B2 (en) * 2009-04-07 2014-01-07 Wen-Hsin Lin Automatic marking method for karaoke vocal accompaniment
WO2011048815A1 (ja) * 2009-10-21 2011-04-28 パナソニック株式会社 オーディオ符号化装置、復号装置、方法、回路およびプログラム
AT509512B1 (de) * 2010-03-01 2012-12-15 Univ Graz Tech Verfahren zur ermittlung von grundfrequenz-verläufen mehrerer signalquellen
US8447596B2 (en) * 2010-07-12 2013-05-21 Audience, Inc. Monaural noise suppression based on computational auditory scene analysis
US9082416B2 (en) * 2010-09-16 2015-07-14 Qualcomm Incorporated Estimating a pitch lag
JP5747562B2 (ja) * 2010-10-28 2015-07-15 ヤマハ株式会社 音響処理装置
US8645128B1 (en) * 2012-10-02 2014-02-04 Google Inc. Determining pitch dynamics of an audio signal
JP6131574B2 (ja) * 2012-11-15 2017-05-24 富士通株式会社 音声信号処理装置、方法、及びプログラム
CN107871492B (zh) * 2016-12-26 2020-12-15 珠海市杰理科技股份有限公司 音乐合成方法和系统
CN111223491B (zh) * 2020-01-22 2022-11-15 深圳市倍轻松科技股份有限公司 一种提取音乐信号主旋律的方法、装置及终端设备

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4731846A (en) 1983-04-13 1988-03-15 Texas Instruments Incorporated Voice messaging system with pitch tracking based on adaptively filtered LPC residual signal
US5007093A (en) * 1987-04-03 1991-04-09 At&T Bell Laboratories Adaptive threshold voiced detector
US5680508A (en) 1991-05-03 1997-10-21 Itt Corporation Enhancement of speech coding in background noise for low-rate speech coder
JPH06332492A (ja) 1993-05-19 1994-12-02 Matsushita Electric Ind Co Ltd 音声検出方法および検出装置
US5704000A (en) 1994-11-10 1997-12-30 Hughes Electronics Robust pitch estimation method and device for telephone speech

Also Published As

Publication number Publication date
CN1338095A (zh) 2002-02-27
US6226606B1 (en) 2001-05-01
JP2003521721A (ja) 2003-07-15
DE69931813D1 (de) 2006-07-20
CN1152365C (zh) 2004-06-02
WO2000031721A1 (en) 2000-06-02
EP1145224A1 (de) 2001-10-17
DE69931813T2 (de) 2006-10-12
AU1632100A (en) 2000-06-13
EP1145224B1 (de) 2006-06-07
JP4354653B2 (ja) 2009-10-28

Similar Documents

Publication Publication Date Title
ATE329345T1 (de) Verfahren und vorrichtung zur grundfrequenzermittlung
DE60336239D1 (de) Signalsuchverfahren für ein positionierungssystem
ATE352836T1 (de) Detektion von emotionen in sprachsignalen mittels analyse einer vielzahl von sprachsignalparametern
ATE338333T1 (de) Zeitskalenmodifikation von signalen mit spezifischem verfahren je nach ermitteltem signaltyp
ATE524746T1 (de) System und verfahren zur verzögerungsleitungsprüfung
ATE498838T1 (de) Elektronische verfahren und vorrichtung zum nachweis von analyten
DE60309142D1 (de) Vorrichtung zur bestimmung von parametern eines gauss'schen mischungmodells (gmm) oder eines gmm basierten hidden markov modells
BR0302683A (pt) Métodos de processamento de sinal para determinar a velocidade acústica como uma função da frequência em um sistema para registro acústico de uma formação geológica, para determinar a lentidão da formação geológica como uma função da frequência em um sistema para registro acústico de formações geológicas, e, para determinar a velocidade acústica e a dispersão de frequência de uma formação geológica
ATE232559T1 (de) Verfahren zur signalverstärkung
DE60128479D1 (de) Verfahren und vorrichtung zur bestimmung eines synthetischen höheren bandsignals in einem sprachkodierer
DE60331475D1 (de) Verfahren und vorrichtung zur analyse von audiosignalen
DE3582503D1 (de) Verfahren, schaltungsanordnung und einrichtung zur beruehrungslosen real-time-bestimmung von geschwindigkeiten sowie deren verwendung.
DE602004028219D1 (de) Verfahren und einrichtung zur erkennung von diskontinuitäten in einem medium
DE69513197D1 (de) Verfahren zur erlangung von information
ATE15563T1 (de) Verfahren und vorrichtung zur redundanzvermindernden digitalen sprachverarbeitung.
ATE234148T1 (de) Verfahren zur feststellung des einsetzens von kolloidbildung insbesondere für schwefelfällung
ATE415023T1 (de) Verfahren und vorrichtung zur bereitstellung von taktinformation in einem drahtlosen nachrichtenübertragungsnetzwerk
EA199800989A1 (ru) Способ и система для установления медицинского состояния
NL7803622A (nl) Werkwijze voor het bepalen van de grondperiode van een spraaksignaal.
ATE525864T1 (de) Verfahren und system zur tondetektion
ATE374990T1 (de) Verfahren zum synthetisieren von sprache
DE50112581D1 (de) Verfahren zur Rekonstruktion tieffrequenter Sprachanteile aus mittelhohen Frequenzanteilen
DE59810386D1 (de) Verfahren und Einrichtung zur Spracherkennung von verwirrenden Wörtern
ATE279816T1 (de) Verfahren und vorrichtung zur detektion von cdma- kodierten signalen
Tufts et al. Measuring pitch and formant frequencies for a speech understanding system

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties