IT1270438B - Procedimento e dispositivo per la determinazione del periodo del tono fondamentale e la classificazione del segnale vocale in codificatori numerici della voce - Google Patents

Procedimento e dispositivo per la determinazione del periodo del tono fondamentale e la classificazione del segnale vocale in codificatori numerici della voce

Info

Publication number
IT1270438B
IT1270438B ITTO930419A ITTO930419A IT1270438B IT 1270438 B IT1270438 B IT 1270438B IT TO930419 A ITTO930419 A IT TO930419A IT TO930419 A ITTO930419 A IT TO930419A IT 1270438 B IT1270438 B IT 1270438B
Authority
IT
Italy
Prior art keywords
classification
voice
vocalized
fundamental tone
numerical
Prior art date
Application number
ITTO930419A
Other languages
English (en)
Inventor
Luca Cellario
Original Assignee
Sip
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sip filed Critical Sip
Publication of ITTO930419A0 publication Critical patent/ITTO930419A0/it
Priority to ITTO930419A priority Critical patent/IT1270438B/it
Priority to US08/243,295 priority patent/US5548680A/en
Priority to CA002124643A priority patent/CA2124643C/en
Priority to DE69412913T priority patent/DE69412913T2/de
Priority to ES94108874T priority patent/ES2065871T3/es
Priority to JP15057194A priority patent/JP3197155B2/ja
Priority to DE0628947T priority patent/DE628947T1/de
Priority to EP94108874A priority patent/EP0628947B1/en
Priority to AT94108874T priority patent/ATE170656T1/de
Priority to FI942761A priority patent/FI111486B/fi
Publication of ITTO930419A1 publication Critical patent/ITTO930419A1/it
Priority to GR950300013T priority patent/GR950300013T1/el
Application granted granted Critical
Publication of IT1270438B publication Critical patent/IT1270438B/it

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/012Comfort noise or silence coding
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/90Pitch determination of speech signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0011Long term prediction filters, i.e. pitch estimation

Abstract

Si forniscono un procedimento e un'apparecchiatura per la codifica numerica del segnale vocale in cui a ogni trama si effettuano un'analisi a lungo termine per determinare il periodo del tono fondamentale d e un coefficiente b e un guadagno g di predizione a lungo termine, e una classificazione a priori del segnale in attivo/inattivo e, per il segnale attivo, in vocalizzato/non vocalizzato. I circuiti (LT1) di determinazione del periodo calcolano tale periodo a partire da una funzione di covarianza opportunamente pesata, e i circuiti di classificazione (RV) distinguono i suoni vocalizzati da quelli non vocalizzati confrontando il coefficiente e il guadagno di predizione a lungo termine con soglie variabili trama per trama.(Fig. 2).
ITTO930419A 1993-06-10 1993-06-10 Procedimento e dispositivo per la determinazione del periodo del tono fondamentale e la classificazione del segnale vocale in codificatori numerici della voce IT1270438B (it)

Priority Applications (11)

Application Number Priority Date Filing Date Title
ITTO930419A IT1270438B (it) 1993-06-10 1993-06-10 Procedimento e dispositivo per la determinazione del periodo del tono fondamentale e la classificazione del segnale vocale in codificatori numerici della voce
US08/243,295 US5548680A (en) 1993-06-10 1994-05-17 Method and device for speech signal pitch period estimation and classification in digital speech coders
CA002124643A CA2124643C (en) 1993-06-10 1994-05-30 Method and device for speech signal pitch period estimation and classification in digital speech coders
DE0628947T DE628947T1 (de) 1993-06-10 1994-06-09 Verfahren und Vorrichtung für digitale Sprachkodierung mit Sprachsignalhöhenabschätzung und Klassification.
ES94108874T ES2065871T3 (es) 1993-06-10 1994-06-09 Procedimiento y dispositivo para la evaluacion del periodo del tono fundamental y la clasificacion de la señal de voz en codificadores numericos de la voz.
JP15057194A JP3197155B2 (ja) 1993-06-10 1994-06-09 ディジタル音声コーダにおける音声信号ピッチ周期の推定および分類のための方法および装置
DE69412913T DE69412913T2 (de) 1993-06-10 1994-06-09 Verfahren und Vorrichtung für digitale Sprachkodierung mit Sprachsignalhöhenabschätzung und Klassifikation in digitalen Sprachkodierern
EP94108874A EP0628947B1 (en) 1993-06-10 1994-06-09 Method and device for speech signal pitch period estimation and classification in digital speech coders
AT94108874T ATE170656T1 (de) 1993-06-10 1994-06-09 Verfahren und vorrichtung für digitale sprachkodierung mit sprachsignalhöhenabschätzung und klassifikation in digitalen sprachkodierern
FI942761A FI111486B (fi) 1993-06-10 1994-06-10 Menetelmä ja laite puhesignaalin äänijakson estimointiin ja luokitteluun digitaalisissa puhekoodereissa
GR950300013T GR950300013T1 (en) 1993-06-10 1995-03-31 Method and device for speech signal pitch period estimation and classification in digital speech coders.

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
ITTO930419A IT1270438B (it) 1993-06-10 1993-06-10 Procedimento e dispositivo per la determinazione del periodo del tono fondamentale e la classificazione del segnale vocale in codificatori numerici della voce

Publications (3)

Publication Number Publication Date
ITTO930419A0 ITTO930419A0 (it) 1993-06-10
ITTO930419A1 ITTO930419A1 (it) 1994-12-10
IT1270438B true IT1270438B (it) 1997-05-05

Family

ID=11411549

Family Applications (1)

Application Number Title Priority Date Filing Date
ITTO930419A IT1270438B (it) 1993-06-10 1993-06-10 Procedimento e dispositivo per la determinazione del periodo del tono fondamentale e la classificazione del segnale vocale in codificatori numerici della voce

Country Status (10)

Country Link
US (1) US5548680A (it)
EP (1) EP0628947B1 (it)
JP (1) JP3197155B2 (it)
AT (1) ATE170656T1 (it)
CA (1) CA2124643C (it)
DE (2) DE628947T1 (it)
ES (1) ES2065871T3 (it)
FI (1) FI111486B (it)
GR (1) GR950300013T1 (it)
IT (1) IT1270438B (it)

Families Citing this family (53)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2729246A1 (fr) * 1995-01-06 1996-07-12 Matra Communication Procede de codage de parole a analyse par synthese
KR970017456A (ko) * 1995-09-30 1997-04-30 김광호 음성신호의 무음 및 무성음 판별방법 및 그 장치
US5659622A (en) * 1995-11-13 1997-08-19 Motorola, Inc. Method and apparatus for suppressing noise in a communication system
FI114248B (fi) * 1997-03-14 2004-09-15 Nokia Corp Menetelmä ja laite audiokoodaukseen ja audiodekoodaukseen
FI971679A (fi) * 1997-04-18 1998-10-19 Nokia Telecommunications Oy Puheen havaitseminen tietoliikennejärjestelmässä
FI113903B (fi) * 1997-05-07 2004-06-30 Nokia Corp Puheen koodaus
US5970441A (en) * 1997-08-25 1999-10-19 Telefonaktiebolaget Lm Ericsson Detection of periodicity information from an audio signal
US5999897A (en) * 1997-11-14 1999-12-07 Comsat Corporation Method and apparatus for pitch estimation using perception based analysis by synthesis
US6023674A (en) * 1998-01-23 2000-02-08 Telefonaktiebolaget L M Ericsson Non-parametric voice activity detection
EP0993674B1 (en) * 1998-05-11 2006-08-16 Philips Electronics N.V. Pitch detection
US6415252B1 (en) * 1998-05-28 2002-07-02 Motorola, Inc. Method and apparatus for coding and decoding speech
US6507814B1 (en) * 1998-08-24 2003-01-14 Conexant Systems, Inc. Pitch determination using speech classification and prior pitch estimation
US7072832B1 (en) * 1998-08-24 2006-07-04 Mindspeed Technologies, Inc. System for speech encoding having an adaptive encoding arrangement
JP3180786B2 (ja) * 1998-11-27 2001-06-25 日本電気株式会社 音声符号化方法及び音声符号化装置
US6691084B2 (en) * 1998-12-21 2004-02-10 Qualcomm Incorporated Multiple mode variable rate speech coding
FI116992B (fi) 1999-07-05 2006-04-28 Nokia Corp Menetelmät, järjestelmä ja laitteet audiosignaalin koodauksen ja siirron tehostamiseksi
US6959274B1 (en) * 1999-09-22 2005-10-25 Mindspeed Technologies, Inc. Fixed rate speech compression system and method
US6782360B1 (en) * 1999-09-22 2004-08-24 Mindspeed Technologies, Inc. Gain quantization for a CELP speech coder
KR100388488B1 (ko) * 2000-12-27 2003-06-25 한국전자통신연구원 유성음 구간에서의 고속 피치 탐색 방법
US6876965B2 (en) 2001-02-28 2005-04-05 Telefonaktiebolaget Lm Ericsson (Publ) Reduced complexity voice activity detector
FR2825505B1 (fr) * 2001-06-01 2003-09-05 France Telecom Procede d'extraction de la frequence fondamentale d'un signal sonore au moyen d'un dispositif mettant en oeuvre un algorithme d'autocorrelation
US7177304B1 (en) * 2002-01-03 2007-02-13 Cisco Technology, Inc. Devices, softwares and methods for prioritizing between voice data packets for discard decision purposes
USH2172H1 (en) * 2002-07-02 2006-09-05 The United States Of America As Represented By The Secretary Of The Air Force Pitch-synchronous speech processing
AU2003248029B2 (en) * 2002-09-17 2005-12-08 Canon Kabushiki Kaisha Audio Object Classification Based on Statistically Derived Semantic Information
DE102005002195A1 (de) * 2005-01-17 2006-07-27 Siemens Ag Verfahren und Anordnung zur Regeneration eines optischen Datensignals
US7707034B2 (en) * 2005-05-31 2010-04-27 Microsoft Corporation Audio codec post-filter
KR100717396B1 (ko) 2006-02-09 2007-05-11 삼성전자주식회사 로컬 스펙트럴 정보를 이용하여 음성 인식을 위한 유성음을판단하는 방법 및 장치
JP4827661B2 (ja) * 2006-08-30 2011-11-30 富士通株式会社 信号処理方法及び装置
WO2009078093A1 (ja) * 2007-12-18 2009-06-25 Fujitsu Limited 非音声区間検出方法及び非音声区間検出装置
CN101599272B (zh) * 2008-12-30 2011-06-08 华为技术有限公司 基音搜索方法及装置
CN101604525B (zh) * 2008-12-31 2011-04-06 华为技术有限公司 基音增益获取方法、装置及编码器、解码器
GB2466671B (en) 2009-01-06 2013-03-27 Skype Speech encoding
GB2466675B (en) * 2009-01-06 2013-03-06 Skype Speech coding
GB2466673B (en) 2009-01-06 2012-11-07 Skype Quantization
US9142220B2 (en) 2011-03-25 2015-09-22 The Intellisis Corporation Systems and methods for reconstructing an audio signal from transformed audio information
US8548803B2 (en) 2011-08-08 2013-10-01 The Intellisis Corporation System and method of processing a sound signal including transforming the sound signal into a frequency-chirp domain
US9183850B2 (en) 2011-08-08 2015-11-10 The Intellisis Corporation System and method for tracking sound pitch across an audio signal
US8620646B2 (en) 2011-08-08 2013-12-31 The Intellisis Corporation System and method for tracking sound pitch across an audio signal using harmonic envelope
US10423650B1 (en) * 2014-03-05 2019-09-24 Hrl Laboratories, Llc System and method for identifying predictive keywords based on generalized eigenvector ranks
US9842611B2 (en) 2015-02-06 2017-12-12 Knuedge Incorporated Estimating pitch using peak-to-peak distances
US9922668B2 (en) 2015-02-06 2018-03-20 Knuedge Incorporated Estimating fractional chirp rate with multiple frequency representations
US9870785B2 (en) 2015-02-06 2018-01-16 Knuedge Incorporated Determining features of harmonic signals
US10390589B2 (en) 2016-03-15 2019-08-27 Nike, Inc. Drive mechanism for automated footwear platform
FR3056813B1 (fr) * 2016-09-29 2019-11-08 Dolphin Integration Circuit audio et procede de detection d'activite
EP3306609A1 (en) 2016-10-04 2018-04-11 Fraunhofer Gesellschaft zur Förderung der Angewand Apparatus and method for determining a pitch information
EP3483882A1 (en) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Controlling bandwidth in encoders and/or decoders
EP3483886A1 (en) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Selecting pitch lag
EP3483880A1 (en) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Temporal noise shaping
EP3483884A1 (en) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Signal filtering
EP3483878A1 (en) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio decoder supporting a set of different loss concealment tools
WO2019091576A1 (en) 2017-11-10 2019-05-16 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoders, audio decoders, methods and computer programs adapting an encoding and decoding of least significant bits
EP3483879A1 (en) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Analysis/synthesis windowing function for modulated lapped transformation
EP3483883A1 (en) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio coding and decoding with selective postfiltering

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5359696A (en) * 1988-06-28 1994-10-25 Motorola Inc. Digital speech coder having improved sub-sample resolution long-term predictor
EP0443548B1 (en) * 1990-02-22 2003-07-23 Nec Corporation Speech coder
CA2051304C (en) * 1990-09-18 1996-03-05 Tomohiko Taniguchi Speech coding and decoding system
JPH04264600A (ja) * 1991-02-20 1992-09-21 Fujitsu Ltd 音声符号化装置および音声復号装置
US5233660A (en) * 1991-09-10 1993-08-03 At&T Bell Laboratories Method and apparatus for low-delay celp speech coding and decoding

Also Published As

Publication number Publication date
ES2065871T1 (es) 1995-03-01
CA2124643A1 (en) 1994-12-11
FI942761A (fi) 1994-12-11
ITTO930419A1 (it) 1994-12-10
ITTO930419A0 (it) 1993-06-10
CA2124643C (en) 1998-07-21
US5548680A (en) 1996-08-20
FI111486B (fi) 2003-07-31
JP3197155B2 (ja) 2001-08-13
JPH0728499A (ja) 1995-01-31
ATE170656T1 (de) 1998-09-15
EP0628947B1 (en) 1998-09-02
DE628947T1 (de) 1995-08-03
GR950300013T1 (en) 1995-03-31
EP0628947A1 (en) 1994-12-14
DE69412913T2 (de) 1999-02-18
FI942761A0 (fi) 1994-06-10
DE69412913D1 (de) 1998-10-08
ES2065871T3 (es) 1998-10-16

Similar Documents

Publication Publication Date Title
IT1270438B (it) Procedimento e dispositivo per la determinazione del periodo del tono fondamentale e la classificazione del segnale vocale in codificatori numerici della voce
US4672669A (en) Voice activity detection process and means for implementing said process
KR960029798A (ko) 신호 특성 측정 방법 및 장치. 음성 신호의 음질 측정 방법 및 신호 성질 측정 방법
SE9800776D0 (sv) Audio coding method and apparatus
US4791670A (en) Method of and device for speech signal coding and decoding by vector quantization techniques
TW326070B (en) The estimation method of the impulse gain for coding vocoder
Wu et al. Fully vector-quantized neural network-based code-excited nonlinear predictive speech coding
MXPA00001875A (es) Sistema y metodo de reconocimiento de voz.
ATE172317T1 (de) Sprachumsetzungsverfahren
NZ266908A (en) Discriminating between stationary and non-stationary signals in mobile radio
DK0819303T3 (da) Prædiktiv delt matrix kvantisering af spektrale parametre med henblik på effektiv kodning tale
KR20030031936A (ko) 피치변경법을 이용한 단일 음성 다중 목소리 합성기
Mumolo et al. Adaptive predictive coding of speech by means of Volterra predictors
Malathi et al. Enhancement of electrolaryngeal speech using Frequency Auditory Masking and GMM based voice conversion
KR20000063265A (ko) 신경회로망을 이용한 음향식별에 기반한 코골이 음향식별방법
Itoh et al. A new artificial speech signal for objective quality evaluation of speech coding systems
KR0175250B1 (ko) 보코더의 톤 검출회로 및 방법
Cuperman Speech coding
Ramamoorthy Voice/unvoice detection based on a composite-Gaussian source model of speech
Wilgus et al. Data rate reduction of gain and pitch parameters in an LPC vocoder
IT1249940B (it) Perfezionamenti ai codificatori della voce basati su tecniche di analisi per sintesi.
DE69908396D1 (de) Sprachverarbeitung
Prasad et al. A 2.4 Kilobits Per Second Linear Prediction Vocoder
Giacobello Study and Evaluation of Innovative Algorithms for Voice Quality Enhancement in Speech Signals Encoded Using ACELP (Algebraic Code Excited Linear Prediction)
Hall Objective quality evaluation of parallel-formant synthesised speech

Legal Events

Date Code Title Description
0001 Granted