ATE206841T1 - METHOD AND ARRANGEMENT FOR CLASSIFYING VOICE SIGNALS - Google Patents

METHOD AND ARRANGEMENT FOR CLASSIFYING VOICE SIGNALS

Info

Publication number
ATE206841T1
ATE206841T1 AT96104213T AT96104213T ATE206841T1 AT E206841 T1 ATE206841 T1 AT E206841T1 AT 96104213 T AT96104213 T AT 96104213T AT 96104213 T AT96104213 T AT 96104213T AT E206841 T1 ATE206841 T1 AT E206841T1
Authority
AT
Austria
Prior art keywords
speech
frames
divided
segments
wavelet transformation
Prior art date
Application number
AT96104213T
Other languages
German (de)
Inventor
Joachim Dipl-Ing Stegmann
Original Assignee
Deutsche Telekom Ag
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from DE19538852A external-priority patent/DE19538852A1/en
Application filed by Deutsche Telekom Ag filed Critical Deutsche Telekom Ag
Application granted granted Critical
Publication of ATE206841T1 publication Critical patent/ATE206841T1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • G10L2025/783Detection of presence or absence of voice signals based on threshold decision
    • G10L2025/786Adaptive threshold
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/93Discriminating between voiced and unvoiced parts of speech signals

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)
  • Time-Division Multiplex Systems (AREA)
  • Machine Translation (AREA)
  • Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)

Abstract

The method classifies speech, in particular speech signals, for the adaptive control of a speech encoding process. This encoding reduces the bit rate while keeping the speech quality the same, or increases the quality while keeping the bit rate the same. After segmenting the speech signal for each frame, a wavelet transformation is calculated. Using adaptive thresholds, a set of parameters is derived which control a state model. The speech frames are divided into sub-frames. Each sub-frame is divided into one of several typical classes for the speech encoding. The speech signal may be divided into segments of constant length. To reduce the edge effects with the wavelet transformation, either the segment at the boundaries is reflected or the wavelet transformation is calculated at smaller intervals. The frames are preferably shifted such that the segments overlap, or at the edges the segments are filled with previous or predicted sample values.
AT96104213T 1995-06-30 1996-03-16 METHOD AND ARRANGEMENT FOR CLASSIFYING VOICE SIGNALS ATE206841T1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
DE19523598 1995-06-30
DE19538852A DE19538852A1 (en) 1995-06-30 1995-10-19 Method and arrangement for classifying speech signals

Publications (1)

Publication Number Publication Date
ATE206841T1 true ATE206841T1 (en) 2001-10-15

Family

ID=26016384

Family Applications (1)

Application Number Title Priority Date Filing Date
AT96104213T ATE206841T1 (en) 1995-06-30 1996-03-16 METHOD AND ARRANGEMENT FOR CLASSIFYING VOICE SIGNALS

Country Status (4)

Country Link
EP (1) EP0751495B1 (en)
AT (1) ATE206841T1 (en)
ES (1) ES2165933T3 (en)
NO (1) NO309831B1 (en)

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
DE19716862A1 (en) * 1997-04-22 1998-10-29 Deutsche Telekom Ag Voice activity detection

Also Published As

Publication number Publication date
NO961636D0 (en) 1996-04-24
NO309831B1 (en) 2001-04-02
ES2165933T3 (en) 2002-04-01
NO961636L (en) 1997-01-02
EP0751495A2 (en) 1997-01-02
EP0751495A3 (en) 1998-04-15
EP0751495B1 (en) 2001-10-10

Similar Documents

Publication Publication Date Title
DE60117144D1 (en) LANGUAGE TRANSMISSION SYSTEM AND METHOD FOR TREATING LOST DATA FRAMES
DE69431622D1 (en) METHOD AND DEVICE FOR ENCODING DIGITAL SOUND ENCODED WITH MULTIPLE BITS BY SUBTRACTING AN ADAPTIVE SHAKE SIGNAL, INSERTING HIDDEN CHANNEL BITS AND FILTERING, AND ENCODING DEVICE FOR USE IN THIS PROCESS
DE60219351D1 (en) SIGNAL MODIFICATION METHOD FOR EFFICIENT CODING OF LANGUAGE SIGNALS
CA2343661A1 (en) Method and apparatus for improving the intelligibility of digitally compressed speech
DE602004006206D1 (en) System and method for high quality extension and shortening of a digital audio signal
ATE302991T1 (en) METHOD FOR SIGNAL-CONTROLLED SWITCHING BETWEEN DIFFERENT AUDIO CODING SYSTEMS
ATE364220T1 (en) METHOD AND APPARATUS FOR CONCEALING FRAME LOSING OF PREDICTION CODED LANGUAGE USING WAVEFORM EXTRAPOLATION
DE69614782T2 (en) Method and device for reproducing voice signals and method for its transmission
ATE202232T1 (en) METHOD FOR VOICE CODING
CA2188369A1 (en) Method and an arrangement for classifying speech signals
CA2102099A1 (en) Variable rate vocoder
ATE15415T1 (en) METHOD AND DEVICE FOR REDUNDANCY-REDUCING DIGITAL SPEECH PROCESSING.
NO982393D0 (en) Process for quality control of seismic data processing and method for processing vertical, seismic profile data
DE69423692T2 (en) Speech coding device and method using classification rules
ATE362634T1 (en) METHOD AND APPARATUS FOR DETERMINING A SYNTHETIC HIGHER BAND SIGNAL IN A VOICE ENCODER
AU4490296A (en) Speech coding method using synthesis analysis
DE59806874D1 (en) METHOD FOR CODING AND / OR DECODING VOICE SIGNALS USING A LONG-TERM PREDICTION AND A MULTI-PULSE EXCITATION SIGNAL
DE59809897D1 (en) Voice Activity Detection
MY111784A (en) Method and apparatus for encoding/decoding of background sounds
ATE206841T1 (en) METHOD AND ARRANGEMENT FOR CLASSIFYING VOICE SIGNALS
DE69719024D1 (en) METHOD FOR DECODING DATA SIGNALS BY MEANS OF A FIXED LENGTH DECISION WINDOW
DE59410189D1 (en) Methods and devices for the detection and control of mass flows and related values
DE68901376D1 (en) METHOD AND DEVICE FOR THE AUTOMATIC CUTTING OF WINE GRAPES.
ATA207094A (en) METHOD FOR PRODUCING XANTHAN GUM BY FERMENTATION
ATE174546T1 (en) APPARATUS AND METHOD FOR PRODUCING PRECAST CONCRETE PARTS ATTACHED OVER CONCRETE BARS OR OTHER INDEPENDENT PARTS

Legal Events

Date Code Title Description
REN Ceased due to non-payment of the annual fee