AU9404098A - Scalable and embedded codec for speech and audio signals - Google Patents

Scalable and embedded codec for speech and audio signals

Info

Publication number
AU9404098A
AU9404098A AU94040/98A AU9404098A AU9404098A AU 9404098 A AU9404098 A AU 9404098A AU 94040/98 A AU94040/98 A AU 94040/98A AU 9404098 A AU9404098 A AU 9404098A AU 9404098 A AU9404098 A AU 9404098A
Authority
AU
Australia
Prior art keywords
scalable
speech
audio signals
embedded codec
codec
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
AU94040/98A
Inventor
Joseph G Aguilar
David A. Campana
Raymond Chen
Robert B. Dunn
Robert J. Mcaulay
Xiaoquin Sun
Wei Wang
Craig Watkins
Robert W. Zopf
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Voxware Inc
Original Assignee
Voxware Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Voxware Inc filed Critical Voxware Inc
Publication of AU9404098A publication Critical patent/AU9404098A/en
Abandoned legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/16Vocoder architecture
    • G10L19/18Vocoders using multiple modes
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0316Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude
    • G10L21/0364Speech enhancement, e.g. noise reduction or echo cancellation by changing the amplitude for improving intelligibility
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0212Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using orthogonal transformation
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • G10L19/09Long term prediction, i.e. removing periodical redundancies, e.g. by using adaptive codebook or pitch predictor
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L2019/0001Codebooks
    • G10L2019/0004Design or structure of the codebook
    • G10L2019/0005Multi-stage vector quantisation
AU94040/98A 1997-09-23 1998-09-23 Scalable and embedded codec for speech and audio signals Abandoned AU9404098A (en)

Applications Claiming Priority (5)

Application Number Priority Date Filing Date Title
US5961097P 1997-09-23 1997-09-23
US60059610 1997-09-23
US7056798P 1998-01-06 1998-01-06
US60070567 1998-01-06
PCT/US1998/019858 WO1999016050A1 (en) 1997-09-23 1998-09-23 Scalable and embedded codec for speech and audio signals

Publications (1)

Publication Number Publication Date
AU9404098A true AU9404098A (en) 1999-04-12

Family

ID=26738975

Family Applications (1)

Application Number Title Priority Date Filing Date
AU94040/98A Abandoned AU9404098A (en) 1997-09-23 1998-09-23 Scalable and embedded codec for speech and audio signals

Country Status (2)

Country Link
AU (1) AU9404098A (en)
WO (1) WO1999016050A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11810545B2 (en) 2011-05-20 2023-11-07 Vocollect, Inc. Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment
US11837253B2 (en) 2016-07-27 2023-12-05 Vocollect, Inc. Distinguishing user speech from background speech in speech-dense environments

Families Citing this family (21)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7072832B1 (en) 1998-08-24 2006-07-04 Mindspeed Technologies, Inc. System for speech encoding having an adaptive encoding arrangement
EP1158494B1 (en) * 2000-05-26 2002-05-29 Lucent Technologies Inc. Method and apparatus for performing audio coding and decoding by interleaving smoothed critical band evelopes at higher frequencies
KR20020084201A (en) * 2001-01-16 2002-11-04 코닌클리케 필립스 일렉트로닉스 엔.브이. Parametric encoder and method for encoding an audio or speech signal
WO2006075269A1 (en) 2005-01-11 2006-07-20 Koninklijke Philips Electronics N.V. Scalable encoding/decoding of audio signals
KR101390051B1 (en) * 2007-10-12 2014-04-29 파나소닉 주식회사 Vector quantizer, vector inverse quantizer, and the methods
US8050932B2 (en) * 2008-02-20 2011-11-01 Research In Motion Limited Apparatus, and associated method, for selecting speech COder operational rates
WO2010092827A1 (en) 2009-02-13 2010-08-19 パナソニック株式会社 Vector quantization device, vector inverse-quantization device, and methods of same
EP2702764B1 (en) 2011-04-25 2016-08-31 Dolby Laboratories Licensing Corporation Non-linear vdr residual quantizer
CN104054317B (en) 2012-01-20 2017-03-29 索诺瓦公司 Wireless voice Transmission system and method
US10531099B2 (en) * 2016-09-30 2020-01-07 The Mitre Corporation Systems and methods for distributed quantization of multimodal images
CN108074588B (en) * 2016-11-15 2020-12-01 北京唱吧科技股份有限公司 Pitch calculation method and pitch calculation device
EP3483884A1 (en) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Signal filtering
EP3483886A1 (en) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Selecting pitch lag
EP3483878A1 (en) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio decoder supporting a set of different loss concealment tools
WO2019091576A1 (en) 2017-11-10 2019-05-16 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Audio encoders, audio decoders, methods and computer programs adapting an encoding and decoding of least significant bits
EP3483879A1 (en) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Analysis/synthesis windowing function for modulated lapped transformation
WO2019091573A1 (en) 2017-11-10 2019-05-16 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Apparatus and method for encoding and decoding an audio signal using downsampling or interpolation of scale parameters
EP3483882A1 (en) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Controlling bandwidth in encoders and/or decoders
EP3483880A1 (en) 2017-11-10 2019-05-15 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Temporal noise shaping
CN110503661A (en) * 2018-05-16 2019-11-26 武汉智云星达信息技术有限公司 A kind of target image method for tracing based on deeply study and space-time context
CN111613241B (en) * 2020-05-22 2023-03-24 厦门理工学院 High-precision high-stability stringed instrument fundamental wave frequency detection method

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4622680A (en) * 1984-10-17 1986-11-11 General Electric Company Hybrid subband coder/decoder method and apparatus
US4924480A (en) * 1988-03-11 1990-05-08 American Telephone And Telegraph Company Codecs with suppression of multiple encoding/decodings across a connection
EP0464839B1 (en) * 1990-07-05 2000-09-27 Fujitsu Limited Digitally multiplexed transmission system
ATE138238T1 (en) * 1991-01-08 1996-06-15 Dolby Lab Licensing Corp ENCODER/DECODER FOR MULTI-DIMENSIONAL SOUND FIELDS
US5253326A (en) * 1991-11-26 1993-10-12 Codex Corporation Prioritization method and device for speech frames coded by a linear predictive coder
JP3475446B2 (en) * 1993-07-27 2003-12-08 ソニー株式会社 Encoding method

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US11810545B2 (en) 2011-05-20 2023-11-07 Vocollect, Inc. Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment
US11817078B2 (en) 2011-05-20 2023-11-14 Vocollect, Inc. Systems and methods for dynamically improving user intelligibility of synthesized speech in a work environment
US11837253B2 (en) 2016-07-27 2023-12-05 Vocollect, Inc. Distinguishing user speech from background speech in speech-dense environments

Also Published As

Publication number Publication date
WO1999016050A1 (en) 1999-04-01

Similar Documents

Publication Publication Date Title
AU9404098A (en) Scalable and embedded codec for speech and audio signals
AU3549299A (en) Coding an audio signal
HK1122639A1 (en) Voice signal encoder and voice signal decoder
AU1978797A (en) Simultaneous transmission of ancillary and audio signals by means of perceptual coding
AU4630597A (en) Multimedia call centre
AU9315998A (en) Improved directional microphone audio system
AU2743597A (en) Audio enhancement system for use in a surround sound environment
AU6270099A (en) Audio encoding apparatus and methods
DE69836785D1 (en) Audio signal compression, speech signal compression and speech recognition
AU6231196A (en) Speech synthesis
AU2427099A (en) Speech coding
AU5093796A (en) Human transcription factors and binding assays
IL136546A0 (en) Audio codec with agc controlled by a vocoder
AU9164998A (en) Speech coding
AU3739195A (en) Global sound microphone system
AU2916901A (en) Digital audio decoder and related methods
AU2756299A (en) Speech coding including soft adaptability feature
EP1087378A4 (en) Voice/music signal encoder and decoder
AU5547100A (en) Speech and voice signal processing
AU3702497A (en) Speech coding
AU6254798A (en) Speech and sound synthesizing
AU3483800A (en) Mouthpiece and speaker assemblies for underwater speech
AU4325796A (en) Embedded higher order microphone
AU5314398A (en) Postfiltering audio signals, especially speech signals
AU9498298A (en) Production of formaldehdye from CH4 and H2S

Legal Events

Date Code Title Description
MK6 Application lapsed section 142(2)(f)/reg. 8.3(3) - pct applic. not entering national phase