CN112133318A - Digital voice coding device - Google Patents

Digital voice coding device Download PDF

Info

Publication number
CN112133318A
CN112133318A CN202011018258.6A CN202011018258A CN112133318A CN 112133318 A CN112133318 A CN 112133318A CN 202011018258 A CN202011018258 A CN 202011018258A CN 112133318 A CN112133318 A CN 112133318A
Authority
CN
China
Prior art keywords
module
coding
decoder
speech
band
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
CN202011018258.6A
Other languages
Chinese (zh)
Inventor
曾伍龙
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Individual filed Critical Individual
Priority to CN202011018258.6A priority Critical patent/CN112133318A/en
Publication of CN112133318A publication Critical patent/CN112133318A/en
Withdrawn legal-status Critical Current

Links

Images

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/02Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
    • G10L19/0204Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/08Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
    • G10L19/04Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
    • G10L19/26Pre-filtering or post-filtering

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Spectroscopy & Molecular Physics (AREA)
  • Compression, Expansion, Code Conversion, And Decoders (AREA)

Abstract

The invention discloses a digital voice coding device, which comprises a coder and a decoder, wherein the coder comprises a pulse code modulation module, a linear programming analysis module, a margin frame energy module, an excitation spectrum analysis module, a fundamental tone estimation and smoothing algorithm module and a molecular band voiced and unvoiced decision module; the decoder includes: the device comprises a sine sum generator, a noise generator, a band-pass filter bank, a synthesis filter module and a post-filter module. The invention solves the problems that the residual signal of the very low-rate speech coding is difficult to compress, the speech quality after decompression is too low, the compression algorithm is too complex and the engineering is difficult to realize.

Description

Digital voice coding device
Technical Field
The invention relates to a voice coding technology, in particular to a digital voice coding device.
Background
Speech coding is currently of considerable importance in communication systems, and to a large extent determines the received speech quality and the system capacity. Conventionally, speech coding techniques can be divided into three categories, waveform coding, parametric coding and hybrid coding.
Waveform coding is to directly convert a time domain signal into a digital code in an attempt to maintain a reconstructed speech waveform in the waveform shape of an original speech signal. The basic principle of waveform coding is to sample analog speech at a certain rate on the time axis, then quantize amplitude samples hierarchically, and represent them with codes. Decoding is the reverse process, and the received digital sequence is decoded and filtered to recover an analog signal. Pulse Code Modulation (PCM) and Delta Modulation (DM), as well as various modified Adaptive Delta Modulation (ADM), adaptive differential coding (ADPCM), etc., thereof, are all waveform coding techniques. However, for network communication with increasingly strained bandwidth resources, the coding method is obviously not suitable, because the communication rate of waveform coding is generally above the rate of 16 Kbit/s. The parameter coding is used for extracting and coding the characteristic parameters of the signal envelope, and at a decoding end, the original voice signal is reconstructed through the characteristic parameters and the residual error, so that the compression rate is high. Typical speech compression coding methods are LPC-10, MELP, SELP, etc. Parameter coding is widely used in mobile communication at present because the communication rate can reach 1.2-2.4 Kbit/s. Hybrid coding, i.e. a process of coding using two or more coding methods simultaneously. Since the hybrid coding combines the high quality of waveform coding and the low data rate of parametric coding, it achieves better effect in practical application. Typical hybrid coding schemes are MPLPC, KPELPC, CELP, etc.
In mobile communication systems, broadband resources are at a premium. Low bit rate speech coding provides one approach to this problem. On the premise that the encoder can transmit high-quality voice, if the bit rate is lower, more channels of high-quality voice can be transmitted in a certain broadband. Speech coding is source coding that converts an analog speech signal into a digital signal for transmission in a channel. The purpose of speech coding is to transmit speech of the highest possible quality while occupying as little communication capacity as possible while maintaining a certain algorithm complexity and communication delay.
Disclosure of Invention
The present invention has been made to solve the above-mentioned problems occurring in the prior art by providing a digital speech encoding apparatus.
The technical problem solved by the invention is realized by adopting the following technical scheme:
a digital voice coding device comprises a coder and a decoder, wherein the coder comprises a pulse code modulation module, a linear programming analysis module, a margin frame energy module, an excitation spectrum analysis module, a fundamental tone estimation and smoothing algorithm module and a molecular band unvoiced and voiced sound judgment module; the decoder includes: the device comprises a sine sum generator, a noise generator, a band-pass filter bank, a synthesis filter module and a post-filter module.
The invention solves the problems that the residual signal of the very low-rate speech coding is difficult to compress, the speech quality after decompression is too low, the compression algorithm is too complex and the engineering is difficult to realize.
Drawings
FIG. 1 is a block diagram of an encoder structure of a digital speech encoder according to the present invention.
FIG. 2 is a block diagram of a decoder structure of a digital speech coder according to the present invention.
Detailed Description
In order to make the technical means, the creation characteristics, the achievement purposes and the effects of the invention easy to understand, the invention is further described with the specific embodiments.
As shown in fig. 1 and 2, a digital speech encoding apparatus includes: an encoder and a decoder, wherein the encoder receives an original voice, encodes the original voice, and transmits; the decoder receives the data sent by the encoder, decodes the data and synthesizes voice; the encoder comprises a pulse code modulation module 02, a linear programming analysis module 04, a margin frame energy module 05, an excitation spectrum analysis module 06, a fundamental tone estimation and smoothing algorithm module 07 and a molecular band unvoiced and voiced sound judgment module 08; the decoder includes: a sine sum generator 09, a noise generator 10, a band pass filter bank 11, a synthesis filter module 12, a post filter module 13.
The foregoing shows and describes the general principles and broad features of the present invention and advantages thereof. It will be understood by those skilled in the art that the present invention is not limited to the embodiments described above, which are described in the specification and illustrated only to illustrate the principle of the present invention, but that various changes and modifications may be made therein without departing from the spirit and scope of the present invention, which fall within the scope of the invention as claimed. The scope of the invention is defined by the appended claims and equivalents thereof.

Claims (1)

1. A digital speech encoding apparatus comprising an encoder and a decoder, characterized in that: the encoder comprises a pulse code modulation module, a linear programming analysis module, a margin frame energy module, an excitation spectrum analysis module, a fundamental tone estimation and smoothing algorithm module and a molecular band unvoiced and voiced sound judgment module; the decoder includes: the device comprises a sine sum generator, a noise generator, a band-pass filter bank, a synthesis filter module and a post-filter module.
CN202011018258.6A 2020-09-24 2020-09-24 Digital voice coding device Withdrawn CN112133318A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011018258.6A CN112133318A (en) 2020-09-24 2020-09-24 Digital voice coding device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011018258.6A CN112133318A (en) 2020-09-24 2020-09-24 Digital voice coding device

Publications (1)

Publication Number Publication Date
CN112133318A true CN112133318A (en) 2020-12-25

Family

ID=73840108

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011018258.6A Withdrawn CN112133318A (en) 2020-09-24 2020-09-24 Digital voice coding device

Country Status (1)

Country Link
CN (1) CN112133318A (en)

Similar Documents

Publication Publication Date Title
KR100837451B1 (en) Method and apparatus for improved quality voice transcoding
KR101105353B1 (en) Method and device for efficient in-band dim-and-burst signaling and half-rate max operation in variable bit-rate wideband speech coding for cdma wireless systems
CN101577605B (en) Speech LPC hiding and extraction algorithm based on filter similarity
EP1785984A1 (en) Audio encoding apparatus, audio decoding apparatus, communication apparatus and audio encoding method
JP4489960B2 (en) Low bit rate coding of unvoiced segments of speech.
CN1302459C (en) A low-bit-rate coding method and apparatus for unvoiced speed
US20110142126A1 (en) Low bit rate codec
CN1432176A (en) Method and appts. for predictively quantizing voice speech
KR19990037291A (en) Speech synthesis method and apparatus and speech band extension method and apparatus
CN108231083A (en) A kind of speech coder code efficiency based on SILK improves method
JPH0850500A (en) Voice encoder and voice decoder as well as voice coding method and voice encoding method
EP1597721B1 (en) 600 bps mixed excitation linear prediction transcoding
CA2293165A1 (en) Method for transmitting data in wireless speech channels
US20020095284A1 (en) System of dynamic pulse position tracks for pulse-like excitation in speech coding
US20040102966A1 (en) Apparatus and method for transcoding between CELP type codecs having different bandwidths
CN106098072B (en) A kind of 600bps very low speed rate encoding and decoding speech method based on mixed excitation linear prediction
Moriya et al. Speech coder using phase equalization and vector quantization
CN110415713B (en) Encoding method and device of DMR system, storage medium and digital interphone
CN112133318A (en) Digital voice coding device
CN112102845A (en) Digital voice coding method
CN1873777B (en) Mobile communication terminal with speech decode function and action method of the same
Bhatt Implementation and overall performance evaluation of CELP based GSM AMR NB coder over ABE
CN101533639A (en) Voice signal processing method and device
Milkovic et al. Speech coding methods in mobile radiocommunication systems
CN111294147B (en) Encoding method and device of DMR system, storage medium and digital interphone

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
WW01 Invention patent application withdrawn after publication

Application publication date: 20201225

WW01 Invention patent application withdrawn after publication