CN112133318A - Digital voice coding device - Google Patents
Digital voice coding device Download PDFInfo
- Publication number
- CN112133318A CN112133318A CN202011018258.6A CN202011018258A CN112133318A CN 112133318 A CN112133318 A CN 112133318A CN 202011018258 A CN202011018258 A CN 202011018258A CN 112133318 A CN112133318 A CN 112133318A
- Authority
- CN
- China
- Prior art keywords
- module
- coding
- decoder
- speech
- band
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 230000015572 biosynthetic process Effects 0.000 claims abstract description 4
- 238000000695 excitation spectrum Methods 0.000 claims abstract description 4
- 238000009499 grossing Methods 0.000 claims abstract description 4
- 238000003786 synthesis reaction Methods 0.000 claims abstract description 4
- 230000006835 compression Effects 0.000 abstract description 4
- 238000007906 compression Methods 0.000 abstract description 4
- 230000006837 decompression Effects 0.000 abstract description 2
- 238000000034 method Methods 0.000 description 7
- 238000004891 communication Methods 0.000 description 6
- 230000003044 adaptive effect Effects 0.000 description 2
- 238000010586 diagram Methods 0.000 description 2
- 230000000694 effects Effects 0.000 description 2
- 238000010295 mobile communication Methods 0.000 description 2
- 101000622137 Homo sapiens P-selectin Proteins 0.000 description 1
- 102100023472 P-selectin Human genes 0.000 description 1
- 101000873420 Simian virus 40 SV40 early leader protein Proteins 0.000 description 1
- 230000005540 biological transmission Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/26—Pre-filtering or post-filtering
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
The invention discloses a digital voice coding device, which comprises a coder and a decoder, wherein the coder comprises a pulse code modulation module, a linear programming analysis module, a margin frame energy module, an excitation spectrum analysis module, a fundamental tone estimation and smoothing algorithm module and a molecular band voiced and unvoiced decision module; the decoder includes: the device comprises a sine sum generator, a noise generator, a band-pass filter bank, a synthesis filter module and a post-filter module. The invention solves the problems that the residual signal of the very low-rate speech coding is difficult to compress, the speech quality after decompression is too low, the compression algorithm is too complex and the engineering is difficult to realize.
Description
Technical Field
The invention relates to a voice coding technology, in particular to a digital voice coding device.
Background
Speech coding is currently of considerable importance in communication systems, and to a large extent determines the received speech quality and the system capacity. Conventionally, speech coding techniques can be divided into three categories, waveform coding, parametric coding and hybrid coding.
Waveform coding is to directly convert a time domain signal into a digital code in an attempt to maintain a reconstructed speech waveform in the waveform shape of an original speech signal. The basic principle of waveform coding is to sample analog speech at a certain rate on the time axis, then quantize amplitude samples hierarchically, and represent them with codes. Decoding is the reverse process, and the received digital sequence is decoded and filtered to recover an analog signal. Pulse Code Modulation (PCM) and Delta Modulation (DM), as well as various modified Adaptive Delta Modulation (ADM), adaptive differential coding (ADPCM), etc., thereof, are all waveform coding techniques. However, for network communication with increasingly strained bandwidth resources, the coding method is obviously not suitable, because the communication rate of waveform coding is generally above the rate of 16 Kbit/s. The parameter coding is used for extracting and coding the characteristic parameters of the signal envelope, and at a decoding end, the original voice signal is reconstructed through the characteristic parameters and the residual error, so that the compression rate is high. Typical speech compression coding methods are LPC-10, MELP, SELP, etc. Parameter coding is widely used in mobile communication at present because the communication rate can reach 1.2-2.4 Kbit/s. Hybrid coding, i.e. a process of coding using two or more coding methods simultaneously. Since the hybrid coding combines the high quality of waveform coding and the low data rate of parametric coding, it achieves better effect in practical application. Typical hybrid coding schemes are MPLPC, KPELPC, CELP, etc.
In mobile communication systems, broadband resources are at a premium. Low bit rate speech coding provides one approach to this problem. On the premise that the encoder can transmit high-quality voice, if the bit rate is lower, more channels of high-quality voice can be transmitted in a certain broadband. Speech coding is source coding that converts an analog speech signal into a digital signal for transmission in a channel. The purpose of speech coding is to transmit speech of the highest possible quality while occupying as little communication capacity as possible while maintaining a certain algorithm complexity and communication delay.
Disclosure of Invention
The present invention has been made to solve the above-mentioned problems occurring in the prior art by providing a digital speech encoding apparatus.
The technical problem solved by the invention is realized by adopting the following technical scheme:
a digital voice coding device comprises a coder and a decoder, wherein the coder comprises a pulse code modulation module, a linear programming analysis module, a margin frame energy module, an excitation spectrum analysis module, a fundamental tone estimation and smoothing algorithm module and a molecular band unvoiced and voiced sound judgment module; the decoder includes: the device comprises a sine sum generator, a noise generator, a band-pass filter bank, a synthesis filter module and a post-filter module.
The invention solves the problems that the residual signal of the very low-rate speech coding is difficult to compress, the speech quality after decompression is too low, the compression algorithm is too complex and the engineering is difficult to realize.
Drawings
FIG. 1 is a block diagram of an encoder structure of a digital speech encoder according to the present invention.
FIG. 2 is a block diagram of a decoder structure of a digital speech coder according to the present invention.
Detailed Description
In order to make the technical means, the creation characteristics, the achievement purposes and the effects of the invention easy to understand, the invention is further described with the specific embodiments.
As shown in fig. 1 and 2, a digital speech encoding apparatus includes: an encoder and a decoder, wherein the encoder receives an original voice, encodes the original voice, and transmits; the decoder receives the data sent by the encoder, decodes the data and synthesizes voice; the encoder comprises a pulse code modulation module 02, a linear programming analysis module 04, a margin frame energy module 05, an excitation spectrum analysis module 06, a fundamental tone estimation and smoothing algorithm module 07 and a molecular band unvoiced and voiced sound judgment module 08; the decoder includes: a sine sum generator 09, a noise generator 10, a band pass filter bank 11, a synthesis filter module 12, a post filter module 13.
The foregoing shows and describes the general principles and broad features of the present invention and advantages thereof. It will be understood by those skilled in the art that the present invention is not limited to the embodiments described above, which are described in the specification and illustrated only to illustrate the principle of the present invention, but that various changes and modifications may be made therein without departing from the spirit and scope of the present invention, which fall within the scope of the invention as claimed. The scope of the invention is defined by the appended claims and equivalents thereof.
Claims (1)
1. A digital speech encoding apparatus comprising an encoder and a decoder, characterized in that: the encoder comprises a pulse code modulation module, a linear programming analysis module, a margin frame energy module, an excitation spectrum analysis module, a fundamental tone estimation and smoothing algorithm module and a molecular band unvoiced and voiced sound judgment module; the decoder includes: the device comprises a sine sum generator, a noise generator, a band-pass filter bank, a synthesis filter module and a post-filter module.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011018258.6A CN112133318A (en) | 2020-09-24 | 2020-09-24 | Digital voice coding device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011018258.6A CN112133318A (en) | 2020-09-24 | 2020-09-24 | Digital voice coding device |
Publications (1)
Publication Number | Publication Date |
---|---|
CN112133318A true CN112133318A (en) | 2020-12-25 |
Family
ID=73840108
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011018258.6A Withdrawn CN112133318A (en) | 2020-09-24 | 2020-09-24 | Digital voice coding device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112133318A (en) |
-
2020
- 2020-09-24 CN CN202011018258.6A patent/CN112133318A/en not_active Withdrawn
Similar Documents
Publication | Publication Date | Title |
---|---|---|
KR100837451B1 (en) | Method and apparatus for improved quality voice transcoding | |
KR101105353B1 (en) | Method and device for efficient in-band dim-and-burst signaling and half-rate max operation in variable bit-rate wideband speech coding for cdma wireless systems | |
CN101577605B (en) | Speech LPC hiding and extraction algorithm based on filter similarity | |
EP1785984A1 (en) | Audio encoding apparatus, audio decoding apparatus, communication apparatus and audio encoding method | |
JP4489960B2 (en) | Low bit rate coding of unvoiced segments of speech. | |
CN1302459C (en) | A low-bit-rate coding method and apparatus for unvoiced speed | |
US20110142126A1 (en) | Low bit rate codec | |
CN1432176A (en) | Method and appts. for predictively quantizing voice speech | |
KR19990037291A (en) | Speech synthesis method and apparatus and speech band extension method and apparatus | |
CN108231083A (en) | A kind of speech coder code efficiency based on SILK improves method | |
JPH0850500A (en) | Voice encoder and voice decoder as well as voice coding method and voice encoding method | |
EP1597721B1 (en) | 600 bps mixed excitation linear prediction transcoding | |
CA2293165A1 (en) | Method for transmitting data in wireless speech channels | |
US20020095284A1 (en) | System of dynamic pulse position tracks for pulse-like excitation in speech coding | |
US20040102966A1 (en) | Apparatus and method for transcoding between CELP type codecs having different bandwidths | |
CN106098072B (en) | A kind of 600bps very low speed rate encoding and decoding speech method based on mixed excitation linear prediction | |
Moriya et al. | Speech coder using phase equalization and vector quantization | |
CN110415713B (en) | Encoding method and device of DMR system, storage medium and digital interphone | |
CN112133318A (en) | Digital voice coding device | |
CN112102845A (en) | Digital voice coding method | |
CN1873777B (en) | Mobile communication terminal with speech decode function and action method of the same | |
Bhatt | Implementation and overall performance evaluation of CELP based GSM AMR NB coder over ABE | |
CN101533639A (en) | Voice signal processing method and device | |
Milkovic et al. | Speech coding methods in mobile radiocommunication systems | |
CN111294147B (en) | Encoding method and device of DMR system, storage medium and digital interphone |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
WW01 | Invention patent application withdrawn after publication |
Application publication date: 20201225 |
|
WW01 | Invention patent application withdrawn after publication |