CN1756128A - Method for improving communication terminal intercommunicating voice quality and communication system - Google Patents

Method for improving communication terminal intercommunicating voice quality and communication system Download PDF

Info

Publication number
CN1756128A
CN1756128A CNA2004100854037A CN200410085403A CN1756128A CN 1756128 A CN1756128 A CN 1756128A CN A2004100854037 A CNA2004100854037 A CN A2004100854037A CN 200410085403 A CN200410085403 A CN 200410085403A CN 1756128 A CN1756128 A CN 1756128A
Authority
CN
China
Prior art keywords
data
core net
access network
speech data
terminal
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CNA2004100854037A
Other languages
Chinese (zh)
Other versions
CN100385976C (en
Inventor
丁以钦
范永顺
温斌
梁敏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Xinwei Telecom Technology Inc
Original Assignee
Beijing Xinwei Telecom Technology Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Xinwei Telecom Technology Inc filed Critical Beijing Xinwei Telecom Technology Inc
Priority to CNB2004100854037A priority Critical patent/CN100385976C/en
Publication of CN1756128A publication Critical patent/CN1756128A/en
Application granted granted Critical
Publication of CN100385976C publication Critical patent/CN100385976C/en
Anticipated expiration legal-status Critical
Expired - Fee Related legal-status Critical Current

Links

Images

Landscapes

  • Data Exchanges In Wide-Area Networks (AREA)
  • Telephonic Communication Services (AREA)

Abstract

The invention provides a method for improving the interconnection audio quality of communication terminals and a relative communication system. In the initial stage of connecting the terminal, processing the audio coding/decoding on the terminals and accessed network, and the accessed network outputs the decoded PCM audio data to the core network; the accessed network adds special mark on the PCM audio data; at the same time, the accessed network detects the PCM audio data from said core network; while said special mark is detected or detects the same mark of frame head and head end of the data in one continuous frame from said core network with the distance between the frame head and frame end is not less than the byte number of decoded frame data, the accessed network avoids the audio coding/decoding. Therefore, the terminal can realize the interconnection of coding/decoding only between terminals. The invention can effectively improve the audio quality of accessed network to reach better communication effect.

Description

Improve the method and the communication system of the voice quality of communication terminal intercommunicating
Technical field
The present invention relates to a kind of method and system that improve the voice quality of communication terminal intercommunicating, relate in particular to a kind of method and system that improve the voice quality of the terminal mutual that adopts same encoding and decoding speech standard.
Background technology
Whole telecommunications network can be regarded as by two parts and form, and local telephone network end office (EO) is above to be core net, and local telephone network end office (EO) or remote end module are partly to be called Access Network down to user terminal.The composition of whole telecommunications network as shown in Figure 1.
For making each terminal in Access Network, take less transmission bandwidth, reach the purpose that improves the Access Network capacity, the voice of user terminal and Access Network often need to carry out encoding and decoding.The voice of core net are PCM (impulse waveform coding) voice of 64kbps (kilobits per second).When the terminal of two Access Networks realizes intercommunication, then need through twice encoding and decoding speech.Theoretically, speech coding all is to come voice are carried out compressed encoding according to a certain voice Mathematical Modeling, and these models can only be a kind of approximate of actual conditions.If with 64kbps PCM waveform coding is MOS (Mean Opinion Score) full marks, which kind of encoding and decoding speech standard no matter then, all be lower than full marks, just all voice quality has been caused decline, thereby, when reducing by two communication terminal intercommunicatings as far as possible the encoding and decoding speech number of times of process should be an important method that improves speech quality.
The method of traditional raising Access Network voice quality mainly is the error rate of selecting the most measured encoding and decoding standard of voice quality for use and reducing to transmit as far as possible, still, and as above-mentioned, even good again encoding and decoding standard is also good not as the voice quality of not doing encoding and decoding.
Summary of the invention
An object of the present invention is to provide a kind of method that improves the voice quality of communication terminal intercommunicating, described communication terminal is for adopting the terminal of same encoding and decoding speech standard.Can improve the voice quality of described terminal mutual by the number of times that reduces encoding and decoding speech.
Method of the present invention may further comprise the steps:
(1) starting stage behind terminal mutual, terminal and Access Network all carry out encoding and decoding speech, and wherein, Access Network is decoded the back to core net input PCM speech data to the speech data that comes self terminal, after the core net transmission, pass to Access Network, pass to another terminal behind the Access Network coding then;
(2) Access Network when importing described PCM speech data to core net in these data the lowest order of N continuous byte be revised as synchronous head sign and encoding and decoding standard sign (being called for short the N bit flag); Simultaneously, when Access Network detects from the PCM speech data of core net, judge that the lowest order of a continuous N byte data is whether sliceable and go out described N bit flag;
(3) setting in the time limit when Access Network, detecting when the PCM of core net speech data satisfies one of following condition, the speech data of Access Network after to the coding that comes self terminal no longer decoded, but adds that frame head postamble sign sends to core net; Data from core net are also no longer encoded, speech data are sent to terminal after removing the frame head postamble, finish until communication:
(a) lowest order of N continuous byte can splice described N bit flag;
(b) at the same frame head postamble sign of detecting in the data of core net of a continuous frame length, and the distance of frame head postamble is not less than the byte number after these frame data are encoded;
(4) otherwise, Access Network no longer carries out step (2) and Access Network and no longer detects PCM speech data from core net, but data are still carried out encoding and decoding, finishes until communication.
Preferably, the selection of N is no more than the byte number of a frame in the described method.
Preferably, N equals 64 in the described method.
Be limited to when preferably, setting described in the step (3) in the described method and be no more than 100-300 frame speech data.
Another object of the present invention provides a kind of communication system that improves the voice quality of terminal mutual, comprises core net, Access Network and user terminal, it is characterized in that, described Access Network comprises following processing unit:
(1) codec unit, the starting stage of this unit behind terminal mutual carries out encoding and decoding to speech data and exports the unit of PCM speech data to core net;
(2) N bit flag processing unit, this unit when core net is exported described PCM speech data wherein the lowest order of N continuous byte data be revised as synchronous head sign and the common N position of encoding and decoding standard sign, and detect PCM speech data simultaneously, judge that the lowest order of a continuous N byte data is whether sliceable and go out described N bit flag from core net;
This unit is when setting continuous N the byte PCM speech data lowest order that detects in the time limit from core net and can splice described N bit flag, then make Access Network no longer carry out the encoding and decoding of speech data, otherwise lowest order from described PCM speech data to core net that do not make a continuous N data when exporting is revised, and data are still carried out encoding and decoding;
(3) processing unit of frame head postamble sign, this unit does not add that to speech data frame head postamble sign sends to core net, whether the data that detect from core net simultaneously are the data that same frame head postamble sign is arranged at Access Network when in the future the speech data behind the coding of self terminal is not decoded; This unit removes the unit that the frame head postamble sends to terminal with this speech data when the byte number of setting after the distance that detects described identical frame head postamble flag data and frame head postamble in the time limit is not less than this frame data coding.
(4) error code processing unit, this unit are carried out error code and are handled when error code takes place.Preferably, the selection of N is no more than the byte number of a frame in the described communication system.Preferably, N equals 64 in the described communication system.
Preferably, be limited to during described setting and be no more than 100-200 frame speech data.
Because any encoding and decoding speech standard all can cause the decline of voice quality, in addition, the encoding and decoding speech cascade is many more, voice quality descends big more, therefore, method that this invention is provided and communication system make the terminal that adopts same encoding and decoding speech standard only carry out encoding and decoding in terminal when intercommunication, have saved the step of Access Network encoding and decoding, and the Access Network voice quality is significantly improved.
Description of drawings
Read in conjunction with the accompanying drawings, purpose of the present invention, feature and advantage are more readily understood.In the accompanying drawing:
Fig. 1 is the composition schematic diagram of telecommunications network;
Fig. 2 connects the PCM speech data form that interior Access Network of beginning a period of time exports core net to for channel, and length is the frame length behind the tone decoding;
Fig. 3 only transfers the PCM speech data form that after terminal is carried out encoding and decoding speech Access Network exports core net to, and length is the frame length behind the tone decoding;
Fig. 4 is that two terminals are connected the schematic diagram of starting stage through terminal, Access Network encoding and decoding intercommunication;
Fig. 5 is that two terminals only change the schematic diagram through terminal encoding and decoding intercommunication over to.
Embodiment
Starting stage behind terminal mutual, terminal and Access Network all carry out encoding and decoding speech, and wherein, Access Network is decoded the back to core net input PCM speech data to the speech data that comes self terminal, after the core net transmission, pass to Access Network, pass to another terminal behind the Access Network coding then.G.729, with encoding and decoding speech standard I TU is example, and frame voice PCM data are 80 bytes.
The N bit flag processing unit of Access Network when exporting described PCM speech data to core net in these data the lowest order of 64 continuous bytes be revised as synchronous head sign and encoding and decoding standard sign (being called for short the N bit flag); Simultaneously, when Access Network detects from the PCM speech data of core net, judge that the lowest order of 64 continuous byte datas is whether sliceable and go out described N bit flag.
Wherein, the selection of N bit flag is no more than the byte number of the frame behind the tone decoding, promptly G.729N encoding and decoding speech standard I TU is no more than 80, is the interpolation that just can make a N bit flag for the decoding of making a frame like this.When the N value was too small, it was disconnected to be easy to that then erroneous judgement takes place, and is about to non-N bit flag and is judged as the N bit flag; And the N value then will make the detection calculations amount big when excessive, consume more DSP (Digital Signal Processing) resource.Experiment finds, the N value is 64 to be reasonable compromise value.
Fig. 2 shows channel and connects the PCM speech data form that interior Access Network of beginning a period of time exports core net to, and its N bit flag is 64.Bit0~bit7 represents the bit 0~bit 7 of Access Network to the PCM speech data (8 bit) of core net output among the figure.Lowest order in continuous 64 byte datas is that the bit0 place is provided with 64 N bit flag, and wherein 60bits FAS represents 60 bit synchronous leader will; 4bits vocoder type represents 4 bit terminal encoding and decoding speech standard signs.The common N bit flag of forming 64 of these two groups of bits.Info data represents the PCM data of tone decoding output.
When Access Network detects when the lowest order of 64 continuous bytes of the PCM of core net speech data can splice described 64 N bit flag, then Access Network no longer carries out the encoding and decoding of speech data; Otherwise, when Access Network detects less than the N bit flag, speech data is still carried out encoding and decoding in 100-200 frame speech data, simultaneously, Access Network no longer detects the PCM speech data from core net, finishes until communication.Wherein, 100-200 frame speech data is for setting the time limit, if should the time limit oversize, then wastes the DSP resource; If should the time limit too short, then cause easily and miss the N bit flag, cause erroneous judgement disconnected.The 100-200 frame is the preferred time limit.
When Access Network detects the N bit flag, the speech data of the processing unit of frame head postamble sign after to the coding that comes self terminal do not decoded, but add that frame head postamble sign sends to core net, whether be the byte number that have the distance of the data of same frame head postamble sign and frame head postamble be not less than this frame data coding after: if then do not encode from the data of core net if in 100-200 frame speech data, detecting simultaneously, remove the frame head postamble speech data is sent to terminal, finish until communication; If do not have then and handle (, no longer burdensome) by error code at this because method for processing error codes belongs to prior art by error code processing unit.
Fig. 3 only transfers the PCM speech data form that after terminal is carried out encoding and decoding speech Access Network exports core net to.Bit0 among the figure~bit7 implication is identical with Fig. 2.Head_H and Head_L are two successive bytes, form leader will word jointly.Equally, Tail_H and Tail_L form the tail banner word.Coded_byte0, Coded_bytei represent the 0th, an i speech coding, and wherein each speech coding is i.e. 8 bits of 1 byte.If the word that two continuous bytes are formed in the vocoded data identical with the head or tail sign (a such word can and end to end the sign generation obscure), then in the middle of this two byte, add and destroy byte (destroy the word that byte and the byte of front and back form and indicate inequality end to end), resolve end and make opposite this destruction byte of removal of handling.Because all much larger than 2: 1 compression ratio, so sign, vocoded data consider that again the destruction byte that adds still can not take a frame (G.729 being 80 bytes) end to end, can add FF data such as (end-of-file marks) this moment to the encoding and decoding speech standard.
It is that two terminals are connected the starting stage and only changed situation through terminal encoding and decoding intercommunication over to through terminal, Access Network encoding and decoding intercommunication situation and two terminals that Fig. 4, Fig. 5 show respectively.Wherein Access Network A and Access Network B are the Access Network of supporting the inventive method, can be same Access Networks, also different Access Networks.Among Fig. 4,, then can not enter intercommunication state shown in Figure 5,, then can realize only carrying out the intercommunication of encoding and decoding speech in terminal if the encoding and decoding speech standard of terminal A and B is identical if terminal A and B encoding and decoding speech standard are inequality.
If the Access Network of two terminal correspondences one of them or two are not for supporting the Access Network of the inventive method, perhaps two terminal encoding and decoding speech standard differences, the intercommunication that then can not enter an encoding and decoding speech.Access Network detects in certain time limit (generally being made as the time of encoding and decoding speech 100-200 frame) less than above-mentioned N bit flag or frame head postamble sign and then no longer sends the N bit flag to core net.
Usually, the intercommunication that two terminals transfer encoding and decoding to from the intercommunication of twice encoding and decoding only needs the time of a few tens of milliseconds, the i.e. time of several frames of encoding and decoding speech.Because the error rate that the core net transmission is very little the invention enables the terminal that adopts same encoding and decoding speech standard only to carry out encoding and decoding in terminal when intercommunication, has saved the step of Access Network encoding and decoding, and the Access Network voice quality is significantly improved.

Claims (8)

1. method that improves the voice quality of terminal mutual in the communication system, described communication system comprises core net, Access Network and user terminal, it is characterized in that, this method comprises the steps:
(1) starting stage behind terminal mutual, terminal and Access Network all carry out encoding and decoding speech, and wherein, Access Network is decoded the back to core net input PCM speech data to the speech data that comes self terminal, after the core net transmission, pass to Access Network, pass to another terminal behind the Access Network coding then;
(2) Access Network when importing described PCM speech data to core net in these data the lowest order of N continuous byte be revised as synchronous head sign and encoding and decoding standard sign (being called for short the N bit flag); Simultaneously, when Access Network detects from the PCM speech data of core net, judge that the lowest order of a continuous N byte data is whether sliceable and go out described N bit flag;
(3) setting in the time limit when Access Network, detecting when the PCM of core net speech data satisfies one of following condition, the speech data of Access Network after to the coding that comes self terminal no longer decoded, but adds that frame head postamble sign sends to core net; Data from core net are also no longer encoded, speech data are sent to terminal after removing the frame head postamble, finish until communication:
(a) lowest order of N continuous byte can splice described N bit flag;
(b) at the same frame head postamble sign of detecting in the data of core net of a continuous frame length, and the distance of frame head postamble is not less than the byte number after these frame data are encoded;
(4) otherwise, Access Network no longer carries out step (2) and Access Network and no longer detects PCM speech data from core net, but data are still carried out encoding and decoding, finishes until communication.
2. the method for claim 1 is characterized in that, the selection of described N is no more than the byte number of a frame.
3. the method for claim 1 is characterized in that, described N equals 64.
4. the method for claim 1 is characterized in that, is limited to when setting described in the step (3) and is no more than 100-200 frame speech data.
5. a communication system that improves the voice quality of terminal mutual comprises core net, Access Network and user terminal, it is characterized in that, described Access Network comprises following processing unit:
(1) codec unit, the starting stage of this unit behind terminal mutual carries out encoding and decoding to speech data and exports the unit of PCM speech data to core net;
(2) N bit flag processing unit, this unit when core net is exported described PCM speech data wherein the lowest order of N continuous byte data be revised as synchronous head sign and the common N position of encoding and decoding standard sign, and detect PCM speech data simultaneously, judge that the lowest order of a continuous N byte data is whether sliceable and go out described N bit flag from core net;
This unit is when setting continuous N the byte PCM speech data lowest order that detects in the time limit from core net and can splice described N bit flag, then make Access Network no longer carry out the encoding and decoding of speech data, otherwise lowest order from described PCM speech data to core net that do not make a continuous N data when exporting is revised, and data are still carried out encoding and decoding;
(3) processing unit of frame head postamble sign, this unit does not add that to speech data frame head postamble sign sends to core net, whether the data that detect from core net simultaneously are the data that same frame head postamble sign is arranged at Access Network when in the future the speech data behind the coding of self terminal is not decoded; This unit removes the unit that the frame head postamble sends to terminal with this speech data when the byte number of setting after the distance that detects described identical frame head postamble flag data and frame head postamble in the time limit is not less than this frame data coding;
(4) error code processing unit, this unit are carried out error code and are handled when error code takes place.
6. communication system as claimed in claim 5 is characterized in that the selection of N is no more than the byte number of a frame.
7. communication system as claimed in claim 5 is characterized in that described N equals 64.
8. communication system as claimed in claim 5 is characterized in that, is limited to during described setting and is no more than 100-200 frame speech data.
CNB2004100854037A 2004-09-30 2004-09-30 Method for improving communication terminal intercommunicating voice quality and communication system Expired - Fee Related CN100385976C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB2004100854037A CN100385976C (en) 2004-09-30 2004-09-30 Method for improving communication terminal intercommunicating voice quality and communication system

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB2004100854037A CN100385976C (en) 2004-09-30 2004-09-30 Method for improving communication terminal intercommunicating voice quality and communication system

Publications (2)

Publication Number Publication Date
CN1756128A true CN1756128A (en) 2006-04-05
CN100385976C CN100385976C (en) 2008-04-30

Family

ID=36689120

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB2004100854037A Expired - Fee Related CN100385976C (en) 2004-09-30 2004-09-30 Method for improving communication terminal intercommunicating voice quality and communication system

Country Status (1)

Country Link
CN (1) CN100385976C (en)

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101568043B (en) * 2009-04-30 2011-02-02 尹海盛 In-band signaling interconnection device among PCM devices
CN101656735B (en) * 2009-09-11 2013-02-13 中兴通讯股份有限公司 Code flow processing method and device in communication link
WO2018103661A1 (en) * 2016-12-09 2018-06-14 华为技术有限公司 Method, device and apparatus for transmitting voice data
CN112259087A (en) * 2020-10-16 2021-01-22 四川长虹电器股份有限公司 Method for complementing voice data based on time sequence neural network model

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3782348B2 (en) * 1999-05-31 2006-06-07 ノキア コーポレイション Transmission and interconnection methods
US6600738B1 (en) * 1999-10-02 2003-07-29 Ericsson, Inc. Routing in an IP network based on codec availability and subscriber preference
CN1214548C (en) * 2002-05-29 2005-08-10 华为技术有限公司 Data transmission method in wireless access network
US7406096B2 (en) * 2002-12-06 2008-07-29 Qualcomm Incorporated Tandem-free intersystem voice communication

Cited By (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101568043B (en) * 2009-04-30 2011-02-02 尹海盛 In-band signaling interconnection device among PCM devices
CN101656735B (en) * 2009-09-11 2013-02-13 中兴通讯股份有限公司 Code flow processing method and device in communication link
WO2018103661A1 (en) * 2016-12-09 2018-06-14 华为技术有限公司 Method, device and apparatus for transmitting voice data
CN112259087A (en) * 2020-10-16 2021-01-22 四川长虹电器股份有限公司 Method for complementing voice data based on time sequence neural network model

Also Published As

Publication number Publication date
CN100385976C (en) 2008-04-30

Similar Documents

Publication Publication Date Title
CN104917671B (en) Audio-frequency processing method and device based on mobile terminal
CN101615910B (en) Method, device and equipment of compression coding and compression coding method
CN102124655A (en) Method for encoding a symbol, method for decoding a symbol, method for transmitting a symbol from a transmitter to a receiver, encoder, decoder and system for transmitting a symbol from a transmitter to a receiver
CN1575491A (en) Method and apparatus for decoding a coded digital audio signal which is arranged in frames containing headers
EP0987827A3 (en) Audio signal encoding method without transmission of bit allocation information
US20080117906A1 (en) Payload header compression in an rtp session
CN101051465B (en) Method and apparatus for decoding layer encoded data
CN1342309A (en) Low bit-rate coding of unvoiced segments of speech
CN100493002C (en) A method for computing the data loss in the network transfer
CN101322375B (en) Audio data packet format and decoding method thereof and method for correcting mobile communication terminal codec setup error and mobile communication terminal performance same
CN1756128A (en) Method for improving communication terminal intercommunicating voice quality and communication system
CN1192502C (en) Method and apparatus for digital signal compression without decoding
CN1222190C (en) Method of voice compressing in mobile terminal
CN1293857A (en) Communication device and method of operation
CN103050123B (en) A kind of method and system of transmitting voice information
CN1913384A (en) Audio source play system utilizing head-on earphone and audio-source play method
CN101478616A (en) Instant voice communication method
CN102239518A (en) Encoding and decoding method and device
CN1132460C (en) Telephone device comprising digital processing element for speech signals and method implemented in such device
CN112637703B (en) Real-time intercom system and method for WEB terminal
CN112509591B (en) Audio encoding and decoding method and system
CN2754281Y (en) Network telephone set
CN1285674A (en) Computer software controlled telephone set connected with microcomputer
CN1328398A (en) IP telephone
CN1630896A (en) Speech extender and method for estimating broadband speech signal from narrowband speech signal

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
PE01 Entry into force of the registration of the contract for pledge of patent right

Denomination of invention: Method for improving communication terminal intercommunicating voice quality and communication system

Effective date of registration: 20110509

Granted publication date: 20080430

Pledgee: Datang Telecom Technology Industry Holding Co.,Ltd.|Technical Research Institute of Telecommunications

Pledgor: BEIJING XINWEI TELECOM TECHNOLOGY Inc.

Registration number: 2011990000168

CI01 Publication of corrected invention patent application

Correction item: The Pledgee of the pledge of patent right

Correct: Datang Telecom Technology Industry Holding Co., Ltd.|Telecommunications Science and Technology Research Institute

False: Datang Telecom Technology Industry Holding Co., Ltd.|Telecommunications Science and Technology Research Institute

Number: 27

Volume: 27

ERR Gazette correction
PC01 Cancellation of the registration of the contract for pledge of patent right

Date of cancellation: 20160314

Granted publication date: 20080430

Pledgee: Datang Telecom Technology Industry Holding Co.,Ltd.|CHINA ACADEMY OF TELECOMMUNICATIONS TECHNOLOGY

Pledgor: BEIJING XINWEI TELECOM TECHNOLOGY Inc.

Registration number: 2011990000168

PLDC Enforcement, change and cancellation of contracts on pledge of patent right or utility model
PP01 Preservation of patent right

Effective date of registration: 20191121

Granted publication date: 20080430

PP01 Preservation of patent right
PD01 Discharge of preservation of patent

Date of cancellation: 20200710

Granted publication date: 20080430

PD01 Discharge of preservation of patent
CF01 Termination of patent right due to non-payment of annual fee

Granted publication date: 20080430

Termination date: 20190930

CF01 Termination of patent right due to non-payment of annual fee