CN1231889C - Speech processing method of multi-channel vocoder - Google Patents

Speech processing method of multi-channel vocoder Download PDF

Info

Publication number
CN1231889C
CN1231889C CNB021525706A CN02152570A CN1231889C CN 1231889 C CN1231889 C CN 1231889C CN B021525706 A CNB021525706 A CN B021525706A CN 02152570 A CN02152570 A CN 02152570A CN 1231889 C CN1231889 C CN 1231889C
Authority
CN
China
Prior art keywords
data
vocoder
mailbox
passage
time
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime
Application number
CNB021525706A
Other languages
Chinese (zh)
Other versions
CN1501350A (en
Inventor
朱祥文
张晓枫
胡锴
覃景繁
董晓宏
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Zhigu Tech Co Ltd
Original Assignee
Huawei Technologies Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co Ltd filed Critical Huawei Technologies Co Ltd
Priority to CNB021525706A priority Critical patent/CN1231889C/en
Publication of CN1501350A publication Critical patent/CN1501350A/en
Application granted granted Critical
Publication of CN1231889C publication Critical patent/CN1231889C/en
Anticipated expiration legal-status Critical
Expired - Lifetime legal-status Critical Current

Links

Images

Abstract

The present invention relates to a speech processing method of multi-channel voice coders. Aiming at the defect of oversized time delay in the prior art, the speech processing method of multi-channel voice coders adopts the dynamic method of acquiring or transmitting data, the corresponding phase point of acquiring or transmitting the data is dynamically regulated according to the time successively processed by the voice coder of each channel, and the voice coder of each channel can acquire or transmit the up-to-date data. The voice coder of any channel in the direction of encoding process is used as an example, and the speech processing method of the voice coder comprises that: the data collected before the time of starting encoding is acquired from a TDM mailbox collecting the data and is used as the second part data; the data which finally enters the TDM mailbox and can form a frame of the complete data with the second part data is acquired from the TDM mailbox completing data collection and is used as the first part data; a frame of the complete data is formed by adding the first part data and the second part data, and the encoding process of the complete data is carried out by the voice coder. The speech processing method of multi-channel voice coders can reduce the time delay to M-M/N.

Description

The method of speech processing of multichannel vocoder
Technical field
The present invention relates to move or the fixed communication field, more particularly, relate to a kind of method of speech processing that is used for mobile or fixed communication network multichannel vocoder.
Background technology
In communication network, usually need the voice signal of coded format (64kbps) is G.711 carried out compressed encoding, to extract voice characteristic parameter (being generally less than 16kbps), reduce message transmission rate, and being reduced into voice signal by portable terminal or fixed terminal, above-mentioned work is finished by vocoder usually.Because the complicacy that vocoder is handled, make the voice signal increase of delaying time end to end.And in communication network, in order to increase power system capacity, usually be that the vocoder of a plurality of passages is handled simultaneously, hyperchannel is handled and has further been increased time-delay, and excessive time-delay will reduce the quality of voice signal.To specifically introduce the principle that produces time-delay below.
The coding staff of existing vocoder to processing procedure as shown in Figure 1.Wherein, TDM (TimeDivision Multiplex, be that the time-division is multiplexing) temporary PCM (pulse code modulation (PCM)) data of from multichannel serial (being the McBSP serial ports the figure), gathering of mailbox, the vocoder of each passage obtains frame PCM data from the respective channel of TDM mailbox, successively the data that it obtained are carried out encoding process then.Wherein, TDM mailbox 1 and 2 adopts ping-pong, gathered frame data in each passage of mailbox shown in the figure 1, and mailbox 2 is being collected new data.In Fig. 1, at first from each passage of TDM mailbox 1, obtain the data that a frame has been gathered respectively simultaneously by multichannel vocoder, by multichannel vocoder each passage is handled successively then.After finishing dealing with, externally switch under the driving of incident (being generally interruption) between mailbox 1 and the mailbox 2.In the whole process, the vocoder or the data of processing TDM mailbox 1, or the data of processing TDM mailbox 2 can not handled the data in mailbox 1 and the mailbox 2 simultaneously.
The data acquisition time of supposing multichannel vocoder is the M millisecond, and promptly first PCM sampling point of a certain passage begins to having gathered the required time of last PCM sampling point from the TDM mailbox, because of collection is carried out simultaneously, so total data acquisition time is M; Because of multichannel vocoder is handled successively to each passage, and need in an acquisition time, finish and handle action, so the maximum time that single passage is handled is the M/N millisecond; In addition, (look-ahead) time is the Δ millisecond to establish " forward sight " of vocoder.According to above-mentioned hypothesis, then the vocoder of passage 1 is handled always to delay time and is: the M+M/N+ Δ, time-delay is: M+N*M/N+ Δ=2M+ Δ and the vocoder of last passage N is always handled.Here said vocoder is handled total time-delay, and the vocoder that is meant a certain passage receives this passage from the TDM mailbox first PCM sampling point begins to export the required time till the data of this passage to vocoder.
What introduce above is that coding staff is to processing, for the decoding direction, the vocoder of each passage receives the data that transmitted by portable terminal or fixed terminal, successively its each frame data of receiving are carried out sending in the respective channel of TDM mailbox after the decoding processing, be sent to multichannel serial by the TDM mailbox again.Though its processing procedure is opposite with cataloged procedure, the reason that produces time-delay is similar.Through above similar analytic process, the maximum delay of the direction of being not difficult to obtain to decode, the required time till promptly the multichannel vocoder compressed voice data that receive passage N begin to send to first PCM sampling point, for: M+N*M/N=2M.
From above-mentioned analysis as can be known, the moment difference that multichannel vocoder is handled different passages, it all is that time with passage 1 is benchmark that but all channel vocoders obtain or send data time, like this when vocoder is handled single passage successively, respective channel data (I-1) * M/N that in fact delayed time, wherein I is the port number when pre-treatment, and 1<I<N.This shows that the speech data of last passage N is the oldest, time-delay is maximum.This will reduce the quality of voice signal greatly.
Summary of the invention
The technical problem to be solved in the present invention is, can reduce the defective of quality of speech signal at time-delay excessive in the prior art, a kind of method of speech processing of multichannel vocoder is provided,, improves quality of speech signal to reduce the speech processes time-delay of multichannel vocoder.
Technical solution of the present invention is, adopt the method for dynamically obtaining or send data, according to the moment that the vocoder of each passage is handled successively, dynamically the phase point of data is obtained or sends in adjustment accordingly, makes the vocoder of each passage can obtain or send up-to-date data.
One, for the vocoder of arbitrary passage of the encoding process direction of multichannel vocoder:
From just the respective channel of the TDM of image data mailbox, obtaining those a part of data of having gathered before beginning to encode constantly, with it as the second portion data;
And from the respective channel of another TDM mailbox of having gathered data, obtain at last and enter, and can form those a part of data of a frame partial data with described second portion data, with it as first's data;
Add the above second portion data with described first data and form the complete data of a frame, by described vocoder it is carried out encoding process again.
Two, for the vocoder of arbitrary passage of the decoding processing direction of multichannel vocoder:
Its frame partial data that will send is divided into first's data that need formerly send and the second portion data that send in the back;
First's data are sent to the afterbody of respective channel that transmits the TDM mailbox of data to multichannel serial, to replace the data that wherein do not transmit as yet in the moment of finishing decoding processing;
The second portion data are sent to the stem of the respective channel of another TDM mailbox;
Described two TDM mailboxes are sent to described first data and second portion data the respective channel of multichannel serial successively.
By such scheme, the vocoder of each passage finish its coding staff to the required total time-delay of processing procedure comprise: from the respective channel of two TDM mailboxes, obtain a part of data respectively to form the required acquisition time M of a frame partial data; The data of being gathered are carried out the required maximum processing time M/N of encoding process, and N is the largest passages number of multichannel vocoder; And the look-ahead time Δ of vocoder.And the vocoder of each passage is finished required total time-delay of processing procedure of its decoding direction and is comprised the data of being received are carried out the required maximum processing time M/N of decoding processing and a frame partial data is divided into the required transmitting time M of respective channel that two parts send to two TDM mailboxes respectively.As seen, compare with the maximum delay of prior art, coding staff to decoding time-delay that direction reduced all be M-M/N, for example work as M=20, during N=20, M-M/N=19, each direction can reduce by 19 milliseconds time-delay, and this will improve the quality of voice signal greatly under the situation of many vocoders cascade.
Description of drawings
The invention will be further described below in conjunction with drawings and Examples, in the accompanying drawing:
Fig. 1 is the synoptic diagram of existing vocoder collecting method commonly used;
Fig. 2 is that the vocoder of one of them passage after the employing method of the present invention is carrying out the synoptic diagram of coding staff when handling;
Fig. 3 is the synoptic diagram of vocoder when the direction of decoding is handled of one of them passage after the employing method of the present invention.
Embodiment
In order to reduce the speech processes time of multichannel vocoder data, the present invention adopts multichannel vocoder Dynamic Data Acquiring method, processing time difference according to different channel vocoders, dynamically adjust the corresponding data acquisition time, make each passage can collect the current up-to-date data of this passage.
Below in conjunction with Fig. 2 with coding staff to be treated to example analysis, in Fig. 2, the passage of TDM mailbox 1 has been gathered frame PCM data, TDM mailbox 2 is gathered the PCM data from multichannel serial.For the ease of analyzing, suppose that all channel vocoders are that to obtain a part of data from the respective channel of two TDM mailboxes respectively be that look-ahead time of M/N, vocoder is Δ to form the required acquisition time M of a frame partial data, the data of being gathered are carried out the required maximum processing time of encoding process to vocoder, the type vocoder of the same type.Below be coding staff to the detailed step of processing procedure:
At first, the processing time of multichannel vocoder is carried out unified planning, the processing time of each channel vocoder is divided according to timeslice, the timeslice size is M/N, and then the processing of each channel vocoder is followed successively by constantly:
Passage 1 vocoder is handled constantly: Codec1_Time=0,
Passage 2 vocoders are handled constantly: Codec2_Time=M/N,
...
Passage I vocoder is handled constantly: CodecI_Time=(I-1) * M/N, wherein 1≤I≤N.
Then, as shown in Figure 2, successively the data that it obtained are begun moment of encoding process, adjust it obtains data from the TDM mailbox phase point according to the vocoder of each passage, for example for the vocoder of wherein passage 3:
Begin coding constantly from just the passage 3 of the TDM of image data mailbox 2, obtaining above-mentioned, i.e. (3-1) * M/N=2M/N that a part of data of having gathered constantly, the dash area in passage 3 left sides of TDM mailbox 2 among Fig. 2 just, with it as the second portion data;
And from the passage 3 of the TDM mailbox 1 of having gathered data, obtain and enter at last wherein, and can form those a part of data of a frame partial data with above-mentioned second portion data, the dash area on passage 3 right sides of TDM mailbox 1 among Fig. 2 just, with it as first's data;
The vocoder of passage 3 adds the second portion data with first's data after having obtained above-mentioned first data and second portion data, and just the passage 3 left side shades of vocoder add right shade among Fig. 2, form the complete data of a frame;
Then, the complete data of this frame are carried out encoding process, send portable terminal or fixed terminal again to by the vocoder of passage 3.
Adopting above-mentioned steps dynamically to adjust obtains after the phase point of data, the vocoder of passage 3 finish its coding staff to the required total time-delay of processing procedure comprise and from the respective channel of two TDM mailboxes, obtain a part of data respectively to form the required acquisition time M of a frame partial data, the data of being gathered are carried out the required maximum processing time M/N of encoding process, and the look-ahead time Δ of vocoder, total time-delay is: the M+M/N+ Δ.Said process all is the same to arbitrary passage, the vocoder that is to say each passage finish its coding staff to the required total time-delay of processing procedure all be the M+M/N+ Δ.
In the prior art, total time-delay of having only the vocoder of passage 1 is the M+M/N+ Δ, and total time-delay of other channel vocoder then increases successively, and total time-delay of the vocoder of passage N is increased to the 2M+ Δ.As seen, the maximum diminishbb time-delay of the present invention is: (2M+ Δ)-(M+M/N+ Δ)=M-M/N.Work as M=20, during N=20, M-M/N=19 can reduce by 19 milliseconds time-delay, in the situation of many vocoders cascade, will improve the quality of voice signal greatly.
What more than analyze is the processing procedure of vocoder coding direction.For the processing procedure of decoding direction in contrast, the moment that can finish decoding processing to its data of receiving successively according to the vocoder of each passage equally wherein, adjust it and send the phase point of data to the TDM mailbox, as shown in Figure 3, to the vocoder of passage 3:
Earlier its frame partial data that will send is divided into first's data that need formerly send and the second portion data that send in the back, corresponds respectively to the right shade part and the left side dash area of the passage 3 of vocoder among Fig. 3;
Then first's data are sent to the afterbody of respective channel that transmits the TDM mailbox 2 of data to multichannel serial, the dash area in passage 3 left sides of TDM mailbox 2 among Fig. 3 just is to replace the data that wherein do not transmit as yet in the moment of finishing decoding processing;
Again the second portion data are sent to the stem of the respective channel of another TDM mailbox, just the dash area on passage 3 right sides of TDM mailbox 1 among Fig. 3;
TDM mailbox 2 is sent to multichannel serial by TDM mailbox 1 with above-mentioned second portion data after above-mentioned first data are sent to the respective channel of multichannel serial again.
According to above analysis as can be known, the required total time-delay of processing procedure of finishing its decoding direction of the vocoder of each passage comprises the data of being received is carried out the required maximum processing time M/N of decoding processing and a frame partial data is divided into the required transmitting time M of respective channel that two parts send to two TDM mailboxes respectively.Total time-delay is: M+M/N.Said process all is the same to arbitrary passage, and required total time-delay of processing procedure that the vocoder that is to say each passage is finished its decoding direction all is M+M/N.Compared with prior art, maximum diminishbb time-delay is: 2M-(M+M/N)=M-M/N.As seen it has the minimizing delay effect identical with the encoding process direction.

Claims (4)

1, a kind of method of speech processing of multichannel vocoder, its coding staff to processing procedure in, the temporary pulse code modulation data of from multichannel serial, gathering of time division multiplex mailbox, the vocoder of each passage obtains frame data from the respective channel of time division multiplex mailbox, successively the data that it obtained are carried out encoding process then, it is characterized in that, in said process, successively the data that it obtained are begun the moment of encoding process according to the vocoder of each passage, adjust it obtains data from the time division multiplex mailbox phase point, to the vocoder of arbitrary passage:
From just the respective channel of the time division multiplex mailbox of image data, obtaining those a part of data of having gathered before beginning to encode constantly described, with it as the second portion data;
And from the respective channel of another time division multiplex mailbox of having gathered data, obtain at last and enter, and can form those a part of data of a frame partial data with described second portion data, with it as first's data;
Add the above second portion data with described first data and form the complete data of a frame, by described vocoder it is carried out encoding process again.
2, method according to claim 1 is characterized in that, the vocoder of each passage finish its coding staff to the required total time-delay of processing procedure comprise:
From the respective channel of two time division multiplex mailboxes, obtain a part of data respectively to form the required acquisition time M of a frame partial data;
The data of being gathered are carried out the required maximum processing time M/N of encoding process, and N is the largest passages number of multichannel vocoder;
And the forward sight time Δ of vocoder.
3, a kind of method of speech processing of multichannel vocoder, in the processing procedure of its decoding direction, the vocoder of each passage receives the data that transmitted by portable terminal or fixed terminal, successively its each frame data of receiving are carried out sending in the respective channel of time division multiplex mailbox after the decoding processing, be sent to multichannel serial by the time division multiplex mailbox again, it is characterized in that, in said process, the moment of successively its data of receiving being finished decoding processing according to the vocoder of each passage, adjust it and send the phase point of data to the time division multiplex mailbox, to the vocoder of arbitrary passage:
Its frame partial data that will send is divided into first's data that need formerly send and the second portion data that send in the back;
First's data are sent to the afterbody of respective channel that transmits the time division multiplex mailbox of data to multichannel serial, to replace the data that wherein do not transmit as yet in the moment of finishing decoding processing;
The second portion data are sent to the stem of the respective channel of another time division multiplex mailbox;
Described two time division multiplex mailboxes are sent to described first data and second portion data the respective channel of multichannel serial successively.
4, method according to claim 3 is characterized in that, the required total time-delay of processing procedure that the vocoder of each passage is finished its decoding direction comprises:
The data of being received are carried out the required maximum processing time M/N of decoding processing, and N is the largest passages number of multichannel vocoder;
One frame partial data is divided into the required transmitting time M of respective channel that two parts send to two time division multiplex mailboxes respectively.
CNB021525706A 2002-11-19 2002-11-19 Speech processing method of multi-channel vocoder Expired - Lifetime CN1231889C (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CNB021525706A CN1231889C (en) 2002-11-19 2002-11-19 Speech processing method of multi-channel vocoder

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CNB021525706A CN1231889C (en) 2002-11-19 2002-11-19 Speech processing method of multi-channel vocoder

Publications (2)

Publication Number Publication Date
CN1501350A CN1501350A (en) 2004-06-02
CN1231889C true CN1231889C (en) 2005-12-14

Family

ID=34234802

Family Applications (1)

Application Number Title Priority Date Filing Date
CNB021525706A Expired - Lifetime CN1231889C (en) 2002-11-19 2002-11-19 Speech processing method of multi-channel vocoder

Country Status (1)

Country Link
CN (1) CN1231889C (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7751572B2 (en) * 2005-04-15 2010-07-06 Dolby International Ab Adaptive residual audio coding
EP2631906A1 (en) * 2012-02-27 2013-08-28 Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V. Phase coherence control for harmonic signals in perceptual audio codecs

Also Published As

Publication number Publication date
CN1501350A (en) 2004-06-02

Similar Documents

Publication Publication Date Title
CA2032765C (en) Variable rate encoding and communicating apparatus
CN1043001C (en) Transcoder and improved land system for a mobile radio communication system
US7660328B1 (en) Method and system for generating, transmitting and utilizing bit rate conversion information
USRE38244E1 (en) Method of bypassing vocoders in digital mobile communication system
US5379293A (en) Voice packet assembling/disassembling apparatus
ES2001568A6 (en) Multiplexed digital packet telephone communication method.
EP0927988A3 (en) Encoding speech
CA2514959A1 (en) Systems and methods for digital processing of satellite communications data
JP2001094433A (en) Sub-band coding and decoding medium
EP1439709A3 (en) Apparatus and method for supporting plural codecs
EP1349396A3 (en) Video encoding method and apparatus, and video decoding method and apparatus
JP2011043795A (en) Encoding method, apparatus and device, and decoding method
CN1231889C (en) Speech processing method of multi-channel vocoder
CN1341317A (en) Communication system and method for multiplexing of RTP data systems
CN103229544A (en) Source signal adaptive frame aggregation
CN1871864A (en) Method for retransmitting vocoded data
CN1140894C (en) Variable bitrate speech transmission system
CN101547010B (en) System, method and device for coding and decoding
CN1192656C (en) Transceiver for selecting source coder and processes carried out in such transceiver
CN1284319C (en) Implement method of multi-channel AMR vocoder and its equipment
EP1478112A3 (en) Apparatus and method for switching broadcast channels using virtual channel connection information
CN1581798A (en) Recognition device and method for frame correction sequence in general frame treating package mode
CN1149777C (en) Method, system and equipment for transmitting coding telecommunication signal
FI105248B (en) A method for transmitting digitized, block-coded audio signals using scaling factors
CN1202513C (en) Audio coding method and apparatus

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
C10 Entry into substantive examination
SE01 Entry into force of request for substantive examination
C14 Grant of patent or utility model
GR01 Patent grant
ASS Succession or assignment of patent right

Owner name: SHENZHEN LIANCHUANG INTELLECTUAL PROPERTY SERVICE

Free format text: FORMER OWNER: HUAWEI TECHNOLOGY CO., LTD.

Effective date: 20141208

C41 Transfer of patent application or patent right or utility model
COR Change of bibliographic data

Free format text: CORRECT: ADDRESS; FROM: 518057 SHENZHEN, GUANGDONG PROVINCE TO: 518052 SHENZHEN, GUANGDONG PROVINCE

TR01 Transfer of patent right

Effective date of registration: 20141208

Address after: 518052, Guangdong, Shenzhen province Nanshan District Nanshan digital cultural industry base, east block, room 407-408

Patentee after: Shenzhen LIAN intellectual property service center

Address before: 518057 Guangdong Shenzhen science and Technology Park HUAWEI road user service center building intellectual property department

Patentee before: HUAWEI TECHNOLOGIES Co.,Ltd.

ASS Succession or assignment of patent right

Owner name: BEIJING Z-GOOD TECHNOLOGY SERVICE CO., LTD.

Free format text: FORMER OWNER: SHENZHEN LIANCHUANG INTELLECTUAL PROPERTY SERVICE CENTER

Effective date: 20150122

C41 Transfer of patent application or patent right or utility model
COR Change of bibliographic data

Free format text: CORRECT: ADDRESS; FROM: 518052 SHENZHEN, GUANGDONG PROVINCE TO: 100085 HAIDIAN, BEIJING

TR01 Transfer of patent right

Effective date of registration: 20150122

Address after: 100085 Beijing city Haidian District No. 33 Xiaoying Road 1 1F06 room

Patentee after: BEIJING ZHIGU TECH Co.,Ltd.

Address before: 518052, Guangdong, Shenzhen province Nanshan District Nanshan digital cultural industry base, east block, room 407-408

Patentee before: Shenzhen LIAN intellectual property service center

CX01 Expiry of patent term
CX01 Expiry of patent term

Granted publication date: 20051214