WO2009099381A1 - Transmission de parole robuste - Google Patents

Transmission de parole robuste Download PDF

Info

Publication number
WO2009099381A1
WO2009099381A1 PCT/SE2009/050014 SE2009050014W WO2009099381A1 WO 2009099381 A1 WO2009099381 A1 WO 2009099381A1 SE 2009050014 W SE2009050014 W SE 2009050014W WO 2009099381 A1 WO2009099381 A1 WO 2009099381A1
Authority
WO
WIPO (PCT)
Prior art keywords
speech data
transmitter
bits
frames
transmission
Prior art date
Application number
PCT/SE2009/050014
Other languages
English (en)
Inventor
Daniel ENSTRÖM
Hans Hannu
Per Synnergren
Original Assignee
Telefonaktiebolaget L M Ericsson (Publ)
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Telefonaktiebolaget L M Ericsson (Publ) filed Critical Telefonaktiebolaget L M Ericsson (Publ)
Publication of WO2009099381A1 publication Critical patent/WO2009099381A1/fr

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L1/00Arrangements for detecting or preventing errors in the information received
    • H04L1/0001Systems modifying transmission characteristics according to link quality, e.g. power backoff
    • H04L1/0014Systems modifying transmission characteristics according to link quality, e.g. power backoff by adapting the source coding

Definitions

  • the present invention relates to a method and a device for transmitting speech over a circuit switched connection.
  • CS Cellular Circuit Switched
  • HSPA High Speed Packet Access
  • CPC Continuous Packet Connectivity
  • An originating mobile station connects via HSPA to the base station NodeB.
  • the base station is connected to a Radio Network Controller (RNC) comprising a jitter buffer.
  • RNC Radio Network Controller
  • the RNC is via a Mobile Switching Center (MSC)/Media Gateway (MGW) connected to an RNC of the terminating mobile station.
  • MSC Mobile Switching Center
  • MGW Media Gateway
  • the terminating mobile station is connected to its RNC via a local base station (NodeB).
  • NodeB local base station
  • the mobile station on the terminating side also comprises a jitter buffer.
  • the air interface is using Wideband Code Division Multiple Access (WCDMA) HSPA, which result in that:
  • the uplink is High Speed Uplink Packet Access (HSUPA) running 2 ms Transmission Time Interval TTI and with Dedicated Physical Control Channel (DPCCH) gating.
  • HSUPA High Speed Uplink Packet Access
  • DPCCH Dedicated Physical Control Channel
  • the downlink is High Speed Downlink Packet Access (HSDPA) and can utilize Fractional Dedicated Physical Channel (F-DPCH) gating and Shared Control Channel for HS-DSCH
  • HSDPA High Speed Downlink Packet Access
  • F-DPCH Fractional Dedicated Physical Channel
  • HS-SCCH High Speed Downlink Shared Channel
  • HARQ Hybrid Automatic Repeat Request
  • the use of fast retransmissions for robustness, and HSDPA scheduling requires a jitter buffer to cancel the delay variations that can occur due to the (HARQ) retransmissions, and scheduling delay variations.
  • Two jitter buffers are needed, one at the originating RNC and one in the terminating terminal.
  • the jitter buffers use a time stamp that is created by the originating terminal or the terminating RNC to de-jitter the packets.
  • the timestamp will be included in the Packet Data Convergence Protocol (PDCP) header of a special PDCP packet type.
  • PDCP Packet Data Convergence Protocol
  • a PDCP header is depicted in Fig. 2.
  • the jitter buffer typically needs sequence number information as well to handle reordering.
  • the sequence number used is the RLC sequence number that is passed on to higher layers.
  • the CS over HSPA solution that is being standardized in 3GPP R7 and R8 should include the concept of rate adaptation.
  • the concept of rate adaptation means that an Adaptive Multi Rate (AMR) codec mode is changed in response to the current radio conditions.
  • AMR Adaptive Multi Rate
  • lower AMR codec modes such as 4.75, 5.15, and 5.9 kbit/s can be used when the User Equipment (UE) is transmitting in bad radio conditions.
  • GSM Global System for Mobile communication
  • WCDMA uses a link adaptation based on fast power control that can change the transmission power with a frequency of 1500 Hz.
  • speech data is transmitted from a mobile transmitter over an air interface.
  • the transmitter selects one of the at least two speech data frames for transmission based on one or more predefined parameters.
  • the transport layer will have the possibility to send the speech data frame using the encoding that has the best chance of arriving at the receiver without bit errors.
  • the speech quality is enhanced in poor radio conditions.
  • the selection is based on available transmission power.
  • the selection is based on current radio link conditions.
  • the parallel encoded speech data frames are encoded as Adaptive MuI ti Rate frames.
  • the parallel encoded speech data frames are encoded as different combinations of classes of bits.
  • the speech data is transmitted over an air interface employing High Speed Packet Access transmission.
  • the invention also extends to a transmitter adapted to employ a transmission scheme as described above.
  • the transport layer will be enabled to send a speech data frame using the encoding that has the best chance of arriving at the receiver without bit errors. As a result the speech quality is enhanced in poor radio conditions.
  • - Fig. 1 is a view illustrating CS transmission over HSPA
  • - Fig. 2 is a view of a PDCP header
  • - Fig. 3 is a view illustrating the scheduling grant that corresponds to a transport format (TF)
  • - Fig. 4 is a view of parallel encoding of speech
  • Fig. 5 is a flowchart illustrating procedural steps performed when providing parallel encoding
  • - Fig. 6 is a transmitter adapted to use parallel encoding
  • - Fig. 7 is a flowchart illustrating procedural steps performed when providing parallel encoding in one embodiment
  • FIG. 8 is a flowchart illustrating procedural steps performed when providing parallel encoding in another embodiment
  • WCDMA Wideband Code Division Multiple Access
  • the radio condition in Wideband Code Division Multiple Access (WCDMA) needs to be well known in order to set the correct output power for the transmission.
  • WCDMA utilize an outer loop power control and a fast inner loop power control.
  • the outer loop power control adopts the power relationship between the data channel and the control channels by investigating the current bit error rate.
  • the inner power control loop adapts the terminals total output power in the transmissions by controlling the current radio condition with the help of pilot bits.
  • the inner power control is fast and adopts the output power with 1500 Hz.
  • the operator needs to provide the end user with a sufficient good quality when the user is running a telephony service like CS over HSPA.
  • the basic quality of the service is given by the target AMR codec mode that is used.
  • AMRl 2.2 is used as the target codec.
  • the quality of the air link deteriorates so much that it is not possible to send the bits encoded by the AMR 12.2 codec mode error- free over the air link.
  • AMR 5.9 can be used in such circumstances.
  • Such adaptation is the type of rate adaptation of AMR used in GSM today.
  • the drawback of the solution in GSM is that the information given to the AMR encoder basically comes from a too slow adaptation loop which runs in 50Hz.
  • WCDMA HSPA the link adaptation is much faster.
  • EUL the User Equipment
  • UE is granted a maximum bit rate (the scheduling grant) that in case of CS over HSPA can be set be equal to the bit rate of the AMR 12.2 mode if this mode is the target mode.
  • the scheduling grant that corresponds to a transport format (TF) is depicted in Fig. 3.
  • the fast power control will give an instant value of the power limit, which in the case of Fig. 3 is lower than the scheduling grant. Therefore, to have a large possibility to get an error free transmission a lower TF can be selected.
  • TF2 is selected.
  • an encoded frame that equals the size of the transport format is then provided when it is time to transmit the data. In the scenario above this would result in that an AMR encoded frame corresponding to TF2 is selected.
  • parallel coding of speech using a set of different encoder modes is performed.
  • the encoder can for example be an AMR codec. The result is that there will be a set of speech frames, all with a different compression ratio, that can be sent when it is time to transmit the speech. This is depicted in Fig. 4.
  • Fig. 4 use of parallel encoding of speech is depicted. The example is given in the context of CS over HSPA, but other transmission standards are envisaged.
  • speech is received by the microphone and encoded in parallel for example into a bit-stream using the AMR 12.2, AMR 7.95, AMR 5.9 and AMR 4.75 mode. All these frames are then fed to and available for the mechanism that chooses the speech frame to send.
  • the radio/transmission layer sends information about the current radio conditions to the mechanism that that selects the speech frame to send based on the available speech frames and information about the current radio conditions. In one embodiment the information about the current radio conditions can be power control information.
  • Fig. 5 a flowchart illustrating procedural steps performed in accordance with one embodiment of the invention is shown where speech data is to be transmitted over a radio interface.
  • a step 501 parallel encoding of speech data is performed thereby generating at least two speech data frames available for transmission.
  • a step 503 one of the parallel encoded speech data frames is selected for transmission based on some, one or many, parameter(s) fed to a selector.
  • the parameter(s) can be any suitable parameter(s) available including but not limited to available power and current radio link conditions.
  • the transmitter 600 comprises a parallel encoding block 601 for performing parallel encoding of speech data thereby generating at least two speech data frames available for transmission.
  • the transmitter 600 also comprises a selector 603 for selecting one of the parallel encoded speech data frames for transmission.
  • the selector may employ any suitable criteria for selecting speech data frame for example the selection can be based on some, one or many, parameter(s) fed to a selector.
  • the parameter(s) can be any suitable parameter(s) available including but not limited to available power and current radio link conditions.
  • a flowchart illustrating procedural steps performed in accordance with one embodiment of the invention shows the logical flow of the invention.
  • the power needed to transmit the bits of the target AMR codec rate is calculated.
  • Information from the fast power control will then tell if the UE is power limited or not. This is performed in a check in a step 703.
  • the UE is not power limited it sends the speech using the target rate in a step 705.
  • the UE will calculate how many bits that can be sent in a step 707. In this case it may be -160- 180 bits, and then the UE checks how large the individual encoded AMR frames are in the set of parallel encoded speech frames. Based on the information on which transport format that is possible to send the UE takes the encoded AMR frame that matches this TF and transmit that in a step 709.
  • the TF sizes are set to exactly match the different codec modes that are used in the procedure.
  • the number of different AMR encoding modes is implementation specific.
  • the number can be any suitable number, i.e. two or more.
  • the parallel coding of speech using a set of different encoder modes is performed by coding different classes of bits.
  • A-class, B- class and C-class bits are coded in the transmission in different combinations in the transmission and then selecting one of the modes for transmission.
  • A-class B-class and C-class bits are coded.
  • A-class bits and the B-class bits are coded.
  • A-class bits are coded in a first mode.
  • B-class bits are coded in a second mode.
  • A-class bits and the B-class bits are coded in a third mode.
  • A-class bits are coded in a third mode.
  • the transmission mode is then selected based on for example available power.
  • the choice can for example depend on current radio link conditions. In one embodiment the choice is made based on the importance of the B- and C-class bits.
  • Such an embodiment is illustrated in Fig. 8.
  • the power needed to transmit the bits of the target AMR codec rate is first calculated in a step 801.
  • Information from the fast power control will then tell if the UE is power limited or not. This is performed in a check in a step 803.
  • the UE is not power limited it sends the speech using the all bits, i.e. in the first mode, in a step 805.
  • the UE will calculate how many bits that can be sent in a step 807. In case there is room to transmit both A and B bits this will be done by selecting the second mode in a step 809. In case there is only room to transmit the A bits this mode, i.e. the third mode is selected in a step 811.
  • a protocol enhancement either a higher layer protocol (higher than the PDCP layer) or in the PDCP protocol must be used to inform that only a partial frame is sent. Below a higher layer than the PDCP is used to inform the encoder
  • AMR Partial speech Frame Pn (only A and B-class bits)
  • the transport layer will have the possibility to send the speech data frame using the encoding that has the best chance of arriving at the receiver without bit errors.
  • the speech quality is enhanced in poor radio conditions.

Landscapes

  • Engineering & Computer Science (AREA)
  • Quality & Reliability (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Mobile Radio Communication Systems (AREA)

Abstract

L'invention porte sur la transmission de données de parole à partir d'un émetteur mobile sur une interface radio, dans laquelle au moins deux trames de données de parole parallèles différentes provenant d'un flux de données de parole sont codées. L'émetteur sélectionne l'une des deux trames, ou plus, de données de parole pour une transmission sur la base d'un ou plusieurs paramètres prédéfinis. La couche transport aura ainsi la possibilité d'envoyer la trame de données de parole en utilisant le codage qui a la plus forte probabilité d'arriver au récepteur sans erreurs binaires. En conséquence, la qualité de parole est améliorée dans de mauvaises conditions radio.
PCT/SE2009/050014 2008-02-05 2009-01-12 Transmission de parole robuste WO2009099381A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US2624308P 2008-02-05 2008-02-05
US61/026,243 2008-02-05

Publications (1)

Publication Number Publication Date
WO2009099381A1 true WO2009099381A1 (fr) 2009-08-13

Family

ID=40545883

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/SE2009/050014 WO2009099381A1 (fr) 2008-02-05 2009-01-12 Transmission de parole robuste

Country Status (1)

Country Link
WO (1) WO2009099381A1 (fr)

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO1999060742A1 (fr) * 1998-05-15 1999-11-25 Conexant Systems, Inc. Systeme et procede d'adaptation de la cadence d'un vocodeur multicadence
WO2001003448A2 (fr) * 1999-07-05 2001-01-11 Nokia Corporation Procede de selection de methode de codage
WO2001052467A1 (fr) * 2000-01-10 2001-07-19 Qualcomm Incorporated Procede et appareil de prise en charge de donnees adaptatives a debits multiples (amr) dans un systeme de communications amcr
EP1235374A2 (fr) * 1996-08-28 2002-08-28 Ericsson, Inc. Circuit pour un appareil de communication

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP1235374A2 (fr) * 1996-08-28 2002-08-28 Ericsson, Inc. Circuit pour un appareil de communication
WO1999060742A1 (fr) * 1998-05-15 1999-11-25 Conexant Systems, Inc. Systeme et procede d'adaptation de la cadence d'un vocodeur multicadence
WO2001003448A2 (fr) * 1999-07-05 2001-01-11 Nokia Corporation Procede de selection de methode de codage
WO2001052467A1 (fr) * 2000-01-10 2001-07-19 Qualcomm Incorporated Procede et appareil de prise en charge de donnees adaptatives a debits multiples (amr) dans un systeme de communications amcr

Similar Documents

Publication Publication Date Title
US7787377B2 (en) Selective redundancy for Voice over Internet transmissions
US9179329B2 (en) Apparatus and method for interference mitigation
EP1586211B1 (fr) Système de rétroaction amélioré en utilisant decodage dynamique
US8432818B2 (en) System and method for link adaptation overhead reduction
US20070140123A1 (en) Control station apparatus, base station apparatus, terminal apparatus, packet communication system, and packet communication method
KR101407459B1 (ko) 가변 레이트 보코더들을 이용하는 이용자 장비에 대한 음성 인터넷 프로토콜 용량을 개선하기 위한 방법
JP2003283471A (ja) 基地局装置及びパケット伝送方法
US7945258B2 (en) Wireless device operable to manipulate high-speed shared control channel (HSSCCH) quality control through channel quality indication report manipulation
JP2007053771A (ja) Harqを支援する移動通信システムにおけるフィードバック信号の信頼度を制御する方法及び装置
WO2015035910A1 (fr) Procédé de transmission d'informations de commande en liaison montante, équipement d'utilisateur, et dispositif côté réseau
CN100484301C (zh) 高速上行分组接入业务重配置方法
JP2008099079A (ja) 移動通信システムの無線基地局における通信処理方式
JP5426574B2 (ja) Hspaを介した回線交換データの送信
US8411697B2 (en) Method and arrangement for improving media transmission quality using robust representation of media frames
CN103079261B (zh) 在无线电发射设备和无线电接收设备之间传送数据的方法
WO2009099364A1 (fr) Procédé et dispositif de commande de tampon de décalage
EP1984917B1 (fr) Procede et dispositif utilises pour ameliorer une qualite de transmission de contenu multimedia
WO2009099381A1 (fr) Transmission de parole robuste
US20100316001A1 (en) Method of Transmitting Synchronized Speech and Video
WO2009099373A1 (fr) Procédé de transmission de parole
Axelsson et al. Improved latency performance with gsm/edge continued evolution
KR20080091639A (ko) 이동통신 시스템에서 불연속 수신모드 동작의 지시 방법 및장치
Iwamura et al. Biased adaptive modulation/coding to provide VoIP QoS over HSDPA
Nombela Gordo Evaluating video quality degradation from data packet losses in an LTE environment
Song Jitter Management in Circuit Switched Voice over HSPA Networks

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 09707175

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 09707175

Country of ref document: EP

Kind code of ref document: A1