US20130336333A1 - External jitter buffer in a packet voice system - Google Patents

External jitter buffer in a packet voice system Download PDF

Info

Publication number
US20130336333A1
US20130336333A1 US13/967,975 US201313967975A US2013336333A1 US 20130336333 A1 US20130336333 A1 US 20130336333A1 US 201313967975 A US201313967975 A US 201313967975A US 2013336333 A1 US2013336333 A1 US 2013336333A1
Authority
US
United States
Prior art keywords
voice
jitter buffer
packets
packet
processor
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US13/967,975
Inventor
Wilfrid LeBlanc
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Avago Technologies International Sales Pte Ltd
Original Assignee
Broadcom Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Broadcom Corp filed Critical Broadcom Corp
Priority to US13/967,975 priority Critical patent/US20130336333A1/en
Assigned to BROADCOM CORPORATION reassignment BROADCOM CORPORATION ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: LEBLANC, WILFRID
Publication of US20130336333A1 publication Critical patent/US20130336333A1/en
Assigned to BANK OF AMERICA, N.A., AS COLLATERAL AGENT reassignment BANK OF AMERICA, N.A., AS COLLATERAL AGENT PATENT SECURITY AGREEMENT Assignors: BROADCOM CORPORATION
Assigned to AVAGO TECHNOLOGIES GENERAL IP (SINGAPORE) PTE. LTD. reassignment AVAGO TECHNOLOGIES GENERAL IP (SINGAPORE) PTE. LTD. ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS). Assignors: BROADCOM CORPORATION
Assigned to BROADCOM CORPORATION reassignment BROADCOM CORPORATION TERMINATION AND RELEASE OF SECURITY INTEREST IN PATENTS Assignors: BANK OF AMERICA, N.A., AS COLLATERAL AGENT
Abandoned legal-status Critical Current

Links

Images

Classifications

    • H04L29/06176
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04JMULTIPLEX COMMUNICATION
    • H04J3/00Time-division multiplex systems
    • H04J3/02Details
    • H04J3/06Synchronising arrangements
    • H04J3/062Synchronisation of signals having the same nominal but fluctuating bit rates, e.g. using buffers
    • H04J3/0632Synchronisation of packets and cells, e.g. transmission of voice via a packet network, circuit emulation service [CES]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/64Hybrid switching systems
    • H04L12/6418Hybrid transport
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L12/00Data switching networks
    • H04L12/64Hybrid switching systems
    • H04L12/6418Hybrid transport
    • H04L2012/6489Buffer Management, Threshold setting, Scheduling, Shaping

Definitions

  • Irregularity (or jitter) in the time of arrival of transmitted packets can jeopardize the quality of speech transmitted by a packet network. Since speech communications is a continuous process, each packet should be available at the receiving end in time for its usage (a packet is used by decoding its content and playing the decoded speech to the listener). Congestion due to inadequate bandwidth often leads to long delays in the delivery of time-sensitive packets. A problem arises, for example, if a few packets are delayed at a node of the packet network. At the receiving end, since the speech packets have not arrived, the listener will experience a discontinuity in speech. Moreover, when the packets finally arrive to their destination, they might arrive too late to be used, and will be dropped. In this case, the listener will lose some of the information. For voice data, packets that are lost or discarded result in gaps, silence, and clipping in real-time audio playback.
  • a FIFO First-In-First-Out buffer type
  • a jitter buffer is commonly called a jitter buffer. If the number of delayed packets is less than the size of the buffer, then the buffer will not become empty, and the listener will not experience speech discontinuity or loss. The greater the potential jitter, the larger the buffer has to be, in order to give more room for the playback of previous packets while waiting for the subsequent arrival of later packets.
  • Some existing voice data processors incorporate an internal jitter buffer that shares the same processor with other voice processing functions. It is desirable to implement a voice processing system that has a jitter buffer that has enough memory to accommodate large amounts of jitter and yet is relatively inexpensive to manufacture.
  • One aspect of the present invention is directed to a packet voice communication system having a voice processor and a jitter buffer external to the voice processor.
  • the external jitter buffer receives voice packets from a network and holds the received voice packets.
  • the voice processor synchronously retrieves voice packets from the external jitter buffer and processes the retrieved voice data packets.
  • the external jitter buffer places an indicator in each of the packets it holds, the indicator being indicative of how long the voice packet was held in the jitter buffer.
  • Another aspect of the present invention is directed to a method of operating a packet voice communication system.
  • a voice packet is received from a network and held in an external jitter buffer external to a voice processor.
  • An indicator is placed in the voice packet, the indicator being indicative of how long the voice packet was held in the external jitter buffer.
  • the voice packet, including the indicator is retrieved from the external jitter buffer and processed by the voice processor.
  • Another aspect of the present invention is directed to a method of operating a packet voice communication system.
  • a voice packet is received from a network and held in a jitter buffer external to a voice processor.
  • the received voice packet is transferred from the external jitter buffer to a packet queue in the voice processor and the voice packet is processed by the voice processor.
  • the external jitter buffer is notified of how many voice packets are in the voice processor's packet queue.
  • FIG. 1 is a functional block diagram representing a packet-based communication system according to an illustrative embodiment of the present invention.
  • FIG. 1A is a functional block diagram representing a packet-based communication system according to an illustrative embodiment of the present invention.
  • FIG. 2 is a functional block diagram representing a packet-based communication system according to an illustrative embodiment of the present invention.
  • FIG. 5 is a functional block diagram representing a packet voice communication system according to an illustrative embodiment of the present invention.
  • FIG. 1 An exemplary topology is shown in FIG. 1 with a packet-based network 10 providing a communication medium between telephony devices.
  • Each network gateway 12 a , 12 b , 12 c includes a signal processing system which provides an interface between the packet-based network 10 and a number of telephony devices.
  • each network gateway 12 a , 12 b , 12 c supports a telephony device 13 a , 13 b , 13 c .
  • Each network gateway 12 a , 12 b , 12 c could support a variety of different telephony arrangements.
  • each network gateway might support any number of telephony devices, circuit-switched networks and/or packet-based networks including, among others, analog telephones, ethernet phones, fax machines, data modems, PSTN lines (Public Switching Telephone Network), ISDN lines (Integrated Services Digital Network), T1 systems, PBXs, key systems, or any other conventional telephony device and/or circuit-switched/packet-based network.
  • PSTN lines Public Switching Telephone Network
  • ISDN lines Integrated Services Digital Network
  • T1 systems PBXs
  • key systems or any other conventional telephony device and/or circuit-switched/packet-based network.
  • two of the network gateways 12 a , 12 b provide a direct interface between their respective telephony devices and the packet-based network 10 .
  • the other network gateway 12 c is connected to its respective telephony device through a PSTN 19 .
  • the network gateways 12 a , 12 b , 12 c permit voice, fax and modem data to be carried over packet-based networks such as PCs running through a USB (Universal Serial Bus) or an asynchronous serial interface, Local Area Networks (LAN) such as Ethernet, Wide Area Networks (WAN) such as Internet Protocol (IP), Frame Relay (FR), Asynchronous Transfer Mode (ATM), Public Digital Cellular Network such as TDMA (Time Division Multiple Access IS-13 ⁇ ), CDMA (Code Division Multiple Access IS-9x) or GSM (Global System for Mobile communications) for terrestrial wireless applications, or any other packet-based system.
  • packet-based networks such as PCs running through a USB (Universal Serial Bus) or an asynchronous serial interface
  • LAN Local Area Networks (LAN) such as Ethernet
  • WAN Wide Area Networks (WAN) such as Internet Protocol (IP), Frame Relay (FR), Asynchronous Transfer Mode (ATM), Public Digital Cellular Network such as TDMA (Time Division Multiple Access IS-13 ⁇ ), CDMA (Code Division Multiple Access IS-9x) or
  • FIG. 1A Another exemplary topology is shown in FIG. 1A .
  • the topology of FIG. 1A is similar to that of FIG. 1 but includes a second packet-based network 16 that is connected to packet-based network 10 and to telephony device 13 b via network gateway 12 b .
  • the signal processing system of network gateway 12 b provides an interface between packet-based network 10 and packet-based network 16 in addition to an interface between packet-based networks 10 , 16 and telephony device 13 b .
  • Network gateway 12 d includes a signal processing system which provides an interface between packet-based network 16 and telephony device 13 d.
  • FIG. 2 is a functional block diagram representing a communication system that enables the transmission of voice data over a packet-based system such as Voice over IP (VoIP, H.323), Voice over Frame Relay (VOFR, FRF-11), Voice Telephony over ATM (VTOA), or any other proprietary network, according to an illustrative embodiment of the present invention.
  • voice data can also be carried over traditional media such as time division multiplex (TDM) networks and voice storage and playback systems.
  • Network gateway 55 a supports the exchange of voice between a traditional circuit-switched network 58 and packet-based networks 56 a and 56 b .
  • Network gateways 55 b , 55 c , 55 d , 55 e support the exchange of voice between packet-based network 56 a and a number of telephony devices 57 b , 57 c , 57 d , 57 e .
  • network gateways 55 f , 55 g , 55 h , 55 i support the exchange of voice between packet-based network 56 b and telephony devices 57 f , 57 g , 57 h , 57 i .
  • Telephony devices 57 a , 57 b , 57 c , 57 d , 57 e , 57 f , 57 g , 57 h , 57 i can be any type of telephony device including telephones, facsimile machines and modems.
  • voice packets are transmitted over a network synchronously.
  • packets can arrive at the receiving end asynchronously due to a variety of factors. Irregularity in the time of arrival of transmitted packets, often referred to as “jitter,” can jeopardize the quality of speech transmitted by the packet network.
  • jitter Irregularity in the time of arrival of transmitted packets, often referred to as “jitter,” can jeopardize the quality of speech transmitted by the packet network.
  • jitter Irregularity in the time of arrival of transmitted packets, often referred to as “jitter,” can jeopardize the quality of speech transmitted by the packet network.
  • jitter can jeopardize the quality of speech transmitted by the packet network.
  • jitter can jeopardize the quality of speech transmitted by the packet network.
  • each packet should be available at the receiving end in time for its usage (a packet is used by decoding its content and playing the decoded speech to the listener).
  • the listener will experience a discontinuity in speech. Moreover, when the packets finally arrive to their destination, they might arrive too late to be used, and will be dropped. In this case, the listener will lose some of the information. For voice data, packets that are lost or discarded result in gaps, silence, and clipping in real-time audio playback.
  • FIG. 3 is a functional block diagram representing a packet voice communication system according to an illustrative embodiment of the present invention.
  • Jitter buffer 310 serves to remove jitter in the stream of voice packets that is provided to the voice processor 340 .
  • Jitter buffer processor 320 of jitter buffer 310 receives voice packets from the packet network 300 .
  • the jitter buffer 310 acts as an intermediate buffer at the receiver end, allowing the packets to be played out of the jitter buffer 310 at a regular or standard predetermined replay rate by the voice processor 340 independent of the rate of arrival of the packets from the packet network 300 .
  • the jitter buffer processor 320 stores incoming voice packets in jitter buffer memory 330 before the packets are replayed.
  • the jitter buffer memory 330 is a relatively high-capacity memory unit. Also, jitter buffer memory 330 is illustratively a relatively low-speed (and therefore relatively inexpensive) memory unit. In an illustrative embodiment of the present invention, jitter buffer memory 330 is a Dynamic Random Access Memory (DRAM) unit. Additionally, in an illustrative embodiment of the present invention, jitter buffer processor 320 has a relatively low amount of processor power. In the illustrative embodiment shown in FIG.
  • DRAM Dynamic Random Access Memory
  • the voice processor 340 pulls voice packets out of the jitter buffer 310 at a regular predetermined rate. If the jitter buffer 310 is empty, a null packet is transferred to the voice processor 340 and the listener will experience discontinuity or loss in the audio produced therefrom. If the number of delayed packets is less than the size of the jitter buffer memory 330 , then the jitter buffer memory 330 will not become empty, and the listener will not experience speech discontinuity or loss. Null frames are also released after SID (silence identifier) packets (voice activity detection (VAD) active on far end) and when lost frames are detected.
  • SID seriousness of packets
  • VAD voice activity detection
  • the voice processor 340 decodes the voice packets to produce decoded voice.
  • the voice processor 340 also performs a variety of other functions on the voice data, such as echo cancellation, automatic gain control, call discrimination, DTMF (dual-tone, multi-frequency) generation, comfort noise generation and/or lost frame recovery.
  • the voice processor 340 is a relatively high-speed processor.
  • the voice processor 340 is a higher-speed processor than the jitter buffer processor 320 .
  • the voice processor 340 utilizes voice processor memory 350 for its processing tasks.
  • the voice processor memory 350 is a relatively high-speed memory unit having a relatively moderate amount of storage capacity.
  • the voice processor memory 350 is a higher-speed memory unit than the jitter buffer memory 330 and the voice processor memory 350 has less storage capacity than jitter buffer memory 330 .
  • the jitter buffer 310 is external to the voice processor 340 .
  • the jitter buffer 310 is on a separate processor from the voice processor 340 .
  • the jitter buffer 310 is on the same processor as voice processor 340 , but operates in a different process or thread than the other voice processing functions.
  • the external jitter buffer 310 places an indicator in each of the packets it holds, the indicator being indicative of how long the voice packet was held in the jitter buffer 310 .
  • such an indicator is placed in the packet header of each voice packet buffered by the jitter buffer 310 .
  • the indicator directly indicates a length of time the voice packet was held in the jitter buffer 310 .
  • the indicator indicates the arrival time of the packet, that is, the time that the voice packet arrived at the external jitter buffer 310 .
  • the indicator indicates how many voice packets are held in the jitter buffer 310 .
  • the indicator indicates a time span represented by the packets held in the jitter buffer 310 . That is, if the jitter buffer holds a plurality of packets, each packet represents a time amount of data, say 10 msec worth of data. Now, if one or more packets arrived at the jitter buffer 310 out of order, the jitter buffer 310 might not hold all of the packets that are between hypothetical packets X and Y in the packet sequence, i.e., the sequence in which the packets are intended to be played.
  • the indicator indicates the span of time represented by the earliest packet (call it packet X) and latest packet (packet Y) in the packet sequence that are held in the jitter buffer 310 , along with all of the packets between packets X and Y in the packet sequence, regardless of whether they are held in the jitter buffer 310 . For example, say packets 1, 2, 3 and 5 of a packet sequence are held in the jitter buffer 310 . If each packet represents 10 msec of video, the indicator would indicate 50 msec.
  • the voice processor 340 adjusts the rate at which it retrieves voice packets from the external jitter buffer 310 based upon a value of the indicator.
  • the voice processor 340 retrieves voice packets at a faster rate if the holding times, that is, the length of time that voice packets are held in the jitter buffer, are relatively high. Conversely, the voice processor 340 retrieves voice packets at a slower rate if the holding times are relatively low, or if null packets indicate jitter buffer underflow.
  • the voice processor 340 retrieves voice packets at a faster rate if the jitter buffer 310 is relatively full and retrieves voice packets at a slower rate if the jitter buffer is relatively less full.
  • the voice processor 340 includes a jitter buffer manager that retrieves voice packets from the jitter buffer 310 .
  • the jitter buffer manager adjusts the rate at which it retrieves voice packets from the external jitter buffer 310 based upon a value of the indicator.
  • the jitter buffer 310 passes data to the voice processor 340 once per clock pulse. In such an embodiment, the voice processor 340 speeds up or slows down its clock depending on the value of the indicator.
  • FIG. 4 is a flowchart representing a method of operating a packet voice communication system according to an exemplary embodiment of the present invention.
  • a voice packet is received from a network.
  • the voice packet is held in external jitter buffer 310 ( FIG. 3 ).
  • an indicator is placed in the voice packet, the indicator being indicative of how long the voice packet was held in the external jitter buffer.
  • the voice packet, including the indicator is retrieved from the external jitter buffer and processed by the voice processor.
  • the rate of retrieval of packets from the jitter buffer 310 is increased.
  • the holding time of the voice packet in the jitter buffer 310 was less than or equal to threshold 1 , it is determined if the holding time of the voice packet in the jitter buffer 310 was less than a second threshold, thresholds, that is lower than threshold 1 .
  • the rate of retrieval of packets from the jitter buffer 310 is decreased.
  • the rate of retrieval of packets from jitter buffer 310 is maintained at its present level.
  • the indicator that is placed in the voice packet indicates how full or empty the jitter buffer 310 is.
  • a first threshold is maintained, wherein if the number of packets held in the jitter buffer is above the threshold, the rate of retrieval of packets from the jitter buffer is increased.
  • a second threshold lower than the first threshold, is maintained, wherein if the number of packets held in the jitter buffer is below the second threshold, the rate of retrieval of packets from the jitter buffer is decreased.
  • any number of holding-time thresholds are maintained and the rate of retrieving packets from the jitter buffer is increased or decreased by varying amounts according to the holding time of packets in the buffer. For example, if the holding time is greater than a first, relatively high, threshold, the retrieval rate is increased by a first, relatively high, amount. If the holding time is lower than the first threshold but higher than a second threshold, the retrieval rate is increased by a second amount that is less than the first amount. If the holding time is lower than the second threshold but higher than a third threshold, the retrieval rate is maintained at its present rate. If the holding time is lower than the third threshold but higher than a fourth threshold, the retrieval rate is decreased by a first amount.
  • the retrieval rate is increased by a second amount that is greater than the first amount.
  • the rate of retrieval of packets from the jitter buffer 310 is set at one of a plurality of predetermined levels, according to the holding time. The higher the holding time, the higher the retrieval rate.
  • FIG. 5 is a functional block diagram representing a packet voice communication system according to an illustrative embodiment of the present invention.
  • the system of FIG. 5 is similar to the system of FIG. 3 .
  • the jitter buffer 310 actively passes voice packets to the voice processor 340
  • voice packets are pulled out of the jitter buffer 310 by the voice processor 340 .
  • the voice processor includes a packet queue in which voice packets received from the jitter buffer 310 are stored prior to processing the voice packets.
  • the packet queue resides in voice processor memory 350 .
  • the packet queue is illustratively a relatively small queue.
  • the packet queue has a capacity of 2-3 voice packets.
  • the voice processor notifies the jitter buffer 310 of how many packets are in the packet queue via feedback line 360 .
  • the voice processor 340 periodically notifies the jitter buffer 310 of how many voice packets are in the packet queue.
  • the voice processor 340 notifies the jitter buffer 310 of how many voice packets are in the packet queue every time the number of packets in the queue changes.
  • the external jitter buffer 310 adjusts the rate at which it provides voice packets to the voice processor 340 based upon how many voice packets are in the voice processor's packet queue.
  • the voice processor 340 retrieves voice packets at a faster rate if there are relatively few voice packets in the voice processor's packet queue. Conversely, voice processor 340 retrieves voice packets at a slower rate if there are a relatively higher number of voice packets in the voice processor's packet queue.
  • the jitter buffer 310 passes data to the voice processor 340 once per clock pulse. In such an embodiment, the voice processor 340 causes the clock that controls the jitter buffer 310 to speed up or slow down depending on the value of the indicator.
  • FIG. 6 is a flowchart representing a method of operating a packet voice communication system according to an illustrative embodiment of the present invention.
  • the example of FIG. 6 pertains to a voice processor having a packet queue that has a capacity of two voice packets. It will be appreciated that the voice processor's packet queue can be of substantially any size in accordance with the present invention.
  • a voice packet is received from a network.
  • the voice packet is held in external jitter buffer 310 ( FIG. 5 ).
  • the voice packet is transferred from the external jitter buffer 310 to a packet queue in the voice processor 340 ( FIG. 5 ).
  • the voice packet is processed by the voice processor 340 .
  • the voice processor 340 notifies the jitter buffer processor 320 ( FIG. 5 ) of how many packets are in the voice processor's packet queue.
  • decision box 650 depending on the number of voice packets in the voice processor's packet queue, one of three actions are taken. If there are not any voice packets in the packet queue, the rate of transferring packets from the jitter buffer 310 to the voice processor 340 is increased, as shown at step 660 . If there is one voice packet in the packet queue, the rate of transferring packets from the jitter buffer 310 to the voice processor 340 is maintained at its present rate, as shown at step 670 . If there are two voice packets in the packet queue, the rate of transferring packets from the jitter buffer 310 to the voice processor 340 is decreased, as shown at step 680 .
  • a first threshold is maintained, wherein if the number of packets held in the voice processor's packet queue is below the threshold, the rate of retrieval of packets from the jitter buffer is increased.
  • a second threshold higher than the first threshold, is maintained, wherein if the number of packets held in the packet queue is above the second threshold, the rate of retrieval of packets from the jitter buffer is decreased.
  • the rate of transferring packets from the jitter buffer to the voice processor is increased or decreased by varying amounts according to the number of voice packets in the packet queue. For example, in a system having a voice processor that has a packet queue having a capacity of four packets, if there are no voice packets in the packet queue, the transfer rate is increased by a first, relatively high, amount. If there is one voice packet in the packet queue, the transfer rate is increased by a second amount that is less than the first amount. If there are two voice packets in the packet queue, the transfer rate is maintained at its present rate.
  • the transfer rate is decreased by a first amount. If the packet queue is full (four packets), the transfer rate is decreased by a second amount that is greater than the first amount.
  • the rate of transferring packets from the jitter buffer 310 ( FIG. 5 ) to the voice processor is set at one of a plurality of predetermined levels, according to the number of packets in the packet queue. The emptier the packet queue, the higher the transfer rate.
  • the present invention is applicable to any real-time media, such as audio and video, in addition to the voice media illustratively described herein.
  • the invention is applicable to any type of transmitted data elements, such as frames, in addition to the application to packet data described herein.
  • Those skilled in the art will understand that various modifications may be made to the described embodiment.
  • the invention itself herein will suggest solutions to other tasks and adaptations for other applications. It is therefore desired that the present embodiments be considered in all respects as illustrative and not restrictive, reference being made to the appended claims rather than the foregoing description to indicate the scope of the invention.

Abstract

A packet voice communication system having a jitter buffer external to a voice processor. The jitter buffer stores voice packets received from a packet network. The voice processor processes the voice packets from the jitter buffer. A jitter buffer processor may place an indicator in each voice packet it holds. The indicator can indicate a length of time the voice packet was held. The rate at which packets come from the jitter buffer may be based upon the indicator, a higher rate if holding times are high and a slower rate if low. The voice processor can store the voice packets in a packet queue prior to processing the voice packets. The rate voice packets come to the voice processor may be based upon how full the packet queue is, a higher rate if the packet queue is relatively empty and a slower rate if relatively full.

Description

    CROSS REFERENCE TO RELATED APPLICATIONS
  • This application is a continuation of U.S. patent application Ser. No. 10/427,807 filed on May 1, 2003, which is hereby expressly incorporated herein by reference and which claims priority to and the benefit of U.S. Provisional Application No. 60/412,276, entitled “EXTERNAL JITTER BUFFER IN A PACKET VOICE SYSTEM” and filed on Sep. 20, 2002, which is hereby expressly incorporated herein by reference.
  • BACKGROUND
  • The present invention relates generally to telecommunications systems, and more particularly, to a packet-based system having a jitter buffer external to a voice processor.
  • Global and local communication systems are rapidly changing from switched network systems to packet network systems. Packet network systems transmit data, speech, and video. Examples of a packet network are the Internet (a globally connected packet network system) and intranets (a local area packet network system). While speech communication in switched network systems is carried by a direct point-to-point connection, speech communication in packet network systems is performed by packing speech frames and transmitting the frames over the network.
  • Irregularity (or jitter) in the time of arrival of transmitted packets can jeopardize the quality of speech transmitted by a packet network. Since speech communications is a continuous process, each packet should be available at the receiving end in time for its usage (a packet is used by decoding its content and playing the decoded speech to the listener). Congestion due to inadequate bandwidth often leads to long delays in the delivery of time-sensitive packets. A problem arises, for example, if a few packets are delayed at a node of the packet network. At the receiving end, since the speech packets have not arrived, the listener will experience a discontinuity in speech. Moreover, when the packets finally arrive to their destination, they might arrive too late to be used, and will be dropped. In this case, the listener will lose some of the information. For voice data, packets that are lost or discarded result in gaps, silence, and clipping in real-time audio playback.
  • One possible solution for the irregular time of arrival of speech packets has been the buffering of several speech packets before using them to produce the speech. The speech packets are put in a FIFO (First-In-First-Out) buffer type, which holds several packets. Such a buffer is commonly called a jitter buffer. If the number of delayed packets is less than the size of the buffer, then the buffer will not become empty, and the listener will not experience speech discontinuity or loss. The greater the potential jitter, the larger the buffer has to be, in order to give more room for the playback of previous packets while waiting for the subsequent arrival of later packets. Some existing voice data processors incorporate an internal jitter buffer that shares the same processor with other voice processing functions. It is desirable to implement a voice processing system that has a jitter buffer that has enough memory to accommodate large amounts of jitter and yet is relatively inexpensive to manufacture.
  • Further limitations and disadvantages of conventional and traditional approaches will become apparent to one of skill in the art through comparison of such systems with the present invention, as set forth in the remainder of the present application with reference to the drawings.
  • BRIEF SUMMARY OF THE INVENTION
  • One aspect of the present invention is directed to a packet voice communication system having a voice processor and a jitter buffer external to the voice processor. The external jitter buffer receives voice packets from a network and holds the received voice packets. The voice processor synchronously retrieves voice packets from the external jitter buffer and processes the retrieved voice data packets. The external jitter buffer places an indicator in each of the packets it holds, the indicator being indicative of how long the voice packet was held in the jitter buffer.
  • Another aspect of the present invention is directed to a method of operating a packet voice communication system. According to the method, a voice packet is received from a network and held in an external jitter buffer external to a voice processor. An indicator is placed in the voice packet, the indicator being indicative of how long the voice packet was held in the external jitter buffer. The voice packet, including the indicator, is retrieved from the external jitter buffer and processed by the voice processor.
  • Another aspect of the present invention is directed to a packet voice communication system having a voice processor and a jitter buffer external to the voice processor. The external jitter buffer receives voice packets from a network, holds the received voice packets, and synchronously provides the voice packets to the voice processor. The voice processor receives the voice packets in a packet queue and processes the received voice packets. The voice processor notifies the external jitter buffer of how many voice packets are in the voice processor's packet queue.
  • Another aspect of the present invention is directed to a method of operating a packet voice communication system. According to the method, a voice packet is received from a network and held in a jitter buffer external to a voice processor. The received voice packet is transferred from the external jitter buffer to a packet queue in the voice processor and the voice packet is processed by the voice processor. The external jitter buffer is notified of how many voice packets are in the voice processor's packet queue.
  • It is understood that other embodiments of the present invention will become readily apparent to those skilled in the art from the following detailed description, wherein embodiments of the invention are shown and described only by way of illustration of the best modes contemplated for carrying out the invention. As will be realized, the invention is capable of other and different embodiments and its several details are capable of modification in various other respects, all without departing from the spirit and scope of the present invention. Accordingly, the drawings and detailed description are to be regarded as illustrative in nature and not as restrictive.
  • BRIEF DESCRIPTION OF THE DRAWINGS
  • These and other features, aspects, and advantages of the present invention will become better understood with regard to the following description, appended claims, and accompanying drawings where:
  • FIG. 1 is a functional block diagram representing a packet-based communication system according to an illustrative embodiment of the present invention.
  • FIG. 1A is a functional block diagram representing a packet-based communication system according to an illustrative embodiment of the present invention.
  • FIG. 2 is a functional block diagram representing a packet-based communication system according to an illustrative embodiment of the present invention.
  • FIG. 3 is a functional block diagram representing a packet voice communication system according to an illustrative embodiment of the present invention.
  • FIG. 4 is a flowchart representing a method of operating a packet voice communication system according to an illustrative embodiment of the present invention.
  • FIG. 5 is a functional block diagram representing a packet voice communication system according to an illustrative embodiment of the present invention.
  • FIG. 6 is a flowchart representing a method of operating a packet voice communication system according to an illustrative embodiment of the present invention.
  • DETAILED DESCRIPTION OF THE INVENTION
  • In an illustrative embodiment of the present invention, a signal processing system is employed to interface voice telephony devices with packet-based networks. Voice telephony devices include, by way of example, analog and digital phones, ethernet phones, Internet Protocol phones, interactive voice response systems, private branch exchanges (PBXs) and any other conventional voice telephony devices known in the art. The described preferred embodiment of the signal processing system can be implemented with a variety of technologies including, by way of example, embedded communications software that enables transmission of voice data over packet-based networks. The embedded communications software is preferably run on programmable digital signal processors (DSPs) and is used in gateways, remote access servers, PBXs, and other packet-based network appliances.
  • An exemplary topology is shown in FIG. 1 with a packet-based network 10 providing a communication medium between telephony devices. Each network gateway 12 a, 12 b, 12 c includes a signal processing system which provides an interface between the packet-based network 10 and a number of telephony devices. In the described exemplary embodiment, each network gateway 12 a, 12 b, 12 c supports a telephony device 13 a, 13 b, 13 c. Each network gateway 12 a, 12 b, 12 c could support a variety of different telephony arrangements. By way of example, each network gateway might support any number of telephony devices, circuit-switched networks and/or packet-based networks including, among others, analog telephones, ethernet phones, fax machines, data modems, PSTN lines (Public Switching Telephone Network), ISDN lines (Integrated Services Digital Network), T1 systems, PBXs, key systems, or any other conventional telephony device and/or circuit-switched/packet-based network. In the described exemplary embodiment, two of the network gateways 12 a, 12 b provide a direct interface between their respective telephony devices and the packet-based network 10. The other network gateway 12 c is connected to its respective telephony device through a PSTN 19. The network gateways 12 a, 12 b, 12 c permit voice, fax and modem data to be carried over packet-based networks such as PCs running through a USB (Universal Serial Bus) or an asynchronous serial interface, Local Area Networks (LAN) such as Ethernet, Wide Area Networks (WAN) such as Internet Protocol (IP), Frame Relay (FR), Asynchronous Transfer Mode (ATM), Public Digital Cellular Network such as TDMA (Time Division Multiple Access IS-13×), CDMA (Code Division Multiple Access IS-9x) or GSM (Global System for Mobile communications) for terrestrial wireless applications, or any other packet-based system.
  • Another exemplary topology is shown in FIG. 1A. The topology of FIG. 1A is similar to that of FIG. 1 but includes a second packet-based network 16 that is connected to packet-based network 10 and to telephony device 13 b via network gateway 12 b. The signal processing system of network gateway 12 b provides an interface between packet-based network 10 and packet-based network 16 in addition to an interface between packet-based networks 10, 16 and telephony device 13 b. Network gateway 12 d includes a signal processing system which provides an interface between packet-based network 16 and telephony device 13 d.
  • FIG. 2 is a functional block diagram representing a communication system that enables the transmission of voice data over a packet-based system such as Voice over IP (VoIP, H.323), Voice over Frame Relay (VOFR, FRF-11), Voice Telephony over ATM (VTOA), or any other proprietary network, according to an illustrative embodiment of the present invention. In one embodiment of the present invention, voice data can also be carried over traditional media such as time division multiplex (TDM) networks and voice storage and playback systems. Network gateway 55 a supports the exchange of voice between a traditional circuit-switched network 58 and packet-based networks 56 a and 56 b. Network gateways 55 b, 55 c, 55 d, 55 e support the exchange of voice between packet-based network 56 a and a number of telephony devices 57 b, 57 c, 57 d, 57 e. In addition, network gateways 55 f, 55 g, 55 h, 55 i support the exchange of voice between packet-based network 56 b and telephony devices 57 f, 57 g, 57 h, 57 i. Telephony devices 57 a, 57 b, 57 c, 57 d, 57 e, 57 f, 57 g, 57 h, 57 i can be any type of telephony device including telephones, facsimile machines and modems.
  • In general, voice packets are transmitted over a network synchronously. However, packets can arrive at the receiving end asynchronously due to a variety of factors. Irregularity in the time of arrival of transmitted packets, often referred to as “jitter,” can jeopardize the quality of speech transmitted by the packet network. Since speech communication is a continuous process, each packet should be available at the receiving end in time for its usage (a packet is used by decoding its content and playing the decoded speech to the listener). Congestion due to inadequate bandwidth often leads to long delays in the delivery of time-sensitive packets. A problem arises, for example, if a few packets are delayed at a node of the packet network. At the receiving end, since the speech packets have not arrived, the listener will experience a discontinuity in speech. Moreover, when the packets finally arrive to their destination, they might arrive too late to be used, and will be dropped. In this case, the listener will lose some of the information. For voice data, packets that are lost or discarded result in gaps, silence, and clipping in real-time audio playback.
  • FIG. 3 is a functional block diagram representing a packet voice communication system according to an illustrative embodiment of the present invention. Jitter buffer 310 serves to remove jitter in the stream of voice packets that is provided to the voice processor 340. Jitter buffer processor 320 of jitter buffer 310 receives voice packets from the packet network 300. The jitter buffer 310 acts as an intermediate buffer at the receiver end, allowing the packets to be played out of the jitter buffer 310 at a regular or standard predetermined replay rate by the voice processor 340 independent of the rate of arrival of the packets from the packet network 300. The jitter buffer processor 320 stores incoming voice packets in jitter buffer memory 330 before the packets are replayed. For example, in an exemplary embodiment wherein each packet represents 10 msec of video data, packets are transferred from the jitter buffer 310 to the voice processor once every 10 msec. In an illustrative embodiment of the present invention, the jitter buffer memory 330 is a relatively high-capacity memory unit. Also, jitter buffer memory 330 is illustratively a relatively low-speed (and therefore relatively inexpensive) memory unit. In an illustrative embodiment of the present invention, jitter buffer memory 330 is a Dynamic Random Access Memory (DRAM) unit. Additionally, in an illustrative embodiment of the present invention, jitter buffer processor 320 has a relatively low amount of processor power. In the illustrative embodiment shown in FIG. 3, the voice processor 340 pulls voice packets out of the jitter buffer 310 at a regular predetermined rate. If the jitter buffer 310 is empty, a null packet is transferred to the voice processor 340 and the listener will experience discontinuity or loss in the audio produced therefrom. If the number of delayed packets is less than the size of the jitter buffer memory 330, then the jitter buffer memory 330 will not become empty, and the listener will not experience speech discontinuity or loss. Null frames are also released after SID (silence identifier) packets (voice activity detection (VAD) active on far end) and when lost frames are detected.
  • The voice processor 340 decodes the voice packets to produce decoded voice. In an illustrative embodiment of the present invention, the voice processor 340 also performs a variety of other functions on the voice data, such as echo cancellation, automatic gain control, call discrimination, DTMF (dual-tone, multi-frequency) generation, comfort noise generation and/or lost frame recovery. In an exemplary embodiment of the present invention, the voice processor 340 is a relatively high-speed processor. In particular, in an exemplary embodiment, the voice processor 340 is a higher-speed processor than the jitter buffer processor 320. The voice processor 340 utilizes voice processor memory 350 for its processing tasks. In an exemplary embodiment of the present invention, the voice processor memory 350 is a relatively high-speed memory unit having a relatively moderate amount of storage capacity. In particular, in an exemplary embodiment, the voice processor memory 350 is a higher-speed memory unit than the jitter buffer memory 330 and the voice processor memory 350 has less storage capacity than jitter buffer memory 330.
  • According to an illustrative embodiment of the present invention, the jitter buffer 310 is external to the voice processor 340. In one embodiment of the present invention, the jitter buffer 310 is on a separate processor from the voice processor 340. In an alternative embodiment of the present invention, the jitter buffer 310 is on the same processor as voice processor 340, but operates in a different process or thread than the other voice processing functions.
  • According to one embodiment of the present invention, the external jitter buffer 310 places an indicator in each of the packets it holds, the indicator being indicative of how long the voice packet was held in the jitter buffer 310. In one embodiment, such an indicator is placed in the packet header of each voice packet buffered by the jitter buffer 310. In an exemplary embodiment, the indicator directly indicates a length of time the voice packet was held in the jitter buffer 310. In another embodiment, the indicator indicates the arrival time of the packet, that is, the time that the voice packet arrived at the external jitter buffer 310. In still another embodiment, the indicator indicates how many voice packets are held in the jitter buffer 310.
  • In yet another embodiment, the indicator indicates a time span represented by the packets held in the jitter buffer 310. That is, if the jitter buffer holds a plurality of packets, each packet represents a time amount of data, say 10 msec worth of data. Now, if one or more packets arrived at the jitter buffer 310 out of order, the jitter buffer 310 might not hold all of the packets that are between hypothetical packets X and Y in the packet sequence, i.e., the sequence in which the packets are intended to be played. In this case, in this illustrative embodiment of the present invention, the indicator indicates the span of time represented by the earliest packet (call it packet X) and latest packet (packet Y) in the packet sequence that are held in the jitter buffer 310, along with all of the packets between packets X and Y in the packet sequence, regardless of whether they are held in the jitter buffer 310. For example, say packets 1, 2, 3 and 5 of a packet sequence are held in the jitter buffer 310. If each packet represents 10 msec of video, the indicator would indicate 50 msec.
  • In an illustrative embodiment of the present invention, the voice processor 340 adjusts the rate at which it retrieves voice packets from the external jitter buffer 310 based upon a value of the indicator. The voice processor 340 retrieves voice packets at a faster rate if the holding times, that is, the length of time that voice packets are held in the jitter buffer, are relatively high. Conversely, the voice processor 340 retrieves voice packets at a slower rate if the holding times are relatively low, or if null packets indicate jitter buffer underflow. In the embodiment wherein the indicator indicates how many voice packets are held in the jitter buffer 310, the voice processor 340 retrieves voice packets at a faster rate if the jitter buffer 310 is relatively full and retrieves voice packets at a slower rate if the jitter buffer is relatively less full. In one embodiment, the voice processor 340 includes a jitter buffer manager that retrieves voice packets from the jitter buffer 310. In this embodiment, the jitter buffer manager adjusts the rate at which it retrieves voice packets from the external jitter buffer 310 based upon a value of the indicator. In an exemplary embodiment of the present invention, the jitter buffer 310 passes data to the voice processor 340 once per clock pulse. In such an embodiment, the voice processor 340 speeds up or slows down its clock depending on the value of the indicator.
  • FIG. 4 is a flowchart representing a method of operating a packet voice communication system according to an exemplary embodiment of the present invention. At step 400, a voice packet is received from a network. At step 410, the voice packet is held in external jitter buffer 310 (FIG. 3). At step 420, an indicator is placed in the voice packet, the indicator being indicative of how long the voice packet was held in the external jitter buffer. At step 430, the voice packet, including the indicator, is retrieved from the external jitter buffer and processed by the voice processor. At decision box 440, it is determined if the holding time of the voice packet in the jitter buffer 310 was greater than a first threshold, threshold1. At step 450, if the holding time of the voice packet in the jitter buffer 310 was greater than threshold1, the rate of retrieval of packets from the jitter buffer 310 is increased. At decision box 460, if the holding time of the voice packet in the jitter buffer 310 was less than or equal to threshold1, it is determined if the holding time of the voice packet in the jitter buffer 310 was less than a second threshold, thresholds, that is lower than threshold1. At step 470, if the holding time of the voice packet in the jitter buffer 310 was less than threshold2, the rate of retrieval of packets from the jitter buffer 310 is decreased. At step 480, if the holding time of the voice packet in the jitter buffer 310 was greater than or equal to threshold2, the rate of retrieval of packets from jitter buffer 310 is maintained at its present level.
  • The method represented in FIG. 4 is merely illustrative. Many variations of the method of FIG. 4 are contemplated by the present invention. For example, in one embodiment, the indicator that is placed in the voice packet indicates how full or empty the jitter buffer 310 is. In one implementation of such an embodiment, a first threshold is maintained, wherein if the number of packets held in the jitter buffer is above the threshold, the rate of retrieval of packets from the jitter buffer is increased. A second threshold, lower than the first threshold, is maintained, wherein if the number of packets held in the jitter buffer is below the second threshold, the rate of retrieval of packets from the jitter buffer is decreased.
  • In another variation of the method of FIG. 4, any number of holding-time thresholds are maintained and the rate of retrieving packets from the jitter buffer is increased or decreased by varying amounts according to the holding time of packets in the buffer. For example, if the holding time is greater than a first, relatively high, threshold, the retrieval rate is increased by a first, relatively high, amount. If the holding time is lower than the first threshold but higher than a second threshold, the retrieval rate is increased by a second amount that is less than the first amount. If the holding time is lower than the second threshold but higher than a third threshold, the retrieval rate is maintained at its present rate. If the holding time is lower than the third threshold but higher than a fourth threshold, the retrieval rate is decreased by a first amount. If the holding time is lower than the fourth threshold but higher than a fifth threshold, the retrieval rate is increased by a second amount that is greater than the first amount. In another variation of the method of FIG. 4, rather than increasing, decreasing or maintaining the rate of retrieving packets from the jitter buffer 310 based on the holding time of packets in the jitter buffer 310, the rate of retrieval of packets from the jitter buffer 310 is set at one of a plurality of predetermined levels, according to the holding time. The higher the holding time, the higher the retrieval rate.
  • FIG. 5 is a functional block diagram representing a packet voice communication system according to an illustrative embodiment of the present invention. The system of FIG. 5 is similar to the system of FIG. 3. However, in the system of FIG. 5, the jitter buffer 310 actively passes voice packets to the voice processor 340, while in the system of FIG. 3, voice packets are pulled out of the jitter buffer 310 by the voice processor 340. In an illustrative embodiment wherein the jitter buffer 310 actively pushes voice packets onto the voice processor, the voice processor includes a packet queue in which voice packets received from the jitter buffer 310 are stored prior to processing the voice packets. In an illustrative embodiment of the present invention, the packet queue resides in voice processor memory 350. The packet queue is illustratively a relatively small queue. In an exemplary embodiment, the packet queue has a capacity of 2-3 voice packets. The voice processor notifies the jitter buffer 310 of how many packets are in the packet queue via feedback line 360. In one embodiment, the voice processor 340 periodically notifies the jitter buffer 310 of how many voice packets are in the packet queue. In another embodiment, the voice processor 340 notifies the jitter buffer 310 of how many voice packets are in the packet queue every time the number of packets in the queue changes.
  • In an illustrative embodiment of the present invention, the external jitter buffer 310 adjusts the rate at which it provides voice packets to the voice processor 340 based upon how many voice packets are in the voice processor's packet queue. The voice processor 340 retrieves voice packets at a faster rate if there are relatively few voice packets in the voice processor's packet queue. Conversely, voice processor 340 retrieves voice packets at a slower rate if there are a relatively higher number of voice packets in the voice processor's packet queue. In an exemplary embodiment of the present invention, the jitter buffer 310 passes data to the voice processor 340 once per clock pulse. In such an embodiment, the voice processor 340 causes the clock that controls the jitter buffer 310 to speed up or slow down depending on the value of the indicator.
  • FIG. 6 is a flowchart representing a method of operating a packet voice communication system according to an illustrative embodiment of the present invention. For purposes of illustration, the example of FIG. 6 pertains to a voice processor having a packet queue that has a capacity of two voice packets. It will be appreciated that the voice processor's packet queue can be of substantially any size in accordance with the present invention. At step 600, a voice packet is received from a network. At step 610, the voice packet is held in external jitter buffer 310 (FIG. 5). At step 620, the voice packet is transferred from the external jitter buffer 310 to a packet queue in the voice processor 340 (FIG. 5). At step 630, the voice packet is processed by the voice processor 340. At step 640, the voice processor 340 notifies the jitter buffer processor 320 (FIG. 5) of how many packets are in the voice processor's packet queue. At decision box 650, depending on the number of voice packets in the voice processor's packet queue, one of three actions are taken. If there are not any voice packets in the packet queue, the rate of transferring packets from the jitter buffer 310 to the voice processor 340 is increased, as shown at step 660. If there is one voice packet in the packet queue, the rate of transferring packets from the jitter buffer 310 to the voice processor 340 is maintained at its present rate, as shown at step 670. If there are two voice packets in the packet queue, the rate of transferring packets from the jitter buffer 310 to the voice processor 340 is decreased, as shown at step 680.
  • The method represented in FIG. 6 is merely illustrative. Many variations of the method of FIG. 6 are contemplated by the present invention. For example, in one embodiment, a first threshold is maintained, wherein if the number of packets held in the voice processor's packet queue is below the threshold, the rate of retrieval of packets from the jitter buffer is increased. A second threshold, higher than the first threshold, is maintained, wherein if the number of packets held in the packet queue is above the second threshold, the rate of retrieval of packets from the jitter buffer is decreased. This embodiment is implementable with any size packet queue.
  • In another variation of the method of FIG. 6, wherein the voice processor's packet queue has a capacity of more than two voice packets, the rate of transferring packets from the jitter buffer to the voice processor is increased or decreased by varying amounts according to the number of voice packets in the packet queue. For example, in a system having a voice processor that has a packet queue having a capacity of four packets, if there are no voice packets in the packet queue, the transfer rate is increased by a first, relatively high, amount. If there is one voice packet in the packet queue, the transfer rate is increased by a second amount that is less than the first amount. If there are two voice packets in the packet queue, the transfer rate is maintained at its present rate. If there are three voice packets in the packet queue, the transfer rate is decreased by a first amount. If the packet queue is full (four packets), the transfer rate is decreased by a second amount that is greater than the first amount. In another variation of the method of FIG. 6, rather than increasing, decreasing or maintaining the rate of retrieving packets from the jitter buffer 310 based on the number of voice packets in the voice processor's packet queue, the rate of transferring packets from the jitter buffer 310 (FIG. 5) to the voice processor is set at one of a plurality of predetermined levels, according to the number of packets in the packet queue. The emptier the packet queue, the higher the transfer rate.
  • Although a preferred embodiment of the present invention has been described, it should not be construed to limit the scope of the appended claims. For example, the present invention is applicable to any real-time media, such as audio and video, in addition to the voice media illustratively described herein. Also, the invention is applicable to any type of transmitted data elements, such as frames, in addition to the application to packet data described herein. Those skilled in the art will understand that various modifications may be made to the described embodiment. Moreover, to those skilled in the various arts, the invention itself herein will suggest solutions to other tasks and adaptations for other applications. It is therefore desired that the present embodiments be considered in all respects as illustrative and not restrictive, reference being made to the appended claims rather than the foregoing description to indicate the scope of the invention.

Claims (19)

Therefore, at least the following is claimed:
1. A packet voice communication system comprising:
an external jitter buffer external to a voice processor and adapted to receive voice packets from a network and to hold the received voice packets, wherein the external jitter buffer is further adapted to place an indicator in each of the packets it holds, the indicator being indicative of how long the voice packet was held in the external jitter buffer; and
a voice processor adapted to synchronously retrieve voice packets from the external jitter buffer and to process the retrieved voice data packets.
2. The system of claim 1 wherein the voice processor is adapted to adjust the rate at which it retrieves voice packets from the external jitter buffer based upon a value of the indicator.
3. The system of claim 2 wherein the voice processor comprises a jitter buffer manager adapted to retrieve voice packets from the external jitter buffer and wherein the jitter buffer manager is adapted to adjust the rate at which it retrieves voice packets from the external jitter buffer based upon a value of the indicator.
4. The system of claim 1 wherein the indicator indicates a length of time the voice packet was held in the external jitter buffer.
5. The system of claim 1 wherein the indicator indicates a time that the voice packet arrived at the external jitter buffer.
6. The system of claim 1 wherein the indicator indicates how many voice packets are held in the jitter buffer.
7. The system of claim 1 wherein the voice packets received by the external jitter buffer are part of a packet sequence according to which the packets are intended to be played out, and wherein the indicator indicates a span of time represented by the earliest and latest packets of a sequence that are held in the external jitter buffer and all packets inbetween the earliest and latest packets in the sequence, regardless of whether the inbetween packets are held in the jitter buffer.
8. The system of claim 1 wherein the external jitter buffer is adapted to place the indicator in a header of each voice packet.
9. The system of claim 1 wherein the external jitter buffer comprises a processor that is separate from the voice processor.
10. A method of operating a packet voice communication system, comprising:
(a) receiving a voice packet from a network;
(b) holding the received voice packet in an external jitter buffer external to a voice processor;
(c) placing an indicator in the voice packet, the indicator being indicative of how long the voice packet was held in the external jitter buffer;
(d) retrieving the voice packet, including the indicator, from the external jitter buffer; and
(e) processing the voice packet with the voice processor.
11. The method of claim 10 further comprising:
(f) adjusting the rate at which voice packets are retrieved from the external jitter buffer based upon a value of the indicator.
12. The method of claim 10 wherein the indicator indicates a length of time the voice packet was held in the jitter buffer.
13. The method of claim 10 wherein the indicator indicates a time that the voice packet arrived at the external jitter buffer.
14. The method of claim 10 wherein the indicator indicates how many voice packets are held in the jitter buffer.
15. The method of claim 10 wherein placing step (c) comprises placing an indicator in the header of the voice packet, the indicator being indicative of how long the voice packet was held in the external jitter buffer.
16. The method of claim 10 wherein the external jitter buffer comprises a processor that is separate from the voice processor.
17. A packet voice communication system comprising:
a voice processor adapted to receive voice packets in a packet queue and to process the received voice packets, an external jitter buffer external to the voice processor and adapted to receive voice packets from a network, hold the received voice packets, and synchronously provide the voice packets to the voice processor, wherein the voice processor is adapted to notify the external jitter buffer of how many voice packets are in the voice processor's packet queue.
18. The system of claim 17 wherein the external jitter buffer is adapted to adjust the rate at which it provides voice packets to the voice processor based upon how many voice packets are in the voice processor's packet queue.
19. The system of claim 17 wherein the external jitter buffer comprises a processor that is separate from the voice processor.
US13/967,975 2002-09-20 2013-08-15 External jitter buffer in a packet voice system Abandoned US20130336333A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
US13/967,975 US20130336333A1 (en) 2002-09-20 2013-08-15 External jitter buffer in a packet voice system

Applications Claiming Priority (3)

Application Number Priority Date Filing Date Title
US41227602P 2002-09-20 2002-09-20
US10/427,807 US8520519B2 (en) 2002-09-20 2003-05-01 External jitter buffer in a packet voice system
US13/967,975 US20130336333A1 (en) 2002-09-20 2013-08-15 External jitter buffer in a packet voice system

Related Parent Applications (1)

Application Number Title Priority Date Filing Date
US10/427,807 Continuation US8520519B2 (en) 2002-09-20 2003-05-01 External jitter buffer in a packet voice system

Publications (1)

Publication Number Publication Date
US20130336333A1 true US20130336333A1 (en) 2013-12-19

Family

ID=31981646

Family Applications (2)

Application Number Title Priority Date Filing Date
US10/427,807 Active 2030-08-07 US8520519B2 (en) 2002-09-20 2003-05-01 External jitter buffer in a packet voice system
US13/967,975 Abandoned US20130336333A1 (en) 2002-09-20 2013-08-15 External jitter buffer in a packet voice system

Family Applications Before (1)

Application Number Title Priority Date Filing Date
US10/427,807 Active 2030-08-07 US8520519B2 (en) 2002-09-20 2003-05-01 External jitter buffer in a packet voice system

Country Status (2)

Country Link
US (2) US8520519B2 (en)
EP (1) EP1404076A3 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130170369A1 (en) * 2012-01-04 2013-07-04 C-Media Electronics Inc. Clock rate controller and method thereof and electronic device thereof
CN108737673A (en) * 2017-04-21 2018-11-02 纬创资通股份有限公司 Network telephone equipment, external connection card and communication method of network telephone equipment

Families Citing this family (31)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
SE0004839D0 (en) * 2000-12-22 2000-12-22 Ericsson Telefon Ab L M Method and communication apparatus in a communication system
JP3825007B2 (en) * 2003-03-11 2006-09-20 沖電気工業株式会社 Jitter buffer control method
US7415044B2 (en) 2003-08-22 2008-08-19 Telefonaktiebolaget Lm Ericsson (Publ) Remote synchronization in packet-switched networks
US20050114118A1 (en) * 2003-11-24 2005-05-26 Jeff Peck Method and apparatus to reduce latency in an automated speech recognition system
JP2007532053A (en) * 2004-03-16 2007-11-08 スノーシヨア・ネツトワークス・インコーポレーテツド Jitter buffer management
US7379466B2 (en) * 2004-04-17 2008-05-27 Innomedia Pte Ltd In band signal detection and presentation for IP phone
MX2007002483A (en) * 2004-08-30 2007-05-11 Qualcomm Inc Adaptive de-jitter buffer for voice over ip.
US8085678B2 (en) * 2004-10-13 2011-12-27 Qualcomm Incorporated Media (voice) playback (de-jitter) buffer adjustments based on air interface
US7970020B2 (en) * 2004-10-27 2011-06-28 Telefonaktiebolaget Lm Ericsson (Publ) Terminal having plural playback pointers for jitter buffer
JP4318651B2 (en) * 2005-02-25 2009-08-26 富士通株式会社 Output method, output device, and communication system
US8355907B2 (en) * 2005-03-11 2013-01-15 Qualcomm Incorporated Method and apparatus for phase matching frames in vocoders
US8155965B2 (en) * 2005-03-11 2012-04-10 Qualcomm Incorporated Time warping frames inside the vocoder by modifying the residual
US7539219B2 (en) * 2005-05-12 2009-05-26 Radioshack Corporation Method and apparatus for synchronization of digital multimedia packets
US7983309B2 (en) * 2007-01-19 2011-07-19 Nokia Corporation Buffering time determination
US8160112B2 (en) * 2007-09-17 2012-04-17 Cisco Technology, Inc. Buffering a media stream
US8281369B2 (en) * 2008-03-12 2012-10-02 Avaya Inc. Method and apparatus for creating secure write-enabled web pages that are associated with active telephone calls
US20100006527A1 (en) * 2008-07-10 2010-01-14 Interstate Container Reading Llc Collapsible merchandising display
US8879464B2 (en) 2009-01-29 2014-11-04 Avaya Inc. System and method for providing a replacement packet
US9525710B2 (en) * 2009-01-29 2016-12-20 Avaya Gmbh & Co., Kg Seamless switch over from centralized to decentralized media streaming
US8238335B2 (en) 2009-02-13 2012-08-07 Avaya Inc. Multi-route transmission of packets within a network
US7936746B2 (en) * 2009-03-18 2011-05-03 Avaya Inc. Multimedia communication session coordination across heterogeneous transport networks
US20100265834A1 (en) * 2009-04-17 2010-10-21 Avaya Inc. Variable latency jitter buffer based upon conversational dynamics
US8094556B2 (en) * 2009-04-27 2012-01-10 Avaya Inc. Dynamic buffering and synchronization of related media streams in packet networks
US8553849B2 (en) 2009-06-17 2013-10-08 Avaya Inc. Personal identification and interactive device for internet-based text and video communication services
US8800049B2 (en) * 2009-08-26 2014-08-05 Avaya Inc. Licensing and certificate distribution via secondary or divided signaling communication pathway
US8976648B2 (en) 2012-08-09 2015-03-10 Blackberry Limited Method and apparatus for managing load on service devices in a communication network
US8995260B2 (en) * 2012-08-09 2015-03-31 Blackberry Limited Method and apparatus for managing load in a communication network
US9559965B2 (en) * 2013-08-20 2017-01-31 Verizon Patent And Licensing Inc. Real-time traffic management for machine to machine communications
GB2521883B (en) * 2014-05-02 2016-03-30 Imagination Tech Ltd Media controller
US9806967B2 (en) * 2014-05-30 2017-10-31 Sony Corporation Communication device and data processing method
US20190102223A1 (en) * 2017-09-29 2019-04-04 Niall Power System, Apparatus And Method For Real-Time Activated Scheduling In A Queue Management Device

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5758076A (en) * 1995-07-19 1998-05-26 International Business Machines Corporation Multimedia server system having rate adjustable data retrieval based on buffer capacity
US5901149A (en) * 1994-11-09 1999-05-04 Sony Corporation Decode and encode system
US6507592B1 (en) * 1999-07-08 2003-01-14 Cisco Cable Products And Solutions A/S (Av) Apparatus and a method for two-way data communication
US20030035371A1 (en) * 2001-07-31 2003-02-20 Coke Reed Means and apparatus for a scaleable congestion free switching system with intelligent control
US6580694B1 (en) * 1999-08-16 2003-06-17 Intel Corporation Establishing optimal audio latency in streaming applications over a packet-based network
US6748000B1 (en) * 2000-09-28 2004-06-08 Nokia Networks Apparatus, and an associated method, for compensating for variable delay of a packet data in a packet data communication system
US6754295B1 (en) * 2000-04-07 2004-06-22 Comrex Corporation Method and apparatus for synchronizing data transmission and reception over a network

Family Cites Families (19)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4914650A (en) * 1988-12-06 1990-04-03 American Telephone And Telegraph Company Bandwidth allocation and congestion control scheme for an integrated voice and data network
EP0485971B1 (en) 1990-11-14 1998-02-18 Fujitsu Limited Delay distortion supressing system for ATM communication system
DE4217911A1 (en) 1992-05-30 1993-12-02 Bundesrep Deutschland Receiver end digital signal timing recovery - providing timing recovery for constant bit rate digital signals after cell structure async. transmission
EP0683945B1 (en) * 1993-11-19 2002-10-30 Codex Corporation Method for adaptive smoothing delay for packet voice applications
JPH07221761A (en) * 1994-02-04 1995-08-18 Fujitsu Ltd Cell delay absorption circuit
DE4417286A1 (en) 1994-05-13 1995-11-23 Deutsche Bundespost Telekom ATM buffer circuit data read-out method
US5966387A (en) * 1995-09-25 1999-10-12 Bell Atlantic Network Services, Inc. Apparatus and method for correcting jitter in data packets
US5615214A (en) * 1995-10-30 1997-03-25 Motorola, Inc. System and method of compensating propagation time variations and substituting for lost packets in a packetized voice communication system
US5640388A (en) * 1995-12-21 1997-06-17 Scientific-Atlanta, Inc. Method and apparatus for removing jitter and correcting timestamps in a packet stream
DE69732150T2 (en) 1997-10-31 2006-01-12 Stmicroelectronics Asia Pacific (Pte) Ltd. ARRANGEMENT AND METHOD FOR PACKET DISTRIBUTION AND FOR ORIENTATION OF INPUT DATA PACKAGES
US6219339B1 (en) * 1998-02-20 2001-04-17 Lucent Technologies Inc. Method and apparatus for selectively discarding packets
US6452950B1 (en) * 1999-01-14 2002-09-17 Telefonaktiebolaget Lm Ericsson (Publ) Adaptive jitter buffering
US6549587B1 (en) * 1999-09-20 2003-04-15 Broadcom Corporation Voice and data exchange over a packet based network with timing recovery
AU6776200A (en) * 1999-08-17 2001-03-13 Conexant Systems, Inc. Integrated circuit with a core processor and a co-processor to provide traffic stream processing
US6707821B1 (en) * 2000-07-11 2004-03-16 Cisco Technology, Inc. Time-sensitive-packet jitter and latency minimization on a shared data link
JP2002077233A (en) * 2000-08-25 2002-03-15 Matsushita Electric Ind Co Ltd Real-time information receiving apparatus
JP4110733B2 (en) * 2000-11-24 2008-07-02 沖電気工業株式会社 Voice packet communication quality evaluation system
JP4110734B2 (en) * 2000-11-27 2008-07-02 沖電気工業株式会社 Voice packet communication quality control device
SE0004839D0 (en) * 2000-12-22 2000-12-22 Ericsson Telefon Ab L M Method and communication apparatus in a communication system

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5901149A (en) * 1994-11-09 1999-05-04 Sony Corporation Decode and encode system
US5758076A (en) * 1995-07-19 1998-05-26 International Business Machines Corporation Multimedia server system having rate adjustable data retrieval based on buffer capacity
US6507592B1 (en) * 1999-07-08 2003-01-14 Cisco Cable Products And Solutions A/S (Av) Apparatus and a method for two-way data communication
US6580694B1 (en) * 1999-08-16 2003-06-17 Intel Corporation Establishing optimal audio latency in streaming applications over a packet-based network
US6754295B1 (en) * 2000-04-07 2004-06-22 Comrex Corporation Method and apparatus for synchronizing data transmission and reception over a network
US6748000B1 (en) * 2000-09-28 2004-06-08 Nokia Networks Apparatus, and an associated method, for compensating for variable delay of a packet data in a packet data communication system
US20030035371A1 (en) * 2001-07-31 2003-02-20 Coke Reed Means and apparatus for a scaleable congestion free switching system with intelligent control

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20130170369A1 (en) * 2012-01-04 2013-07-04 C-Media Electronics Inc. Clock rate controller and method thereof and electronic device thereof
US8908719B2 (en) * 2012-01-04 2014-12-09 C-Media Electronics Inc. Clock rate controller and method thereof and electronic device thereof
CN108737673A (en) * 2017-04-21 2018-11-02 纬创资通股份有限公司 Network telephone equipment, external connection card and communication method of network telephone equipment

Also Published As

Publication number Publication date
EP1404076A3 (en) 2010-02-17
US8520519B2 (en) 2013-08-27
EP1404076A2 (en) 2004-03-31
US20040057445A1 (en) 2004-03-25

Similar Documents

Publication Publication Date Title
US8520519B2 (en) External jitter buffer in a packet voice system
US6977942B2 (en) Method and a device for timing the processing of data packets
US7496086B2 (en) Techniques for jitter buffer delay management
US9967307B2 (en) Implementing a high quality VoIP device
EP0683945B1 (en) Method for adaptive smoothing delay for packet voice applications
US6658027B1 (en) Jitter buffer management
US7274691B2 (en) Network switch with packet scheduling
US6377931B1 (en) Speech manipulation for continuous speech playback over a packet network
US6859460B1 (en) System and method for providing multimedia jitter buffer adjustment for packet-switched networks
US7746881B2 (en) Mechanism for modem pass-through with non-synchronized gateway clocks
US6570849B1 (en) TDM-quality voice over packet
JP4067133B2 (en) Two-way video communication over packet data network
JP2610115B2 (en) Isochronous connection processing method and packet switching network
US7079486B2 (en) Adaptive threshold based jitter buffer management for packetized data
US6049565A (en) Method and apparatus for audio communication
US20010020280A1 (en) Sub-packet insertion for packet loss compensation in voice over IP networks
JP2002009842A (en) Automatic adaptive jitter buffer
US6996059B1 (en) Increasing duration of information in a packet to reduce processing requirements
KR20030018059A (en) Priority packet transmission method and system for multimedia in a shared
EP1449378A2 (en) System and method for compensating packet delay variations
US7542465B2 (en) Optimization of decoder instance memory consumed by the jitter control module
US6931029B1 (en) System and method for synchronizing with data received over an unreliable asynchronous medium
GB2283152A (en) Audio transmission over a computer network
JP3669660B2 (en) Call system
WO2002037813A1 (en) System and method for announcing messages to phone users in a telecommunications network

Legal Events

Date Code Title Description
AS Assignment

Owner name: BROADCOM CORPORATION, CALIFORNIA

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:LEBLANC, WILFRID;REEL/FRAME:031543/0277

Effective date: 20030429

AS Assignment

Owner name: BANK OF AMERICA, N.A., AS COLLATERAL AGENT, NORTH CAROLINA

Free format text: PATENT SECURITY AGREEMENT;ASSIGNOR:BROADCOM CORPORATION;REEL/FRAME:037806/0001

Effective date: 20160201

Owner name: BANK OF AMERICA, N.A., AS COLLATERAL AGENT, NORTH

Free format text: PATENT SECURITY AGREEMENT;ASSIGNOR:BROADCOM CORPORATION;REEL/FRAME:037806/0001

Effective date: 20160201

STCB Information on status: application discontinuation

Free format text: ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION

AS Assignment

Owner name: AVAGO TECHNOLOGIES GENERAL IP (SINGAPORE) PTE. LTD., SINGAPORE

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:BROADCOM CORPORATION;REEL/FRAME:041706/0001

Effective date: 20170120

Owner name: AVAGO TECHNOLOGIES GENERAL IP (SINGAPORE) PTE. LTD

Free format text: ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:BROADCOM CORPORATION;REEL/FRAME:041706/0001

Effective date: 20170120

AS Assignment

Owner name: BROADCOM CORPORATION, CALIFORNIA

Free format text: TERMINATION AND RELEASE OF SECURITY INTEREST IN PATENTS;ASSIGNOR:BANK OF AMERICA, N.A., AS COLLATERAL AGENT;REEL/FRAME:041712/0001

Effective date: 20170119