WO2007080788A1 - Teleconference control device and teleconference control method - Google Patents

Teleconference control device and teleconference control method Download PDF

Info

Publication number
WO2007080788A1
WO2007080788A1 PCT/JP2006/326033 JP2006326033W WO2007080788A1 WO 2007080788 A1 WO2007080788 A1 WO 2007080788A1 JP 2006326033 W JP2006326033 W JP 2006326033W WO 2007080788 A1 WO2007080788 A1 WO 2007080788A1
Authority
WO
WIPO (PCT)
Prior art keywords
delay
video
conference
priority
transmission
Prior art date
Application number
PCT/JP2006/326033
Other languages
French (fr)
Japanese (ja)
Inventor
Yoshimasa Honda
Original Assignee
Matsushita Electric Industrial Co., Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Matsushita Electric Industrial Co., Ltd. filed Critical Matsushita Electric Industrial Co., Ltd.
Publication of WO2007080788A1 publication Critical patent/WO2007080788A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/56Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities
    • H04M3/567Multimedia conference systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/4302Content synchronisation processes, e.g. decoder synchronisation
    • H04N21/4307Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen
    • H04N21/43072Synchronising the rendering of multiple content streams or additional data on devices, e.g. synchronisation of audio on a mobile phone with the video output on the TV screen of multiple content streams on the same device
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/236Assembling of a multiplex stream, e.g. transport stream, by combining a video stream with other content or additional data, e.g. inserting a URL [Uniform Resource Locator] into a video stream, multiplexing software data into a video stream; Remultiplexing of multiplex streams; Insertion of stuffing bits into the multiplex stream, e.g. to obtain a constant bit-rate; Assembling of a packetised elementary stream
    • H04N21/2368Multiplexing of audio and video streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/42203Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS] sound input device, e.g. microphone
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/41Structure of client; Structure of client peripherals
    • H04N21/422Input-only peripherals, i.e. input devices connected to specially adapted client devices, e.g. global positioning system [GPS]
    • H04N21/4223Cameras
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/434Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
    • H04N21/4341Demultiplexing of audio and video streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/478Supplemental services, e.g. displaying phone caller identification, shopping application
    • H04N21/4788Supplemental services, e.g. displaying phone caller identification, shopping application communicating with other users, e.g. chatting
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/16Analogue secrecy systems; Analogue subscription systems
    • H04N7/173Analogue secrecy systems; Analogue subscription systems with two-way working, e.g. subscriber sending a programme selection signal
    • H04N7/17309Transmission or handling of upstream communications
    • H04N7/17318Direct or substantially direct transmission and handling of requests
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/56Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities
    • H04M3/568Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities audio processing specific to telephonic conferencing, e.g. spatial distribution, mixing of participants

Definitions

  • the present invention relates to a ⁇ conference control device that controls a ⁇ conference device.
  • ADSL Asymmetric Digital Subscriber Line
  • optical fiber networks have rapidly spread and low-cost and high-speed Internet connections are becoming available.
  • video and audio data (hereinafter referred to as “video / audio data”) is transmitted bidirectionally between multiple remote sites. It has become possible to build
  • the IP network represented by the current Internet is a best-f-auto type network that guarantees nothing about the effective bandwidth in which data can be transmitted without data loss. Therefore, for example, if data in a band that exceeds the effective bandwidth that can be transmitted is transmitted, a congestion state occurs in the network, a transmission delay of video and audio data occurs, and a congestion state that exceeds the noise in the network device Then, data loss occurs.
  • FIG. 9 is a diagram showing a conventional video conference apparatus.
  • the video conference apparatus 9 10 includes a video code Z decoding unit 901 that performs video code Z decoding, and audio coding Z decoding that performs audio code Z decoding.
  • transmission delay circuit 903 for delaying voice input, transmission switcher 904 for switching delay input, and voice input on the transmission / reception side are monitored, and when both are input simultaneously, the delay time is shortened.
  • An audio input monitoring unit 905 that performs reception switching, a reception delay circuit 907 that performs delay input on received audio, and a multiplexing Z separation unit that performs video and audio multiplexing Z separation processing 908 And is connected to the multipoint connection control device 909, and performs two-way video and audio communication between the TV conference devices.
  • the TV conference device in Patent Document 1 monitors the audio input on the transmission / reception side in the audio input monitoring unit 905, and shortens the delay time when both are input simultaneously. Thus, low-delay transmission of voice is enabled.
  • Patent Document 1 JP-A-8-317362
  • Patent Document 1 has a problem in that only audio data is transmitted with low delay, so that the video data is displayed without being synchronized with the audio data.
  • the present invention determines the degree of necessity of low-delay transmission of video / audio data using the frequency of audio data transmission / reception as a low-delay priority, By controlling the video / audio encoding parameters or transmission parameters so that the lower the delay, the higher the priority, the synchronization between audio and video is always possible even in a best-f-automatic network.
  • the purpose is to provide a device capable of seamless transmission with a delay amount according to the conference state.
  • the TV conference control device of the present invention is a TV conference control device that controls a TV conference device that transmits video data and audio data through a transmission line, and is a sound that is detected by the TV conference device.
  • a low-delay priority determination means for determining a low-delay priority indicating a degree to which delay of video data and audio data should be suppressed as the frequency of switching between transmission and reception of certain audio data increases, and the low-delay priority Higher !, smaller! /, Delay amount determining means for determining the delay amount, and determining the parameter or packet priority of the code key used in the video conference apparatus according to the determined delay amount Parameter control means.
  • the parameter of the code key is a parameter used for the compression code key, and thereby the amount of data after the code key and the data amount used for the decoding key can be controlled.
  • Data with a high packet priority is processed with priority over data with a low packet priority by a router on the transmission path, and is transferred first on the transmission path.
  • the low-delay priority determining unit measures the frequency based on the number of times within a predetermined time when transmission and reception of voice data, which is sound detected by the TV conference device, are switched.
  • the frequency of switching between transmission and reception of audio data within a predetermined time is used as an index of the speed of switching between transmission and reception of audio data. Therefore, it is possible to determine the low delay priority with a simple process of monitoring the transmission / reception timing.
  • the low delay priority determination means compares the frequency with a threshold, and when the frequency is greater than the threshold, the low delay priority is higher than when the frequency is equal to or less than the threshold. To decide.
  • the audio data transmission 'reception frequency and The low delay priority can be determined by a simple process of comparing preset threshold values.
  • the low-delay priority determining unit measures the frequency based on a difference between a transmission time and a reception time of audio data detected by the TV conference device, and a difference between the transmission time and the reception time. The lower the priority, the higher the low delay priority is determined.
  • the difference between the transmission time and the reception time of the audio data is used as an index of the speed of switching between transmission and reception of the audio data. Therefore, it is possible to determine the low-latency priority with a simple process that monitors the transmission / reception time and the difference.
  • the low delay priority determination means compares the difference between the transmission time and the reception time with a threshold value, and when the difference between the transmission time and the reception time is smaller than the threshold value, The low delay priority that is higher than when the difference between the time and the reception time is equal to or greater than the threshold is determined.
  • the parameter control means decodes the video data and audio data in a TV conference device that receives the video data and audio data as the compression code amount parameter. Determine the receive buffer capacity.
  • the parameter control means determines a maximum generated code amount for encoding the video data and audio data in the video conference device as the compression code amount parameter.
  • the transmission delay becomes smaller and low-delay transmission is performed, so that it can be controlled to be equal to or less than the requested delay amount.
  • the TV conference control device of the present invention is a TV conference control device that controls a plurality of TV conference devices that communicate video data and audio data of a plurality of conferences through a common transmission path.
  • the frequency of switching between transmission and reception of voice data that is detected by each TV conference device is high!
  • the present invention can be realized not only as such a TV conference control apparatus but also as an integrated circuit and a TV conference apparatus including characteristic means included in the TV conference control apparatus.
  • the present invention can also be realized as a TV conference system including a TV conference control device and a TV conference device.
  • the present invention can also be realized by a TV conference control method in which characteristic means included in the TV conference control apparatus is a step.
  • the parameter for controlling the delay amount common to video and audio is determined according to the speed of switching between transmission and reception of audio data. For this reason, as the need for low delay is high, such as in TV conferences where active discussions are taking place, the delay is low and the video and audio are synchronized, so the conference can proceed smoothly. become.
  • FIG. 1 is a block diagram showing a configuration of a video conference apparatus according to Embodiment 1.
  • FIG. 2 is a flowchart showing processing executed when the TV conference apparatus according to Embodiment 1 performs a TV conference.
  • FIG. 3 is a flowchart showing details of delay control processing S204 shown in FIG.
  • FIG. 4 is a conceptual diagram showing audio data transmission'reception frequency according to the first embodiment.
  • FIG. 5 is a block diagram showing a configuration of a video conference system according to Embodiment 2 of the present invention.
  • FIG. 6 is a diagram showing a relationship between a plurality of video conference apparatuses and a delay control server according to Embodiment 2.
  • FIG. 7 is a flowchart showing processing executed by the video conference apparatus according to Embodiment 2.
  • FIG. 8 is a flowchart showing processing executed by the delay control server according to the second embodiment.
  • FIG. 9 is a diagram showing a configuration of a conventional video conference terminal device.
  • FIG. 1 is a block diagram showing the configuration of the video conference apparatus according to Embodiment 1 of the present invention.
  • the video conference apparatus of this embodiment is placed under a conference participant and communicates audio and video.
  • Each of the video conference apparatuses according to the present embodiment dynamically changes the delay amount from the input of the other party's audio and video to the output of the other audio and video to the other TV conference apparatus according to the state of the conference.
  • a video conference control device is provided.
  • a video conference apparatus 101 shown in FIG. 1 includes a video / audio input unit 102 connected to a camera and a microphone and inputs video and audio, and video data and audio data (hereinafter referred to as “audio / video data”).
  • a video / audio code unit 103 that performs compression coding
  • a transmission unit 104 that is connected to the transmission path 111 and transmits encoded video / audio data
  • a low-delay priority that determines low delay priority using video / audio input.
  • a receiving unit 110 that receives encoded video / audio data transmitted from a video conference terminal that is a connected communication partner, a video / audio decoding unit 109 that decodes the encoded video / audio data, and a monitor And connected to the speaker, And a video / audio output unit 108 for outputting the decoded video / audio data to a monitor and a speaker.
  • the code key parameter is a parameter used for the code key in the video / audio code key unit 103 and the decoding key in the video / audio decoding key unit 109. Same It is.
  • the video / audio input unit 102 outputs uncompressed video data and audio data input by the camera and microphone power to the video / audio encoding unit 103 and the low delay priority determination unit 105 in units of frames.
  • the video / audio encoding unit 103 compresses the video / audio data input from the video / audio input unit 102 using a coding parameter input from the parameter control unit 107, such as MPEG-2.
  • the encoded video / audio data is output to the transmission unit 104.
  • Transmitting section 104 uses another transmission parameter input from parameter control section 107 for the encoded video / audio data input from video / audio encoding section 103 to another TV conference apparatus that is a communication partner. Send to.
  • the low-delay priority determination unit 105 detects a voiced portion included in the audio data from the video / audio data input from the video / audio input unit 102, and the low-delay priority is determined based on the detected voiced portion.
  • the low delay priority is output to the delay amount determination unit 106.
  • the voiced portion is detected by, for example, whether or not the sound volume included in the audio data is greater than or equal to a threshold value.
  • a threshold value A method for determining the low delay priority will be described later.
  • the delay amount determination unit 106 determines a delay amount using the low delay priority input from the low delay priority determination unit 105, and outputs the delay amount to the parameter control unit 107.
  • the meter control unit 107 changes the code key parameter using the delay amount input from the delay amount determination unit 106, and outputs the code key parameter to the video / audio code key unit 103.
  • the parameter control unit 107 may change the transmission parameter instead of the sign key parameter, and output the transmission parameter to the transmission unit 104.
  • This transmission parameter is, for example, packet priority.
  • Receiving section 110 receives the encoded video / audio data transmitted from the video conference terminal as the communication partner through transmission path 111, and outputs the received data to video / audio decoding section 109.
  • the video / audio decoding unit 109 performs a decoding process on the encoded video / audio data input from the receiving unit 110 in accordance with the encoded encoding method.
  • the audio data is output to the audio / video output unit 108, and the audio data is assigned a low delay priority. Output to fixed part 105.
  • the video / audio output unit 108 outputs the video / audio data input from the video / audio decoding unit 109 to the connected monitor and speaker.
  • FIG. 2 is a flowchart of processing executed when the TV conference apparatus 101 according to Embodiment 1 performs a TV conference.
  • Each process shown in FIG. 2 is stored as a control program in a storage device (not shown) such as a ROM or flash memory of the TV conference apparatus 101, and is controlled by the CPU (not shown).
  • Step S201 Video / audio input processing>
  • the video / audio input unit 102 inputs uncompressed video data and audio data in units of frames from the connected camera and microphone, and outputs the video data and audio data to the video / audio encoding unit 103 for audio.
  • the data is output to the low delay priority determination unit 105 (step S201).
  • Step S202 Data reception processing>
  • the receiving unit 110 receives the encoded video / audio data transmitted from the video conference terminal as a communication partner through the transmission path 111, and outputs the received encoded data to the video / audio decoding unit 109 (step S1). S 202).
  • Step S203 Video 'Audio Decoding and Display Output Processing>
  • the video / audio decoding unit 109 performs decoding processing on the encoded video / audio data input from the receiving unit 110 in accordance with the encoded encoding method, and the decoded video / audio data. Is output to the video / audio output unit 108, and the audio data is output to the low delay priority determination unit 105.
  • the video / audio output unit 108 connects the video / audio data input from the video / audio decoding unit 109 to the video / audio output unit 108. Is output to the monitor and speaker (step S203).
  • Step S204 Delay control processing>
  • the low delay priority determination unit 105 determines the low delay priority
  • the delay amount determination unit 106 determines the delay amount according to the determined low delay priority
  • the parameter control unit 107 uses the delay amount to code. ⁇ Determine the parameters (step S204). Details of this delay control process This will be described with reference to the drawings.
  • parameter control section 107 may determine the transmission parameter using the delay amount instead of the sign key parameter.
  • FIG. 3 is a flowchart showing details of the delay control process (step S204) shown in FIG.
  • the delay priority determination unit 105 transmits audio data using the transmission audio data input from the video / audio input unit 102 and the reception audio stream input from the video / audio decoding unit 109. And the frequency of reception is calculated (step S301).
  • the frequency of transmission and reception of voice data represents the degree of conversation activity.
  • the higher the frequency of voice data transmission and reception the more active the discussion is, and the greater the delay effect, the lower the delay transmission is required.
  • Equation 1 shows an example of a calculation formula for the frequency of transmission and reception of audio data.
  • N (t) represents the frequency of audio data transmission and reception at time t
  • Ns (t) represents the number of times audio data was transmitted during the past T hours
  • Nr (t) Indicates the number of audio data received during the past T hours from time t.
  • the number of transmissions and the number of receptions are sound data with sound, and the delay priority determination unit 105 performs sound determination.
  • the frequency of transmission and reception of such voice data is an example of the frequency at which transmission and reception of voice data that is sound is switched, and transmission and reception of voice data that is sound during a certain period of time. It is indicated by the number of times to switch.
  • Equation 1 is an example of a method for determining the transmission / reception frequency of audio data, and any method can be used as long as it is a calculation method representing the transmission / reception frequency of audio data.
  • the transmission data and the reception data may be out of time at the time t due to the effect of the transmission delay. Therefore, the transmission delay at the time t of the transmission data in consideration of the transmission delay amount. It is also possible to calculate by adding.
  • FIG. 4 is a conceptual diagram showing the frequency of audio data transmission / reception according to the first embodiment. With reference to this figure, a method for calculating the frequency of audio data transmission / reception between two TV conference devices will be described.
  • a hatched section 401 written as S indicates a voice data transmission state
  • a white section 402 written as a scale indicates a voice data reception state.
  • priority is given to the louder volume.
  • Fig. 4 (a) is more affected by delay compared to Fig. 4 (b), and the need for low-delay transmission is high, and it is necessary to increase the low-delay priority.
  • the low delay priority determination unit 105 calculates the audio data transmission / reception frequency, and then determines the low delay priority to be higher as the transmission / reception frequency is higher.
  • the low delay priority P (t) is determined as in (Equation 2) (step S302).
  • Equation 2 P (t) is the low delay priority at time t
  • N (t) is the frequency of audio data transmission and reception at time t
  • TH1 and TH2 are It is a predetermined threshold (where TH 1 ⁇ TH2)
  • PMAX is a predetermined maximum priority.
  • Equation 2 is an example of a method for determining the low delay priority. Any method can be used as long as the delay priority increases as the frequency of audio data transmission / reception increases.
  • the delay amount determination unit 106 calculates a delay amount using the low delay priority calculated as described above (step S303).
  • Equation 3 is an example showing a method of calculating the delay amount.
  • Equation 3 Delay
  • (t) is a delay amount at time t
  • DMAX is a predetermined maximum delay amount
  • P (t) is a low delay priority at time t.
  • the delay amount determination unit 106 determines the delay amount so that the value decreases as the low delay priority increases. Note that (Equation 3) is an example of a method for determining the delay amount, and any method can be used as long as the value becomes lower as the low delay priority is higher.
  • Step S304 Parameter calculation 'Update process>
  • parameter control section 107 calculates an encoding parameter using the delay amount calculated in the delay amount calculation process (step S303), and outputs the calculated encoding parameter to video / audio encoding section 103. .
  • the fluctuation width of the bit rate (the amount of code generated per unit time), which is one of the sign key parameters, is targeted as a parameter to be calculated.
  • the target code amount per frame which is uniquely determined from the bit rate and frame rate (screen update frequency), is set as the target for video code, and the generated code amount per frame is kept below the target. is there.
  • the maximum generated code amount is defined as the maximum generated code amount per frame. For example, if the generated code amount per frame is N times the average code amount, it takes N times the normal amount of data to transmit the encoded data for that frame because the data amount is N times. become. Therefore, if the frame rate is 30fps, the delay amount is NZ30 (second ).
  • (Expression 4) is a mathematical expression showing a method of calculating the maximum generated code amount.
  • BITS MAX is the maximum generated code amount (bits) per frame
  • Delay (t) is the delay amount (ms) at time t
  • BITRATE is the video code This is the bit rate (bitsZ seconds), which is the amount of generated code per second.
  • the maximum amount of generated code is calculated for the delay amount power, but as a parameter, the TOS (TYPE OF SE RVICE) value, which is the packet priority in TCPZIP communication, is changed. Any parameter can be used as long as the delay amount can be controlled relatively.
  • delay control process described above is performed for each frame, but can be performed at predetermined intervals to reduce the processing amount.
  • the parameter control unit 107 calculates a transmission parameter using the delay amount calculated in the delay amount calculation process (step S303) instead of calculating the encoding parameter, and calculates the calculated transmission parameter. You may output to the transmission part 104. FIG.
  • the transmission side increases the packet priority included in the packet as the low delay priority increases. Since routers on the transmission path preferentially process packets with higher packet priority, packets with higher packet priority reach the TV conference device on the receiving side earlier. For this reason, packets with a higher packet priority are processed earlier at the router on the transmission path than packets with a lower packet priority, and low-delay transmission is realized.
  • Step S205 Video / Audio Coding and Transmission Processing>
  • the audio / video encoding unit 103 performs compression encoding such as MPEG-2 on the audio / video data input from the audio / video input unit 102 using the encoding parameters input from the noram control unit 107,
  • the encoded video / audio data is output to transmitting section 104 (step S 205).
  • the generated code amount per frame is set to be equal to or less than the input value and per unit time.
  • the bit rate is controlled so that the amount of generated codes is less than a certain value.
  • the encoding method in the video / audio encoding unit 103 is not limited to MPEG, and any encoding method can be used.
  • the transmission unit 104 uses the predetermined transmission parameter for the encoded video / audio data input from the video / audio code unit 103, and transmits to another TV as a communication partner. Data is transmitted to the conference device.
  • IPZUDPZRTP is used as the transmission method, but any method can be used as long as it is a method capable of transmitting video and audio data through the transmission path.
  • transmission unit 104 uses another transmission parameter input from parameter control unit 107 as another TV as a communication partner. Data is transmitted to the conference device.
  • the video / audio input unit 102 determines that the process has ended when the input of the video / audio data is completed, or when a preset time has elapsed (Yes in step S206), and ends the process. In other cases (No in step S206), the video conference apparatus 101 continues the video / audio input process (step S201) and the data reception process (step S202).
  • the low delay priority determination unit 105 determines a low delay priority
  • the delay amount determination unit 106 determines a delay amount using the low delay priority
  • the parameter control unit 107 changes the video / audio code parameter or the transmission parameter according to the delay amount.
  • the low-delay priority determination unit 105 uses the frequency of transmission and reception of voice data that is voiced to transmit voice data. Determine the degree high.
  • the low delay priority determination unit 105 uses the frequency of transmission and reception of voice data that is voiced to transmit voice data. When determining a high degree, the low delay priority is determined by comparing with a preset threshold.
  • the low delay priority determination unit 105 determines the low delay priority using the audio data transmission / reception frequency. However, the transmission time of the audio data and the voice data of the voice data are determined. The lower delay priority may be determined as the difference between the transmission time and the reception time is smaller! /.
  • low delay priority determination section 105 determines low delay priority by comparing the frequency of transmission and reception of voice data that is voiced with a preset threshold value. However, the difference between the transmission time and the reception time of the voice data may be compared with a preset threshold value, and the lower the delay priority, the higher the low delay priority may be determined.
  • the delay amount determination unit 106 performs a predetermined threshold process using the determined low delay priority, and the delay amount is set to a smaller value as the priority is higher. decide. [0101] According to this configuration, since the delay amount is determined by threshold processing using the low delay priority, the delay amount can be determined by simple processing.
  • parameter control section 107 changes the maximum generated code amount of video / audio encoding to be small so as to be equal to or less than the determined delay amount.
  • the maximum generated code amount of the video / audio encoding is smaller! /
  • the maximum value of the transmission delay can be made less than the delay amount, and the low-delay transmission with less than the designated delay amount. Can be performed.
  • parameter control unit 107 may change the buffer capacity of the video / audio decoding key so as to be equal to or less than the determined delay amount.
  • parameter control section 107 may set the packet priority of transmission data higher as the delay amount is smaller in accordance with the determined delay amount.
  • the delay control servers connected via transmission lines determine the delay amounts of a plurality of video conference apparatuses in an integrated manner.
  • FIG. 5 is a block diagram showing a configuration of the video conference system according to Embodiment 2 of the present invention.
  • the TV conference system according to the present embodiment is placed under a conference participant and communicates with a plurality of TV conference devices that communicate audio and video, and a delay in a TV conference using the plurality of TV conference devices.
  • a delay control server 504 for determining and controlling the quantity.
  • the plurality of video conference apparatuses and the delay control server 504 are configured by being connected through a common transmission path 111.
  • a video conference apparatus 501 shown in the figure shows one of a plurality of video conference apparatuses connected to the transmission path 111.
  • the video conference apparatus 501 shown in Fig. 5 includes a video / audio input unit 102 that is connected to a camera and a microphone and receives video / audio, a video / audio encoding unit 103 that encodes video / audio, and a transmission A transmission unit 104 connected to the channel 111 for transmitting the encoded video / audio data, a low delay priority determining unit 105 for determining the low delay priority using the video / audio input, and the low delay priority
  • a transmission / reception unit 502 that transmits to the delay control server 504 and receives the delay amount, a parameter control unit 503 that changes the sign key parameter using the delay amount, and a video conference that is connected to the transmission path 111 and is the communication partner
  • a receiving unit 110 that receives the encoded video / audio data transmitted from the terminal camera, a video / audio decoding unit 109 that decodes the encoded video / audio data, and a monitor and speaker connected to the decoding unit
  • the delay control server 504 includes a low delay priority receiving unit 505 that receives a low delay priority through a transmission line, a delay amount determining unit 506 that determines a delay amount using the low delay priority, a delay A delay amount transmitting unit 507 for transmitting the amount.
  • the delay amount determination unit 506 provided in the delay control server 504, the low delay priority determination unit 105 and the parameter control unit 503 provided in the TV conference device 501 correspond to the TV conference control device.
  • the communication between the delay control server 504 and the TV conference device 501 is an example realized by the low delay priority receiving unit 505, the delay amount transmitting unit 507, and the transmitting / receiving unit 502.
  • the processing units having the same operation contents as those of the first embodiment are given the same numbers as those in FIG. 1, and the description of the operations is omitted. Therefore, the processing units that differ in operation content from Embodiment 1 are the transmission / reception unit 502, the parameter control unit 503, and the delay control server 504 in the TV conference device 501.
  • the transmission / reception unit 502 transmits the low delay priority input from the low delay priority determination unit 105 to the delay control server 504 through the transmission path 111.
  • the low delay priority receiving unit 505 receives the low delay priority transmitted from the transmission / reception unit 502 through the transmission path 111 and outputs the low delay priority to the delay amount determination unit 106.
  • the delay amount determination unit 106 determines a delay amount using the low delay priority input from the low delay priority reception unit 505, and outputs the delay amount to the delay amount transmission unit 507.
  • the delay amount transmitting unit 507 transmits the delay amount input from the delay amount determining unit 506 to the TV conference device 501 through the transmission path 111.
  • FIG. 6 shows the relationship between a plurality of video conference apparatuses and a delay control server according to Embodiment 2.
  • FIG. 6 shows the relationship between a plurality of video conference apparatuses and a delay control server according to Embodiment 2.
  • the six TV conference devices shown in the figure each have three conferences separately, and the delay control server 607 controls the delay amount of these TV conference devices.
  • TV conference apparatuses 601 to 606 have the same function as the TV conference apparatus 501 in FIG. 5
  • the delay control server 607 has the same function as the delay control server 504 in FIG.
  • FIGS. 7 and 8 The operations of the flowcharts shown in FIGS. 7 and 8 are stored as control programs in a storage device (for example, a ROM or a flash memory), not shown in the video conference device 501 and the delay control server 504. Controlled by a CPU (not shown).
  • a storage device for example, a ROM or a flash memory
  • FIG. 7 is a flowchart showing processing executed by the video conference apparatus 501 according to Embodiment 2.
  • steps having the same processing contents as those in the first embodiment are given the same numbers as those in FIG. 2 and will not be described.
  • the video conference apparatus 501 performs the video / audio input process (step S201), the data reception process (step S202), and the video / audio decoding output process (step S203) of the first embodiment. ) And then the following processing is executed.
  • Step S701 Low-latency priority calculation processing>
  • the low delay priority determination unit 105 calculates the low delay priority using the audio data transmission / reception frequency, and outputs the low delay priority to the transmission / reception unit 502 (step S701).
  • the low-delay priority determination unit 105 obtains the low-delay priority through the same processes as the audio data transmission / reception frequency calculation process (step S301) and the low-delay priority calculation process (step S302) of the first embodiment. calculate.
  • Step S702 Low delay priority transmission processing>
  • the transmission / reception unit 502 transmits the low delay priority input from the low delay priority determination unit 105 to the delay control server 504 through the transmission path 111 (step S702).
  • Step S703 Receive delay amount, update parameter>
  • the transmission / reception unit 502 receives the delay amount transmitted from the delay control server 504 through the transmission path 111, and the parameter control unit 503 further performs the delay amount calculation process of the first embodiment.
  • Step S303 and parameter calculation / update processing
  • the delay amount is calculated, the encoding parameter is determined, and the encoding parameter is input to the video / audio encoding unit 103.
  • parameter control section 503 may determine a transmission parameter based on the calculated delay amount, and output the transmission parameter to transmission section 104.
  • TV conference device 501 executes the processing of the video and audio encoding transmission processing (step S205) and the end determination processing (step S206) of the first embodiment, and ends the processing.
  • FIG. 8 is a flowchart showing a process executed by the delay control server according to the second embodiment.
  • the delay control server it is assumed that three video conference sessions are established as shown in FIG. 6, and the case where all video conference device capabilities low delay priorities are transmitted to the delay control server will be described. That is, in FIG. 6, the TV conference devices 601 and 602, the TV conference devices 6 03 and 604, and the TV conference devices 605 and 606 are holding separate TV conferences. The case where each TV conference device transmits a low delay priority to the delay control server 607 will be described.
  • Step S801 Low-latency priority reception processing>
  • the low-delay priority receiving unit 505 receives the low-delay priority transmitted from the transmission / reception unit 502 of the video conference apparatus 501 via the transmission path 111 and outputs the low-delay priority to the delay amount determination unit 506 (step S801).
  • Step S802 Delay amount calculation process>
  • the delay amount determination unit 506 receives six low delay priorities from the TV conference terminals 601 to 606 in FIG. 6, determines individual delay amounts for each of the six TV conference devices, and sets the delay amount. Output to delay amount transmission section 507 (step S802).
  • (Formula 5) shows an example of a delay amount calculation formula.
  • Delay (t, x) and P (t, ⁇ ) are the amount of delay and low delay priority of the TV conference device ⁇ at time t
  • DAVE is the average value of the predetermined amount of delay
  • PAVE (t) is an average value of low delay priorities in all TV conference devices at time t
  • K is a predetermined delay adjustment parameter.
  • time t is synchronized with all terminals, and in calculating the delay amount, the delay amount is calculated from the low delay priority at the same time t.
  • Equation 5 is an example of a method for calculating the delay amount, and any method can be used as long as the lower delay priority is higher and the delay amount can be calculated smaller.
  • Step S803 Delay amount transmission processing>
  • the delay amount transmission unit 507 transmits the delay amount for each of the plurality of TV terminals input from the delay amount determination unit 506 to the respective TV conference apparatuses 601 to 606 through the transmission path 111 (step S803).
  • the transmission / reception unit 502 performs low-delay priority transmission processing and delay amount reception processing between the delay control servers 607, and the delay control server 607 includes a plurality of delay control servers 607.
  • the amount of delay is determined so that the TV conference device having a relatively high low delay priority has a higher value.
  • the delay control server centrally manages the amount of delay, so it is easy to grasp the low delay priority of multiple TV conferences. Or transmission is possible.
  • transmission / reception section 502 transmits the low delay priority to the delay control server, but it can also be transmitted to other video conference apparatuses, and the delay amount is determined by the delay control. It is also possible to do it in the video conferencing device. [0140] With this, a delay control server is unnecessary, the delay amount is determined by comparing with other TV conference devices, and it is possible to set a smaller delay amount for a TV conference that requires a lower delay. is there.
  • the transmission / reception unit 502 transmits the low delay priority to the delay control server, but instead of the low delay priority, the transmission / reception frequency of the audio data is exchanged with other TV conference devices.
  • the low-delay priority determination unit 105 can also determine the delay priority as the frequency is higher than the transmission / reception frequency of other TV conference devices received by the transmission / reception unit 502. Is possible.
  • the transmission / reception unit 502 transmits the low delay priority to the delay control server.
  • the transmission / reception time difference of the audio data is changed to another TV conference device.
  • the low-latency priority determination unit 105 can reduce the delay amount as the difference is smaller compared to the difference in the transmission / reception time of other TV conference devices received by the transmission / reception unit 502. It is also possible to determine a smaller value.
  • the video conference apparatus uses the frequency of transmission and reception of audio data, and the frequency is high! Decide the amount of delay so that the amount of delay decreases as you go, and follow the determined amount of delay! TV
  • the amount of delay becomes smaller in the TV conference state where the frequency of audio data transmission / reception is high and the need for low delay is low, and the frequency of audio data transmission / reception is low
  • the delay amount for video conferencing conditions that do not require a network it is possible to set an optimal delay amount on a transmission path with limited bandwidth, especially for the best-f-auto Internet. This is particularly useful in video conference systems that require low-delay transmission.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • General Engineering & Computer Science (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
  • Data Exchanges In Wide-Area Networks (AREA)
  • Telephonic Communication Services (AREA)

Abstract

Provided is a device capable of continuously performing transmission with a delay amount based on the conference state while synchronizing audio with video even in a best effort type network. A teleconference control device controls a teleconference device (101) communicating video data and audio data via a transmission path. The teleconference control device includes: a low delay priority decision unit (105) for deciding the low delay priority indicating the degree for suppressing the delay of the video data and the audio data to a higher value as the frequency of switching between transmission and reception of the audio data as a sound detected by the teleconference device (101) increases; a delay amount decision unit (106) for deciding a smaller delay amount as the low delay priority increases; and a parameter control unit (107) for deciding an encoding parameter or packet priority used in the teleconference device (101) in accordance with the decided delay amount.

Description

明 細 書  Specification
TV会議制御装置および TV会議制御方法  Video conference control device and video conference control method
技術分野  Technical field
[0001] 本発明は、 τν会議装置を制御する τν会議制御装置に関する。  The present invention relates to a τν conference control device that controls a τν conference device.
背景技術  Background art
[0002] 近年、 ADSL (Asymmetric Digital Subscriber Line)や光ファイバ一網が急 速に普及し、低価格で高速なインターネット接続が利用可能となってきている。また、 こうした低価格の高速インターネットを利用して、遠隔の複数拠点間で映像データお よび音声データ (以下、「映像音声データ」という。)を双方向に伝送することにより簡 易に TV会議システムを構築することが可能となってきて 、る。  [0002] In recent years, ADSL (Asymmetric Digital Subscriber Line) and optical fiber networks have rapidly spread and low-cost and high-speed Internet connections are becoming available. In addition, by using such a low-cost high-speed Internet, video and audio data (hereinafter referred to as “video / audio data”) is transmitted bidirectionally between multiple remote sites. It has become possible to build
[0003] しかしながら、現在のインターネットに代表される IPネットワークは、データの欠落無 くデータを伝送可能な有効帯域に関しては何も保証されないベストエフオート型のネ ットワークである。従って、例えば伝送可能な有効帯域を越えた帯域のデータを伝送 すると、ネットワークにおいては、輻輳状態が発生し、映像音声データの伝送遅延が 発生し、さらにネットワーク機器におけるノ ッファを超えるような輻輳状態では、データ の欠落が発生する。  [0003] However, the IP network represented by the current Internet is a best-f-auto type network that guarantees nothing about the effective bandwidth in which data can be transmitted without data loss. Therefore, for example, if data in a band that exceeds the effective bandwidth that can be transmitted is transmitted, a congestion state occurs in the network, a transmission delay of video and audio data occurs, and a congestion state that exceeds the noise in the network device Then, data loss occurs.
[0004] また、一般に TV会議では、円滑なコミュニケーションを図るために映像音声を途切 れなく低遅延に伝送することが非常に重要とされている。しかしながら、先に述べたよ うに低価格なベストエフオート型ネットワークでは、帯域が保証されないため、常に低 遅延伝送を実現することは非常に困難である。一方、低遅延伝送を実現するため、 ベストエフオート型のネットワークではなぐ帯域保証型のネットワークである専用線サ 一ビスを利用することも可能であるが、月額約 1千万円の使用料が必要となり、非常 に高コストとなる。  [0004] In general, in video conferences, in order to facilitate smooth communication, it is very important to transmit video and audio with a low delay without interruption. However, as mentioned above, in a low-cost best F auto network, the bandwidth is not guaranteed, so it is very difficult to always realize low-delay transmission. On the other hand, in order to realize low-latency transmission, it is possible to use a dedicated line service, which is a bandwidth-guaranteed network that is not the best F-automatic network, but the monthly fee is approximately 10 million yen. It is necessary and very expensive.
[0005] これを解決する従来方法として、例えば特許文献 1では、 TV会議にぉ 、て、通常 は帯域の大きな映像の遅延に合わせて音声に遅延を挿入して映像音声の同期を行 うが、両者の音声が入力された場合は、音声遅延の挿入を止めることにより音声の低 遅延化を図り、応答性能を高めている。 [0006] 図 9は従来方法の TV会議装置を示した図面である。図 9において、 TV会議装置 9 10は、映像の符号ィ匕 Z復号ィ匕を行う映像符号ィ匕 Z復号ィ匕部 901と、音声の符号ィ匕 Z復号化を行う音声符号化 Z復号化部 902と、音声入力を遅延させる送信遅延回 路 903と、遅延入力の切り替えを行う送信切り替え器 904と、送信受信側の音声入力 を監視し、双方が同時に入力される場合に遅延時間を短くする音声入力監視部 905 と、受信切り替えを行う受信切り替え部 906と、受信音声に対して遅延入力を行う受 信遅延回路 907と、映像,音声の多重化 Z分離処理を行う多重 Z分離部 908とから 構成され、多地点接続制御装置 909と接続し、 TV会議装置間で双方向の映像 '音 声通信を行うものである。 [0005] As a conventional method for solving this problem, for example, in Patent Document 1, a video conference is synchronized by inserting a delay into an audio in accordance with a delay of an image having a large bandwidth, usually in a video conference. When both voices are input, the voice delay is reduced by stopping the voice delay insertion to improve the response performance. FIG. 9 is a diagram showing a conventional video conference apparatus. In FIG. 9, the video conference apparatus 9 10 includes a video code Z decoding unit 901 that performs video code Z decoding, and audio coding Z decoding that performs audio code Z decoding. 902, transmission delay circuit 903 for delaying voice input, transmission switcher 904 for switching delay input, and voice input on the transmission / reception side are monitored, and when both are input simultaneously, the delay time is shortened. An audio input monitoring unit 905 that performs reception switching, a reception delay circuit 907 that performs delay input on received audio, and a multiplexing Z separation unit that performs video and audio multiplexing Z separation processing 908 And is connected to the multipoint connection control device 909, and performs two-way video and audio communication between the TV conference devices.
[0007] このように、特許文献 1における TV会議装置は、音声入力監視部 905において、 送信'受信側の音声入力を監視し、双方が同時に入力される場合に、遅延時間を短 くすることにより、音声の低遅延伝送を可能とするものである。  [0007] As described above, the TV conference device in Patent Document 1 monitors the audio input on the transmission / reception side in the audio input monitoring unit 905, and shortens the delay time when both are input simultaneously. Thus, low-delay transmission of voice is enabled.
特許文献 1:特開平 8— 317362号公報  Patent Document 1: JP-A-8-317362
発明の開示  Disclosure of the invention
発明が解決しょうとする課題  Problems to be solved by the invention
[0008] し力しながら、特許文献 1では、音声データのみを低遅延に伝送してしまうため、映 像データは音声データと同期が取られずに表示されるという問題がある。 However, Patent Document 1 has a problem in that only audio data is transmitted with low delay, so that the video data is displayed without being synchronized with the audio data.
[0009] 例えば、発話者の音声が再生された後に、遅延して発話映像が再生されるといった[0009] For example, after a speaker's voice is played back, a speech video is played back with a delay.
、不自然な映像'音声再生となり、 TV会議においては大きな違和感を生むこととなる, Unnatural video 'sound playback will be a big discomfort in video conferences
。また例えば、模型を指し示しながら会議をする場合、音声と物を指し示す動作との 同期が円滑なコミュニケーションには不可欠である。 . Also, for example, when a meeting is held while pointing to a model, synchronization between voice and movement pointing to an object is essential for smooth communication.
[0010] このような点に鑑みて、本発明では、音声データの送受信頻度を用いて映像音声 データの低遅延伝送が必要とされる度合!/、を低遅延優先度として判定し、低遅延優 先度が高 、状態ほど、低遅延伝送を実現できる様に映像音声の符号化パラメータあ るいは伝送パラメータを制御することにより、ベストエフオート型ネットワークにおいて も、常に音声と映像との同期をとりながら、会議の状態に応じた遅延量で途切れなく 伝送することが可能な装置の提供を目的とする。  In view of such points, the present invention determines the degree of necessity of low-delay transmission of video / audio data using the frequency of audio data transmission / reception as a low-delay priority, By controlling the video / audio encoding parameters or transmission parameters so that the lower the delay, the higher the priority, the synchronization between audio and video is always possible even in a best-f-automatic network. The purpose is to provide a device capable of seamless transmission with a delay amount according to the conference state.
課題を解決するための手段 [0011] 本発明の TV会議制御装置は、伝送路を通じて映像データおよび音声データを通 信する TV会議装置を制御する TV会議制御装置であって、前記 TV会議装置で検 知される有音である音声データの送信と受信とが切り替わる頻度が高 、ほど、映像 データおよび音声データの遅延を抑えるべき度合いを示す低遅延優先度を高く決定 する低遅延優先度決定手段と、前記低遅延優先度が高!、ほど小さ!/、遅延量を決定 する遅延量決定手段と、決定された前記遅延量に応じて、前記 TV会議装置で利用 される符号ィ匕のパラメータまたはパケット優先度を決定するパラメータ制御手段とを 備える。 Means for solving the problem [0011] The TV conference control device of the present invention is a TV conference control device that controls a TV conference device that transmits video data and audio data through a transmission line, and is a sound that is detected by the TV conference device. A low-delay priority determination means for determining a low-delay priority indicating a degree to which delay of video data and audio data should be suppressed as the frequency of switching between transmission and reception of certain audio data increases, and the low-delay priority Higher !, smaller! /, Delay amount determining means for determining the delay amount, and determining the parameter or packet priority of the code key used in the video conference apparatus according to the determined delay amount Parameter control means.
[0012] この構成によれば、音声データの送信と受信とが切り替わる頻度が高いほど、音声 および映像の低遅延優先度を高く設定し、それに応じた遅延量を決定する。さらに、 遅延量に応じて符号ィ匕パラメータまたはパケット優先度を決定する。  [0012] According to this configuration, the higher the frequency of switching between transmission and reception of audio data, the higher the low delay priority of audio and video is set, and the delay amount corresponding thereto is determined. Furthermore, the code key parameter or the packet priority is determined according to the delay amount.
[0013] 符号ィ匕のパラメータとは、圧縮符号ィ匕に用いられるパラメータであり、これにより符 号ィ匕後のデータ量や復号ィ匕に利用されるデータ量を制御できる。また、パケット優先 度が高いデータは、伝送経路上のルータによって、パケット優先度が低いデータより 優先的に処理され、伝送路では先に転送される。  [0013] The parameter of the code key is a parameter used for the compression code key, and thereby the amount of data after the code key and the data amount used for the decoding key can be controlled. Data with a high packet priority is processed with priority over data with a low packet priority by a router on the transmission path, and is transferred first on the transmission path.
[0014] そのため、符号ィ匕パラメータまたはパケット優先度を決定することによって、音声と 映像の同期をとりながら、活発な議論がなされている会議ほど低遅延で映像および 音声を通信することが可能になる。 [0014] Therefore, by determining the code key parameter or packet priority, it is possible to communicate video and audio with a lower delay for a meeting that is actively discussed while synchronizing audio and video. Become.
[0015] 好ましくは、前記低遅延優先度決定手段は、前記 TV会議装置で検知される有音 である音声データの送信と受信とが切り替わる一定時間内の回数により前記頻度を 計測する。 [0015] Preferably, the low-delay priority determining unit measures the frequency based on the number of times within a predetermined time when transmission and reception of voice data, which is sound detected by the TV conference device, are switched.
[0016] この構成によれば、一定時間内に音声データの送信と受信とが切り替わる頻度を 音声データの送信と受信との切り替わりの早さの指標とする。そのため、送受信のタ イミングを監視する単純な処理で低遅延優先度を決定することが可能になる。  [0016] According to this configuration, the frequency of switching between transmission and reception of audio data within a predetermined time is used as an index of the speed of switching between transmission and reception of audio data. Therefore, it is possible to determine the low delay priority with a simple process of monitoring the transmission / reception timing.
[0017] さらに好ましくは、前記低遅延優先度決定手段は、前記頻度を閾値と比較し、前記 頻度が前記閾値より大きい場合に、前記頻度が前記閾値以下の場合よりも高い前記 低遅延優先度を決定する。  [0017] More preferably, the low delay priority determination means compares the frequency with a threshold, and when the frequency is greater than the threshold, the low delay priority is higher than when the frequency is equal to or less than the threshold. To decide.
[0018] この構成によれば、低遅延優先度を決定する際に、音声データの送信'受信頻度と 予め設定した閾値を比較する単純な処理で低遅延優先度を決定することが可能に なる。 [0018] According to this configuration, when the low delay priority is determined, the audio data transmission 'reception frequency and The low delay priority can be determined by a simple process of comparing preset threshold values.
[0019] さらに好ましくは、前記低遅延優先度決定手段は、前記 TV会議装置で検知される 音声データの送信時刻と受信時刻との差により前記頻度を計測し、送信時刻と受信 時刻との差が小さ!ヽほど、高!ヽ前記低遅延優先度を決定する。  [0019] More preferably, the low-delay priority determining unit measures the frequency based on a difference between a transmission time and a reception time of audio data detected by the TV conference device, and a difference between the transmission time and the reception time. The lower the priority, the higher the low delay priority is determined.
[0020] この構成によれば、音声データの送信時刻と受信時刻との差を音声データの送信 と受信との切り替わりの早さの指標とする。そのため、送受信の時刻とその差を監視 する単純な処理で低遅延優先度を決定することが可能になる。  [0020] According to this configuration, the difference between the transmission time and the reception time of the audio data is used as an index of the speed of switching between transmission and reception of the audio data. Therefore, it is possible to determine the low-latency priority with a simple process that monitors the transmission / reception time and the difference.
[0021] さらに好ましくは、前記低遅延優先度決定手段は、前記送信時刻と受信時刻との 差を閾値と比較し、前記送信時刻と受信時刻との差が前記閾値より小さい場合に、 前記送信時刻と受信時刻との差が前記閾値以上の場合よりも高い前記低遅延優先 度を決定する。  [0021] More preferably, the low delay priority determination means compares the difference between the transmission time and the reception time with a threshold value, and when the difference between the transmission time and the reception time is smaller than the threshold value, The low delay priority that is higher than when the difference between the time and the reception time is equal to or greater than the threshold is determined.
[0022] この構成によれば、低遅延優先度を決定する際に、音声データの送受信時刻の差 と予め設定した閾値を比較する単純な処理で低遅延優先度を決定することが可能に なる。  [0022] According to this configuration, when the low delay priority is determined, it is possible to determine the low delay priority by a simple process of comparing a difference between audio data transmission / reception times and a preset threshold value. .
[0023] さらに好ましくは、前記パラメータ制御手段は、前記圧縮符号量のパラメータとして 、前記映像データおよび音声データを受信する TV会議装置にお ヽて前記映像デー タおよび音声データを復号ィ匕するための受信バッファ容量を決定する。  [0023] More preferably, the parameter control means decodes the video data and audio data in a TV conference device that receives the video data and audio data as the compression code amount parameter. Determine the receive buffer capacity.
[0024] この構成によれば、映像音声復号ィ匕のバッファ量を動的に変更することにより、受 信バッファ容量が小さいほど受信待ち時間が小さくて済むため、要求された遅延値 以下に遅延量を制御することが可能になる。  [0024] According to this configuration, by dynamically changing the buffer amount of the video / audio decoding key, the smaller the reception buffer capacity, the smaller the reception waiting time. Therefore, the delay is less than the requested delay value. It becomes possible to control the amount.
[0025] さらに好ましくは、前記パラメータ制御手段は、前記圧縮符号量のパラメータとして 、前記 TV会議装置における前記映像データおよび音声データの符号化の最大発 生符号量を決定する。  [0025] More preferably, the parameter control means determines a maximum generated code amount for encoding the video data and audio data in the video conference device as the compression code amount parameter.
[0026] 映像音声符号化の最大発生符号量が小さ!/、ほど、伝送遅延が小さくなり低遅延伝 送を行うことなるため、要求された遅延量以下に制御することが可能である。  [0026] Since the maximum generated code amount of video / audio encoding is smaller, the transmission delay becomes smaller and low-delay transmission is performed, so that it can be controlled to be equal to or less than the requested delay amount.
[0027] また、本発明の TV会議制御装置は、共通の伝送路を通じて、複数の会議の映像 データおよび音声データを通信する複数の TV会議装置を制御する TV会議制御装 置であって、前記各 TV会議装置で検知される有音である音声データの送信と受信と が切り替わる頻度が高!ヽほど、映像データおよび音声データの遅延を抑えるべき度 合!、を示す低遅延優先度を高く決定する低遅延優先度決定手段と、前記低遅延優 先度が高いほど、各会議に利用される TV会議装置群に対する小さい遅延量を決定 する遅延量決定手段と、決定された前記遅延量に応じて、前記 TV会議装置で利用 される符号ィ匕のパラメータまたはパケット優先度を決定するパラメータ制御手段とを 備える。 [0027] Further, the TV conference control device of the present invention is a TV conference control device that controls a plurality of TV conference devices that communicate video data and audio data of a plurality of conferences through a common transmission path. The frequency of switching between transmission and reception of voice data that is detected by each TV conference device is high! The lower the delay priority determining means for determining the higher the lower delay priority indicating the degree to which the delay of the video data and the audio data should be suppressed, and the higher the lower delay priority, the more used for each conference. A delay amount determining means for determining a small delay amount for the TV conference device group, and parameter control for determining a parameter of a code or a packet priority used in the TV conference device according to the determined delay amount Means.
[0028] この構成によれば、他の TV会議と比較して低遅延優先度を決定するため、共通の 伝送路を使用する複数の TV会議の中で低遅延の要求が高いものほど低遅延とする ことが可能になる。また、音声と映像の同期をとりながら、活発な議論がなされている 会議ほど低遅延で映像および音声を通信することが可能になる。  [0028] According to this configuration, since the low delay priority is determined as compared with other TV conferences, the higher the request for low delay among the plurality of TV conferences using a common transmission path, the lower the delay. It becomes possible to. In addition, it is possible to communicate video and audio with a lower delay in a conference where active discussions are made while synchronizing audio and video.
[0029] なお、本発明はこのような TV会議制御装置として実現できるだけでなぐ TV会議 制御装置が備える特徴的な手段を備える集積回路、 TV会議装置としても実現できる 。また、本発明は、 TV会議制御装置と TV会議装置とから構成される TV会議システ ムとしても実現できる。さらに、本発明は、 TV会議制御装置が備える特徴的な手段を ステップとする TV会議制御方法によって実現することもできる。  It should be noted that the present invention can be realized not only as such a TV conference control apparatus but also as an integrated circuit and a TV conference apparatus including characteristic means included in the TV conference control apparatus. The present invention can also be realized as a TV conference system including a TV conference control device and a TV conference device. Furthermore, the present invention can also be realized by a TV conference control method in which characteristic means included in the TV conference control apparatus is a step.
発明の効果  The invention's effect
[0030] 本発明によれば、音声データの送受信の切り替わりの早さに従って、映像と音声に 共通の遅延量を制御するためのパラメータを決定する。そのため、活発な議論が行 われている TV会議など、低遅延の必要性が高い場合ほど低遅延となり、かつ、映像 と音声との同期がとれているため、円滑に会議を進行することが可能になる。  [0030] According to the present invention, the parameter for controlling the delay amount common to video and audio is determined according to the speed of switching between transmission and reception of audio data. For this reason, as the need for low delay is high, such as in TV conferences where active discussions are taking place, the delay is low and the video and audio are synchronized, so the conference can proceed smoothly. become.
図面の簡単な説明  Brief Description of Drawings
[0031] [図 1]図 1は、実施の形態 1に係る TV会議装置の構成を示すブロック図である。 FIG. 1 is a block diagram showing a configuration of a video conference apparatus according to Embodiment 1.
[図 2]図 2は、実施の形態 1に係る TV会議装置が、 TV会議を行う際に実行する処理 を示すフローチャートである。  FIG. 2 is a flowchart showing processing executed when the TV conference apparatus according to Embodiment 1 performs a TV conference.
[図 3]図 3は、図 2に示す遅延制御処理 S204の詳細を示すフローチャートである。  FIG. 3 is a flowchart showing details of delay control processing S204 shown in FIG.
[図 4]図 4は、実施の形態 1に係る音声データの送信'受信頻度を示す概念図である [図 5]図 5は、本発明の実施の形態 2に係る TV会議システムの構成を示すブロック図 である。 [FIG. 4] FIG. 4 is a conceptual diagram showing audio data transmission'reception frequency according to the first embodiment. FIG. 5 is a block diagram showing a configuration of a video conference system according to Embodiment 2 of the present invention.
[図 6]図 6は、実施の形態 2に係る複数の TV会議装置と遅延制御サーバとの関係を 示す図である。  FIG. 6 is a diagram showing a relationship between a plurality of video conference apparatuses and a delay control server according to Embodiment 2.
[図 7]図 7は、実施の形態 2に係る TV会議装置が実行する処理を示すフローチャート である。  FIG. 7 is a flowchart showing processing executed by the video conference apparatus according to Embodiment 2.
[図 8]図 8は、実施の形態 2に係る遅延制御サーバが実行する処理を示すフローチヤ ートである。  FIG. 8 is a flowchart showing processing executed by the delay control server according to the second embodiment.
[図 9]図 9は、従来技術の TV会議端末装置の構成を示す図である。  FIG. 9 is a diagram showing a configuration of a conventional video conference terminal device.
符号の説明 Explanation of symbols
101、 501、 601、 602、 603、 604 TV会議装置  101, 501, 601, 602, 603, 604 Video conferencing equipment
102 映像音声入力部  102 Video / audio input section
103 映像音声符号化部  103 Video / audio encoding unit
104 送信部  104 Transmitter
105 低遅延優先度決定部  105 Low-latency priority determination unit
106 遅延量決定部  106 Delay amount determination unit
107、 503 パラメータ制御部  107, 503 Parameter control section
108 映像音声出力部  108 Video / audio output
109 映像音声復号化部  109 Video audio decoder
110 受信部  110 Receiver
111 伝送路  111 Transmission line
502 送受信部  502 transceiver
504、 607 遅延制御サーバ  504, 607 Delay control server
505 低遅延優先度受信部  505 Low latency priority receiver
506 遅延量決定部  506 Delay amount determination unit
507 遅延量送信部  507 Delay amount transmitter
901 映像符号化,復号化部  901 Video encoding / decoding unit
902 音声符号化 Z復号化部 903 送信遅延回路 902 Speech coding Z decoding unit 903 Transmission delay circuit
904 送信切り替え器  904 Transmitter switch
905 音声入力監視部  905 Voice input monitoring unit
906 受信切り替え部  906 Reception switching unit
907 受信遅延回路  907 Receive delay circuit
908 多重 Z分離部  908 Multiplex Z separator
909 多地点接続制御装置  909 Multipoint connection controller
発明を実施するための最良の形態  BEST MODE FOR CARRYING OUT THE INVENTION
[0033] 以下、本発明の実施の形態について、図面を参照して詳細に説明する。  Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.
[0034] (実施の形態 1)  [Embodiment 1]
図 1は、本発明の実施の形態 1に係る TV会議装置の構成を示すブロック図である。 本実施の形態の TV会議装置は、会議の参加者の下に置かれ、音声と映像とを通信 する。本実施の形態の TV会議装置は、それぞれ、相手方の音声および映像が入力 された後に他の TV会議装置にその音声および映像が出力されるまでの遅延量を会 議の状態に応じて動的に制御する TV会議制御装置を備える。  FIG. 1 is a block diagram showing the configuration of the video conference apparatus according to Embodiment 1 of the present invention. The video conference apparatus of this embodiment is placed under a conference participant and communicates audio and video. Each of the video conference apparatuses according to the present embodiment dynamically changes the delay amount from the input of the other party's audio and video to the output of the other audio and video to the other TV conference apparatus according to the state of the conference. A video conference control device is provided.
[0035] 図 1に示す TV会議装置 101は、カメラおよびマイクに接続され映像および音声が 入力される映像音声入力部 102と、映像データおよび音声データ(以下、「音声映像 データ」という。)を圧縮符号ィ匕する映像音声符号ィ匕部 103と、伝送路 111に接続さ れ符号化後の映像音声データを伝送する送信部 104と、映像音声入力を用いて低 遅延優先度を決定する低遅延優先度決定部 105と、低遅延優先度を用いて遅延量 を決定する遅延量決定部 106と、遅延量を用いて符号化のパラメータを変更するパ ラメータ制御部 107と、伝送路 111に接続され通信相手である TV会議端末から送信 された符号化後の映像音声データを受信する受信部 110と、符号化された映像音声 データを復号ィ匕する映像音声復号ィ匕部 109と、モニタおよびスピーカに接続され、 復号ィ匕した映像音声データをモニタおよびスピーカに出力する映像音声出力部 108 とを備える。  A video conference apparatus 101 shown in FIG. 1 includes a video / audio input unit 102 connected to a camera and a microphone and inputs video and audio, and video data and audio data (hereinafter referred to as “audio / video data”). A video / audio code unit 103 that performs compression coding, a transmission unit 104 that is connected to the transmission path 111 and transmits encoded video / audio data, and a low-delay priority that determines low delay priority using video / audio input. A delay priority determination unit 105, a delay amount determination unit 106 that determines a delay amount using a low delay priority, a parameter control unit 107 that changes a coding parameter using the delay amount, and a transmission path 111. A receiving unit 110 that receives encoded video / audio data transmitted from a video conference terminal that is a connected communication partner, a video / audio decoding unit 109 that decodes the encoded video / audio data, and a monitor And connected to the speaker, And a video / audio output unit 108 for outputting the decoded video / audio data to a monitor and a speaker.
[0036] ここで、符号ィ匕パラメータとは、映像音声符号ィ匕部 103での符号ィ匕および映像音声 復号ィ匕部 109での復号ィ匕に用いられるパラメータのことであり、以下においても同様 である。 Here, the code key parameter is a parameter used for the code key in the video / audio code key unit 103 and the decoding key in the video / audio decoding key unit 109. Same It is.
[0037] 映像音声入力部 102は、カメラおよびマイク力 入力された、非圧縮の映像データ および音声データをフレーム単位で、映像音声符号化部 103と低遅延優先度決定 部 105とに出力する。  [0037] The video / audio input unit 102 outputs uncompressed video data and audio data input by the camera and microphone power to the video / audio encoding unit 103 and the low delay priority determination unit 105 in units of frames.
[0038] 映像音声符号ィ匕部 103は、映像音声入力部 102より入力された映像音声データに 対して、ノ ラメータ制御部 107より入力された符号化パラメータを用いて MPEG— 2 等の圧縮符号化を行い、符号化後の映像音声データを送信部 104に出力する。  [0038] The video / audio encoding unit 103 compresses the video / audio data input from the video / audio input unit 102 using a coding parameter input from the parameter control unit 107, such as MPEG-2. The encoded video / audio data is output to the transmission unit 104.
[0039] 送信部 104は、映像音声符号ィ匕部 103より入力された符号ィ匕後の映像音声データ をパラメータ制御部 107より入力された伝送パラメータを用いて通信相手である別の TV会議装置へ送信する。  [0039] Transmitting section 104 uses another transmission parameter input from parameter control section 107 for the encoded video / audio data input from video / audio encoding section 103 to another TV conference apparatus that is a communication partner. Send to.
[0040] 低遅延優先度決定部 105は、映像音声入力部 102より入力された映像音声データ を音声データに含まれる有音部分を検出し、検出された有音部分に基づいて低遅延 優先度を決定し、低遅延優先度を遅延量決定部 106に出力する。  [0040] The low-delay priority determination unit 105 detects a voiced portion included in the audio data from the video / audio data input from the video / audio input unit 102, and the low-delay priority is determined based on the detected voiced portion. The low delay priority is output to the delay amount determination unit 106.
[0041] ここで、有音部分は、例えば音声データに含まれる音量が閾値以上である力否か により検出される。また、低遅延優先度の決定方法は後述する。 Here, the voiced portion is detected by, for example, whether or not the sound volume included in the audio data is greater than or equal to a threshold value. A method for determining the low delay priority will be described later.
[0042] 遅延量決定部 106は、低遅延優先度決定部 105より入力された低遅延優先度を用 いて遅延量を決定し、遅延量をパラメータ制御部 107へ出力する。 The delay amount determination unit 106 determines a delay amount using the low delay priority input from the low delay priority determination unit 105, and outputs the delay amount to the parameter control unit 107.
[0043] ノ メータ制御部 107は、遅延量決定部 106から入力された遅延量を用いて符号 ィ匕パラメータを変更し、その符号ィ匕パラメータを映像音声符号ィ匕部 103に出力する。 The meter control unit 107 changes the code key parameter using the delay amount input from the delay amount determination unit 106, and outputs the code key parameter to the video / audio code key unit 103.
[0044] なお、パラメータ制御部 107は、符号ィ匕パラメータの代わりに、伝送パラメータを変 更し、その伝送パラメータを送信部 104に出力してもよい。この伝送パラメータとは、 例えば、パケット優先度である。 Note that the parameter control unit 107 may change the transmission parameter instead of the sign key parameter, and output the transmission parameter to the transmission unit 104. This transmission parameter is, for example, packet priority.
[0045] 受信部 110は、伝送路 111を通じて通信相手である TV会議端末力 送信された 符号ィ匕後の映像音声データを受信し、受信データを映像音声復号ィ匕部 109へ出力 する。 [0045] Receiving section 110 receives the encoded video / audio data transmitted from the video conference terminal as the communication partner through transmission path 111, and outputs the received data to video / audio decoding section 109.
[0046] 映像音声復号ィ匕部 109は、受信部 110より入力された符号ィ匕後の映像音声データ に対して、符号化された符号化方式に従い復号化処理を行い、復号化後の映像音 声データを映像音声出力部 108へ出力するとともに、音声データを低遅延優先度決 定部 105に出力する。 [0046] The video / audio decoding unit 109 performs a decoding process on the encoded video / audio data input from the receiving unit 110 in accordance with the encoded encoding method. The audio data is output to the audio / video output unit 108, and the audio data is assigned a low delay priority. Output to fixed part 105.
[0047] 映像音声出力部 108は、映像音声復号化部 109より入力された映像音声データを 、接続されたモニタおよびスピーカに出力する。  The video / audio output unit 108 outputs the video / audio data input from the video / audio decoding unit 109 to the connected monitor and speaker.
[0048] 次に、本実施の形態の TV会議装置 101の動作について、図 2および図 3に示すフ ローチャートを用いて説明する。  Next, the operation of the video conference apparatus 101 according to the present embodiment will be described using the flowcharts shown in FIG. 2 and FIG.
[0049] 図 2は、実施の形態 1に係る TV会議装置 101が、 TV会議を行う際に実行する処理 のフローチャートである。図 2に示す各処理は、 TV会議装置 101の図示しない記憶 装置 (例えば ROMやフラッシュメモリなど)に制御プログラムとして記憶されており、 図示しな!、CPUによって制御される。 FIG. 2 is a flowchart of processing executed when the TV conference apparatus 101 according to Embodiment 1 performs a TV conference. Each process shown in FIG. 2 is stored as a control program in a storage device (not shown) such as a ROM or flash memory of the TV conference apparatus 101, and is controlled by the CPU (not shown).
[0050] <ステップ S201 :映像 ·音声入力処理 >  [0050] <Step S201: Video / audio input processing>
まず、映像音声入力部 102は、接続されたカメラおよびマイクから、非圧縮の映像 データと音声データをフレーム単位で入力し、映像データと音声データを映像音声 符号ィ匕部 103に出力し、音声データを低遅延優先度決定部 105に出力する (ステツ プ S201)。  First, the video / audio input unit 102 inputs uncompressed video data and audio data in units of frames from the connected camera and microphone, and outputs the video data and audio data to the video / audio encoding unit 103 for audio. The data is output to the low delay priority determination unit 105 (step S201).
[0051] <ステップ S202 :データ受信処理 >  [0051] <Step S202: Data reception processing>
受信部 110は、伝送路 111を通じて通信相手である TV会議端末カゝら送信された 符号化後の映像音声データを受信し、受信した符号化データを映像音声復号化部 109へ出力する(ステップ S 202)。  The receiving unit 110 receives the encoded video / audio data transmitted from the video conference terminal as a communication partner through the transmission path 111, and outputs the received encoded data to the video / audio decoding unit 109 (step S1). S 202).
[0052] <ステップ S203 :映像'音声復号化、表示出力処理 > [0052] <Step S203: Video 'Audio Decoding and Display Output Processing>
映像音声復号ィ匕部 109は、受信部 110より入力された符号ィ匕後の映像音声データ に対して、符号化された符号化方式に従い復号化処理を行い、復号化後の映像音 声データを映像音声出力部 108へ出力し、音声データを低遅延優先度決定部 105 に出力し、また、映像音声出力部 108は、映像音声復号化部 109より入力された映 像音声データを、接続されたモニタおよびスピーカに出力する (ステップ S203)。  The video / audio decoding unit 109 performs decoding processing on the encoded video / audio data input from the receiving unit 110 in accordance with the encoded encoding method, and the decoded video / audio data. Is output to the video / audio output unit 108, and the audio data is output to the low delay priority determination unit 105. The video / audio output unit 108 connects the video / audio data input from the video / audio decoding unit 109 to the video / audio output unit 108. Is output to the monitor and speaker (step S203).
[0053] <ステップ S 204 :遅延制御処理 > [0053] <Step S204: Delay control processing>
低遅延優先度決定部 105は、低遅延優先度を決定し、遅延量決定部 106は決定 された低遅延優先度に従い遅延量を決定し、パラメータ制御部 107は、遅延量を用 いて符号ィ匕パラメータを決定する (ステップ S204)。この遅延制御処理の詳細につい て、図を用いて説明する。 The low delay priority determination unit 105 determines the low delay priority, the delay amount determination unit 106 determines the delay amount according to the determined low delay priority, and the parameter control unit 107 uses the delay amount to code.匕 Determine the parameters (step S204). Details of this delay control process This will be described with reference to the drawings.
[0054] なお、ここでパラメータ制御部 107は、符号ィ匕パラメータの代わりに、遅延量を用い て伝送パラメータを決定してもよ 、。  [0054] Here, parameter control section 107 may determine the transmission parameter using the delay amount instead of the sign key parameter.
[0055] 図 3は、図 2に示す遅延制御処理 (ステップ S204)の詳細を示すフローチャートで ある。 FIG. 3 is a flowchart showing details of the delay control process (step S204) shown in FIG.
[0056] <ステップ S301 :音声パケット間隔算出処理 >  [0056] <Step S301: Voice packet interval calculation process>
まず、遅延優先度決定部 105は、映像音声入力部 102より入力された送信音声デ ータと、映像音声復号ィ匕部 109より入力された受信音声ストリームを用いて、音声デ ータの送信および受信の頻度を算出する (ステップ S301)。  First, the delay priority determination unit 105 transmits audio data using the transmission audio data input from the video / audio input unit 102 and the reception audio stream input from the video / audio decoding unit 109. And the frequency of reception is calculated (step S301).
[0057] ここで、音声データの送信および受信の頻度とは、会話の活発度をあらわす。すな わち、音声データの送信および受信の頻度は、それが高いほど活発な議論がなされ ており、遅延の影響が大きく低遅延伝送を必要とする状態であることを表す。 Here, the frequency of transmission and reception of voice data represents the degree of conversation activity. In other words, the higher the frequency of voice data transmission and reception, the more active the discussion is, and the greater the delay effect, the lower the delay transmission is required.
[0058] [数 1] [0058] [Equation 1]
N ( t ) = N s ( t ) + N r ( t ) (式 1 ) N (t) = N s (t) + N r (t) (Equation 1)
[0059] (式 1)は音声データの送信および受信の頻度の算出式の一例を示したものである 。(式 1)において、 N (t)は時刻 tにおける音声データの送受信頻度を示し、 Ns (t)は 時刻はりも過去 T時間の間に送信した音声データの送信回数を示し、 Nr (t)は時刻 tよりも過去 T時間の間に受信した音声データの受信回数を示す。 (Equation 1) shows an example of a calculation formula for the frequency of transmission and reception of audio data. In (Equation 1), N (t) represents the frequency of audio data transmission and reception at time t, Ns (t) represents the number of times audio data was transmitted during the past T hours, and Nr (t) Indicates the number of audio data received during the past T hours from time t.
[0060] ここで、送信回数および受信回数は有音である音声データを対象とし、遅延優先度 決定部 105が有音判定を行うものとする。すなわち、このような音声データの送信お よび受信の頻度は、有音である音声データの送信と受信とが切り替わる頻度の一例 であり、一定時間に有音である音声データの送信と受信とが切り替わる回数により示 される。  Here, it is assumed that the number of transmissions and the number of receptions are sound data with sound, and the delay priority determination unit 105 performs sound determination. In other words, the frequency of transmission and reception of such voice data is an example of the frequency at which transmission and reception of voice data that is sound is switched, and transmission and reception of voice data that is sound during a certain period of time. It is indicated by the number of times to switch.
[0061] なお、(式 1)は音声データの送受信頻度の決定方式の一例であり、音声データの 送受信頻度を表す算出方法であればいかなる方法も利用可能である。  [0061] (Equation 1) is an example of a method for determining the transmission / reception frequency of audio data, and any method can be used as long as it is a calculation method representing the transmission / reception frequency of audio data.
[0062] なお、厳密には送信データと受信データには伝送遅延の影響で、時刻 tのタイミン グがずれる場合があるため、伝送遅延量を考慮し、送信データの時刻 tに伝送遅延 を加えて算出する事も可能である。 [0062] Strictly speaking, the transmission data and the reception data may be out of time at the time t due to the effect of the transmission delay. Therefore, the transmission delay at the time t of the transmission data in consideration of the transmission delay amount. It is also possible to calculate by adding.
[0063] 図 4は、実施の形態 1に係る音声データの送信'受信頻度を示す概念図である。本 図を参照して、 2つの TV会議装置間での、音声データの送受信頻度の算出方法を 説明する。  FIG. 4 is a conceptual diagram showing the frequency of audio data transmission / reception according to the first embodiment. With reference to this figure, a method for calculating the frequency of audio data transmission / reception between two TV conference devices will be described.
[0064] 図 4において、 Sと書いた斜線を施した区間 401は音声データの送信状態で、尺と 書いた白色の区間 402は音声データの受信状態を示す。ただし、送信、受信ともに 入力がある場合は、音量の大きな方を優先するものとする。  [0064] In Fig. 4, a hatched section 401 written as S indicates a voice data transmission state, and a white section 402 written as a scale indicates a voice data reception state. However, if there is an input for both transmission and reception, priority is given to the louder volume.
[0065] (式 1)に従って音声データの送受信頻度を算出すると、例えば時刻 tlでは N(tl) [0065] When the transmission / reception frequency of audio data is calculated according to (Equation 1), for example, at time tl, N (tl)
=4+4 = 8、時刻 t2では N(t2)=2 + 2=4となり、時刻 t2の方が音声データの送受 信頻度が高いこととなる。  = 4 + 4 = 8, and at time t2, N (t2) = 2 + 2 = 4, and at time t2, the frequency of audio data transmission / reception is higher.
[0066] すなわち、時刻 t2である図 4 (a)の状態の方が高い頻度で音声データの送受信を 行っていることとなり、 TV会議としては活発な議論が行われている状態といえる。従 つて、図 4 (a)の方が図 4(b)と比較して遅延の影響を大きく受け、低遅延伝送の必要 性が高い状態であり、低遅延優先度を高くする必要がある。 [0066] That is, in the state of Fig. 4 (a) at time t2, voice data is transmitted and received at a higher frequency, and it can be said that a live discussion is being conducted as a TV conference. Therefore, Fig. 4 (a) is more affected by delay compared to Fig. 4 (b), and the need for low-delay transmission is high, and it is necessary to increase the low-delay priority.
[0067] <ステップ S302:低遅延優先度算出処理 > <Step S302: Low Delay Priority Calculation Processing>
次に、低遅延優先度決定部 105は、音声データの送受信頻度を算出後、送受信 頻度が高いほど低遅延優先度を高く決定する。例えば、低遅延優先度 P(t)は (式 2) のように決定する(ステップ S 302)。  Next, the low delay priority determination unit 105 calculates the audio data transmission / reception frequency, and then determines the low delay priority to be higher as the transmission / reception frequency is higher. For example, the low delay priority P (t) is determined as in (Equation 2) (step S302).
[0068] [数 2] [0068] [Equation 2]
0 (N (t) く TH1の場合)0 (when N (t) is TH1)
PMAXxfN ( t) -TH 1) PMAXxfN (t) -TH 1)
P (t) (TH 1<N ( t) <TH2の場合)  P (t) (when TH 1 <N (t) <TH2)
(TH2-TH 1)  (TH2-TH 1)
PMAX (N (t)>TH2の場合)  PMAX (when N (t)> TH2)
[0069] (式 2)にお 、て、 P (t)は時刻 tにおける低遅延優先度であり、 N (t)は時刻 tにおけ る音声データの送受信頻度であり、 TH1と TH2は予め決められた閾値 (ただし、 TH 1<TH2)であり、 PMAXは予め決められた優先度の最大値である。なお、(式 2)は 低遅延優先度の決定方式の一例であり、音声データの送受信頻度が大きいほど遅 延優先度が大きくなる算出方法であればいかなる方法も利用可能である。 [0070] <ステップ S303 :遅延量算出処理 > [0069] In (Equation 2), P (t) is the low delay priority at time t, N (t) is the frequency of audio data transmission and reception at time t, and TH1 and TH2 are It is a predetermined threshold (where TH 1 <TH2), and PMAX is a predetermined maximum priority. (Equation 2) is an example of a method for determining the low delay priority. Any method can be used as long as the delay priority increases as the frequency of audio data transmission / reception increases. [0070] <Step S303: Delay Amount Calculation Processing>
次に、遅延量決定部 106は、前記のように算出した低遅延優先度を用いて遅延量 を算出する (ステップ S303)。  Next, the delay amount determination unit 106 calculates a delay amount using the low delay priority calculated as described above (step S303).
[0071] ここで、(式 3)は、遅延量の算出方法を示した一例である。(式 3)において、 Delay  Here, (Equation 3) is an example showing a method of calculating the delay amount. In (Equation 3), Delay
(t)は時刻 tにおける遅延量であり、 DMAXは予め決められた最大遅延量であり、 P ( t)は時刻 tにおける低遅延優先度である。  (t) is a delay amount at time t, DMAX is a predetermined maximum delay amount, and P (t) is a low delay priority at time t.
[0072] [数 3] t ) = 0の場合)  [0072] [Equation 3] t) = 0)
D e l a y ( t )  D e l a y (t)
( t )≠0の場合)  (When (t) ≠ 0)
P ( t )+l  P (t) + l
[0073] 遅延量決定部 106は、(式 3)に従い、低遅延優先度が高いほど、値が小さくなるよ うに遅延量を決定する。なお、(式 3)は遅延量の決定方式の一例であり、低遅延優 先度が高いほど、値が小さくなる算出方法であればいかなる方法も利用可能である。 [0073] According to (Equation 3), the delay amount determination unit 106 determines the delay amount so that the value decreases as the low delay priority increases. Note that (Equation 3) is an example of a method for determining the delay amount, and any method can be used as long as the value becomes lower as the low delay priority is higher.
[0074] くステップ S304 :パラメータ算出'更新処理 >  [0074] Step S304: Parameter calculation 'Update process>
さらに、パラメータ制御部 107は、遅延量算出処理 (ステップ S303)において算出 された遅延量を用いて、符号化パラメータを算出し、算出した符号化パラメータを映 像音声符号ィ匕部 103へ出力する。  Further, parameter control section 107 calculates an encoding parameter using the delay amount calculated in the delay amount calculation process (step S303), and outputs the calculated encoding parameter to video / audio encoding section 103. .
[0075] ここでは、算出するパラメータとして、例えば符号ィ匕パラメータの 1つであるビットレ ート (単位時間当たりに発生する符号量)の揺らぎ幅を対象とする場合について述べ る。低遅延伝送を行う上では、映像符号ィ匕における固定ビットレート制御を行うことが 重要である。すなわち、映像符号ィ匕においてビットレートとフレームレート(画面の更 新頻度)から一意に決まる 1フレームあたりの平均符号量を目標として、 1フレームあ たりの発生符号量を前記目標以下に抑えることである。  Here, a case will be described in which the fluctuation width of the bit rate (the amount of code generated per unit time), which is one of the sign key parameters, is targeted as a parameter to be calculated. In order to perform low-delay transmission, it is important to perform fixed bit rate control in the video code. In other words, the target code amount per frame, which is uniquely determined from the bit rate and frame rate (screen update frequency), is set as the target for video code, and the generated code amount per frame is kept below the target. is there.
[0076] ここで、最大発生符号量を、 1フレームあたりの最大発生符号量と定義することにす る。例えば、 1フレームあたりの発生符号量が平均符号量の N倍となった場合、当該 フレームの符号化データを送信するためには、データ量が N倍のため通常の N倍時 間がかかることになる。従って、フレームレートを 30fpsとすると、遅延量は NZ30 (秒 )となる。 [0076] Here, the maximum generated code amount is defined as the maximum generated code amount per frame. For example, if the generated code amount per frame is N times the average code amount, it takes N times the normal amount of data to transmit the encoded data for that frame because the data amount is N times. become. Therefore, if the frame rate is 30fps, the delay amount is NZ30 (second ).
[0077] [数 4] [0077] [Equation 4]
B I T S MA X = D e 1 a y ( t )x B I T L A T E BITS MA X = De 1 ay (t) x BITLATE
1 0 0 0  1 0 0 0
[0078] (式 4)は、最大発生符号量の算出方法を示した数式である。(式 4)にお 、て、 BITS MAXは 1フレーム当たりの最大発生符号量 (bits)であり、 Delay (t)は時刻 tにおけ る遅延量 (ms)であり、 BITRATEは、映像符号ィ匕における 1秒あたりの発生符号量 であるビットレート(bitsZ秒)である。 [0078] (Expression 4) is a mathematical expression showing a method of calculating the maximum generated code amount. In (Equation 4), BITS MAX is the maximum generated code amount (bits) per frame, Delay (t) is the delay amount (ms) at time t, and BITRATE is the video code This is the bit rate (bitsZ seconds), which is the amount of generated code per second.
[0079] 本実施の形態では、遅延量力も最大発生符号量を算出する例を示したが、パラメ ータとしては、 TCPZIP通信におけるパケット優先度である TOS (TYPE OF SE RVICE)値を変更する等、相対的に遅延量が制御可能なパラメータであれば如何な るパラメータも利用可能である。  In this embodiment, an example has been shown in which the maximum amount of generated code is calculated for the delay amount power, but as a parameter, the TOS (TYPE OF SE RVICE) value, which is the packet priority in TCPZIP communication, is changed. Any parameter can be used as long as the delay amount can be controlled relatively.
[0080] なお、上記の遅延制御処理は、フレーム毎に行っていたが、予め定めた一定間隔 毎に行な 、処理量を削減することも可能である。  Note that the delay control process described above is performed for each frame, but can be performed at predetermined intervals to reduce the processing amount.
[0081] なお、パラメータ制御部 107は、符号化パラメータを算出する代わりに、遅延量算 出処理 (ステップ S303)において算出された遅延量を用いて、伝送パラメータを算出 し、算出した伝送パラメータを送信部 104へ出力してもよい。  [0081] Note that the parameter control unit 107 calculates a transmission parameter using the delay amount calculated in the delay amount calculation process (step S303) instead of calculating the encoding parameter, and calculates the calculated transmission parameter. You may output to the transmission part 104. FIG.
[0082] パケット優先度を用いる場合、送信側は、低遅延優先度が高いほどパケットに含ま れるパケット優先度を高くする。伝送経路上のルータは、パケット優先度が高いバケツ トほど優先的に処理するため、パケット優先度が高いパケットほど早く受信側の TV会 議装置に到達する。そのため、パケット優先度が高いパケットの方が、パケット優先度 が低いパケットに比べて伝送経路上のルータで早く処理されることとなり、低遅延伝 送が実現される。  When using the packet priority, the transmission side increases the packet priority included in the packet as the low delay priority increases. Since routers on the transmission path preferentially process packets with higher packet priority, packets with higher packet priority reach the TV conference device on the receiving side earlier. For this reason, packets with a higher packet priority are processed earlier at the router on the transmission path than packets with a lower packet priority, and low-delay transmission is realized.
[0083] <ステップ S205 :映像 ·音声符号化、伝送処理 >  [0083] <Step S205: Video / Audio Coding and Transmission Processing>
映像音声符号ィ匕部 103は、映像音声入力部 102より入力された映像音声データに 対して、ノラメータ制御部 107より入力された符号化パラメータを用いて MPEG— 2 等の圧縮符号化を行い、符号化後の映像音声データを送信部 104に出力する (ステ ップ S 205)。 [0084] 例えば、映像音声符号ィ匕部 103では、パラメータ制御部 107より最大発生符号量 が入力された場合には、 1フレームあたりの発生符号量が入力された値以下としつつ 、単位時間当たりの発生符号量を一定値以下とするビットレートの制御を行うものとす る。 The audio / video encoding unit 103 performs compression encoding such as MPEG-2 on the audio / video data input from the audio / video input unit 102 using the encoding parameters input from the noram control unit 107, The encoded video / audio data is output to transmitting section 104 (step S 205). [0084] For example, when the maximum generated code amount is input from the parameter control unit 107 in the video / audio code input unit 103, the generated code amount per frame is set to be equal to or less than the input value and per unit time. The bit rate is controlled so that the amount of generated codes is less than a certain value.
[0085] ただし、映像音声符号ィ匕部 103における符号ィ匕方法は MPEGに限定されず、いか なる符号化方法も利用可能である。  However, the encoding method in the video / audio encoding unit 103 is not limited to MPEG, and any encoding method can be used.
[0086] さらに、送信部 104は、映像音声符号ィ匕部 103より入力された符号ィ匕後の映像音 声データに対して、予め決められた伝送パラメータを用いて通信相手である別の TV 会議装置へデータ送信を行う。例えば、ここで伝送方式は IPZUDPZRTPを用い るものとするが、伝送路を通じて映像音声データ伝送を行うことが可能な方式であれ ば如何なる方式も利用可能である。  [0086] Furthermore, the transmission unit 104 uses the predetermined transmission parameter for the encoded video / audio data input from the video / audio code unit 103, and transmits to another TV as a communication partner. Data is transmitted to the conference device. For example, here, IPZUDPZRTP is used as the transmission method, but any method can be used as long as it is a method capable of transmitting video and audio data through the transmission path.
[0087] なお、パラメータ制御部 107が符号ィ匕パラメータの代わりにパケット優先度を算出 する場合、送信部 104は、パラメータ制御部 107より入力された伝送パラメータを用 いて通信相手である別の TV会議装置へデータ送信を行う。  [0087] When parameter control unit 107 calculates the packet priority instead of the sign key parameter, transmission unit 104 uses another transmission parameter input from parameter control unit 107 as another TV as a communication partner. Data is transmitted to the conference device.
[0088] <ステップ S206 :終了判定 >  [0088] <Step S206: End determination>
映像音声入力部 102は、映像音声データの入力が終了した場合、もしくは予め設 定された時間が経過した場合には処理終了と判定し (ステップ S206で Yes)、処理を 終了する。それ以外の場合には (ステップ S206で No)、 TV会議装置 101は、映像' 音声入力処理 (ステップ S201)およびデータ受信処理 (ステップ S202)を継続する。  The video / audio input unit 102 determines that the process has ended when the input of the video / audio data is completed, or when a preset time has elapsed (Yes in step S206), and ends the process. In other cases (No in step S206), the video conference apparatus 101 continues the video / audio input process (step S201) and the data reception process (step S202).
[0089] 以上が、本実施形態の TV会議装置 101が実行する処理の説明である。  The above is the description of the processing executed by the TV conference device 101 of the present embodiment.
[0090] 以上のように、本実施の形態では、低遅延優先度決定部 105は低遅延の優先度を 決定し、遅延量決定部 106は低遅延優先度を用いて遅延量を決定し、パラメータ制 御部 107は遅延量に従い映像音声符号ィ匕パラメータまたは伝送パラメータの変更を 行う。  [0090] As described above, in the present embodiment, the low delay priority determination unit 105 determines a low delay priority, the delay amount determination unit 106 determines a delay amount using the low delay priority, The parameter control unit 107 changes the video / audio code parameter or the transmission parameter according to the delay amount.
[0091] これにより、 TV会議中に動的に低遅延優先度を決定し、符号化パラメータまたは 伝送パラメータを変更し遅延量を制御することが可能である。また、 TV会議中に動 的に低遅延優先度を決定し、伝送パラメータを変更することによつても、遅延量を制 御できる。 [0092] また、本実施の形態では、低遅延優先度決定部 105は、有音である音声データの 送信および受信の頻度を利用して音声データの送信 '受信の頻度が高くなるほど低 遅延優先度を高く決定する。 [0091] With this, it is possible to dynamically determine the low delay priority during the TV conference, change the encoding parameter or the transmission parameter, and control the delay amount. The amount of delay can also be controlled by dynamically determining the low delay priority during a video conference and changing the transmission parameters. Further, in the present embodiment, the low-delay priority determination unit 105 uses the frequency of transmission and reception of voice data that is voiced to transmit voice data. Determine the degree high.
[0093] これにより、 TV会議において音声データの送信 *受信の頻度が高ぐ低遅延性が 重要である活発な議論がなされて 、る状態ほど、低遅延の優先度を高めることが可 能である。 [0093] This makes it possible to increase the priority of low-delay as the state of active discussion that audio transmission / reception frequency is high and low-latency is important is important. is there.
[0094] また、本実施の形態では、低遅延優先度決定部 105は、有音である音声データの 送信および受信の頻度を利用して音声データの送信 '受信の頻度が高くなるほど低 遅延優先度を高く決定する際に、あらかじめ設定された閾値と比較して低遅延優先 度を決定する。  Further, in the present embodiment, the low delay priority determination unit 105 uses the frequency of transmission and reception of voice data that is voiced to transmit voice data. When determining a high degree, the low delay priority is determined by comparing with a preset threshold.
[0095] この構成によれば、低遅延優先度を決定する際に、音声データの送信'受信頻度と 予め設定した閾値とを比較する単純な処理で低遅延優先度を決定することが可能で ある。  According to this configuration, when determining the low delay priority, it is possible to determine the low delay priority with a simple process of comparing the audio data transmission frequency with a preset threshold value. is there.
[0096] なお、本実施の形態では、低遅延優先度決定部 105は、音声データの送信受信 頻度を用いて低遅延優先度を決定したが、音声データの送信時刻および有音の音 声データの受信時刻を用いて、送信時刻と受信時刻の差が小さ!/、ほど低遅延優先 度を高く決定してもよい。  In this embodiment, the low delay priority determination unit 105 determines the low delay priority using the audio data transmission / reception frequency. However, the transmission time of the audio data and the voice data of the voice data are determined. The lower delay priority may be determined as the difference between the transmission time and the reception time is smaller! /.
[0097] これによれば、送信 ·受信時刻の差が小さい状態、すなわち、 TV会議においてお 互 、の発話タイミングが重なるような状態で、遅延量を減らすことが可能である。 According to this, it is possible to reduce the delay amount in a state where the difference between the transmission time and the reception time is small, that is, in a state where the speech timings overlap each other in the video conference.
[0098] なお、本実施の形態では、低遅延優先度決定部 105は、有音である音声データの 送信および受信の頻度と、あらかじめ設定された閾値と比較して低遅延優先度を決 定するが、音声データの送信時刻および受信時刻の差を予め設定された閾値と比 較して、差が小さ!ヽほど低遅延優先度を高く決定してもよ ヽ。 [0098] In the present embodiment, low delay priority determination section 105 determines low delay priority by comparing the frequency of transmission and reception of voice data that is voiced with a preset threshold value. However, the difference between the transmission time and the reception time of the voice data may be compared with a preset threshold value, and the lower the delay priority, the higher the low delay priority may be determined.
[0099] これによれば、低遅延優先度の決定に際して、単純な閾値処理により低遅延優先 度を決定することが可能である。 According to this, when determining the low delay priority, it is possible to determine the low delay priority by simple threshold processing.
[0100] また、本実施の形態では、遅延量決定部 106は、前記決定された低遅延優先度を 用いて、予め決められた閾値処理を行い、優先度が高いほど遅延量を小さい値に決 定する。 [0101] この構成によれば、低遅延優先度を用い閾値処理により遅延量を決定するため、 単純な処理によって遅延量を決定することが可能である。 [0100] Also, in the present embodiment, the delay amount determination unit 106 performs a predetermined threshold process using the determined low delay priority, and the delay amount is set to a smaller value as the priority is higher. decide. [0101] According to this configuration, since the delay amount is determined by threshold processing using the low delay priority, the delay amount can be determined by simple processing.
[0102] また、本実施の形態では、パラメータ制御部 107は、前記決定された遅延量以下と なるように、映像音声符号化の最大発生符号量を小さく変更する。  [0102] Also, in the present embodiment, parameter control section 107 changes the maximum generated code amount of video / audio encoding to be small so as to be equal to or less than the determined delay amount.
[0103] これによれば、映像音声符号化の最大発生符号量が小さ!/、ほど、伝送遅延の最大 値を遅延量以下とすることができ、指定された遅延量以下での低遅延伝送を行うこと が可能である。  [0103] According to this, as the maximum generated code amount of the video / audio encoding is smaller! /, The maximum value of the transmission delay can be made less than the delay amount, and the low-delay transmission with less than the designated delay amount. Can be performed.
[0104] なお、本実施の形態では、パラメータ制御部 107は、前記決定された遅延量以下と なるように、映像音声復号ィ匕のバッファ容量を変更してもよ 、。  In the present embodiment, parameter control unit 107 may change the buffer capacity of the video / audio decoding key so as to be equal to or less than the determined delay amount.
[0105] これによれば、受信側で設定しているバッファ容量を動的に小さくすることにより、受 信待ち最大遅延量を小さくすることができるため、指定された遅延値以下に制御する ことが可能である。 [0105] According to this, it is possible to reduce the maximum amount of delay waiting for reception by dynamically reducing the buffer capacity set on the receiving side, and therefore control it to be equal to or less than the specified delay value. Is possible.
[0106] また、本実施の形態では、パラメータ制御部 107は、前記決定された遅延量に従い 、遅延量が小さ 、ほど送信データのパケット優先度を高く設定してもよ 、。  In the present embodiment, parameter control section 107 may set the packet priority of transmission data higher as the delay amount is smaller in accordance with the determined delay amount.
[0107] これによれば、パケット優先度が高いデータほど、伝送路では先に転送されるため 、遅延量を小さく設定したものほど低遅延伝送を実現する事が可能である。  According to this, since data with higher packet priority is transferred earlier in the transmission path, it is possible to realize low-delay transmission as the delay amount is set smaller.
[0108] (実施の形態 2)  [Embodiment 2]
本実施の形態では、複数の TV会議装置の遅延量を伝送路で接続された遅延制 御サーバが一元的に決定する例につ 、て述べる。  In the present embodiment, an example will be described in which the delay control servers connected via transmission lines determine the delay amounts of a plurality of video conference apparatuses in an integrated manner.
[0109] 図 5は、本発明の実施の形態 2に係る TV会議システムの構成を示すブロック図で ある。本実施の形態の TV会議システムは、会議の参加者の下に置かれ、音声と映 像とを通信する複数の TV会議装置と、それら複数の TV会議装置を利用した TV会 議での遅延量を決定し制御する遅延制御サーバ 504とを備える。複数の TV会議装 置と、遅延制御サーバ 504とは、共通の伝送路 111を通じて接続されることにより構 成される。本図に示す TV会議装置 501は、伝送路 111に接続されている複数の TV 会議装置の 1台を示す。  FIG. 5 is a block diagram showing a configuration of the video conference system according to Embodiment 2 of the present invention. The TV conference system according to the present embodiment is placed under a conference participant and communicates with a plurality of TV conference devices that communicate audio and video, and a delay in a TV conference using the plurality of TV conference devices. A delay control server 504 for determining and controlling the quantity. The plurality of video conference apparatuses and the delay control server 504 are configured by being connected through a common transmission path 111. A video conference apparatus 501 shown in the figure shows one of a plurality of video conference apparatuses connected to the transmission path 111.
[0110] 図 5に示す TV会議装置 501は、カメラおよびマイクに接続され映像音声が入力さ れる映像音声入力部 102と、映像音声を符号化する映像音声符号化部 103と、伝送 路 111に接続され符号ィ匕後の映像音声データを伝送する送信部 104と、映像音声 入力を用いて低遅延優先度を決定する低遅延優先度決定部 105と、低遅延優先度 を伝送路経由で遅延制御サーバ 504に送信し、遅延量を受信する送受信部 502と、 遅延量を用いて符号ィ匕パラメータを変更するパラメータ制御部 503と、伝送路 111に 接続され通信相手である TV会議端末カゝら送信された符号ィ匕後の映像音声データを 受信する受信部 110と、符号化された映像音声データを復号化する映像音声復号 化部 109と、モニタおよびスピーカに接続され、復号ィ匕した映像音声データをモニタ およびスピーカに出力する映像音声出力部 108とを備える。 [0110] The video conference apparatus 501 shown in Fig. 5 includes a video / audio input unit 102 that is connected to a camera and a microphone and receives video / audio, a video / audio encoding unit 103 that encodes video / audio, and a transmission A transmission unit 104 connected to the channel 111 for transmitting the encoded video / audio data, a low delay priority determining unit 105 for determining the low delay priority using the video / audio input, and the low delay priority A transmission / reception unit 502 that transmits to the delay control server 504 and receives the delay amount, a parameter control unit 503 that changes the sign key parameter using the delay amount, and a video conference that is connected to the transmission path 111 and is the communication partner A receiving unit 110 that receives the encoded video / audio data transmitted from the terminal camera, a video / audio decoding unit 109 that decodes the encoded video / audio data, and a monitor and speaker connected to the decoding unit And a video / audio output unit 108 for outputting the video / audio data thus output to a monitor and a speaker.
[0111] さらに遅延制御サーバ 504は、伝送路を通じて低遅延優先度を受信する低遅延優 先度受信部 505と、低遅延優先度を用いて遅延量を決定する遅延量決定部 506と、 遅延量を送信する遅延量送信部 507とを備える。  Further, the delay control server 504 includes a low delay priority receiving unit 505 that receives a low delay priority through a transmission line, a delay amount determining unit 506 that determines a delay amount using the low delay priority, a delay A delay amount transmitting unit 507 for transmitting the amount.
[0112] すなわち、本実施の形態では、遅延制御サーバ 504が備える遅延量決定部 506と 、 TV会議装置 501が備える低遅延優先度決定部 105およびパラメータ制御部 503 とが TV会議制御装置に相当する。また、遅延制御サーバ 504と TV会議装置 501と の間の通信は、低遅延優先度受信部 505、遅延量送信部 507および送受信部 502 により実現される例である。  That is, in this embodiment, the delay amount determination unit 506 provided in the delay control server 504, the low delay priority determination unit 105 and the parameter control unit 503 provided in the TV conference device 501 correspond to the TV conference control device. To do. Further, the communication between the delay control server 504 and the TV conference device 501 is an example realized by the low delay priority receiving unit 505, the delay amount transmitting unit 507, and the transmitting / receiving unit 502.
[0113] なお、図 5において、実施の形態 1と動作内容が同一である処理部に関しては、図 1と同一の番号を付与するものとし、動作の説明は省略する。したがって、実施の形 態 1と動作内容が異なる処理部は、 TV会議装置 501においては、送受信部 502、パ ラメータ制御部 503、および遅延制御サーバ 504である。  In FIG. 5, the processing units having the same operation contents as those of the first embodiment are given the same numbers as those in FIG. 1, and the description of the operations is omitted. Therefore, the processing units that differ in operation content from Embodiment 1 are the transmission / reception unit 502, the parameter control unit 503, and the delay control server 504 in the TV conference device 501.
[0114] 送受信部 502は、低遅延優先度決定部 105より入力された低遅延優先度を、伝送 路 111を通じて遅延制御サーバ 504へ送信する。  The transmission / reception unit 502 transmits the low delay priority input from the low delay priority determination unit 105 to the delay control server 504 through the transmission path 111.
[0115] 低遅延優先度受信部 505は、送受信部 502より伝送路 111を通じて送信された低 遅延優先度を受信し、遅延量決定部 106に出力する。遅延量決定部 106は、低遅 延優先度受信部 505より入力された低遅延優先度を用いて遅延量を決定し、遅延量 を遅延量送信部 507に出力する。遅延量送信部 507は、遅延量決定部 506より入力 された遅延量を、伝送路 111を通じて TV会議装置 501へ送信する。  The low delay priority receiving unit 505 receives the low delay priority transmitted from the transmission / reception unit 502 through the transmission path 111 and outputs the low delay priority to the delay amount determination unit 106. The delay amount determination unit 106 determines a delay amount using the low delay priority input from the low delay priority reception unit 505, and outputs the delay amount to the delay amount transmission unit 507. The delay amount transmitting unit 507 transmits the delay amount input from the delay amount determining unit 506 to the TV conference device 501 through the transmission path 111.
[0116] 図 6は、実施の形態 2に係る複数の TV会議装置と遅延制御サーバとの関係を示す 図である。例えば、本図に示す 6台の TV会議装置がそれぞれ 2台どうしで 3つの会 議を別々に行っており、それらの TV会議装置の遅延量を遅延制御サーバ 607が制 御する。図 6において、 TV会議装置 601〜606は図 5の TV会議装置 501と同一の 機能を備え、遅延制御サーバ 607は図 5の遅延制御サーバ 504と同一の機能を備え る。 [0116] FIG. 6 shows the relationship between a plurality of video conference apparatuses and a delay control server according to Embodiment 2. FIG. For example, the six TV conference devices shown in the figure each have three conferences separately, and the delay control server 607 controls the delay amount of these TV conference devices. In FIG. 6, TV conference apparatuses 601 to 606 have the same function as the TV conference apparatus 501 in FIG. 5, and the delay control server 607 has the same function as the delay control server 504 in FIG.
[0117] 次いで、上記構成を有する TV会議装置 501および遅延制御サーバ 504の動作に ついて、図 7および図 8に示すフローチャートを用いて説明する。  Next, operations of the TV conference device 501 and the delay control server 504 having the above-described configurations will be described using the flowcharts shown in FIG. 7 and FIG.
[0118] なお、図 7および図 8示すフローチャートの動作は、 TV会議装置 501および遅延制 御サーバ 504の図示しな 、記憶装置 (例えば ROMやフラッシュメモリなど)に制御プ ログラムとして記憶されており、図示しない CPUによって制御される。  [0118] The operations of the flowcharts shown in FIGS. 7 and 8 are stored as control programs in a storage device (for example, a ROM or a flash memory), not shown in the video conference device 501 and the delay control server 504. Controlled by a CPU (not shown).
[0119] 図 7は、実施の形態 2に係る TV会議装置 501が実行する処理を示すフローチヤ一 トである。図 7において実施の形態 1と処理内容が同一であるステップに関しては、図 2と同一の番号を付与するものとし、説明は行わないものとする。  FIG. 7 is a flowchart showing processing executed by the video conference apparatus 501 according to Embodiment 2. In FIG. 7, steps having the same processing contents as those in the first embodiment are given the same numbers as those in FIG. 2 and will not be described.
[0120] すなわち本実施の形態では、 TV会議装置 501は、実施の形態 1の映像音声入力 処理 (ステップ S201)と、データ受信処理 (ステップ S202)と、映像'音声復号化出力 処理 (ステップ S203)と実行し、続いて以下の処理を実行する。  That is, in the present embodiment, the video conference apparatus 501 performs the video / audio input process (step S201), the data reception process (step S202), and the video / audio decoding output process (step S203) of the first embodiment. ) And then the following processing is executed.
[0121] <ステップ S701 :低遅延優先度算出処理 >  [0121] <Step S701: Low-latency priority calculation processing>
低遅延優先度決定部 105は、音声データの送受信頻度を用いて低遅延優先度を 算出し、送受信部 502に低遅延優先度を出力する (ステップ S701)。ここでは、低遅 延優先度決定部 105は、実施の形態 1の音声データ送受信頻度算出処理 (ステップ S301)および低遅延優先度算出処理 (ステップ S302)と同様の処理を経て低遅延 優先度を算出する。  The low delay priority determination unit 105 calculates the low delay priority using the audio data transmission / reception frequency, and outputs the low delay priority to the transmission / reception unit 502 (step S701). Here, the low-delay priority determination unit 105 obtains the low-delay priority through the same processes as the audio data transmission / reception frequency calculation process (step S301) and the low-delay priority calculation process (step S302) of the first embodiment. calculate.
[0122] <ステップ S702 :低遅延優先度送信処理 >  [0122] <Step S702: Low delay priority transmission processing>
次に、送受信部 502は、低遅延優先度決定部 105より入力された低遅延優先度を 、伝送路 111を通じて遅延制御サーバ 504に送信する (ステップ S 702)。  Next, the transmission / reception unit 502 transmits the low delay priority input from the low delay priority determination unit 105 to the delay control server 504 through the transmission path 111 (step S702).
[0123] <ステップ S703 :遅延量受信、パラメータ更新処理 >  [0123] <Step S703: Receive delay amount, update parameter>
次に、送受信部 502は、伝送路 111を通じて遅延制御サーバ 504から送信された 遅延量を受信し、さらに、パラメータ制御部 503は、実施の形態 1の遅延量算出処理 (ステップ S303)およびパラメータ算出 ·更新処理 (ステップ S304)と同様の処理を実 行することにより遅延量を算出し、符号化パラメータを決定し、符号化パラメータを映 像音声符号ィ匕部 103に出力する。なお、ここでパラメータ制御部 503は、算出した遅 延量に基づ 、て伝送パラメータを決定し、伝送パラメータを送信部 104に出力しても よい。 Next, the transmission / reception unit 502 receives the delay amount transmitted from the delay control server 504 through the transmission path 111, and the parameter control unit 503 further performs the delay amount calculation process of the first embodiment. (Step S303) and parameter calculation / update processing By executing the same processing as (Step S304), the delay amount is calculated, the encoding parameter is determined, and the encoding parameter is input to the video / audio encoding unit 103. Output. Here, parameter control section 503 may determine a transmission parameter based on the calculated delay amount, and output the transmission parameter to transmission section 104.
[0124] さらに、 TV会議装置 501は、実施の形態 1の映像'音声符号化伝送処理 (ステップ S205)、および終了判定処理 (ステップ S206)の処理を実行し、処理を終了する。  Furthermore, TV conference device 501 executes the processing of the video and audio encoding transmission processing (step S205) and the end determination processing (step S206) of the first embodiment, and ends the processing.
[0125] 図 6に示す各 TV会議装置 601〜606において実行される処理は、以上の図 7を参 照して説明した TV会議装置 501が実行する処理と同様である。  [0125] The process executed in each of the TV conference apparatuses 601 to 606 shown in FIG. 6 is the same as the process executed by the TV conference apparatus 501 described with reference to FIG.
[0126] 次に、図 8を参照して遅延制御サーバ 504の動作について説明する。  [0126] Next, the operation of the delay control server 504 will be described with reference to FIG.
[0127] 図 8は、実施の形態 2に係る遅延制御サーバが実行する処理を示すフローチャート である。ここでは、図 6に示すように 3つの TV会議セッションが張られている場合を想 定し、全ての TV会議装置力 低遅延優先度が遅延制御サーバに送信される場合に ついて説明する。すなわち、図 6において、 TV会議装置 601と 602、 TV会議装置 6 03と 604、 TV会議装置 605と 606がそれぞれ別の TV会議を行っており。それぞれ の TV会議装置が遅延制御サーバ 607に低遅延優先度を送信する場合を説明する  FIG. 8 is a flowchart showing a process executed by the delay control server according to the second embodiment. Here, it is assumed that three video conference sessions are established as shown in FIG. 6, and the case where all video conference device capabilities low delay priorities are transmitted to the delay control server will be described. That is, in FIG. 6, the TV conference devices 601 and 602, the TV conference devices 6 03 and 604, and the TV conference devices 605 and 606 are holding separate TV conferences. The case where each TV conference device transmits a low delay priority to the delay control server 607 will be described.
[0128] <ステップ S801 :低遅延優先度受信処理 > <Step S801: Low-latency priority reception processing>
まず、低遅延優先度受信部 505は、伝送路 111を経て TV会議装置 501の送受信 部 502より送信された低遅延優先度を受信し、遅延量決定部 506に出力する (ステツ プ S801)。  First, the low-delay priority receiving unit 505 receives the low-delay priority transmitted from the transmission / reception unit 502 of the video conference apparatus 501 via the transmission path 111 and outputs the low-delay priority to the delay amount determination unit 506 (step S801).
[0129] <ステップ S802 :遅延量算出処理 >  <Step S802: Delay amount calculation process>
次に、遅延量決定部 506は、図 6の TV会議端末 601〜606から 6つの低遅延優先 度を受信し、それぞれ 6つの TV会議装置に対して個別の遅延量を決定し、遅延量を 遅延量送信部 507に出力する (ステップ S802)。  Next, the delay amount determination unit 506 receives six low delay priorities from the TV conference terminals 601 to 606 in FIG. 6, determines individual delay amounts for each of the six TV conference devices, and sets the delay amount. Output to delay amount transmission section 507 (step S802).
[0130] [数 5] [0130] [Equation 5]
D e 1 a y ( t , χ ) = D AV E— ( Ρ ( t , χ ) - P A V E ( t ) ) χ Κ D e 1 a y (t, χ) = D AV E— (Ρ (t, χ)-P A V E (t)) χ Κ
(式 5 ) [0131] (式 5)に遅延量の算出式の一例を示す。(式 5)において、 Delay (t, x)および P (t , χ)は、時刻 tにおける TV会議装置 χの遅延量および低遅延優先度であり、 DAVE は予め定められた遅延量の平均値であり、 PAVE (t)は時刻 tにおける全ての TV会 議装置における低遅延優先度の平均値であり、 Kは予め定められた遅延調整パラメ ータである。 (Formula 5) [0131] (Formula 5) shows an example of a delay amount calculation formula. In (Equation 5), Delay (t, x) and P (t, χ) are the amount of delay and low delay priority of the TV conference device χ at time t, and DAVE is the average value of the predetermined amount of delay. PAVE (t) is an average value of low delay priorities in all TV conference devices at time t, and K is a predetermined delay adjustment parameter.
[0132] なお、時刻 tは全ての端末で同期されているものとし、遅延量算出においては同一 の時刻 tの低遅延優先度から遅延量を算出するものとする。  [0132] It is assumed that time t is synchronized with all terminals, and in calculating the delay amount, the delay amount is calculated from the low delay priority at the same time t.
[0133] このように、複数の TV会議装置において、他の TV会議装置に比べて相対的に低 遅延優先度が高いものほど、遅延量を小さくすることが出来る。 [0133] As described above, in a plurality of TV conference devices, the delay amount can be reduced as the relatively low delay priority is higher than that of other TV conference devices.
[0134] なお、(式 5)は遅延量の算出方法の一例であり、低遅延優先度が高いものほど遅 延量が小さく算出可能な方法であれば如何なる方法も利用可能である。 [0134] Note that (Equation 5) is an example of a method for calculating the delay amount, and any method can be used as long as the lower delay priority is higher and the delay amount can be calculated smaller.
[0135] <ステップ S803 :遅延量送信処理 > [0135] <Step S803: Delay amount transmission processing>
次に、遅延量送信部 507は、遅延量決定部 506より入力された複数の TV端末毎 の遅延量を、伝送路 111を通じて、それぞれの TV会議装置 601〜606へ送信する( ステップ S803)。  Next, the delay amount transmission unit 507 transmits the delay amount for each of the plurality of TV terminals input from the delay amount determination unit 506 to the respective TV conference apparatuses 601 to 606 through the transmission path 111 (step S803).
[0136] 以上のように、本実施の形態では、送受信部 502は低遅延優先度の送信処理およ び遅延量の受信処理を遅延制御サーバ 607間と行い、遅延制御サーバ 607は、複 数の TV会議装置の低遅延優先度を用いて、相対的に低遅延優先度が高い TV会 議装置ほど値力 、さくなるように遅延量を決定する。  [0136] As described above, in the present embodiment, the transmission / reception unit 502 performs low-delay priority transmission processing and delay amount reception processing between the delay control servers 607, and the delay control server 607 includes a plurality of delay control servers 607. Using the low delay priority of the TV conference device, the amount of delay is determined so that the TV conference device having a relatively high low delay priority has a higher value.
[0137] これにより、遅延制御サーバが遅延量を一元管理するため、複数の TV会議の低遅 延優先度の把握が容易であり、低遅延要求が高い TV会議ほど低遅延な符号ィ匕ある いは伝送を行うことが可能である。 [0137] As a result, the delay control server centrally manages the amount of delay, so it is easy to grasp the low delay priority of multiple TV conferences. Or transmission is possible.
[0138] また、複数の TV会議が共通の伝送路を利用する場合、会話が頻繁に取り交わされ ている会議が円滑に進行されるように帯域が使用されることになる。従って、限られた 伝送路の帯域を有効に活用することが可能になる。 [0138] Also, when a plurality of TV conferences use a common transmission path, the bandwidth is used so that a conference in which conversations are frequently exchanged proceeds smoothly. Therefore, it is possible to effectively use the limited bandwidth of the transmission path.
[0139] なお、本実施の形態では、送受信部 502は、低遅延優先度を遅延制御サーバに 送信したが、他の TV会議装置に送信する事も可能で有り、遅延量の決定を遅延制 御サーバではなぐ TV会議装置内で行うことも可能である。 [0140] これにより、遅延制御サーバが不要で、遅延量の決定を他の TV会議装置の比較 により行い、より低遅延を必要とする TV会議ほど、遅延量を小さく設定することが可 能である。 [0139] In the present embodiment, transmission / reception section 502 transmits the low delay priority to the delay control server, but it can also be transmitted to other video conference apparatuses, and the delay amount is determined by the delay control. It is also possible to do it in the video conferencing device. [0140] With this, a delay control server is unnecessary, the delay amount is determined by comparing with other TV conference devices, and it is possible to set a smaller delay amount for a TV conference that requires a lower delay. is there.
[0141] なお、本実施の形態では、送受信部 502は低遅延優先度を遅延制御サーバに送 信したが、低遅延優先度の代わりに音声データの送受信頻度を他の TV会議装置と の間で送受信することも可能であり、低遅延優先度決定部 105は、送受信部 502に より受信した他の TV会議装置の送受信頻度と比較して、頻度が高いほど遅延優先 度を決定することも可能である。  [0141] Note that in this embodiment, the transmission / reception unit 502 transmits the low delay priority to the delay control server, but instead of the low delay priority, the transmission / reception frequency of the audio data is exchanged with other TV conference devices. The low-delay priority determination unit 105 can also determine the delay priority as the frequency is higher than the transmission / reception frequency of other TV conference devices received by the transmission / reception unit 502. Is possible.
[0142] これによれば、他の TV会議と比較して低遅延優先度を決定するため、複数の TV 会議の中で低遅延の必要性が高 、ものほど低遅延とすることが可能である。  [0142] According to this, since the low-latency priority is determined in comparison with other video conferences, the necessity of low delay is high among a plurality of video conferences, and it is possible to reduce the delay as the number of video conferences increases. is there.
[0143] なお、本実施の形態では、送受信部 502は低遅延優先度を遅延制御サーバに送 信したが、低遅延優先度の変わりに、音声データの送受信時刻の差を他の TV会議 装置との間で送受信することも可能であり、低遅延優先度決定部 105は、送受信部 5 02により受信した他の TV会議装置の送受信時刻の差と比較して、差が小さいほど 遅延量を小さく決定することも可能である。  [0143] In the present embodiment, the transmission / reception unit 502 transmits the low delay priority to the delay control server. However, instead of the low delay priority, the transmission / reception time difference of the audio data is changed to another TV conference device. The low-latency priority determination unit 105 can reduce the delay amount as the difference is smaller compared to the difference in the transmission / reception time of other TV conference devices received by the transmission / reception unit 502. It is also possible to determine a smaller value.
[0144] これにより、低遅延優先度決定に際して、送信時刻 ·受信時刻の差を他の TV会議 と比較して決定するため、より低遅延要求が高い TV会議ほど低遅延とする事が可能 である。  [0144] Thus, when determining the low-latency priority, the difference between the transmission time and the reception time is determined by comparison with other TV conferences. Therefore, a TV conference with a higher request for lower delay can have a lower delay. is there.
産業上の利用可能性  Industrial applicability
[0145] 本発明に係る TV会議装置は、音声データの送信および受信頻度を利用し、頻度 が高!ヽほど遅延量が小さくなるように遅延量を決定し、決定した遅延量に従!ヽ符号 化あるいは伝送パラメータを動的に制御することにより、音声データの送受信頻度が 高ぐ低遅延をより必要としている TV会議状態ほど遅延量を小さくし、音声データの 送受信頻度が低ぐ低遅延を必要としていない TV会議状態ほど遅延量を大きくする ことにより、帯域限られた伝送路において、最適な遅延量を設定することが可能であ り、特に、ベストエフオート型のインターネットにおいて映像音声の低遅延伝送を必要 とする TV会議システムにおいては、特に有用である。 [0145] The video conference apparatus according to the present invention uses the frequency of transmission and reception of audio data, and the frequency is high! Decide the amount of delay so that the amount of delay decreases as you go, and follow the determined amount of delay! TV By dynamically controlling the encoding or transmission parameters, the amount of delay becomes smaller in the TV conference state where the frequency of audio data transmission / reception is high and the need for low delay is low, and the frequency of audio data transmission / reception is low By increasing the delay amount for video conferencing conditions that do not require a network, it is possible to set an optimal delay amount on a transmission path with limited bandwidth, especially for the best-f-auto Internet. This is particularly useful in video conference systems that require low-delay transmission.

Claims

請求の範囲 The scope of the claims
[1] 伝送路を通じて映像データおよび音声データを通信する TV会議装置を制御する TV会議制御装置であって、  [1] A TV conference control device for controlling a TV conference device that communicates video data and audio data through a transmission line,
前記 TV会議装置で検知される有音である音声データの送信と受信とが切り替わる 頻度が高 ヽほど、映像データおよび音声データの遅延を抑えるべき度合 ヽを示す低 遅延優先度を高く決定する低遅延優先度決定手段と、  The higher the frequency of switching between voice data transmission and reception detected by the video conference device, the lower the delay priority that indicates the degree to which the delay of video data and audio data should be suppressed. Delay priority determination means;
前記低遅延優先度が高!、ほど小さ!/、遅延量を決定する遅延量決定手段と、 決定された前記遅延量に応じて、前記 TV会議装置で利用される符号化のパラメ一 タまたはパケット優先度を決定するパラメータ制御手段とを備える  The lower delay priority is higher !, the smaller is! /, A delay amount determination means for determining a delay amount, and an encoding parameter or a coding parameter used in the TV conference device according to the determined delay amount, or Parameter control means for determining packet priority
ことを特徴とする TV会議制御装置。  A video conference control device characterized by that.
[2] 前記低遅延優先度決定手段は、前記 TV会議装置で検知される有音である音声デ ータの送信と受信とが切り替わる一定時間内の回数により前記頻度を計測する ことを特徴とする請求項 1に記載の TV会議制御装置。 [2] The low-delay priority determining means measures the frequency based on the number of times within a fixed time at which transmission and reception of voice data that is sound detected by the video conference device is switched. The video conference control device according to claim 1.
[3] 前記低遅延優先度決定手段は、前記頻度を閾値と比較し、前記頻度が前記閾値 より大きい場合に、前記頻度が前記閾値以下の場合よりも高い前記低遅延優先度を 決定する [3] The low-delay priority determining unit compares the frequency with a threshold value, and determines the low-delay priority level that is higher than when the frequency is less than or equal to the threshold value when the frequency is greater than the threshold value.
ことを特徴とする請求項 2に記載の TV会議装置。  The video conference apparatus according to claim 2, wherein
[4] 前記低遅延優先度決定手段は、前記 TV会議装置で検知される音声データの送 信時刻と受信時刻との差により前記頻度を計測し、送信時刻と受信時刻との差が小 さいほど、高い前記低遅延優先度を決定する [4] The low-latency priority determination means measures the frequency based on a difference between the transmission time and the reception time of the audio data detected by the TV conference device, and the difference between the transmission time and the reception time is small. The higher the low latency priority is determined
ことを特徴とする請求項 1に記載の TV会議制御装置。  The video conference control device according to claim 1, wherein:
[5] 前記低遅延優先度決定手段は、前記送信時刻と受信時刻との差を閾値と比較し、 前記送信時刻と受信時刻との差が前記閾値より小さ ヽ場合に、前記送信時刻と受信 時刻との差が前記閾値以上の場合よりも高い前記低遅延優先度を決定する ことを特徴とする請求項 4に記載の TV会議装置。 [5] The low delay priority determining means compares the difference between the transmission time and the reception time with a threshold value, and if the difference between the transmission time and the reception time is smaller than the threshold value, the transmission time and the reception time are determined. 5. The video conference apparatus according to claim 4, wherein the low-latency priority is determined to be higher than when the difference from time is equal to or greater than the threshold.
[6] 前記パラメータ制御手段は、前記圧縮符号量のパラメータとして、前記映像データ および音声データを受信する TV会議装置において前記映像データおよび音声デ ータを復号ィ匕するための受信バッファ容量を決定する ことを特徴とする請求項 1に記載の TV会議制御装置。 [6] The parameter control means determines a reception buffer capacity for decoding the video data and audio data in the video conference device that receives the video data and audio data as the compression code amount parameter. Do The video conference control device according to claim 1, wherein:
[7] 前記パラメータ制御手段は、前記圧縮符号量のパラメータとして、前記 TV会議装 置における前記映像データおよび音声データの符号化の最大発生符号量を決定す る [7] The parameter control means determines, as the compression code amount parameter, a maximum generated code amount for encoding the video data and audio data in the video conference apparatus.
ことを特徴とする請求項 1に記載の TV会議制御装置。  The video conference control device according to claim 1, wherein:
[8] 共通の伝送路を通じて、複数の会議の映像データおよび音声データを通信する複 数の TV会議装置を制御する TV会議制御装置であって、 [8] A TV conference control device that controls a plurality of TV conference devices that communicate video data and audio data of a plurality of conferences through a common transmission line.
前記各 TV会議装置で検知される有音である音声データの送信と受信とが切り替わ る頻度が高 ヽほど、映像データおよび音声データの遅延を抑えるべき度合 、を示す 低遅延優先度を高く決定する低遅延優先度決定手段と、  The higher the frequency of switching between voice data transmission and reception detected by each video conference device, the higher the low delay priority that indicates the degree to which the delay of video data and audio data should be suppressed. Low-delay priority determination means for determining;
前記低遅延優先度が高いほど、各会議に利用される TV会議装置群に対する小さ Vヽ遅延量を決定する遅延量決定手段と、  Delay amount determining means for determining a smaller V ヽ delay amount for the TV conference device group used for each conference as the low delay priority is higher;
決定された前記遅延量に応じて、前記 TV会議装置で利用される符号化のパラメ一 タまたはパケット優先度を決定するパラメータ制御手段とを備える  Parameter control means for determining encoding parameters or packet priorities used in the video conference device according to the determined delay amount.
ことを特徴とする TV会議制御装置。  A video conference control device characterized by that.
[9] 伝送路を通じて映像データおよび音声データを通信する TV会議装置を制御する TV会議制御方法であって、 [9] A TV conference control method for controlling a TV conference device that communicates video data and audio data through a transmission line,
前記 TV会議装置で検知される有音である音声データの送信と受信とが切り替わる 頻度が高 ヽほど、映像データおよび音声データの遅延を抑えるべき度合 ヽを示す低 遅延優先度を高く決定する低遅延優先度決定ステップと、  The higher the frequency of switching between voice data transmission and reception detected by the video conference device, the lower the delay priority that indicates the degree to which the delay of video data and audio data should be suppressed. A delay priority determination step;
前記低遅延優先度が高!ヽほど小さ!/ヽ遅延量を決定する遅延量決定ステップと、 決定された前記遅延量に応じて、前記 TV会議装置で利用される符号化のパラメ一 タまたはパケット優先度を決定するパラメータ制御ステップとを含む  A delay amount determining step for determining a low / low delay amount as the low delay priority is higher and lower, and according to the determined delay amount, an encoding parameter used in the TV conference device or Parameter control step for determining packet priority
ことを特徴とする TV会議制御方法。  A video conference control method characterized by the above.
[10] 伝送路を通じて映像データおよび音声データを通信する TV会議装置を制御する 集積回路であって、 [10] An integrated circuit for controlling a TV conference device that communicates video data and audio data through a transmission line,
前記 TV会議装置で検知される有音である音声データの送信と受信とが切り替わる 頻度が高 ヽほど、映像データおよび音声データの遅延を抑えるべき度合 ヽを示す低 遅延優先度を高く決定する低遅延優先度決定手段と、 The higher the frequency of switching between voice data transmission and reception detected by the video conference device, the lower the degree to which the delay of video data and audio data should be suppressed. Low delay priority determination means for determining a high delay priority;
前記低遅延優先度が高!、ほど小さ!、遅延量を決定する遅延量決定手段と、 決定された前記遅延量に応じて、前記 TV会議装置で利用される符号化のパラメ一 タまたはパケット優先度を決定するパラメータ制御手段とを備える  The low delay priority is high !, the smaller is !, delay amount determining means for determining a delay amount, and an encoding parameter or packet used in the TV conference device according to the determined delay amount Parameter control means for determining priority
ことを特徴とする集積回路。  An integrated circuit characterized by that.
[11] 伝送路を通じて映像データおよび音声データを通信する TV会議装置であって、 映像および音声を含む映像データおよび音声データが入力される入力手段と、 入力された映像データおよび音声データを符号化する符号化手段と、 符号化された映像データおよび音声データを、前記伝送路を通じて他の TV会議 装置に送信する送信手段と、  [11] A TV conference device that communicates video data and audio data through a transmission line, and an input means for inputting video data and audio data including video and audio, and encodes the input video data and audio data Encoding means for transmitting, transmitting means for transmitting the encoded video data and audio data to another TV conference apparatus through the transmission path,
符号化された映像データおよび音声データを、前記伝送路を通じて他の TV会議 装置から受信する受信手段と、  Receiving means for receiving encoded video data and audio data from another TV conference device through the transmission path;
受信された映像データおよび音声データを復号化する復号化手段と、 復号ィヒされた映像データおよび音声データに含まれる映像および音声を出力する 出力手段と、  Decoding means for decoding received video data and audio data; output means for outputting video and audio included in the decoded video data and audio data;
前記送信手段により送信される音声データの有音部分と前記受信手段により受信 される音声データの有音部分とを検知し、検知された有音部分が含まれる音声デー タの送信と受信とが切り替わる頻度が高 、ほど、映像データおよび音声データの遅 延を抑えるべき度合いを示す低遅延優先度を高く決定する低遅延優先度決定手段 と、  The voiced portion of the voice data transmitted by the transmitting unit and the voiced portion of the voice data received by the receiving unit are detected, and transmission and reception of the voice data including the detected voiced portion is performed. A low-delay priority determination means for determining a low-delay priority that indicates a degree to which delay of video data and audio data should be suppressed as the frequency of switching is higher;
前記低遅延優先度が高!、ほど小さ!/、遅延量を決定する遅延量決定手段と、 決定された前記遅延量に応じて、前記符号化手段で用いられる符号化のパラメ一 タまたは前記送信手段により送信されるデータに含まれるパケット優先度を決定する ノラメータ制御手段とを備える  The lower delay priority is higher !, the smaller is! /, The delay amount determining means for determining the delay amount, and the encoding parameters used by the encoding means or the parameter according to the determined delay amount And a norm control means for determining a packet priority included in the data transmitted by the transmission means.
ことを特徴とする TV会議装置。  A video conference apparatus characterized by that.
[12] 共通の伝送路を通じて、複数の会議を含む映像データおよび音声データを通信す る複数の TV会議装置と、前記複数の TV会議装置を制御する TV会議制御装置とを 備える TV会議システムであって、 前記各 TV会議装置は、 [12] A TV conference system comprising: a plurality of TV conference devices that communicate video data and audio data including a plurality of conferences through a common transmission path; and a TV conference control device that controls the plurality of TV conference devices. There, Each of the video conference devices
映像および音声を含む映像データおよび音声データを入力する入力手段と、 入力された映像データおよび音声データを符号化する符号化手段と、  Input means for inputting video data and audio data including video and audio; encoding means for encoding the input video data and audio data;
符号化された映像データおよび音声データを、前記他の TV会議装置に前記伝送 路を通じて送信する送信手段と、  Transmitting means for transmitting the encoded video data and audio data to the other video conference apparatus through the transmission line;
符号化された映像データおよび音声データを、前記他の TV会議装置から受信す る受信手段と、  Receiving means for receiving encoded video data and audio data from the other video conference device;
受信された映像データおよび音声データを復号化する復号化手段と、  Decoding means for decoding received video data and audio data;
復号ィヒされた映像データおよび音声データに含まれる映像および音声を出力する 出力手段とを備え、  Output means for outputting video and audio included in the decoded video data and audio data,
前記 TV会議制御装置は、  The video conference control device
前記各 TV会議装置で検知される有音である音声データの送信と受信とが切り替わ る頻度が高 ヽほど、映像データおよび音声データの遅延を抑えるべき度合 、を示す 低遅延優先度を高く決定する低遅延優先度決定手段と、  The higher the frequency of switching between voice data transmission and reception detected by each video conference device, the higher the low delay priority that indicates the degree to which the delay of video data and audio data should be suppressed. Low-delay priority determination means for determining;
前記低遅延優先度が高いほど、各会議に利用される TV会議装置群に対する小さ Vヽ遅延量を決定する遅延量決定手段と、  Delay amount determining means for determining a smaller V ヽ delay amount for the TV conference device group used for each conference as the low delay priority is higher;
決定された前記遅延量に応じて、前記 TV会議装置で利用される符号化のパラメ一 タまたはパケット優先度を決定するパラメータ制御手段とを備える  Parameter control means for determining encoding parameters or packet priorities used in the video conference device according to the determined delay amount.
ことを特徴とする TV会議システム。  A video conference system characterized by this.
PCT/JP2006/326033 2006-01-12 2006-12-27 Teleconference control device and teleconference control method WO2007080788A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2006-004520 2006-01-12
JP2006004520A JP2009076952A (en) 2006-01-12 2006-01-12 Tv conference apparatus and method

Publications (1)

Publication Number Publication Date
WO2007080788A1 true WO2007080788A1 (en) 2007-07-19

Family

ID=38256199

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2006/326033 WO2007080788A1 (en) 2006-01-12 2006-12-27 Teleconference control device and teleconference control method

Country Status (2)

Country Link
JP (1) JP2009076952A (en)
WO (1) WO2007080788A1 (en)

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2009089156A (en) * 2007-10-01 2009-04-23 Yamaha Corp Distribution system and method
JP2009089157A (en) * 2007-10-01 2009-04-23 Yamaha Corp Distribution system and method
JP2012253823A (en) * 2012-09-24 2012-12-20 Yamaha Corp Distribution system, distribution method, distribution server and communication terminal

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP5434390B2 (en) * 2009-09-01 2014-03-05 株式会社リコー Electronic conference system, multipoint connection device, data communication method, program, recording medium, and communication device
JP5987915B2 (en) * 2012-11-12 2016-09-07 日本電気株式会社 Communication relay device, communication relay system, communication relay method, and communication relay program
JP2014212407A (en) * 2013-04-18 2014-11-13 富士通株式会社 Transmission device and path switching method
EP2945393A4 (en) * 2014-01-20 2016-06-22 Panasonic Ip Man Co Ltd Reproduction device and data reproduction method
WO2020095728A1 (en) 2018-11-06 2020-05-14 ソニー株式会社 Information processing device and information processing method

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH09163333A (en) * 1995-12-06 1997-06-20 Nec Corp Voice delay controller
JPH10191245A (en) * 1996-12-24 1998-07-21 Fuji Xerox Co Ltd Information accumulation device
JP2005318535A (en) * 2004-03-19 2005-11-10 Marconi Intellectual Property (Ringfence) Inc Method an apparatus for holding conference by controlling bandwidth

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH09163333A (en) * 1995-12-06 1997-06-20 Nec Corp Voice delay controller
JPH10191245A (en) * 1996-12-24 1998-07-21 Fuji Xerox Co Ltd Information accumulation device
JP2005318535A (en) * 2004-03-19 2005-11-10 Marconi Intellectual Property (Ringfence) Inc Method an apparatus for holding conference by controlling bandwidth

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2009089156A (en) * 2007-10-01 2009-04-23 Yamaha Corp Distribution system and method
JP2009089157A (en) * 2007-10-01 2009-04-23 Yamaha Corp Distribution system and method
JP2012253823A (en) * 2012-09-24 2012-12-20 Yamaha Corp Distribution system, distribution method, distribution server and communication terminal

Also Published As

Publication number Publication date
JP2009076952A (en) 2009-04-09

Similar Documents

Publication Publication Date Title
US10027818B2 (en) Seamless codec switching
JP4367657B2 (en) Voice communication method and apparatus
TWI439086B (en) Jitter buffer adjustment
US7817625B2 (en) Method of transmitting data in a communication system
US7817557B2 (en) Method and system for buffering audio/video data
JP4661373B2 (en) Transmission device and transmission program for controlling discard of specific media data
WO2007080788A1 (en) Teleconference control device and teleconference control method
JP5442771B2 (en) Data transmission method in communication system
US20100118114A1 (en) Video rate adaptation for congestion control
WO2006054442A1 (en) Transmitting apparatus, receiving apparatus and communication system
KR20180031016A (en) Downside of the transmitter side video phone
WO2012075951A1 (en) Method and device for adjusting bandwidth in conference place, conference terminal and media control server
KR20060111036A (en) Method providing service of an image telephone call in mobile terminal considering situation of a weak current
CN113242436B (en) Live broadcast data processing method and device and electronic equipment
JPWO2005039180A1 (en) Media signal transmission method and reception method, and transmission / reception method and apparatus
US8438016B2 (en) Silence-based adaptive real-time voice and video transmission methods and system
JP3707369B2 (en) Video phone equipment
US7697553B2 (en) Method for managing variation in a data flow rate
JP2012151555A (en) Television conference system, television conference relay device, television conference relay method and relay program
CN108206925A (en) Implementation method, device and the mostly logical terminal of multi-channel video call
JP4496755B2 (en) COMMUNICATION PROCESSING DEVICE, COMMUNICATION PROCESSING METHOD, AND COMPUTER PROGRAM
KR102109607B1 (en) System for reducing delay of transmission and reception in communication network, and apparatus thereof
JP4050961B2 (en) Packet-type voice communication terminal
CN108353035B (en) Method and apparatus for multiplexing data
WO2007031924A2 (en) Video telephone system, video telephone terminal, and method for video telephoning

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application
NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 06843415

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: JP