WO2012043290A1 - Video call system, main-side terminal, and subordinate-side terminals - Google Patents

Video call system, main-side terminal, and subordinate-side terminals Download PDF

Info

Publication number
WO2012043290A1
WO2012043290A1 PCT/JP2011/071306 JP2011071306W WO2012043290A1 WO 2012043290 A1 WO2012043290 A1 WO 2012043290A1 JP 2011071306 W JP2011071306 W JP 2011071306W WO 2012043290 A1 WO2012043290 A1 WO 2012043290A1
Authority
WO
WIPO (PCT)
Prior art keywords
terminal
call
video
slave
video data
Prior art date
Application number
PCT/JP2011/071306
Other languages
French (fr)
Japanese (ja)
Inventor
和田 真
田中 克明
大野 博
潤 山形
哲夫 楠
Original Assignee
コクヨ株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by コクヨ株式会社 filed Critical コクヨ株式会社
Publication of WO2012043290A1 publication Critical patent/WO2012043290A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/15Conference systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/1066Session management
    • H04L65/1069Session establishment or de-establishment
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L65/00Network arrangements, protocols or services for supporting real-time applications in data packet communication
    • H04L65/80Responding to QoS
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M3/00Automatic or semi-automatic exchanges
    • H04M3/42Systems providing special services or facilities to subscribers
    • H04M3/56Arrangements for connecting several subscribers to a common circuit, i.e. affording conference facilities
    • H04M3/567Multimedia conference systems
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04MTELEPHONIC COMMUNICATION
    • H04M7/00Arrangements for interconnection between switching centres
    • H04M7/006Networks other than PSTN/ISDN providing telephone service, e.g. Voice over Internet Protocol (VoIP), including next generation networks with a packet-switched transport layer

Definitions

  • the present invention relates to a system for realizing a video call function in which three or more parties can participate.
  • Video call systems that realize video lectures or video conferences between multiple points via telecommunications lines are well known.
  • a guarantee-type (non-best-effort type) dedicated line with a sufficiently high communication speed is installed, or an MCU (Multi-point Control) that is a high-performance relay device between each base. It is effective to use a unit).
  • MCU Multi-point Control
  • IP Internet Protocol
  • an off-the-shelf system automatically optimizes the transfer rate of video data to be transmitted in response to the congestion status of the line that varies depending on the day or time.
  • it takes a certain amount of time to optimize the transfer rate of the video data and although the video sent from the other party is waited until it is displayed on the screen or displayed, the video is not stable. May cause problems.
  • the main object of the present invention is to stabilize the video display as much as possible immediately after the start of a video call in which there are three or more participants.
  • the present invention is a system for realizing a video call function for transmitting and receiving video between terminals installed at each of three or more bases, and any one terminal is a master terminal and the remaining terminals are slave terminals.
  • the data transfer rate (uplink rate) that can be transmitted via the communication line connected to the terminal for each terminal when the main terminal starts a video call or data that can be received
  • the maximum transfer rate that should be transmitted during video calls from each terminal to other terminals by referring to the transfer rate (downlink rate) and discounting the transfer rate according to the number of slave terminals
  • a maximum rate calculation unit that calculates a value, a maximum rate specification information transmission unit that transmits maximum rate specification information describing the calculated maximum transfer rate to each of the slave terminals, and the master side terminal
  • a call transmission unit that generates video data having a rate equal to or less than the calculated maximum transfer rate from video captured by the camera at the installation site of the mobile station and transmits the video data to each of the slave terminals, and is transmitted from each of the slave terminals.
  • a call output control unit for displaying the received video data on the display screen, and the slave terminal transmits a maximum of the maximum number of messages transmitted from the master terminal at the start of the video call.
  • a call transmitting unit for transmitting to each of the main terminal and other slave terminals, and a call for receiving video data transmitted from each of the master terminal and other slave terminals.
  • the maximum rate calculation unit of the master terminal basically calculates the maximum transfer rate by dividing the transfer rate by the number of slave terminals.
  • a group view mode in which video data is transmitted from all terminals to each of the other terminals, and all the terminals display and output videos of all bases other than their own base, and each of the slave terminals from the master terminal It is good also as a system which can switch arbitrarily to the single view mode which transmits video data with respect to, and the slave side terminal displays and outputs the video of the installation base of the master side terminal.
  • the main terminal switches an operation input receiving unit that receives an operation input indicating that the group view mode and the single view mode should be switched, and switches between the group view mode and the single view mode in response to the received operation input.
  • a command transmission unit that transmits a switching command, which is a command for the slave terminal, to each of the slave terminals, and the call transmission unit is not restricted to the maximum transfer rate in the single view mode.
  • Video data having a higher transfer rate than the mode is generated and transmitted to each of the slave terminals, and the slave terminal further includes a command receiving unit that receives a switching command transmitted from the master terminal.
  • the call transmission unit transmits video data only to the main terminal, and the call reception unit It shall only receive video data from the main terminal in the single view mode. In reality, in video lectures and video conferences, not all participants need images of all other participants at all times. Therefore, in the single view mode, transmission / reception of video between the slave terminals is stopped, and higher-quality video with an increased transfer rate can be distributed from the master terminal to each slave terminal. .
  • the single view mode it is also preferable to be able to change a terminal that sends high-quality video to other terminals, that is, a subject that attracts the attention of others in a video call. That is, from all terminals, the video data is transmitted to each of the other terminals, and all the terminals display and output the video of all bases other than their own base, and from any one selected terminal It is possible to switch to a single view mode that transmits video data to each of the other terminals and displays and outputs the video of the installation base of the one terminal selected by the other terminal, the main terminal Corresponding to the operation input accepting an operation input for switching between the group view mode and the single view mode and an operation input for selecting any one terminal in the single view mode, Command to switch between group view mode and single view mode, or which terminal is selected in single view mode A command transmission unit that transmits a switching command that is a command for the slave terminal to each of the slave terminals, and the call transmitter is in a single view mode and the master terminal itself is selected Is not restricted by the
  • Video data is transmitted only to the selected terminal, and when the call receiver is in single view mode and the slave terminal is selected, the video is transmitted only from the selected terminal.
  • the slave terminal further includes a command receiving unit that receives a switching command transmitted from the master terminal. When the side terminal itself is selected, it generates video data having a higher transfer rate than the other cases without being constrained by the maximum transfer rate to each of the master side terminal and the other slave side terminals.
  • the video data is transmitted only to the selected terminal, and the call receiver Mode, and when a master terminal or other slave terminal is selected, if the system receives video data only from the selected terminal, a terminal other than the master terminal is selected in the single view mode.
  • the operation input receiving unit of the master side terminal should operate the pan, tilt, or zoom of the camera associated with the slave terminal when the single view mode and another slave terminal are selected.
  • An operation input is received, and the command transmission unit of the master terminal transmits a camera operation command for operating the pan, tilt, or zoom of the camera to the selected slave terminal in response to the received operation input.
  • the command receiving unit of the slave terminal is a system that receives a camera operation command transmitted from the master terminal when the slave terminal itself is selected in the single view mode, any slave In a situation where high-definition video is being distributed from the side terminal to the master side terminal and other slave side terminals, the user of the master side terminal can remotely control the camera of the slave side terminal as necessary. Can.
  • the master terminal and the slave terminal can be shared, which contributes to a reduction in the production cost of the terminal.
  • the video display can be stabilized as much as possible immediately after the start of a video call in which three or more participants exist.
  • the figure which shows the hardware resource which the terminal in the embodiment comprises. Functional block diagram of the terminal. Functional block diagram of the terminal. The figure explaining the content of the group view mode in this embodiment. The figure explaining the content of the single view mode in this embodiment. The figure explaining the content of the single view mode in this embodiment. The figure which illustrates the information memorize
  • a terminal 1 distributed in each base is connected to a public network represented by the Internet, a WAN (Wide Area Network), or a LAN (Local Area Network). It is constructed by connecting to each other via a telecommunication line 2 such as the above. At least one of the terminals 1 is the master terminal 11, and the remaining plurality are slave terminals 12.
  • the terminal 1 includes a processor 1a, a main memory 1b, an auxiliary storage device 1c, an operation input device 1d, a microphone 1e, a speaker 1f, an audio codec 1g, a camera 1h, a display 1i, a video codec 1j, and a communication interface.
  • Hardware resources such as 1k are provided, and these are controlled by a controller (system controller, I / O controller, etc.) 11 to perform a cooperative operation.
  • the auxiliary storage device 1c is a hard disk drive, a flash memory, an optical disk drive, or the like.
  • the operation input device 1d is a push button that can be operated with a finger, a pointing device such as a touch panel, a track pad, or the like.
  • a remote controller capable of wireless communication with the terminal 1 is assumed.
  • the audio codec 1g encodes the sound collected through the microphone 1e (which means collecting on-site sound, including recording), or decodes the encoded audio information and outputs the audio from the speaker 1f.
  • the video codec 1j encodes video captured through the camera 1h, decodes encoded video information, and displays and outputs it on the screen of the display 1i.
  • Each of the audio codec 1g and the video codec 1j can be implemented as software instead of hardware.
  • the camera 1 h may be built in the terminal 1 or may be externally attached to the terminal 1. In the case of external attachment, it may be internal or external to a general-purpose terminal distributed in the market. In the case of external attachment, a general-purpose camera distributed in the market can be used.
  • the camera 1h receives a control signal from a terminal 1 that is mounted or directly connected to the camera 1h, thereby panning (changing the angle of the optical axis along the horizontal direction) and tilting (angle of the optical axis along the vertical direction). (Change) and / or zoom (change of focal length) is preferable.
  • the communication interface 1g is a device for performing information communication via the telecommunication line 2 and is typified by NIC (Network Interface Card) or wireless LAN transceiver, but besides these, USB (Universal Serial Bus), IEEE 1394, etc.
  • NIC Network Interface Card
  • USB Universal Serial Bus
  • IEEE 1394 IEEE 1394
  • the terminal 1 is basically a dedicated device manufactured for the purpose of constructing this system. However, it does not prevent the terminal 1 according to the present invention from being configured by installing a program necessary for a general-purpose personal computer, workstation, video game machine or the like.
  • the program to be executed by the processor 1a is stored in the auxiliary storage device 1c.
  • the program is read from the auxiliary storage device 1c into the main memory 1b and decoded by the processor 1a.
  • the terminal 1 operates the hardware resource according to the program, and exhibits the function of the master terminal 11 shown in FIG. 3 and the function of the slave terminal 12 shown in FIG.
  • the function units of the main terminal 11 include a setting information storage unit 111, a call request transmission unit 112, a maximum rate calculation unit 116, a maximum rate designation information transmission unit 117, a call reception unit 113, a call output control unit 114, and a call transmission unit. 115, an operation input reception unit 118, and a command transmission unit 119.
  • the functional units of the slave terminal 12 include a setting information storage unit 124, a call request receiving unit 121, a maximum rate designation information receiving unit 125, a call receiving unit 122, a call output control unit 123, a call transmitting unit 127, and command reception. Part 126 is provided.
  • each functional unit Prior to the description of each functional unit, a video call mode realized by the video call system of this embodiment will be described.
  • “Group View Mode” in which all bases can see the images of all other bases at the same time, and only one selected base can watch the pictures of all other bases at the same time.
  • Other sites can appropriately switch to “single view mode” in which only the video of the one site is viewed.
  • each terminal 1 installed at each site participating in the video call has a session connection individually.
  • the terminal 1 at one base is the master terminal 11 and each terminal 1 at the remaining four bases is the slave terminal 12, each of the terminals 11 and 12 is the other four.
  • a session connection with the terminals 11 and 12 is made individually.
  • the terminals 11 and 12 installed at each site participating in the video call capture images shot at their own sites by all other users. It transmits toward the terminals 11 and 12 (indicated by broken lines in the figure).
  • each terminal 11, 12 sends video to each of the other four terminals 11, 12 and receives video from each of the other four terminals 11, 12. .
  • These videos are arranged and displayed on the screen of the display 1i.
  • the transfer rate of the video transmitted from the terminals 11 and 12 is set to the maximum transfer rate described later. The following are to be suppressed.
  • the group view mode is immediately after the start of the video call.
  • one terminal 11 out of the terminals 11 and 12 installed at each site participating in the video call captures a video image taken at its own site. Transmission is performed toward all other terminals 12 (indicated by a solid line in the figure).
  • the other terminal 12 transmits the video imaged at its installation site to the one terminal 11 (indicated by a broken line in the figure).
  • the other terminals 12 do not transmit / receive video to / from each other.
  • the transfer rate of the video transmitted from one terminal 11 to each of the other terminals 12 is allowed to be larger than that of the video transmitted in the group view mode.
  • the video has a relatively small frame size (definition) (for example, 960 ⁇ 540 pixels or 720 ⁇ 480 pixels), and the video has a relatively large frame size (for example, 1920 ⁇ 1080). Pixel).
  • Each of the other terminals 12 receives a high-definition video from one terminal 11 and displays it on substantially the entire screen of the display 1i.
  • the other terminals 12 in the single view mode each send a low-quality video to one terminal 11.
  • the one terminal 11 that has received the video from each of the other terminals 12 arranges and displays the video on the screen of the display 1 i as in the group view mode.
  • the main terminal 11 is a subject that attracts the attention of others in the single view mode.
  • any one of the slave terminals 12 is in the single view mode. Is selected as a subject that attracts the attention of others, and a high-definition video is transmitted from the terminal 12 to each of the other terminals 11 and 12 (shown by a solid line in the figure).
  • a mode in which a low-quality video is transmitted from each to the terminal 12 can be taken. Under this situation, the user of the main terminal 11 can remotely operate pan, tilt or zoom of the camera 1h of the slave terminal 12.
  • the terminals 11 and 12 always transmit the audio collected at their installation bases to all other terminals 11 and 12.
  • the bit rate of audio data is an order of magnitude smaller than that of video data, and transmission from all terminals 11 and 12 participating in a lecture or a meeting to all other terminals 11 and 12 This is probably because it will not cause a shortage of communication bandwidth.
  • the setting information storage unit 111 stores various setting information using the storage area of the main memory 1b or the auxiliary storage device 1c.
  • the setting information first, there is telephone book (or address book) information.
  • the phone book information is obtained by associating identification information of other terminals 1 (for example, a maximum of 100 cases) that can be candidates for a video call partner and attribute information related to the terminal 1.
  • FIG. 8 illustrates phone book information.
  • the setting information storage unit 111 stores identification information and attribute information in association with an ID number or the like for each terminal 1 for other terminals 1.
  • the identification information is information for identifying a communication destination represented by, for example, an IP address, a SIP (Session Initiation Protocol) address, or another URI.
  • the attribute information is information related to the other party's name or name, affiliation, location, and other destinations, and is character string information in this embodiment.
  • the permissible bandwidth information indicates the permissible transfer rate of data that can be transmitted via the communication line 2 connected to each terminal 1 and / or the permissible transfer rate of receivable data. Information.
  • the allowable transfer rate is the communication speed quality of how much bit rate data can be transmitted / received by the communication line 2 to which the terminal 1 is connected during the video call, or the video call. In order to clarify the communication speed limit taking into account the balance with other services (or others sharing the same line 2), such as to what extent it is allowed to occupy the communication bandwidth I can say that.
  • the size of the allowable transfer rate varies depending on the type of the communication line 2 to which the terminal 1 is connected, the number of sharers of the line 2, the location of the terminal 1, the date or time zone, and various other circumstances.
  • the allowable bandwidth information may be set and input in advance by the terminal 1 or the user or administrator of the system.
  • the allowable bandwidth information related to the terminal 1 is set as part of the setting information. Store in the storage unit 111. Further, when the allowable bandwidth information for the other terminal 1 has already been acquired, this allowable bandwidth information is also associated with the ID number or the like for identifying the other terminal 1 in the setting information storage unit 111.
  • the allowable bandwidth information is not necessarily set and input in advance and stored in the setting information storage unit 111.
  • the above setting information may be manually input directly or may be transmitted from another terminal 1 or a computer connected via the telecommunication line 2.
  • the terminal 1 accepts manual input of setting information via the operation input device 1 d or receives it using the function of the communication interface 1 k and stores it in the setting information storage unit 111.
  • identification information and attribute information relating to other terminals 1 that have made a video call in the past as a past call history in the telephone directory. .
  • information indicating whether the call was made (transmitted) from us or received (received) from the other party a time stamp indicating the date and time when the video call was made with the terminal 1, etc.
  • the call request transmitting unit 112 When holding a video call, that is, a video lecture or a video conference, the call request transmitting unit 112 sends a call request for performing a video call to the terminal 1 participating in the lecture or the conference. Send using the function of. In principle, the call request is transmitted from the master terminal 11 to the slave terminal 12. Of course, this does not hinder the mode of transmitting a call request from the slave terminal 12 to the master terminal 11.
  • the main terminal 11 displays on the screen of the display 1i a list of other terminals 1 that can be candidates for video call destinations, that is, candidates for participation in lectures or meetings.
  • FIG. 9 shows a candidate list display example.
  • the terminal 1 that is first displayed as a candidate is the terminal 1 that has participated in a lecture or a meeting in the past (the history information is left in the setting information storage unit 111).
  • the main terminal 11 reads the identification information from the setting information storage unit 111 for each of the plurality of candidate terminals 1. Then, based on the identification information, a live icon request that is information for requesting the terminal 1 to transmit a live icon (symbol B in the figure) is transmitted.
  • the live icon is a real-time video or still image taken at the place where the terminal 1 is installed.
  • the size of the live icon (for example, 192 ⁇ 108 pixels) is smaller than the size of the video exchanged during the video call.
  • the live icon frame rate is about 5 fps or more.
  • the terminals 1 are interconnected in a peer-to-peer manner via the telecommunication line 2.
  • the identification information does not require address resolution such as an IP address
  • a live icon request can be immediately transmitted to the terminal 1 using the identification information.
  • the identification information requires address resolution, such as a SIP address
  • the invitee terminal 12 and the callee terminal 12 can be contacted by transmitting an invitation message to another computer (for example, a SIP proxy server) using the identification information. Establish a session between them and then send a live icon request.
  • the identification information is transmitted to another computer having an address resolution capability (for example, a DNS (Domain Name System) server), and an IP address corresponding to the identification information is received from the computer.
  • a live icon request is transmitted to the terminal 1 using.
  • the terminal 1 that has received the live icon request normally transmits the live icon captured by the camera 1h to the terminal 1 that has caused the live icon request.
  • the main terminal 11 receives the live icon returned from each terminal 1, and displays it on the screen of the display 1i together with the attribute information (reference symbol A in the figure) of the terminal. Since the live icon cannot be displayed for the terminal 1 that has not received the live icon, an image or a character string indicating that fact is displayed. In the example shown in FIG. 9, the display of “Connecting ...” (reference symbol C) indicates that a live icon request is transmitted to the terminal 1 and a response is awaited.
  • the display of “communication impossible” indicates that a response has not been returned until a certain time has elapsed after transmitting a live icon request to the terminal 1, that is, for some reason (the power supply of the terminal 1 is The communication line to which the terminal 1 is connected is interrupted, etc., indicating that it is impossible to establish a session with the terminal 1.
  • a display of “not displayable” indicates that a live icon request is transmitted to the terminal 1, but the live icon request is rejected, in other words, a response that the live icon is not transmitted is live. Indicates that it was sent instead of an icon.
  • busy symbol F in the figure
  • a live icon request is transmitted to the terminal 1, but a response indicating that a video call is already being made with another terminal 1 is sent instead of the live icon. It shows that it has come.
  • the master terminal 11 sends operation inputs for designating a plurality of slave terminals 12 to be participants in a video lecture or video conference from the candidates displayed on the screen via the operation input device 1d. And accept.
  • the user of the master terminal 11 operates the operation input device 1d to move the cursor (symbol G in the figure), and clicks the information display column (symbols A and B in the figure) of the desired slave terminal 12.
  • the main terminal 11 that has received this operation knows the terminal 12 corresponding to the clicked display field as a participant, and starts the video call to the terminal 12 based on the identification information related to the terminal 12. Send the requested call request.
  • This call request includes the identification information and ID number of each of the plurality of slave terminals 12 and the master terminal 11 that are participants in a video lecture or video conference. Even if the identification information related to the slave terminal 12 requires address resolution, if the session has already been established or address resolution has already been completed at the live icon request transmission stage, a call request is made directly to the slave terminal 12. Can be sent. Otherwise, session establishment or address resolution is performed in the same manner as when the live icon request is transmitted, and a call request is transmitted to the slave terminal 12. Incidentally, the terminal 1 that displays “communication impossible” or “busy” instead of the live icon cannot be designated as a participant (the cursor cannot be moved to the information display field related to the terminal 1). It is like that.
  • the maximum rate calculation unit 116 calculates the maximum value of the transfer rate of the video data that each of the terminals 11 and 12 should transmit during the video call prior to the start of the video lecture or video conference. At this time, the allowable transfer rate for each of the plurality of slave terminals 12 as participants and the allowable transfer rate for the master terminal 11 itself are referred to.
  • the master side terminal 11 transmits / receives data packets to / from each of the slave side terminals 12 that have sent a call request by a known method (counting delay time of data packet transmission / reception, error packet rate, etc.).
  • each of the slave terminals 12 receives a communication line from the slave terminals 12 that have transmitted the transmission request, by receiving the allowable bandwidth information that the slave terminals 12 store and hold in advance as setting information. 2 to obtain and aggregate the allowable transfer rate of data that can be transmitted / received via the network.
  • the allowable bandwidth information for the slave terminal 12 is already stored in the setting information storage unit 111 as a part of the setting information, the stored information in the setting information storage unit 111 may be read. .
  • the main terminal 11 uses a known method for transmitting and receiving data packets to and from the slave terminal 12 on a trial basis, or the allowable bandwidth that the main terminal 11 itself stores and holds as setting information in advance. By reading the width information, the main terminal 11 knows the allowable transfer rate of data that can be transmitted / received via the communication line 2.
  • the maximum value of the transfer rate of the video image to be transmitted from the terminal 11, 12 to the other terminal 11, 12 is calculated.
  • the allowable transmission rate via the communication line 2 to which the terminals 11 and 12 are connected is determined by the number of slave terminals 12 participating in a video lecture or video conference, in other words, the number of participating bases other than itself. Divide by to get the maximum rate of data sent during a video call.
  • Data to be transmitted from the terminals 11 and 12 during a video call includes not only video but also audio, a header defined by a protocol, an error correction (particularly, forward error correction) code, and the like. Because. Therefore, it can be said that it is more desirable to determine the maximum rate of video to be transmitted by subtracting the corresponding band.
  • (Maximum rate of video to be transmitted during video call) (Allowable transmission rate of communication line 2) ⁇ (Number of participating slave terminals 12) ⁇ (Band for transmission of protocol header, error correction code, etc.)
  • the maximum value of the transfer rate of the video to be transmitted from the terminals 11 and 12 to the other terminals 11 and 12 is calculated.
  • the allowable transmission rate is not divided by the number of slave terminals 12, but is multiplied by a ratio corresponding to the number of slave terminals 12 according to the allowable transmission rate, or the allowable transmission rate is discounted. It is also conceivable that the maximum transfer rate of the video transmitted from each terminal 11, 12 is determined by discounting the allowable transmission rate by multiplying a predetermined ratio that does not depend on the number of the following terminals 12 according to the rate.
  • the maximum rate designation information transmission unit 117 transmits the maximum rate designation information describing the value of the maximum transfer rate calculated by the maximum rate calculation unit 116 to each of the slave terminals 12 designated as participants in a video lecture or a video conference. On the other hand, it transmits using the function of the communication interface 1k. In the present embodiment, the maximum transfer rate of video transmitted to the other terminals 11 and 12 is individually calculated for each of the main terminal 11 and the slave terminal 12. Therefore, the maximum rate designation information transmitting unit transmits the maximum rate designation information describing the maximum transfer rate corresponding to each slave terminal 12 to each slave terminal 12. The value of the maximum transfer rate for the main terminal 11 itself is temporarily stored in the main memory 1b or the auxiliary storage device 1c of the main terminal 11.
  • the call receiving unit 113 receives the video and audio transmitted from the slave terminal 12 after the video call is started, using the function of the communication interface 1k.
  • video data is received from all the slave terminals 12 participating in a video conference or a video lecture.
  • the processing differs depending on which terminal 11 or 12 is the subject that attracts the attention of others. While the main terminal 11 itself is a subject that attracts the attention of others, low-definition video data is received from all participating slave terminals 12. While any slave terminal 12 is a subject that attracts the attention of others, high-quality video data is received only from the slave terminal 12.
  • the call output control unit 114 displays the video during a call received from any one or all of the slave terminals 12 on the screen of the display 1i, and the voice during a call received from all the slave terminals 12 to the speaker 1f. To output sound.
  • FIG. 10 shows a display example of the image of the other party during the video call. However, what is shown in FIG. 10 is a call screen in the group view mode, and the call screen in the single view mode is different from this. On the call screen in the group view mode, the video (symbol H in the figure) taken at all participating slave terminals 12 is displayed in parallel with the same dimensions.
  • the call output control unit 114 can also display a video image captured by the camera 1h at its installation location in a small size within the call screen. Further, the call output control unit 114 superimposes the voice received from each slave terminal 12 and outputs the voice from the speaker 1f.
  • the call transmission unit 115 sends the video captured by the camera 1h and the sound collected by the microphone 1e at the installation location of the main terminal 11 to the slave terminal 12 participating in a video lecture or a video conference. Transmission is performed using the function of the communication interface 1k. In the group view mode, the same video data is transmitted to all participating slave terminals 12. For this purpose, the video captured by the camera 1h is encoded into video data having a rate equal to or lower than the maximum transfer rate calculated by the maximum rate calculation unit 116, and then transmitted to each of the slave terminals 12.
  • Methods for increasing or decreasing the bit rate of video data include adjusting the quality after encoding (in other words, the degree of compression), changing the compression algorithm, changing the frame rate, or the size (number of pixels) of the video frame.
  • the bit rate of the video data is suppressed to the maximum transfer rate or less by a method other than enlarging or reducing the size of the video frame.
  • the processing differs depending on which terminal 11 or 12 is the subject that attracts the attention of others. While the main terminal 11 itself is a subject that attracts the attention of others, the same high-definition video data is transmitted to all participating slave terminals 12.
  • the transfer rate of the high-quality video data is not necessarily restricted by the maximum transfer rate calculated by the maximum rate calculation unit 116. This is because each slave terminal 12 receives video only from the master terminal 11.
  • the low-quality video data is transmitted only to the slave terminal 12.
  • the low-quality video data is preferably encoded into data having a rate equal to or lower than the maximum transfer rate calculated by the maximum rate calculation unit 116.
  • the maximum transfer rate it is not necessarily bound by the maximum transfer rate. This is because video exchange does not occur between the terminals 11 and 12 other than the slave terminal 12 that is the subject that attracts the attention of others.
  • the operation input reception unit 118 is an operation input for switching between the group view mode and the single view mode, and an operation input for selecting any one of the terminals 11 and 12 in the single view mode as a subject to attract the attention of others. Is received via the operation input device 1d. For example, the user of the master side terminal 1 operates the operation input device 1d on the screen of the group view mode shown in FIG. Click the “1 site” button (symbol P in the figure) after matching the display field. The master side terminal 11 that has received this operation knows that it should be switched from the group view mode to the single view mode, and further, the slave side terminal 12 corresponding to the position of the cursor receives the attention of others in the single view mode. Know as the terminal 12 that should be the subject of the collection.
  • the user When the user simply wants to switch between the group view mode and the single view mode (a state in which the main terminal 11 is the subject that attracts the attention of others), the user presses a specific key of the operation input device 1d, etc. do it.
  • the main terminal 11 that has received this operation knows that switching from the group view mode to the single view mode or from the single view mode to the group view mode is required.
  • the command transmission unit 119 In response to the operation input received by the operation input reception unit 118, the command transmission unit 119 is instructed to switch between the group view mode and the single view mode, and / or any of the terminals 11 and 12 in the single view mode.
  • a switching command which is a command for instructing whether or not it has been selected, is transmitted to each slave terminal 12 using the function of the communication interface 1k.
  • the operation input reception unit 118 is performed by the user of the master terminal 11.
  • the operation input indicating that the pan, tilt or zoom of the camera 1h associated with the camera 1h should be operated can be received via the operation input device 1d.
  • the command operation unit 119 sends a camera operation command corresponding to the operation input to operate the pan, tilt, or zoom of the camera 1 h to the slave terminal 12. And using the function of the communication interface 1k.
  • the setting information storage unit 124 stores setting information, in particular, allowable bandwidth information regarding the slave terminal 12 using the storage area of the main memory 1b or the auxiliary storage device 1c.
  • the setting information may be manually input directly or may be transmitted from another terminal 1 or a computer connected via the telecommunication line 2.
  • the terminal 1 accepts input of setting information via the operation input device 1 d or receives it using the function of the communication interface 1 k and stores it in the setting information storage unit 124.
  • the allowable bandwidth information is not necessarily set and input in advance and stored in the setting information storage unit 121.
  • the call request receiving unit 121 receives a call request transmitted from the main terminal 11 by using the function of the communication interface 1k.
  • the call request includes identification information and an ID number of each of the plurality of slave terminals 12 and the master terminal 1 that are participants in a video lecture or a video conference. Therefore, by referring to the call request, the slave terminal 12 can know the identification information of the other participating terminals 11 and 12 and can establish a session connection with the terminals 11 and 12. It becomes.
  • the slave terminal 12 that has received the call request from the master terminal 11 is directed toward the other slave terminal 12 described in the call request just as the master terminal 11 has executed for the slave terminal 12. Send further call requests.
  • the slave terminal 12 may return the allowable bandwidth information stored and held in its own setting information storage unit 124 to the master terminal 1 in response to the call request.
  • the maximum rate designation information receiving unit 125 receives the maximum rate designation information transmitted from the main terminal 11 by using the function of the communication interface 1k. By referring to this maximum rate designation information, the maximum transfer rate of video data to be transmitted, particularly in the group view mode, is obtained. The value of the maximum transfer rate is temporarily stored in the main memory 1b or the auxiliary storage device 1c of the slave terminal 12.
  • the call receiving unit 122 receives the video and audio transmitted from the main terminal 11 and / or the slave terminal 12 after the start of the video call using the function of the communication interface 1k.
  • video data is received from the master terminal 11 and all other slave terminals 12 participating in a video conference or a video lecture.
  • the processing differs depending on which terminal 11 or 12 is the subject that attracts the attention of others. While the slave terminal 11 itself is a subject that attracts the attention of others, it receives low-quality video data from the participating master terminal 11 and all other slave terminals 12. While either the master terminal 11 or the slave terminal 12 other than itself is the subject that attracts the attention of others, high-quality video data is received only from the master terminal 11 or the slave terminal 12 To do.
  • the call output control unit 123 displays a video during a call received from any one or all of the main side terminal 11 and the subordinate terminal 12 on the screen of the display 1i and all of the main side terminal 11 and the subordinate side terminal 12.
  • the voice during a call received from the voice is output from the speaker 1f.
  • the sound received from the terminals 11 and 12 is superimposed and output from the speaker 1f.
  • the call transmission unit 127 receives the video captured by the camera 1h at the installation location of the slave terminal 12 and the sound collected by the microphone 1e and / or the master terminal 11 participating in the video lecture or video conference. It transmits to the side terminal 12 using the function of the communication interface 1k. In the group view mode, the same video data is transmitted to the participating master terminal 11 and all other slave terminals 12. For this purpose, the video captured by the camera 1h is calculated by the maximum rate calculation unit 116 of the main terminal 11 and encoded into video data having a rate equal to or less than the maximum transfer rate described in the maximum rate designation information. The data is transmitted to each of the terminal 11 and the slave terminal 12.
  • the processing differs depending on which terminal 11 or 12 is the subject that attracts the attention of others. While the slave terminal 12 itself is a subject that attracts the attention of others, the same high-definition video data is transmitted to the participating master terminal 11 and all other slave terminals 12. .
  • the transfer rate of the high-quality video data is not necessarily restricted by the maximum transfer rate described in the maximum rate designation information. This is because the master terminal 11 and the other slave terminals 12 receive video only from this side.
  • the low-definition video data only for the master terminal 11 or the slave terminal 12 Send.
  • the low-quality video data at this time is preferably encoded into data having a rate equal to or less than the maximum transfer rate described in the maximum rate designation information. However, it is not necessarily bound by the maximum transfer rate. This is because video exchange does not occur between the terminals 11 and 12 other than the terminals 11 and 12 that are the subject that attracts the attention of others.
  • the command receiving unit 126 receives the switching command transmitted from the main terminal 11 using the function of the communication interface 1k. By referring to the switching command, it is known whether the terminal is currently in the group view mode or the single view mode, and which of the terminals 11 and 12 is the subject that attracts the attention of others in the single view mode. To get.
  • the command receiving unit 126 receives a camera operation command provided from the master terminal 11 There is.
  • a control signal is input to the camera 1h based on the camera operation command to operate pan, tilt, or zoom.
  • step S1 when the main terminal 11 is currently in the group view mode (step S1), the low-definition video transmitted from all the slave terminals 12 participating in the video lecture or video conference or the like. And the audio are received (step S2), and the received video and audio are output via the display 1i and the speaker 1f (step S3).
  • step S4 the user's own video captured by the camera 1h and the user's own voice picked up by the microphone 1e are transmitted to all participating slave terminals 12 (step S4).
  • step S4 the video data is encoded into a low-quality video, and the transfer rate is suppressed below the maximum transfer rate.
  • step S5 If the current mode is the single view mode and the main terminal 11 itself is selected as a subject that attracts the attention of others (step S5), the low-level transmitted from all of the participating slave terminals 12 that are also participating.
  • the quality video and audio are received (step S6), and the received video and audio are output via the display 1i and the speaker 1f (step S7).
  • step S8 the user's own video captured by the camera 1h and the user's own voice collected by the microphone 1e are transmitted to all participating slave terminals 12 (step S8).
  • step S8 the video data is encoded into a high quality video.
  • the transfer rate is not necessarily suppressed below the maximum transfer rate.
  • step S9 When currently in the single view mode and any of the slave terminals 12 is selected as a subject that attracts the attention of others, while receiving a high-definition video transmitted from only the slave terminal 12, Audio transmitted from all the slave terminals 12 is received (step S9), and the received video and audio are output via the display 1i and the speaker 1f (step S10).
  • step S11 the user's own video captured by the camera 1h and the user's own voice picked up by the microphone 1e are transmitted only to the slave terminal 12 that is the subject that attracts the attention of others (step S11).
  • step S11 the video data is encoded into a low-quality video, and the transfer rate is suppressed below the maximum transfer rate.
  • step 12 an operation input indicating that the user of the main terminal 11 should switch between the group view mode and the single view mode, or a subject that attracts the attention of others in the single view mode for any one of the terminals 11 and 12 Is received (step 12), a switching command corresponding to the operation input is transmitted to all participating slave terminals 12 (step S13).
  • the main terminal 11 repeats the above steps S1 to S13 until the end of the video lecture or video conference.
  • step S14 when the slave terminal 12 is currently in the group view mode (step S14), from the master terminal 11 participating in the video lecture or video conference and all other slave terminals 12
  • the low-definition video and audio transmitted from is received (step S15), and the received video and audio are output via the display 1i and the speaker 1f (step S16).
  • step S17 the self-image captured by the camera 1h and the own sound collected by the microphone 1e are transmitted to the main terminal 11 and all other subordinate terminals 12 (step S17).
  • step S17 the video data is encoded into a low-quality video, and the transfer rate is suppressed below the maximum transfer rate.
  • step S18 When the current mode is the single view mode and the slave terminal 12 itself has been selected as a subject that attracts the attention of others (step S18), it is also transmitted from the master terminal 11 and all the slave terminals 12.
  • the low-quality video and audio are received (step S19), and the received video and audio are output via the display 1i and the speaker 1f (step S20).
  • step S21 the self-image captured by the camera 1h and the own sound collected by the microphone 1e are transmitted to the main terminal 11 and all other subordinate terminals 12 (step S21).
  • step S21 the video data is encoded into a high quality video.
  • the transfer rate is not necessarily suppressed below the maximum transfer rate.
  • step S22 If the current mode is the single view mode and the slave terminal 12 other than the master terminal 11 or the slave terminal 12 other than itself is selected as a subject that attracts the attention of others, transmission is performed only from the master terminal 11 or the slave terminal 12 While receiving the high-definition video, the audio transmitted from all the slave terminals 12 is received (step S22), and the received video and audio are output via the display 1i and the speaker 1f (step S23). At the same time, the user's own video captured by the camera 1h and the user's own voice collected by the microphone 1e are transmitted only to the main terminal 11 or the slave terminal 12 that is the subject that attracts the attention of others. (Step S24). In step S24, the video data is encoded into a low-quality video, and the transfer rate is suppressed below the maximum transfer rate.
  • step S25 when a switching command is provided from the main terminal 11, it is received (step S25).
  • the slave terminal 12 repeats the above steps S14 to S25 until the end of the video lecture or video conference.
  • each of the terminals 11 and 12 has lost the communication connection. , 12 and automatically re-send the call request to attempt reconnection.
  • the slave terminal 12 can leave the currently participating video call according to the convenience of the user of the slave terminal 12.
  • the slave terminal 12 receives, via the operation input device 1d, an operation input that instructs the user to leave the video call, information indicating that the slave terminal 12 is to leave the video call is displayed. And to the other slave terminal 12.
  • the master terminal 11 and the slave terminal 12 that have received the above information disconnect the communication connection with the slave terminal 12 that is about to leave the video call.
  • the main terminal 11 calculates the maximum transfer rate again in the same manner as when the video call starts, and describes the calculated maximum transfer rate.
  • the rate designation information is transmitted to each of the slave terminals 12 other than the slave terminal 12 that leaves the video call.
  • the slave side terminal 12 receives the maximum rate designation information brought again from the master side terminal 11. Thereafter, the master terminal 11 and the slave terminal 12 that make a video call each generate and transmit a video that is less than or equal to the recalculated maximum transfer rate at least in the group view mode.
  • the main terminal 11 can newly add a participating base to a video call that has already been started.
  • the master side terminal 11 receives an operation input including designation of the slave side terminal 12 to newly participate in the already started video call by the user's hand via the operation input device 1d. For example, as shown in FIG. 9, a candidate for a slave terminal 12 to be newly joined is displayed on the screen of the display 1i, and the slave terminal 12 to be a new participant is designated as a user among the candidates.
  • the master terminal 11 that has received the operation input transmits a call request for requesting the slave terminal 12 to start a video call based on the identification information related to the slave terminal 12 designated as a new participant. .
  • This call request includes identification information and ID numbers of each of the plurality of slave terminals 12 and the master terminal 11 already participating in the video call.
  • the slave terminal 12 designated as the new participant receives the call request from the master terminal 11, and already participates in the video call by referring to the identification information and ID number included in the call request.
  • a call request is transmitted to each slave terminal 12 that is in charge. Through these processes, communication is established between the slave terminal 12 newly participating in the video call and the slave terminal 12 and the master terminal already participating in the video call.
  • the master terminal 11 also transmits information including identification information and ID number of the slave terminal 12 designated as a new participant to the slave terminal 12 already participating in the video call.
  • the slave terminal 12 having received this information may transmit a call request to the slave terminal 12 newly participating in the video call.
  • the main terminal 11 calculates the maximum transfer rate again in the same manner as when starting the video call, and describes the calculated maximum transfer rate.
  • the rate designation information is transmitted to each of the slave terminals 12 (including bases newly participating in video calls).
  • the slave side terminal 12 receives the maximum rate designation information brought again from the master side terminal 11. Thereafter, the master terminal 11 and the slave terminal 12 that make a video call each generate and transmit a video that is less than or equal to the recalculated maximum transfer rate at least in the group view mode.
  • the main terminal 11 can selectively exclude any of the bases participating in the video call from the video call.
  • the master side terminal 11 receives an operation input including designation of the slave side terminal 12 to be excluded from the already started video call by the user's hand via the operation input device 1d.
  • the master terminal 11 that has received the operation input transmits information including the identification information and ID number related to the slave terminal 12 to be excluded to the slave terminals 12 other than the slave terminal 12.
  • the slave terminal 12 that has received the information disconnects the communication connection with the slave terminal 12 that should be excluded.
  • the master terminal 11 transmits information indicating that to the slave terminal 12 to be excluded.
  • the slave terminal 12 that has received this information may disconnect the communication connection with the other slave terminals 12 and the master terminal 11.
  • the main terminal 11 calculates the maximum transfer rate again in the same manner as when the video call starts, and describes the calculated maximum transfer rate.
  • the rate designation information is transmitted to each of the slave terminals 12 other than the slave terminal 12 excluded from the video call.
  • the slave side terminal 12 receives the maximum rate designation information brought again from the master side terminal 11. Thereafter, the master terminal 11 and the slave terminal 12 that make a video call each generate and transmit a video that is less than or equal to the recalculated maximum transfer rate at least in the group view mode.
  • transmission rate optimization processing is not required at the start of a video call, and video transmission / reception and screen display can be started quickly in the group view mode. Immediately after the image stabilizes.
  • the present invention is not limited to the embodiment described in detail above.
  • the method for calculating the maximum transfer rate of video by the maximum rate calculation unit 116 of the main terminal 11 is not limited to that in the above embodiment.
  • the transfer rate of data that can be transmitted via the line 2 to which the terminals 11 and 12 are connected is the bottle. It becomes a neck. This is because the xDSL line has an uplink communication speed that is significantly slower than the downlink communication speed. In such a case, it is reasonable to calculate the maximum video transfer rate by referring only to the uplink transfer rate of the line 2 to which the terminals 11 and 12 are connected. The transfer rate of data that can be received through the network, that is, the downlink transfer rate may be ignored.
  • the maximum transfer rate may be calculated by referring only to the uplink transfer rate, and the maximum transfer rate may be calculated by referring only to the downlink transfer rate. It may be calculated. In the latter case, for example, the maximum transfer rate of video is calculated by dividing or discounting the minimum value that is a bottleneck among the downlink transfer rates for each terminal 11 and 12 by the number of slave terminals 12. It is possible to do.
  • the maximum video transmission rate may be calculated by referring to both the upstream transfer rate and the downstream transfer rate.
  • the downlink transfer rate of the communication line 2 to which the terminal 11, 12 is connected is Do not puncture, and when each terminal 11, 12 transmits video to all other terminals 11, 12 simultaneously, the uplink transfer rate of the communication line 2 to which the terminal 11, 12 is connected is punctured No, it is required to calculate the maximum transfer rate so as to satisfy the above two points.
  • the present invention can be used, for example, as a system for realizing video lectures or video conferences between multiple points.

Abstract

A main-side terminal (11), when starting a video call, refers to allowed bandwidth information for each of terminals (11 and 12), and calculates a value which is the allowed transfer rate divided by the number of subordinate-side terminals (12) as the maximum value of the transfer rate for video image data to be transmitted from each of the terminals (11 and 12) during the video call. Then, information of the calculated maximum transfer rate is transmitted from the main-side terminal (11) toward each of the subordinate-side terminals (12).

Description

ビデオ通話システム、主側端末、従側端末Video call system, master terminal, slave terminal
 本発明は、三者以上が参加することのできるビデオ通話機能を実現するためのシステムに関する。 The present invention relates to a system for realizing a video call function in which three or more parties can participate.
 電気通信回線を介して、多地点間でのビデオ講義またはビデオ会議等を実現するビデオ通話システムが周知である。 Video call systems that realize video lectures or video conferences between multiple points via telecommunications lines are well known.
 近時では、映像の高品位化への要求がますます高まっているが、依然として通信回線の帯域幅がボトルネックとなっており、三者以上の者がそれぞれに向けて高品位の映像を送出しようとすれば即座に通信帯域が不足してしまう。通信帯域が不足すれば、映像のフレーム落ちや音声の中断が発生し、重要な話題に関する講義や会議には最早適さなくなる。 Recently, the demand for higher-quality video has been increasing, but the bandwidth of the communication line is still a bottleneck, and more than three people send high-quality video to each. If you try to do so, the communication bandwidth will be insufficient immediately. If the communication bandwidth is insufficient, video frames are dropped and audio is interrupted, making it no longer suitable for lectures and meetings on important topics.
 これを避けるには、十分な大きさの通信速度が保証されたギャランティ型の(ベストエフォート型でない)専用回線を敷設したり、各拠点の間に高性能中継装置であるMCU(Multi-point Control Unit)を介設したりすることが有効である。だが、何れも巨額の投資を要求される面は否めない。IP(Internet Protocol)マルチキャスト技術を利用することも考えられるものの、現状、インターネット等の公衆回線上でマルチキャスト通信を行うことは一般に許されてはいない。 In order to avoid this, a guarantee-type (non-best-effort type) dedicated line with a sufficiently high communication speed is installed, or an MCU (Multi-point Control) that is a high-performance relay device between each base. It is effective to use a unit). However, there is no denying that all of them require huge investments. Although it may be possible to use IP (Internet Protocol) multicast technology, at present, it is not generally allowed to perform multicast communication on a public line such as the Internet.
 以上に鑑み、出願人は、比較的安価な投資で高品位の映像をやり取りできるようにした、新たなビデオ通話システムを提案している(下記特許文献及び非特許文献を参照)。 In view of the above, the applicant has proposed a new video call system that enables high-quality video to be exchanged with a relatively inexpensive investment (see the following patent documents and non-patent documents).
 ところで、既製のシステムでは、日または時間帯によって変動する回線の混雑状況に対応して、送出する映像データの転送レートを自動的に最適化している。しかしながら、映像データの転送レートの最適化調整にはある程度の時間が必要で、相手方から送られてくる映像が画面に表示されるまで待たされたり、表示はされるもののその映像がなかなか安定しなかったりする問題を生ずることがあった。 By the way, an off-the-shelf system automatically optimizes the transfer rate of video data to be transmitted in response to the congestion status of the line that varies depending on the day or time. However, it takes a certain amount of time to optimize the transfer rate of the video data, and although the video sent from the other party is waited until it is displayed on the screen or displayed, the video is not stable. May cause problems.
特願2009-170612号明細書Japanese Patent Application No. 2009-170612
 本発明は、三者以上の参加者が存在するビデオ通話の開始直後から、映像表示を可及的に安定化させることを主たる目的としている。 The main object of the present invention is to stabilize the video display as much as possible immediately after the start of a video call in which there are three or more participants.
 本発明では、三以上の拠点のそれぞれに設置された端末間で映像を送受信するビデオ通話機能を実現するシステムであり、何れか一の端末が主側端末、残りの端末が従側端末となるものであって、主側端末が、ビデオ通話の開始に際して、各端末毎の、当該端末が接続している通信回線を介して送信可能なデータの転送レート(アップリンクレート)または受信可能なデータの転送レート(ダウンリンクレート)を参照し、その転送レートを従側端末の数の多寡に応じて割り引くことを通じて、各端末から他の端末に向けてビデオ通話中に送出するべき転送レートの最大値を算定する最大レート算定部と、算定した最大転送レートを記述した最大レート指定情報を従側端末の各々に対して送信する最大レート指定情報送信部と、当該主側端末の設置拠点においてカメラで撮影される映像から、算定した最大転送レート以下のレートの映像データを生成して従側端末の各々に対して送信する通話送信部と、従側端末の各々から送信される映像データを受信する通話受信部と、受信した映像データをディスプレイの画面に表示させる通話出力制御部とを具備し、従側端末が、ビデオ通話の開始に際して、主側端末から送信される最大レート指定情報を受信する最大レート指定情報受信部と、当該従側端末の設置拠点においてカメラで撮影される映像から、最大レート指定情報に記述された最大転送レート以下のレートの映像データを生成して主側端末及び他の従側端末の各々に対して送信する通話送信部と、主側端末及び他の従側端末の各々から送信される映像データを受信する通話受信部と、受信した映像データをディスプレイの画面に表示させる通話出力制御部とを具備していることを特徴とするビデオ通話システムを構成した。 The present invention is a system for realizing a video call function for transmitting and receiving video between terminals installed at each of three or more bases, and any one terminal is a master terminal and the remaining terminals are slave terminals. The data transfer rate (uplink rate) that can be transmitted via the communication line connected to the terminal for each terminal when the main terminal starts a video call or data that can be received The maximum transfer rate that should be transmitted during video calls from each terminal to other terminals by referring to the transfer rate (downlink rate) and discounting the transfer rate according to the number of slave terminals A maximum rate calculation unit that calculates a value, a maximum rate specification information transmission unit that transmits maximum rate specification information describing the calculated maximum transfer rate to each of the slave terminals, and the master side terminal A call transmission unit that generates video data having a rate equal to or less than the calculated maximum transfer rate from video captured by the camera at the installation site of the mobile station and transmits the video data to each of the slave terminals, and is transmitted from each of the slave terminals. And a call output control unit for displaying the received video data on the display screen, and the slave terminal transmits a maximum of the maximum number of messages transmitted from the master terminal at the start of the video call. Generates video data at a rate equal to or less than the maximum transfer rate described in the maximum rate specification information from the maximum rate specification information receiving unit that receives the rate specification information and the video captured by the camera at the location where the slave terminal is installed. A call transmitting unit for transmitting to each of the main terminal and other slave terminals, and a call for receiving video data transmitted from each of the master terminal and other slave terminals. A signal unit and the received video data to configure a video call system characterized in that it comprises a call output control unit for displaying on the display screens.
 前記主側端末の最大レート算定部は、基本的には、前記転送レートを従側端末の数で除算することによって最大転送レートを算定する。 The maximum rate calculation unit of the master terminal basically calculates the maximum transfer rate by dividing the transfer rate by the number of slave terminals.
 また、全ての端末から他の端末の各々に対して映像データを送信し、全ての端末が自拠点以外の全拠点の映像を表示出力するグループビューモードと、主側端末から従側端末の各々に対して映像データを送信し、従側端末が主側端末の設置拠点の映像を表示出力するシングルビューモードとを任意に切り替えることできるシステムとしてもよい。この場合、前記主側端末は、グループビューモードとシングルビューモードとを切り替えるべき旨の操作入力を受け付ける操作入力受付部と、受け付けた操作入力に対応してグループビューモードとシングルビューモードとを切り替えるための指令である切替指令を従側端末の各々に対して送信する指令送信部とをさらに具備した上、通話送信部が、シングルビューモードにおいては前記最大転送レートに拘束されることなくグループビューモードよりも転送レートの高い映像データを生成して従側端末の各々に対して送信するものとし、前記従側端末は、主側端末から送信される切替指令を受信する指令受信部をさらに具備した上、通話送信部が、シングルビューモードにおいては主側端末に対してのみ映像データを送信し、通話受信部が、シングルビューモードにおいては主側端末からのみ映像データを受信するものとする。現実には、ビデオ講義やビデオ会議等において、全ての参加者が常に他の全ての参加者の映像を必要とする訳ではない。そこで、シングルビューモードにおいては、従側端末間での映像の送受信を停止することとし、その分転送レートを上げたより高品位な映像を主側端末から各従側端末に配信できるようにしている。 In addition, a group view mode in which video data is transmitted from all terminals to each of the other terminals, and all the terminals display and output videos of all bases other than their own base, and each of the slave terminals from the master terminal It is good also as a system which can switch arbitrarily to the single view mode which transmits video data with respect to, and the slave side terminal displays and outputs the video of the installation base of the master side terminal. In this case, the main terminal switches an operation input receiving unit that receives an operation input indicating that the group view mode and the single view mode should be switched, and switches between the group view mode and the single view mode in response to the received operation input. And a command transmission unit that transmits a switching command, which is a command for the slave terminal, to each of the slave terminals, and the call transmission unit is not restricted to the maximum transfer rate in the single view mode. Video data having a higher transfer rate than the mode is generated and transmitted to each of the slave terminals, and the slave terminal further includes a command receiving unit that receives a switching command transmitted from the master terminal. In addition, in the single view mode, the call transmission unit transmits video data only to the main terminal, and the call reception unit It shall only receive video data from the main terminal in the single view mode. In reality, in video lectures and video conferences, not all participants need images of all other participants at all times. Therefore, in the single view mode, transmission / reception of video between the slave terminals is stopped, and higher-quality video with an increased transfer rate can be distributed from the master terminal to each slave terminal. .
 シングルビューモードにおいて、高品位の映像を他の端末に向けて送出する端末、いわばビデオ通話で他者の注目を集める主体を変更できるようにすることも好ましい。即ち、全ての端末から他の端末の各々に対して映像データを送信し、全ての端末が自拠点以外の全拠点の映像を表示出力するグループビューモードと、選択された何れか一の端末から他の端末の各々に対して映像データを送信し、他の端末が選択された一の端末の設置拠点の映像を表示出力するシングルビューモードとを切り替えることが可能であって、前記主側端末が、グループビューモードとシングルビューモードとを切り替えるべき旨の操作入力、及びシングルビューモードにおける何れか一の端末を選択する操作入力を受け付ける操作入力受付部と、受け付けた操作入力に対応して、グループビューモードとシングルビューモードとを切り替えるための指令、またはシングルビューモードにおいて何れの端末が選択されたのかを指示するための指令である切替指令を従側端末の各々に対して送信する指令送信部とをさらに具備した上、通話送信部が、シングルビューモードかつ当該主側端末自身が選択されている場合には、前記最大転送レートに拘束されることなくそれ以外の場合よりも転送レートの高い映像データを生成して従側端末の各々に対して送信する一方、シングルビューモードかつ従側端末が選択されている場合には、その選択された端末に対してのみ映像データを送信し、通話受信部が、シングルビューモードかつ従側端末が選択されている場合には、その選択された端末からのみ映像データを送信するとともに、前記従側端末が、主側端末から送信される切替指令を受信する指令受信部をさらに具備した上、通話送信部が、シングルビューモードかつ当該従側端末自身が選択されている場合には、前記最大転送レートに拘束されることなくそれ以外の場合よりも転送レートの高い映像データを生成して主側端末及び他の従側端末の各々に対して送信する一方、シングルビューモードかつ主側端末または他の従側端末が選択されている場合には、その選択された端末に対してのみ映像データを送信し、通話受信部が、シングルビューモードかつ主側端末または他の従側端末が選択されている場合には、その選択された端末からのみ映像データを受信するシステムとすれば、シングルビューモードにおいて、主側端末以外の端末を選択して、当該端末から主側端末を含む他の端末に向けて高品位の映像を配信させることが可能となる。つまり、主側端末を操作する使用者の手により、ビデオ通話で他者の注目を集める主体を適宜変更することができるようになる。 In the single view mode, it is also preferable to be able to change a terminal that sends high-quality video to other terminals, that is, a subject that attracts the attention of others in a video call. That is, from all terminals, the video data is transmitted to each of the other terminals, and all the terminals display and output the video of all bases other than their own base, and from any one selected terminal It is possible to switch to a single view mode that transmits video data to each of the other terminals and displays and outputs the video of the installation base of the one terminal selected by the other terminal, the main terminal Corresponding to the operation input accepting an operation input for switching between the group view mode and the single view mode and an operation input for selecting any one terminal in the single view mode, Command to switch between group view mode and single view mode, or which terminal is selected in single view mode A command transmission unit that transmits a switching command that is a command for the slave terminal to each of the slave terminals, and the call transmitter is in a single view mode and the master terminal itself is selected Is not restricted by the maximum transfer rate and generates video data having a higher transfer rate than other cases and transmits it to each of the slave terminals, while the single view mode and slave terminal are selected. Video data is transmitted only to the selected terminal, and when the call receiver is in single view mode and the slave terminal is selected, the video is transmitted only from the selected terminal. In addition to transmitting data, the slave terminal further includes a command receiving unit that receives a switching command transmitted from the master terminal. When the side terminal itself is selected, it generates video data having a higher transfer rate than the other cases without being constrained by the maximum transfer rate to each of the master side terminal and the other slave side terminals. On the other hand, when the single-view mode and the master terminal or other slave terminal are selected, the video data is transmitted only to the selected terminal, and the call receiver Mode, and when a master terminal or other slave terminal is selected, if the system receives video data only from the selected terminal, a terminal other than the master terminal is selected in the single view mode. Thus, it is possible to distribute high-quality video from the terminal toward other terminals including the main terminal. That is, it is possible to appropriately change the subject that attracts the attention of others in the video call by the hand of the user who operates the main terminal.
 さらに、前記主側端末の操作入力受付部が、シングルビューモードかつ他の従側端末が選択されている場合に、当該従側端末に付随するカメラのパン、チルトまたはズームを操作するべき旨の操作入力を受け付け、前記主側端末の指令送信部が、受け付けた操作入力に対応してカメラのパン、チルトまたはズームを操作するためのカメラ操作指令を選択された従側端末に対して送信し、前記従側端末の指令受信部が、シングルビューモードかつ当該従側端末自身が選択されている場合に、主側端末から送信されるカメラ操作指令を受信するシステムとすれば、何れかの従側端末から主側端末及び他の従側端末に高品位の映像を配信している状況の下で、主側端末の使用者が当該従側端末のカメラを必要に応じてリモートコントロールすることができる。 Further, the operation input receiving unit of the master side terminal should operate the pan, tilt, or zoom of the camera associated with the slave terminal when the single view mode and another slave terminal are selected. An operation input is received, and the command transmission unit of the master terminal transmits a camera operation command for operating the pan, tilt, or zoom of the camera to the selected slave terminal in response to the received operation input. If the command receiving unit of the slave terminal is a system that receives a camera operation command transmitted from the master terminal when the slave terminal itself is selected in the single view mode, any slave In a situation where high-definition video is being distributed from the side terminal to the master side terminal and other slave side terminals, the user of the master side terminal can remotely control the camera of the slave side terminal as necessary. Can.
 前記主側端末が前記従側端末としての機能をも兼備しているならば、主側端末と従側端末とを共通化でき、端末の製作コストの低減に資する。 If the main terminal also has the function as the slave terminal, the master terminal and the slave terminal can be shared, which contributes to a reduction in the production cost of the terminal.
 本発明によれば、三者以上の参加者が存在するビデオ通話の開始直後から、映像表示を可及的に安定化させることができる。 According to the present invention, the video display can be stabilized as much as possible immediately after the start of a video call in which three or more participants exist.
本発明の一実施形態のビデオ通話システムの概要を示す図。The figure which shows the outline | summary of the video call system of one Embodiment of this invention. 同実施形態における端末が具備するハードウェア資源を示す図。The figure which shows the hardware resource which the terminal in the embodiment comprises. 同端末の機能ブロック図。Functional block diagram of the terminal. 同端末の機能ブロック図。Functional block diagram of the terminal. 本実施形態におけるグループビューモードの内容を説明する図。The figure explaining the content of the group view mode in this embodiment. 本実施形態におけるシングルビューモードの内容を説明する図。The figure explaining the content of the single view mode in this embodiment. 本実施形態におけるシングルビューモードの内容を説明する図。The figure explaining the content of the single view mode in this embodiment. 電話帳記憶部に記憶蓄積している情報を例示する図。The figure which illustrates the information memorize | stored and accumulated in the telephone directory memory | storage part. 端末上での画面表示例を示す図。The figure which shows the example of a screen display on a terminal. 端末上での画面表示例を示す図。The figure which shows the example of a screen display on a terminal. 端末がプログラムに従い実行する処理の手順を示すフローチャート。The flowchart which shows the procedure of the process which a terminal performs according to a program. 端末がプログラムに従い実行する処理の手順を示すフローチャート。The flowchart which shows the procedure of the process which a terminal performs according to a program.
 本発明の一実施形態を、図面を参照して説明する。図1に示すように、本実施形態のビデオ通話システムは、各拠点に分散して設置される端末1を、インターネットに代表される公衆網やWAN(Wide Area Network)、LAN(Local Area Network)等の電気通信回線2を介して相互通信可能に接続して構築される。それら端末1のうちの少なくとも一つが主側端末11となり、残りの複数が従側端末12となる。 An embodiment of the present invention will be described with reference to the drawings. As shown in FIG. 1, in the video call system of this embodiment, a terminal 1 distributed in each base is connected to a public network represented by the Internet, a WAN (Wide Area Network), or a LAN (Local Area Network). It is constructed by connecting to each other via a telecommunication line 2 such as the above. At least one of the terminals 1 is the master terminal 11, and the remaining plurality are slave terminals 12.
 図2に示すように、端末1は、プロセッサ1a、メインメモリ1b、補助記憶デバイス1c、操作入力デバイス1d、マイク1e、スピーカ1f、オーディオコーデック1g、カメラ1h、ディスプレイ1i、ビデオコーデック1j、通信インタフェース1k等のハードウェア資源を備え、これらがコントローラ(システムコントローラ、I/Oコントローラ等)1lにより制御されて連携動作するものである。 As shown in FIG. 2, the terminal 1 includes a processor 1a, a main memory 1b, an auxiliary storage device 1c, an operation input device 1d, a microphone 1e, a speaker 1f, an audio codec 1g, a camera 1h, a display 1i, a video codec 1j, and a communication interface. Hardware resources such as 1k are provided, and these are controlled by a controller (system controller, I / O controller, etc.) 11 to perform a cooperative operation.
 補助記憶デバイス1cは、ハードディスクドライブ、フラッシュメモリ、光学ディスクドライブ、その他である。 The auxiliary storage device 1c is a hard disk drive, a flash memory, an optical disk drive, or the like.
 操作入力デバイス1dは、手指で操作可能な押下ボタンや、タッチパネル、トラックパッド等のポインティングデバイスであるが、本実施形態では端末1と無線交信可能なリモートコントローラを想定している。 The operation input device 1d is a push button that can be operated with a finger, a pointing device such as a touch panel, a track pad, or the like. In the present embodiment, a remote controller capable of wireless communication with the terminal 1 is assumed.
 オーディオコーデック1gは、マイク1eを介して収音(現場の音を集めることを言い、録音を含む)した音声を符号化したり、符号化されている音声情報を復号化してスピーカ1fから音声出力したりする。ビデオコーデック1jは、カメラ1hを介して撮影した映像を符号化したり、符号化されている映像情報を復号化してディスプレイ1iの画面に表示出力したりする。オーディオコーデック1g、ビデオコーデック1jはそれぞれ、ハードウェアでなくソフトウェアとして実装することも可能である。 The audio codec 1g encodes the sound collected through the microphone 1e (which means collecting on-site sound, including recording), or decodes the encoded audio information and outputs the audio from the speaker 1f. Or The video codec 1j encodes video captured through the camera 1h, decodes encoded video information, and displays and outputs it on the screen of the display 1i. Each of the audio codec 1g and the video codec 1j can be implemented as software instead of hardware.
 カメラ1hは、端末1に内蔵されるものであってもよく、端末1に外付けされるものであってもよい。外付けの場合、市場に流通している汎用的な端末内蔵でも外付けでもよい。外付けの場合、市場に流通している汎用的なカメラを用いることができる。カメラ1hは、これを実装または直接に接続している端末1から制御信号を入力することにより、パン(左右方向に沿った光軸の角度変更)、チルト(上下方向に沿った光軸の角度変更)、及び/または、ズーム(焦点距離の変更)を操作できる態様のものであることが好ましい。 The camera 1 h may be built in the terminal 1 or may be externally attached to the terminal 1. In the case of external attachment, it may be internal or external to a general-purpose terminal distributed in the market. In the case of external attachment, a general-purpose camera distributed in the market can be used. The camera 1h receives a control signal from a terminal 1 that is mounted or directly connected to the camera 1h, thereby panning (changing the angle of the optical axis along the horizontal direction) and tilting (angle of the optical axis along the vertical direction). (Change) and / or zoom (change of focal length) is preferable.
 通信インタフェース1gは、電気通信回線2を介した情報通信を行うためのデバイスであり、NIC(Network Interface Card)や無線LANトランシーバに代表されるが、これら以外にUSB(Universal Serial Bus)、IEEE1394等のインタフェースを採用することもできる。 The communication interface 1g is a device for performing information communication via the telecommunication line 2 and is typified by NIC (Network Interface Card) or wireless LAN transceiver, but besides these, USB (Universal Serial Bus), IEEE 1394, etc. The interface can also be adopted.
 端末1は、基本的には、本システムを構築する目的で製作された専用の機器である。但し、汎用のパーソナルコンピュータ、ワークステーション、ビデオゲーム機等に必要なプログラムをインストールして、本発明に係る端末1を構成することを妨げるものではない。 The terminal 1 is basically a dedicated device manufactured for the purpose of constructing this system. However, it does not prevent the terminal 1 according to the present invention from being configured by installing a program necessary for a general-purpose personal computer, workstation, video game machine or the like.
 プロセッサ1aによって実行されるべきプログラムは補助記憶デバイス1cに格納されており、プログラムの実行の際には補助記憶デバイス1cからメインメモリ1bに読み込まれ、プロセッサ1aによって解読される。端末1は、プログラムに従い上記ハードウェア資源を作動して、図3に示す主側端末11の機能、並びに図4に示す従側端末12の機能を発揮する。 The program to be executed by the processor 1a is stored in the auxiliary storage device 1c. When the program is executed, the program is read from the auxiliary storage device 1c into the main memory 1b and decoded by the processor 1a. The terminal 1 operates the hardware resource according to the program, and exhibits the function of the master terminal 11 shown in FIG. 3 and the function of the slave terminal 12 shown in FIG.
 主側端末11の機能部としては、設定情報記憶部111、通話要求送信部112、最大レート算定部116、最大レート指定情報送信部117、通話受信部113、通話出力制御部114、通話送信部115、操作入力受付部118、指令送信部119を具備している。 The function units of the main terminal 11 include a setting information storage unit 111, a call request transmission unit 112, a maximum rate calculation unit 116, a maximum rate designation information transmission unit 117, a call reception unit 113, a call output control unit 114, and a call transmission unit. 115, an operation input reception unit 118, and a command transmission unit 119.
 また、従側端末12の機能部としては、設定情報記憶部124、通話要求受信部121、最大レート指定情報受信部125、通話受信部122、通話出力制御部123、通話送信部127、指令受信部126を具備している。 The functional units of the slave terminal 12 include a setting information storage unit 124, a call request receiving unit 121, a maximum rate designation information receiving unit 125, a call receiving unit 122, a call output control unit 123, a call transmitting unit 127, and command reception. Part 126 is provided.
 各機能部の説明に先んじて、本実施形態のビデオ通話システムが実現するビデオ通話のモードについて述べる。本実施形態では、全拠点が互いに他の全ての拠点の映像を同時に見ることができる「グループビューモード」と、選定された一の拠点のみが他の全ての拠点の映像を同時に見ることができ他の拠点は当該一の拠点の映像のみを見る「シングルビューモード」とを、適宜切り替えることができる。 Prior to the description of each functional unit, a video call mode realized by the video call system of this embodiment will be described. In this embodiment, “Group View Mode” in which all bases can see the images of all other bases at the same time, and only one selected base can watch the pictures of all other bases at the same time. Other sites can appropriately switch to “single view mode” in which only the video of the one site is viewed.
 グループビューモード、シングルビューモードを問わず、ビデオ通話に参加する各拠点に設置されている端末1は、それぞれが個別にセッション接続する。五拠点が参加し、そのうち一つの拠点にある端末1が主側端末11、残り四つの拠点にある各端末1が従側端末12となるケースでは、各端末11、12がそれぞれ他の四つの端末11、12と個別にセッション接続することとなる。 Regardless of the group view mode or the single view mode, each terminal 1 installed at each site participating in the video call has a session connection individually. In the case where five bases participate, the terminal 1 at one base is the master terminal 11 and each terminal 1 at the remaining four bases is the slave terminal 12, each of the terminals 11 and 12 is the other four. A session connection with the terminals 11 and 12 is made individually.
 図5に示すように、グループビューモードでは、ビデオ通話に参加している各拠点に設置されている端末11、12が、自己の設置拠点にて撮影した映像を、自分以外の他の全ての端末11、12に向けて送信する(図中破線で示す)。五拠点が参加しているケースでは、各端末11、12は、他の四つの端末11、12の各々に対して映像を送出するとともに、他の四つの端末11、12の各々から映像を受け取る。そして、それら映像を、ディスプレイ1iの画面内に配列して表示する。グループビューモードでは、各端末11、12が接続している通信回線2の帯域幅が必然的に圧迫されることから、各端末11、12が送出する映像の転送レートを、後述する最大転送レート以下に抑制することとしている。因みに、ビデオ通話の開始直後は、グループビューモードである。 As shown in FIG. 5, in the group view mode, the terminals 11 and 12 installed at each site participating in the video call capture images shot at their own sites by all other users. It transmits toward the terminals 11 and 12 (indicated by broken lines in the figure). In the case where five bases are participating, each terminal 11, 12 sends video to each of the other four terminals 11, 12 and receives video from each of the other four terminals 11, 12. . These videos are arranged and displayed on the screen of the display 1i. In the group view mode, since the bandwidth of the communication line 2 to which the terminals 11 and 12 are connected is inevitably squeezed, the transfer rate of the video transmitted from the terminals 11 and 12 is set to the maximum transfer rate described later. The following are to be suppressed. Incidentally, the group view mode is immediately after the start of the video call.
 図6に示すように、シングルビューモードでは、ビデオ通話に参加している各拠点に設置されている端末11、12のうちの一の端末11が、自己の設置拠点にて撮影した映像を、他の全ての端末12に向けて送信する(図中実線で示す)。一方、他の端末12は、自己の設置拠点にて撮影した映像を、前記一の端末11に向けて送信する(図中破線で示す)。が、他の端末12同士で相互に映像を送受信することはない。これにより、少なくとも、他の端末12が接続している通信回線2の帯域幅には余裕が生まれる。従って、一の端末11から他の端末12の各々に対して送出する映像の転送レートは、グループビューモードにおいて送出する映像のそれと比較して大きくすることが許される。映像の転送レートは、必ずしも前記最大転送レート以下に抑制する必要はない。本実施形態では、映像はフレームの寸法(精細度)が比較的小さいもの(例えば、960×540ピクセルまたは720×480ピクセル)とし、映像はフレームの寸法が比較的大きいもの(例えば、1920×1080ピクセル)としている。他の端末12は、それぞれが一の端末11から高品位の映像を受け取り、これをディスプレイ1iの画面の略全域に表示する。翻って、シングルビューモードにおける他の端末12は、それぞれが一の端末11に低品位の映像を送出する。シングルビューモードにおいて他の端末12が送出する映像は、グループビューモードにおけるものと同じく、その転送レートを前記最大転送レート以下に抑制することが好ましい。他の端末12の各々から映像を受け取った一の端末11は、グループビューモードと同様に、それら映像をディスプレイ1iの画面内に配列して表示する。 As shown in FIG. 6, in the single view mode, one terminal 11 out of the terminals 11 and 12 installed at each site participating in the video call captures a video image taken at its own site. Transmission is performed toward all other terminals 12 (indicated by a solid line in the figure). On the other hand, the other terminal 12 transmits the video imaged at its installation site to the one terminal 11 (indicated by a broken line in the figure). However, the other terminals 12 do not transmit / receive video to / from each other. As a result, at least the bandwidth of the communication line 2 to which the other terminal 12 is connected is afforded. Accordingly, the transfer rate of the video transmitted from one terminal 11 to each of the other terminals 12 is allowed to be larger than that of the video transmitted in the group view mode. It is not always necessary to suppress the video transfer rate below the maximum transfer rate. In this embodiment, the video has a relatively small frame size (definition) (for example, 960 × 540 pixels or 720 × 480 pixels), and the video has a relatively large frame size (for example, 1920 × 1080). Pixel). Each of the other terminals 12 receives a high-definition video from one terminal 11 and displays it on substantially the entire screen of the display 1i. In turn, the other terminals 12 in the single view mode each send a low-quality video to one terminal 11. As in the group view mode, it is preferable to suppress the transfer rate of the video transmitted from the other terminal 12 in the single view mode to the maximum transfer rate or less. The one terminal 11 that has received the video from each of the other terminals 12 arranges and displays the video on the screen of the display 1 i as in the group view mode.
 図6に示した例では、主側端末11がシングルビューモードにおける他者の注目を集める主体となっていたが、図7に示しているように、従側端末12の何れかをシングルビューモードにおける他者の注目を集める主体として選択し、当該端末12から他の端末11、12の各々に対して高品位の映像を送信して(図中実線で示す)、他の端末11、12の各々から当該端末12に対して低品位の映像を送信する(図中破線で示す)態様をとることもできる。この状況下では、主側端末11の使用者が、当該従側端末12のカメラ1hのパン、チルトまたはズームをリモート操作することが可能となっている。 In the example shown in FIG. 6, the main terminal 11 is a subject that attracts the attention of others in the single view mode. However, as shown in FIG. 7, any one of the slave terminals 12 is in the single view mode. Is selected as a subject that attracts the attention of others, and a high-definition video is transmitted from the terminal 12 to each of the other terminals 11 and 12 (shown by a solid line in the figure). A mode in which a low-quality video is transmitted from each to the terminal 12 (shown by a broken line in the figure) can be taken. Under this situation, the user of the main terminal 11 can remotely operate pan, tilt or zoom of the camera 1h of the slave terminal 12.
 音声については、グループビューモード、シングルビューモードを問わず常に、各端末11、12が自己の設置拠点にて収音した音声を他の全ての端末11、12に対して送信する。音声データのビットレートは映像データに比して桁違いに小さく、講義または会議等に参加している全ての端末11、12から他の全ての端末11、12に向けて送信することとしても、おそらくは通信帯域の不足を招かないと考えられるからである。 As for audio, regardless of the group view mode or the single view mode, the terminals 11 and 12 always transmit the audio collected at their installation bases to all other terminals 11 and 12. The bit rate of audio data is an order of magnitude smaller than that of video data, and transmission from all terminals 11 and 12 participating in a lecture or a meeting to all other terminals 11 and 12 This is probably because it will not cause a shortage of communication bandwidth.
 主側端末11の各機能部を説明する。設定情報記憶部111は、各種の設定情報を、メインメモリ1bまたは補助記憶デバイス1cの記憶領域を利用して記憶する。設定情報としては、まず、電話帳(または、アドレス帳)情報がある。電話帳情報は、ビデオ通話の相手先の候補となり得る他の端末1(例えば、最大100件)の識別情報と、当該端末1に係る属性情報とを関連づけたものである。図8に、電話帳情報を例示する。設定情報記憶部111は、他の端末1について、各端末1毎に、識別情報及び属性情報をID番号等に関連づけて記憶している。識別情報は、例えばIPアドレス、SIP(Session Initiation Protocol)アドレスその他のURIに代表される、通信の宛先を識別する情報である。属性情報は、相手先の氏名または名称、所属、所在地その他の相手先に関する情報であり、本実施形態では文字列情報である。 Each functional unit of the main terminal 11 will be described. The setting information storage unit 111 stores various setting information using the storage area of the main memory 1b or the auxiliary storage device 1c. As the setting information, first, there is telephone book (or address book) information. The phone book information is obtained by associating identification information of other terminals 1 (for example, a maximum of 100 cases) that can be candidates for a video call partner and attribute information related to the terminal 1. FIG. 8 illustrates phone book information. The setting information storage unit 111 stores identification information and attribute information in association with an ID number or the like for each terminal 1 for other terminals 1. The identification information is information for identifying a communication destination represented by, for example, an IP address, a SIP (Session Initiation Protocol) address, or another URI. The attribute information is information related to the other party's name or name, affiliation, location, and other destinations, and is character string information in this embodiment.
 他の設定情報として、当該主側端末11に関する許容帯域幅情報が挙げられる。許容帯域幅情報は、それぞれの端末1が接続している通信回線2を介して送信可能なデータの許容転送レート、及び/または、受信可能なデータの許容転送レートを示す、各端末1個別の情報である。許容転送レートは、ビデオ通話時に、端末1が接続している通信回線2がどの程度の大きさのビットレートのデータを送信/受信することに耐えられるかという通信速度品質、あるいは、ビデオ通話のためにどの程度まで通信帯域幅を占有することが許されるかという他のサービス(または、同じ回線2を共有する他人)との兼ね合いを考慮に入れた通信速度制限を明らかにするものであると言うことができる。許容転送レートの大きさは、端末1が接続している通信回線2の種類やその回線2の共有者の数、端末1の所在地、日または時間帯その他諸々の事情に応じて異なってくる。 Other setting information includes allowable bandwidth information related to the main terminal 11. The permissible bandwidth information indicates the permissible transfer rate of data that can be transmitted via the communication line 2 connected to each terminal 1 and / or the permissible transfer rate of receivable data. Information. The allowable transfer rate is the communication speed quality of how much bit rate data can be transmitted / received by the communication line 2 to which the terminal 1 is connected during the video call, or the video call In order to clarify the communication speed limit taking into account the balance with other services (or others sharing the same line 2), such as to what extent it is allowed to occupy the communication bandwidth I can say that. The size of the allowable transfer rate varies depending on the type of the communication line 2 to which the terminal 1 is connected, the number of sharers of the line 2, the location of the terminal 1, the date or time zone, and various other circumstances.
 許容帯域幅情報は、端末1またはシステムの使用者ないし管理者の手によって予め設定入力されることがあり、その場合、自己の端末1に係る許容帯域幅情報を設定情報の一部として設定情報記憶部111に記憶する。さらに、他の端末1についての許容帯域幅情報を既に取得している場合には、この許容帯域幅情報をも、当該他の端末1を識別するID番号等に関連づけて設定情報記憶部111に記憶する。但し、許容帯域幅情報が予め設定入力され、設定情報記憶部111に記憶されているとは限られない。 The allowable bandwidth information may be set and input in advance by the terminal 1 or the user or administrator of the system. In this case, the allowable bandwidth information related to the terminal 1 is set as part of the setting information. Store in the storage unit 111. Further, when the allowable bandwidth information for the other terminal 1 has already been acquired, this allowable bandwidth information is also associated with the ID number or the like for identifying the other terminal 1 in the setting information storage unit 111. Remember. However, the allowable bandwidth information is not necessarily set and input in advance and stored in the setting information storage unit 111.
 上記の設定情報は、直接に手入力されることもあれば、電気通信回線2を介して接続している他の端末1またはコンピュータから送信されてくることもある。端末1は、設定情報の手入力を操作入力デバイス1dを介して受け付け、または通信インタフェース1kの機能を利用して受信し、設定情報記憶部111に記憶する。 The above setting information may be manually input directly or may be transmitted from another terminal 1 or a computer connected via the telecommunication line 2. The terminal 1 accepts manual input of setting information via the operation input device 1 d or receives it using the function of the communication interface 1 k and stores it in the setting information storage unit 111.
 図8には示していないが、過去にビデオ通話を行ったことのある他の端末1に係る識別情報及び属性情報を、過去の通話履歴として、電話帳に蓄積しておくことも可能である。具体的には、当方からの発呼(送信)であったのか相手方からの受呼(着信)であったのかを示す情報や、その端末1とビデオ通話を行った日時を示すタイムスタンプ等を、併せて設定情報記憶部111に記憶する。特に、過去に開催したビデオ講義またはビデオ会議等に参加した複数の端末1について、その旨の情報や、講義または会議等の開催日時を示すタイムスタンプ等を記憶しておくことが考えられる。 Although not shown in FIG. 8, it is also possible to accumulate identification information and attribute information relating to other terminals 1 that have made a video call in the past as a past call history in the telephone directory. . Specifically, information indicating whether the call was made (transmitted) from us or received (received) from the other party, a time stamp indicating the date and time when the video call was made with the terminal 1, etc. Also stored in the setting information storage unit 111. In particular, for a plurality of terminals 1 that have participated in video lectures or video conferences held in the past, it is conceivable to store information to that effect and time stamps indicating the dates and times of lectures or conferences.
 通話要求送信部112は、ビデオ通話、即ちビデオ講義またはビデオ会議等を開催するにあたり、その講義または会議等に参加する端末1との間で、ビデオ通話を行うための通話要求を、通信インタフェース1kの機能を利用して送信する。原則として、通話要求は、主側端末11から従側端末12に向けて送信する。無論、通話要求を従側端末12から主側端末11に送信する態様を妨げるものではない。 When holding a video call, that is, a video lecture or a video conference, the call request transmitting unit 112 sends a call request for performing a video call to the terminal 1 participating in the lecture or the conference. Send using the function of. In principle, the call request is transmitted from the master terminal 11 to the slave terminal 12. Of course, this does not hinder the mode of transmitting a call request from the slave terminal 12 to the master terminal 11.
 主側端末11は、ビデオ通話の相手先の候補、即ち講義または会議等の参加候補となり得る他の端末1の一覧を、ディスプレイ1iの画面に表示させる。図9に、候補の一覧表示例を示す。最初に候補として表示させるのは、過去に講義または会議等に参加したことのある(その履歴情報が設定情報記憶部111に残されている)端末1である。主側端末11は、候補である複数の端末1の各々について、設定情報記憶部111から識別情報を読み出す。そして、その識別情報に基づき、当該端末1に対してライブアイコン(図中符号B)の送信を要求する情報であるライブアイコン要求を送信する。ライブアイコンは、端末1の設置場所で撮影されたリアルタイムの映像または静止画像である。ライブアイコンの寸法(例えば、192×108ピクセル)は、ビデオ通話時にやりとりされる映像の寸法よりも小さい。ライブアイコンのフレームレートは、5fps程度かそれ以上である。 The main terminal 11 displays on the screen of the display 1i a list of other terminals 1 that can be candidates for video call destinations, that is, candidates for participation in lectures or meetings. FIG. 9 shows a candidate list display example. The terminal 1 that is first displayed as a candidate is the terminal 1 that has participated in a lecture or a meeting in the past (the history information is left in the setting information storage unit 111). The main terminal 11 reads the identification information from the setting information storage unit 111 for each of the plurality of candidate terminals 1. Then, based on the identification information, a live icon request that is information for requesting the terminal 1 to transmit a live icon (symbol B in the figure) is transmitted. The live icon is a real-time video or still image taken at the place where the terminal 1 is installed. The size of the live icon (for example, 192 × 108 pixels) is smaller than the size of the video exchanged during the video call. The live icon frame rate is about 5 fps or more.
 本実施形態のシステムでは、原則として、端末1同士を電気通信回線2を介してピアツーピアの形で相互接続する。即ち、ビデオ講義またはビデオ会議等を実施するにあたって、他の中継サーバやMCU等の中継機を使用する必要がない。識別情報がIPアドレスのようなアドレス解決を要しないものである場合には、その識別情報を用いて即座に端末1にライブアイコン要求を送信することができる。識別情報がSIPアドレスのようなアドレス解決を要するものである場合には、その識別情報を用いて他のコンピュータ(例えば、SIPプロキシサーバ)に招待メッセージを送信する等して受呼側端末12との間でセッションを確立し、その後にライブアイコン要求を送信する。あるいは、その識別情報をアドレス解決能力を備えた他のコンピュータ(例えば、DNS(Domain Name System)サーバ)に送信して当該識別情報に対応したIPアドレス等をこのコンピュータから受信し、このIPアドレス等を用いて端末1にライブアイコン要求を送信する。 In the system of this embodiment, as a rule, the terminals 1 are interconnected in a peer-to-peer manner via the telecommunication line 2. In other words, it is not necessary to use another relay server or a relay device such as an MCU when conducting a video lecture or a video conference. When the identification information does not require address resolution such as an IP address, a live icon request can be immediately transmitted to the terminal 1 using the identification information. When the identification information requires address resolution, such as a SIP address, the invitee terminal 12 and the callee terminal 12 can be contacted by transmitting an invitation message to another computer (for example, a SIP proxy server) using the identification information. Establish a session between them and then send a live icon request. Alternatively, the identification information is transmitted to another computer having an address resolution capability (for example, a DNS (Domain Name System) server), and an IP address corresponding to the identification information is received from the computer. A live icon request is transmitted to the terminal 1 using.
 ライブアイコン要求を受信した端末1は、通常の場合、カメラ1hで撮影したライブアイコンを、そのライブアイコン要求をもたらした端末1に向けて送信する。主側端末11では、各端末1から返信されるライブアイコンを受信して、当該端末の属性情報(図中符号A)とともにディスプレイ1iの画面に表示させる。ライブアイコンを受信できていない端末1については、ライブアイコンを表示させることができないので、その旨を示す画像または文字列を表示させる。図9に示す例において、「接続中・・・」の表示(図中符号C)は、その端末1に対してライブアイコン要求を送信し、応答待ちであることを示す。「通信不可」の表示(図中符号D)は、その端末1に対してライブアイコン要求を送信した後一定時間が経過するまでに応答が返ってこなかった、つまり何らかの理由(端末1の電源が切られている、端末1が接続する通信回線に支障が発生している、等々)で当該端末1とのセッション確立が不能であることを示す。「表示不可」の表示(図中符号E)は、その端末1に対してライブアイコン要求を送信したが、ライブアイコン要求を拒絶する、換言すればライブアイコンの送信を行わない旨の応答がライブアイコンの代わりに送られてきたことを示す。「通話中」の表示(図中符号F)は、その端末1に対してライブアイコン要求を送信したが、既に別の端末1とビデオ通話中である旨の応答がライブアイコンの代わりに送られてきたことを示す。 The terminal 1 that has received the live icon request normally transmits the live icon captured by the camera 1h to the terminal 1 that has caused the live icon request. The main terminal 11 receives the live icon returned from each terminal 1, and displays it on the screen of the display 1i together with the attribute information (reference symbol A in the figure) of the terminal. Since the live icon cannot be displayed for the terminal 1 that has not received the live icon, an image or a character string indicating that fact is displayed. In the example shown in FIG. 9, the display of “Connecting ...” (reference symbol C) indicates that a live icon request is transmitted to the terminal 1 and a response is awaited. The display of “communication impossible” (symbol D in the figure) indicates that a response has not been returned until a certain time has elapsed after transmitting a live icon request to the terminal 1, that is, for some reason (the power supply of the terminal 1 is The communication line to which the terminal 1 is connected is interrupted, etc., indicating that it is impossible to establish a session with the terminal 1. A display of “not displayable” (symbol E in the figure) indicates that a live icon request is transmitted to the terminal 1, but the live icon request is rejected, in other words, a response that the live icon is not transmitted is live. Indicates that it was sent instead of an icon. In the display of “busy” (symbol F in the figure), a live icon request is transmitted to the terminal 1, but a response indicating that a video call is already being made with another terminal 1 is sent instead of the live icon. It shows that it has come.
 しかして、主側端末11は、画面に表示させている候補の中から、ビデオ講義またはビデオ会議等の参加者となる複数の従側端末12を指定する操作入力を、操作入力デバイス1dを介して受け付ける。主側端末11の使用者は、操作入力デバイス1dを操作してカーソル(図中符号G)を移動させ、所望の従側端末12の情報表示欄(図中符号A、B)をクリックする。この操作を受け付けた主側端末11は、クリックされた表示欄に対応した端末12を参加者として知得し、当該端末12に係る識別情報に基づき、当該端末12に対してビデオ通話の開始を要求する通話要求を送信する。この通話要求には、ビデオ講義またはビデオ会議等の参加者となる複数の従側端末12及び主側端末11の各々の識別情報、ID番号が含まれている。当該従側端末12に係る識別情報がアドレス解決を要するものであったとしても、ライブアイコン要求の送信段階で既にセッション確立またはアドレス解決が済んでいる場合には、直接従側端末12に通話要求を送信することができる。そうでなければ、ライブアイコン要求送信時と同様にしてセッション確立またはアドレス解決を図り、当該従側端末12に通話要求を送信する。因みに、ライブアイコンの代わりに「通信不可」または「通話中」の旨を表示している端末1は、参加者として指定できない(その端末1に係る情報表示欄にカーソルを移動させることができない)ようになっている。 Thus, the master terminal 11 sends operation inputs for designating a plurality of slave terminals 12 to be participants in a video lecture or video conference from the candidates displayed on the screen via the operation input device 1d. And accept. The user of the master terminal 11 operates the operation input device 1d to move the cursor (symbol G in the figure), and clicks the information display column (symbols A and B in the figure) of the desired slave terminal 12. The main terminal 11 that has received this operation knows the terminal 12 corresponding to the clicked display field as a participant, and starts the video call to the terminal 12 based on the identification information related to the terminal 12. Send the requested call request. This call request includes the identification information and ID number of each of the plurality of slave terminals 12 and the master terminal 11 that are participants in a video lecture or video conference. Even if the identification information related to the slave terminal 12 requires address resolution, if the session has already been established or address resolution has already been completed at the live icon request transmission stage, a call request is made directly to the slave terminal 12. Can be sent. Otherwise, session establishment or address resolution is performed in the same manner as when the live icon request is transmitted, and a call request is transmitted to the slave terminal 12. Incidentally, the terminal 1 that displays “communication impossible” or “busy” instead of the live icon cannot be designated as a participant (the cursor cannot be moved to the information display field related to the terminal 1). It is like that.
 最大レート算定部116は、ビデオ講義またはビデオ会議等の開始に先んじて、各端末11、12がビデオ通話中に送出するべき映像データの転送レートの最大値を算定する。この際、参加者となる複数の従側端末12の各々についての許容転送レート、さらには主側端末11自身についての許容転送レートを参照する。 The maximum rate calculation unit 116 calculates the maximum value of the transfer rate of the video data that each of the terminals 11 and 12 should transmit during the video call prior to the start of the video lecture or video conference. At this time, the allowable transfer rate for each of the plurality of slave terminals 12 as participants and the allowable transfer rate for the master terminal 11 itself are referred to.
 最大転送レートの算定方法について述べる。主側端末11はまず、通話要求を送った従側端末12の各々との間でデータパケットを試験的に送受信する(データパケット送受信の遅延時間、エラーパケット率等を計数する)既知の手法によって、あるいは、送信要求を送った従側端末12の各々からもたらされる、従側端末12が予め設定情報として記憶保持している許容帯域幅情報を受信することによって、各従側端末12が通信回線2を介して送信/受信可能なデータの許容転送レートを知得、集約する。但し、従側端末12についての許容帯域幅情報が設定情報の一部として既に設定情報記憶部111に記憶されている場合には、設定情報記憶部111に記憶されているそれを読み出せばよい。 Describes how to calculate the maximum transfer rate. First, the master side terminal 11 transmits / receives data packets to / from each of the slave side terminals 12 that have sent a call request by a known method (counting delay time of data packet transmission / reception, error packet rate, etc.). Alternatively, each of the slave terminals 12 receives a communication line from the slave terminals 12 that have transmitted the transmission request, by receiving the allowable bandwidth information that the slave terminals 12 store and hold in advance as setting information. 2 to obtain and aggregate the allowable transfer rate of data that can be transmitted / received via the network. However, when the allowable bandwidth information for the slave terminal 12 is already stored in the setting information storage unit 111 as a part of the setting information, the stored information in the setting information storage unit 111 may be read. .
 のみならず、主側端末11は、従側端末12との間でデータパケットを試験的に送受信する既知の手法によって、あるいは、主側端末11自身が予め設定情報として記憶保持している許容帯域幅情報を読み出すことによって、主側端末11が通信回線2を介して送信/受信可能なデータの許容転送レートを知得する。 Not only that, the main terminal 11 uses a known method for transmitting and receiving data packets to and from the slave terminal 12 on a trial basis, or the allowable bandwidth that the main terminal 11 itself stores and holds as setting information in advance. By reading the width information, the main terminal 11 knows the allowable transfer rate of data that can be transmitted / received via the communication line 2.
 そして、ビデオ講義またはビデオ会議等の参加者となる各端末11、12毎に、当該端末11、12から他の端末11、12に向けて送出するべきビデオ映像の転送レートの最大値を算定する。基本的には、端末11、12が接続している通信回線2を介した許容送信レートを、ビデオ講義またはビデオ会議等に参加する従側端末12の数、換言すれば自身以外の参加拠点数で除算することで、ビデオ通話中に送出するデータの最大レートを得る。 Then, for each terminal 11, 12 that is a participant in a video lecture or video conference, the maximum value of the transfer rate of the video image to be transmitted from the terminal 11, 12 to the other terminal 11, 12 is calculated. . Basically, the allowable transmission rate via the communication line 2 to which the terminals 11 and 12 are connected is determined by the number of slave terminals 12 participating in a video lecture or video conference, in other words, the number of participating bases other than itself. Divide by to get the maximum rate of data sent during a video call.
 但し、実際には、通信回線2の帯域幅の全てを常に映像データの送出のみに用いることはできない。ビデオ通話中に端末11、12から送信するべきデータには、映像だけでなく、音声や、プロトコルで規定されたヘッダ、誤り訂正(特に、前方誤り訂正(Forward Error Correction))符号等も含まれるからである。故に、その分の帯域を控除して、送出する映像の最大レートを確定することがより望ましいと言える。即ち、
(ビデオ通話中に送出する映像の最大レート)=(通信回線2の許容送信レート)÷(参加する従側端末12の数)-(プロトコルヘッダ、誤り訂正符号等の送信のための帯域分)
として、端末11、12から他の端末11、12に向けて送出するべき映像の転送レートの最大値を算定する。
However, in practice, the entire bandwidth of the communication line 2 cannot always be used only for sending video data. Data to be transmitted from the terminals 11 and 12 during a video call includes not only video but also audio, a header defined by a protocol, an error correction (particularly, forward error correction) code, and the like. Because. Therefore, it can be said that it is more desirable to determine the maximum rate of video to be transmitted by subtracting the corresponding band. That is,
(Maximum rate of video to be transmitted during video call) = (Allowable transmission rate of communication line 2) ÷ (Number of participating slave terminals 12) − (Band for transmission of protocol header, error correction code, etc.)
As described above, the maximum value of the transfer rate of the video to be transmitted from the terminals 11 and 12 to the other terminals 11 and 12 is calculated.
 尤も、許容送信レートを従側端末12の数で除算するのではなく、許容送信レートに従側端末12の数の多寡に応じた比率を乗じて許容送信レートを割り引くことで、または、許容送信レートに従側端末12の数の多寡によらない所定の比率を乗じて許容送信レートを割り引くことで、各端末11、12から送出する映像の最大転送レートを決定するということも考えられる。 However, the allowable transmission rate is not divided by the number of slave terminals 12, but is multiplied by a ratio corresponding to the number of slave terminals 12 according to the allowable transmission rate, or the allowable transmission rate is discounted. It is also conceivable that the maximum transfer rate of the video transmitted from each terminal 11, 12 is determined by discounting the allowable transmission rate by multiplying a predetermined ratio that does not depend on the number of the following terminals 12 according to the rate.
 最大レート指定情報送信部117は、最大レート算定部116で算定した最大転送レートの値を記述した最大レート指定情報を、ビデオ講義またはビデオ会議等の参加者として指定された従側端末12の各々に対して、通信インタフェース1kの機能を利用して送信する。本実施形態では、主側端末11及び従側端末12のそれぞれについて、他の端末11、12に向けて送出する映像の最大転送レートを個別に算定している。従って、最大レート指定情報送信部は、各従側端末12に向けて、その従側端末12に対応する最大転送レートを記述した最大レート指定情報を送信する。主側端末11自身についての最大転送レートの値は、主側端末11のメインメモリ1bまたは補助記憶デバイス1cに一時記憶しておく。 The maximum rate designation information transmission unit 117 transmits the maximum rate designation information describing the value of the maximum transfer rate calculated by the maximum rate calculation unit 116 to each of the slave terminals 12 designated as participants in a video lecture or a video conference. On the other hand, it transmits using the function of the communication interface 1k. In the present embodiment, the maximum transfer rate of video transmitted to the other terminals 11 and 12 is individually calculated for each of the main terminal 11 and the slave terminal 12. Therefore, the maximum rate designation information transmitting unit transmits the maximum rate designation information describing the maximum transfer rate corresponding to each slave terminal 12 to each slave terminal 12. The value of the maximum transfer rate for the main terminal 11 itself is temporarily stored in the main memory 1b or the auxiliary storage device 1c of the main terminal 11.
 通話受信部113は、ビデオ通話の開始後、従側端末12から送信される映像及び音声を、通信インタフェース1kの機能を利用して受信する。グループビューモードでは、ビデオ会議またはビデオ講義等に参加している全ての従側端末12から映像データを受信する。 The call receiving unit 113 receives the video and audio transmitted from the slave terminal 12 after the video call is started, using the function of the communication interface 1k. In the group view mode, video data is received from all the slave terminals 12 participating in a video conference or a video lecture.
 シングルビューモードでは、何れの端末11、12が他者の注目を集める主体となっているかに応じて処理が異なる。主側端末11自身が他者の注目を集める主体となっている間は、参加している全ての従側端末12から低品位の映像データを受信する。何れかの従側端末12が他者の注目を集める主体となっている間は、当該従側端末12からのみ、高品位の映像データを受信する。 In the single view mode, the processing differs depending on which terminal 11 or 12 is the subject that attracts the attention of others. While the main terminal 11 itself is a subject that attracts the attention of others, low-definition video data is received from all participating slave terminals 12. While any slave terminal 12 is a subject that attracts the attention of others, high-quality video data is received only from the slave terminal 12.
 通話出力制御部114は、何れか一または全ての従側端末12から受信した通話中の映像をディスプレイ1iの画面に表示させるとともに、全ての従側端末12から受信した通話中の音声をスピーカ1fから音声出力させる。図10に、ビデオ通話中の相手先の映像の表示例を示す。但し、図10に示しているものは、グループビューモードにおける通話画面であり、シングルビューモードにおける通話画面はこれとは異なる。グループビューモードでの通話画面には、参加している全ての従側端末12において撮影された映像(図中符号H)が同じ寸法で並列的に映し出される。加えて、図示していないが、通話出力制御部114は、同通話画面内に、自己の設置場所にてカメラ1hで撮影した映像を小さく表示させることもできる。また、通話出力制御部114は、各従側端末12から受信した音声を重畳した上でスピーカ1fから出力させる。 The call output control unit 114 displays the video during a call received from any one or all of the slave terminals 12 on the screen of the display 1i, and the voice during a call received from all the slave terminals 12 to the speaker 1f. To output sound. FIG. 10 shows a display example of the image of the other party during the video call. However, what is shown in FIG. 10 is a call screen in the group view mode, and the call screen in the single view mode is different from this. On the call screen in the group view mode, the video (symbol H in the figure) taken at all participating slave terminals 12 is displayed in parallel with the same dimensions. In addition, although not shown, the call output control unit 114 can also display a video image captured by the camera 1h at its installation location in a small size within the call screen. Further, the call output control unit 114 superimposes the voice received from each slave terminal 12 and outputs the voice from the speaker 1f.
 通話送信部115は、主側端末11の設置場所にてカメラ1hで撮影した映像及びマイク1eで収音した音声を、ビデオ講義またはビデオ会議等に参加している従側端末12に対して、通信インタフェース1kの機能を利用して送信する。グループビューモードでは、参加している全ての従側端末12に対して同じ映像データを送信する。そのために、カメラ1hで撮影した映像を、最大レート算定部116で算定した最大転送レート以下のレートの映像データにエンコードした上で、従側端末12の各々に対して送信する。映像データのビットレートを増減させる手法としては、エンコード後の品質(いわば、圧縮の度合い)を調整したり、圧縮アルゴリズムを変更したり、フレームレートを変えたり、または映像フレームの寸法(ピクセル数)を拡縮したりする手法が知られている。本実施形態では、映像の寸法はグループビューモード、シングルビューモードでそれぞれ固定としているので、映像フレームの寸法を拡縮する以外の方法で映像データのビットレートを最大転送レート以下に抑制する。 The call transmission unit 115 sends the video captured by the camera 1h and the sound collected by the microphone 1e at the installation location of the main terminal 11 to the slave terminal 12 participating in a video lecture or a video conference. Transmission is performed using the function of the communication interface 1k. In the group view mode, the same video data is transmitted to all participating slave terminals 12. For this purpose, the video captured by the camera 1h is encoded into video data having a rate equal to or lower than the maximum transfer rate calculated by the maximum rate calculation unit 116, and then transmitted to each of the slave terminals 12. Methods for increasing or decreasing the bit rate of video data include adjusting the quality after encoding (in other words, the degree of compression), changing the compression algorithm, changing the frame rate, or the size (number of pixels) of the video frame. There are known methods for scaling. In the present embodiment, since the video size is fixed in the group view mode and the single view mode, the bit rate of the video data is suppressed to the maximum transfer rate or less by a method other than enlarging or reducing the size of the video frame.
 シングルビューモードでは、何れの端末11、12が他者の注目を集める主体となっているかに応じて処理が異なる。主側端末11自身が他者の注目を集める主体となっている間は、参加している全ての従側端末12に対して、同じ高品位の映像データを送信する。この高品位の映像データの転送レートは、最大レート算定部116で算定した最大転送レートには必ずしも拘束されない。各従側端末12は主側端末11からのみ映像を受け取ることとなるからである。 In the single view mode, the processing differs depending on which terminal 11 or 12 is the subject that attracts the attention of others. While the main terminal 11 itself is a subject that attracts the attention of others, the same high-definition video data is transmitted to all participating slave terminals 12. The transfer rate of the high-quality video data is not necessarily restricted by the maximum transfer rate calculated by the maximum rate calculation unit 116. This is because each slave terminal 12 receives video only from the master terminal 11.
 他方、シングルビューモードにおいて、何れかの従側端末12が他者の注目を集める主体となっている間は、当該従側端末12に対してのみ、低品位の映像データを送信する。このときの低品位の映像データは、最大レート算定部116で算定した最大転送レート以下のレートのデータにエンコードすることが好ましい。とは言え、最大転送レートに拘束されるとは限られない。他者の注目を集める主体となっている従側端末12以外の端末11、12間では映像のやり取りが発生しないからである。 On the other hand, in the single view mode, while any slave terminal 12 is a subject that attracts the attention of others, low-quality video data is transmitted only to the slave terminal 12. At this time, the low-quality video data is preferably encoded into data having a rate equal to or lower than the maximum transfer rate calculated by the maximum rate calculation unit 116. However, it is not necessarily bound by the maximum transfer rate. This is because video exchange does not occur between the terminals 11 and 12 other than the slave terminal 12 that is the subject that attracts the attention of others.
 操作入力受付部118は、グループビューモードとシングルビューモードとを切り替えるべき旨の操作入力、及び、シングルビューモードにおける何れか一の端末11、12を他者の注目を集める主体として選定する操作入力を、操作入力デバイス1dを介して受け付ける。例えば、主側端末1の使用者は、図10に示すグループビューモードの画面上で、操作入力デバイス1dを操作してカーソル(図中符号O)を移動させ、所望の従側端末12の映像表示欄に合わせた上、「1拠点」ボタン(図中符号P)をクリックする。この操作を受け付けた主側端末11は、グループビューモードからシングルビューモードへと切り替えるべき旨を知得し、さらに、カーソルの位置に対応した従側端末12をシングルビューモードにおいて他者の注目を集める主体となるべき端末12として知得する。単純にグループビューモードとシングルビューモード(主側端末11が他者の注目を集める主体となる状態)とを切り替えたい場合には、使用者は、操作入力デバイス1dの特定のキーを押下する等すればよい。この操作を受け付けた主側端末11は、グループビューモードからシングルビューモードへ、またはシングルビューモードからグループビューモードへと切り替えるべき旨を知得する。 The operation input reception unit 118 is an operation input for switching between the group view mode and the single view mode, and an operation input for selecting any one of the terminals 11 and 12 in the single view mode as a subject to attract the attention of others. Is received via the operation input device 1d. For example, the user of the master side terminal 1 operates the operation input device 1d on the screen of the group view mode shown in FIG. Click the “1 site” button (symbol P in the figure) after matching the display field. The master side terminal 11 that has received this operation knows that it should be switched from the group view mode to the single view mode, and further, the slave side terminal 12 corresponding to the position of the cursor receives the attention of others in the single view mode. Know as the terminal 12 that should be the subject of the collection. When the user simply wants to switch between the group view mode and the single view mode (a state in which the main terminal 11 is the subject that attracts the attention of others), the user presses a specific key of the operation input device 1d, etc. do it. The main terminal 11 that has received this operation knows that switching from the group view mode to the single view mode or from the single view mode to the group view mode is required.
 指令送信部119は、操作入力受付部118で受け付けた操作入力に対応して、グループビューモードとシングルビューモードとを切り替えるための指令、及び/または、シングルビューモードにおいて何れの端末11、12が選択されたのかを指示するための指令である切替指令を、従側端末12の各々に対し、通信インタフェース1kの機能を利用して送信する。 In response to the operation input received by the operation input reception unit 118, the command transmission unit 119 is instructed to switch between the group view mode and the single view mode, and / or any of the terminals 11 and 12 in the single view mode. A switching command, which is a command for instructing whether or not it has been selected, is transmitted to each slave terminal 12 using the function of the communication interface 1k.
 さらに、従側端末12が他者の注目を集める主体として選択されているシングルビューモードの状況の下では、操作入力受付部118が、主側端末11の使用者が行う、当該従側端末12に付随するカメラ1hのパン、チルトまたはズームを操作するべき旨の操作入力を、操作入力デバイス1dを介して受け付けることができる。そして、指令操作部119は、上記の操作入力を受け付けた場合において、その操作入力に対応した、カメラ1hのパン、チルトまたはズームを操作するためのカメラ操作指令を、当該従側端末12に対し、通信インタフェース1kの機能を利用して送信する。 Further, under the situation of the single view mode in which the slave terminal 12 is selected as a subject that attracts the attention of others, the operation input reception unit 118 is performed by the user of the master terminal 11. The operation input indicating that the pan, tilt or zoom of the camera 1h associated with the camera 1h should be operated can be received via the operation input device 1d. When the command operation unit 119 receives the operation input, the command operation unit 119 sends a camera operation command corresponding to the operation input to operate the pan, tilt, or zoom of the camera 1 h to the slave terminal 12. And using the function of the communication interface 1k.
 次いで、従側端末12の各機能部を説明する。設定情報記憶部124は、設定情報、特に当該従側端末12に関する許容帯域幅情報を、メインメモリ1bまたは補助記憶デバイス1cの記憶領域を利用して記憶する。設定情報は、直接手入力されることもあれば、電気通信回線2を介して接続している他の端末1またはコンピュータから送信されてくることもある。端末1は、設定情報の入力を操作入力デバイス1dを介して受け付け、または通信インタフェース1kの機能を利用して受信し、設定情報記憶部124に記憶する。但し、許容帯域幅情報が予め設定入力され、設定情報記憶部121に記憶されているとは限られない。 Next, each functional unit of the slave terminal 12 will be described. The setting information storage unit 124 stores setting information, in particular, allowable bandwidth information regarding the slave terminal 12 using the storage area of the main memory 1b or the auxiliary storage device 1c. The setting information may be manually input directly or may be transmitted from another terminal 1 or a computer connected via the telecommunication line 2. The terminal 1 accepts input of setting information via the operation input device 1 d or receives it using the function of the communication interface 1 k and stores it in the setting information storage unit 124. However, the allowable bandwidth information is not necessarily set and input in advance and stored in the setting information storage unit 121.
 通話要求受信部121は、主側端末11から送信される通話要求を、通信インタフェース1kの機能を利用して受信する。通話要求には、ビデオ講義またはビデオ会議等の参加者となる複数の従側端末12及び主側端末1の各々の識別情報、ID番号が含まれている。よって、この通話要求を参照することにより、従側端末12は、参加する他の端末11、12の識別情報を知得し、それら端末11、12との間でセッション接続を確立することが可能となる。主側端末11からの通話要求を受信した従側端末12は、ちょうど主側端末11が従側端末12に対して実行したように、通話要求に記述された他の従側端末12に向けて、さらなる通話要求を送信する。つまり、通話要求の受信を契機として、従側端末12と主側端末11との相互通信、従側端末12同士の相互通信が確立され、従側端末12が主側端末11の主催するビデオ講義またはビデオ会議等に参加することが確定する。なお、従側端末12は、通話要求に対して、自己の設定情報記憶部124に記憶保持している許容帯域幅情報を、主側端末1に向けて返信することがある。 The call request receiving unit 121 receives a call request transmitted from the main terminal 11 by using the function of the communication interface 1k. The call request includes identification information and an ID number of each of the plurality of slave terminals 12 and the master terminal 1 that are participants in a video lecture or a video conference. Therefore, by referring to the call request, the slave terminal 12 can know the identification information of the other participating terminals 11 and 12 and can establish a session connection with the terminals 11 and 12. It becomes. The slave terminal 12 that has received the call request from the master terminal 11 is directed toward the other slave terminal 12 described in the call request just as the master terminal 11 has executed for the slave terminal 12. Send further call requests. That is, triggered by the reception of the call request, mutual communication between the slave terminal 12 and the master terminal 11 and mutual communication between the slave terminals 12 are established, and the video lecture hosted by the slave terminal 11 by the slave terminal 12 is established. Or it is decided to participate in a video conference or the like. The slave terminal 12 may return the allowable bandwidth information stored and held in its own setting information storage unit 124 to the master terminal 1 in response to the call request.
 最大レート指定情報受信部125は、主側端末11から送信される最大レート指定情報を、通信インタフェース1kの機能を利用して受信する。この最大レート指定情報を参照することにより、特にグループビューモードにおいて送出するべき映像データの最大転送レートを知得する。最大転送レートの値は、従側端末12のメインメモリ1bまたは補助記憶デバイス1cに一時記憶しておく。 The maximum rate designation information receiving unit 125 receives the maximum rate designation information transmitted from the main terminal 11 by using the function of the communication interface 1k. By referring to this maximum rate designation information, the maximum transfer rate of video data to be transmitted, particularly in the group view mode, is obtained. The value of the maximum transfer rate is temporarily stored in the main memory 1b or the auxiliary storage device 1c of the slave terminal 12.
 通話受信部122は、ビデオ通話の開始後、主側端末11及び/または従側端末12から送信される映像及び音声を、通信インタフェース1kの機能を利用して受信する。グループビューモードでは、ビデオ会議またはビデオ講義等に参加している主側端末11及び他の全ての従側端末12から映像データを受信する。 The call receiving unit 122 receives the video and audio transmitted from the main terminal 11 and / or the slave terminal 12 after the start of the video call using the function of the communication interface 1k. In the group view mode, video data is received from the master terminal 11 and all other slave terminals 12 participating in a video conference or a video lecture.
 シングルビューモードでは、何れの端末11、12が他者の注目を集める主体となっているかに応じて処理が異なる。従側端末11自身が他者の注目を集める主体となっている間は、参加している主側端末11及び他の全ての従側端末12から低品位の映像データを受信する。主側端末11または自身以外の従側端末12の何れかが他者の注目を集める主体となっている間は、当該主側端末11または従側端末12からのみ、高品位の映像データを受信する。 In the single view mode, the processing differs depending on which terminal 11 or 12 is the subject that attracts the attention of others. While the slave terminal 11 itself is a subject that attracts the attention of others, it receives low-quality video data from the participating master terminal 11 and all other slave terminals 12. While either the master terminal 11 or the slave terminal 12 other than itself is the subject that attracts the attention of others, high-quality video data is received only from the master terminal 11 or the slave terminal 12 To do.
 通話出力制御部123は、主側端末11及び従側端末12の何れか一または全てから受信した通話中の映像をディスプレイ1iの画面に表示させるとともに、主側端末11及び従側端末12の全てから受信した通話中の音声をスピーカ1fから音声出力させる。音声については、各端末11、12から受信した音声を重畳した上でスピーカ1fから出力させる。 The call output control unit 123 displays a video during a call received from any one or all of the main side terminal 11 and the subordinate terminal 12 on the screen of the display 1i and all of the main side terminal 11 and the subordinate side terminal 12. The voice during a call received from the voice is output from the speaker 1f. As for the sound, the sound received from the terminals 11 and 12 is superimposed and output from the speaker 1f.
 通話送信部127は、従側端末12の設置場所にてカメラ1hで撮影した映像及びマイク1eで収音した音声を、ビデオ講義またはビデオ会議等に参加している主側端末11及び/または従側端末12に対して、通信インタフェース1kの機能を利用して送信する。グループビューモードでは、参加している主側端末11及び他の全ての従側端末12に対して同じ映像データを送信する。そのために、カメラ1hで撮影した映像を、主側端末11の最大レート算定部116で算定され、最大レート指定情報に記述された最大転送レート以下のレートの映像データにエンコードした上で、主側端末11及び従側端末12の各々に対して送信する。 The call transmission unit 127 receives the video captured by the camera 1h at the installation location of the slave terminal 12 and the sound collected by the microphone 1e and / or the master terminal 11 participating in the video lecture or video conference. It transmits to the side terminal 12 using the function of the communication interface 1k. In the group view mode, the same video data is transmitted to the participating master terminal 11 and all other slave terminals 12. For this purpose, the video captured by the camera 1h is calculated by the maximum rate calculation unit 116 of the main terminal 11 and encoded into video data having a rate equal to or less than the maximum transfer rate described in the maximum rate designation information. The data is transmitted to each of the terminal 11 and the slave terminal 12.
 シングルビューモードでは、何れの端末11、12が他者の注目を集める主体となっているかに応じて処理が異なる。従側端末12自身が他者の注目を集める主体となっている間は、参加している主側端末11及び他の全ての従側端末12に対して、同じ高品位の映像データを送信する。この高品位の映像データの転送レートは、最大レート指定情報に記述された最大転送レートには必ずしも拘束されない。主側端末11及び他の従側端末12は、此方からのみ映像を受け取るからである。 In the single view mode, the processing differs depending on which terminal 11 or 12 is the subject that attracts the attention of others. While the slave terminal 12 itself is a subject that attracts the attention of others, the same high-definition video data is transmitted to the participating master terminal 11 and all other slave terminals 12. . The transfer rate of the high-quality video data is not necessarily restricted by the maximum transfer rate described in the maximum rate designation information. This is because the master terminal 11 and the other slave terminals 12 receive video only from this side.
 主側端末11または自身以外の従側端末12の何れかが他者の注目を集める主体となっている間は、当該主側端末11または従側端末12に対してのみ、低品位の映像データを送信する。このときの低品位の映像データは、最大レート指定情報に記述された最大転送レート以下のレートのデータにエンコードすることが好ましい。とは言え、最大転送レートに拘束されるとは限られない。他者の注目を集める主体となっている端末11、12以外の端末11、12間では映像のやり取りが発生しないからである。 While either the master terminal 11 or the slave terminal 12 other than itself is the subject that attracts the attention of others, the low-definition video data only for the master terminal 11 or the slave terminal 12 Send. The low-quality video data at this time is preferably encoded into data having a rate equal to or less than the maximum transfer rate described in the maximum rate designation information. However, it is not necessarily bound by the maximum transfer rate. This is because video exchange does not occur between the terminals 11 and 12 other than the terminals 11 and 12 that are the subject that attracts the attention of others.
 指令受信部126は、主側端末11から送信される切替指令を、通信インタフェース1kの機能を利用して受信する。この切替指令を参照することにより、現在グループビューモードにあるのかシングルビューモードにあるのか、また、シングルビューモードにおいて何れの端末11、12が他者の注目を集める主体となっているのかを知得する。 The command receiving unit 126 receives the switching command transmitted from the main terminal 11 using the function of the communication interface 1k. By referring to the switching command, it is known whether the terminal is currently in the group view mode or the single view mode, and which of the terminals 11 and 12 is the subject that attracts the attention of others in the single view mode. To get.
 加えて、従側端末12自らが他者の注目を集める主体として選択されているシングルビューモードの状況の下では、指令受信部126が、主側端末11からもたらされるカメラ操作指令を受信することがある。カメラ操作指令を受信した際には、このカメラ操作指令に基づき、カメラ1hに制御信号を入力してパン、チルトまたはズームを操作する。 In addition, under the situation of the single view mode in which the slave terminal 12 is selected as a subject that attracts the attention of others, the command receiving unit 126 receives a camera operation command provided from the master terminal 11 There is. When a camera operation command is received, a control signal is input to the camera 1h based on the camera operation command to operate pan, tilt, or zoom.
 本実施形態のビデオ通話システムを構成する端末1が、ビデオ通話中に実行する処理の手順を概説する。図11に示すように、主側端末11は、現在グループビューモードである場合(ステップS1)、ビデオ講義またはビデオ会議等に参加している全ての従側端末12からから送信された低品位映像及び音声を受信し(ステップS2)、受信した映像及び音声をディスプレイ1i及びスピーカ1fを介して出力する(ステップS3)。それとともに、カメラ1hで撮影した自己の映像及びマイク1eで収音した自己の音声を、参加している全ての従側端末12に向けて送信する(ステップS4)。ステップS4では、映像データを低品位映像にエンコードし、その転送レートを最大転送レート以下に抑制する。 The outline of the procedure of processing executed by the terminal 1 constituting the video call system of the present embodiment during the video call will be outlined. As shown in FIG. 11, when the main terminal 11 is currently in the group view mode (step S1), the low-definition video transmitted from all the slave terminals 12 participating in the video lecture or video conference or the like. And the audio are received (step S2), and the received video and audio are output via the display 1i and the speaker 1f (step S3). At the same time, the user's own video captured by the camera 1h and the user's own voice picked up by the microphone 1e are transmitted to all participating slave terminals 12 (step S4). In step S4, the video data is encoded into a low-quality video, and the transfer rate is suppressed below the maximum transfer rate.
 現在シングルビューモードであり、主側端末11自身が他者の注目を集める主体として選択されている場合には(ステップS5)、やはり参加している全ての従側端末12からから送信された低品位映像及び音声を受信し(ステップS6)、受信した映像及び音声をディスプレイ1i及びスピーカ1fを介して出力する(ステップS7)。それとともに、カメラ1hで撮影した自己の映像及びマイク1eで収音した自己の音声を、参加している全ての従側端末12に向けて送信する(ステップS8)。ステップS8では、映像データを高品位映像にエンコードする。その転送レートは、必ずしも最大転送レート以下に抑制はしない。 If the current mode is the single view mode and the main terminal 11 itself is selected as a subject that attracts the attention of others (step S5), the low-level transmitted from all of the participating slave terminals 12 that are also participating. The quality video and audio are received (step S6), and the received video and audio are output via the display 1i and the speaker 1f (step S7). At the same time, the user's own video captured by the camera 1h and the user's own voice collected by the microphone 1e are transmitted to all participating slave terminals 12 (step S8). In step S8, the video data is encoded into a high quality video. The transfer rate is not necessarily suppressed below the maximum transfer rate.
 現在シングルビューモードであり、何れかの従側端末12が他者の注目を集める主体として選択されている場合には、当該従側端末12のみからから送信された高品位映像を受信しつつ、全ての従側端末12から送信された音声を受信し(ステップS9)、受信した映像及び音声をディスプレイ1i及びスピーカ1fを介して出力する(ステップS10)。それとともに、カメラ1hで撮影した自己の映像及びマイク1eで収音した自己の音声を、他者の注目を集める主体となっている従側端末12のみに向けて送信する(ステップS11)。ステップS11では、映像データを低品位映像にエンコードし、その転送レートを最大転送レート以下に抑制する。 When currently in the single view mode and any of the slave terminals 12 is selected as a subject that attracts the attention of others, while receiving a high-definition video transmitted from only the slave terminal 12, Audio transmitted from all the slave terminals 12 is received (step S9), and the received video and audio are output via the display 1i and the speaker 1f (step S10). At the same time, the user's own video captured by the camera 1h and the user's own voice picked up by the microphone 1e are transmitted only to the slave terminal 12 that is the subject that attracts the attention of others (step S11). In step S11, the video data is encoded into a low-quality video, and the transfer rate is suppressed below the maximum transfer rate.
 また、主側端末11の使用者の手による、グループビューモードとシングルビューモードとを切り替えるべき旨の操作入力、または、何れかの端末11、12をシングルビューモードにおける他者の注目を集める主体として選定する操作入力を受け付けた場合には(ステップ12)、その操作入力に対応した切替指令を、参加している全ての従側端末12に向けて送信する(ステップS13)。 Also, an operation input indicating that the user of the main terminal 11 should switch between the group view mode and the single view mode, or a subject that attracts the attention of others in the single view mode for any one of the terminals 11 and 12 Is received (step 12), a switching command corresponding to the operation input is transmitted to all participating slave terminals 12 (step S13).
 主側端末11は、上記のステップS1ないしS13を、ビデオ講義またはビデオ会議等の終了まで反復する。 The main terminal 11 repeats the above steps S1 to S13 until the end of the video lecture or video conference.
 図12に示すように、従側端末12は、現在グループビューモードである場合(ステップS14)、ビデオ講義またはビデオ会議等に参加している主側端末11及び他の全ての従側端末12からから送信された低品位映像及び音声を受信し(ステップS15)、受信した映像及び音声をディスプレイ1i及びスピーカ1fを介して出力する(ステップS16)。それとともに、カメラ1hで撮影した自己の映像及びマイク1eで収音した自己の音声を、主側端末11及び他の全ての従側端末12に向けて送信する(ステップS17)。ステップS17では、映像データを低品位映像にエンコードし、その転送レートを最大転送レート以下に抑制する。 As shown in FIG. 12, when the slave terminal 12 is currently in the group view mode (step S14), from the master terminal 11 participating in the video lecture or video conference and all other slave terminals 12 The low-definition video and audio transmitted from is received (step S15), and the received video and audio are output via the display 1i and the speaker 1f (step S16). At the same time, the self-image captured by the camera 1h and the own sound collected by the microphone 1e are transmitted to the main terminal 11 and all other subordinate terminals 12 (step S17). In step S17, the video data is encoded into a low-quality video, and the transfer rate is suppressed below the maximum transfer rate.
 現在シングルビューモードであり、従側端末12自身が他者の注目を集める主体として選択されている場合には(ステップS18)、やはり主側側端末11及び全ての従側端末12からから送信された低品位映像及び音声を受信し(ステップS19)、受信した映像及び音声をディスプレイ1i及びスピーカ1fを介して出力する(ステップS20)。それとともに、カメラ1hで撮影した自己の映像及びマイク1eで収音した自己の音声を、主側端末11及び他の全ての従側端末12に向けて送信する(ステップS21)。ステップS21では、映像データを高品位映像にエンコードする。その転送レートは、必ずしも最大転送レート以下に抑制はしない。 When the current mode is the single view mode and the slave terminal 12 itself has been selected as a subject that attracts the attention of others (step S18), it is also transmitted from the master terminal 11 and all the slave terminals 12. The low-quality video and audio are received (step S19), and the received video and audio are output via the display 1i and the speaker 1f (step S20). At the same time, the self-image captured by the camera 1h and the own sound collected by the microphone 1e are transmitted to the main terminal 11 and all other subordinate terminals 12 (step S21). In step S21, the video data is encoded into a high quality video. The transfer rate is not necessarily suppressed below the maximum transfer rate.
 現在シングルビューモードであり、主側端末11または自身以外の従側端末12が他者の注目を集める主体として選択されている場合には、当該主側端末11または従側端末12のみからから送信された高品位映像を受信しつつ、全ての従側端末12から送信された音声を受信し(ステップS22)、受信した映像及び音声をディスプレイ1i及びスピーカ1fを介して出力する(ステップS23)。それとともに、カメラ1hで撮影した自己の映像及びマイク1eで収音した自己の音声を、他者の注目を集める主体となっている当該主側端末11または従側端末12のみに向けて送信する(ステップS24)。ステップS24では、映像データを低品位映像にエンコードし、その転送レートを最大転送レート以下に抑制する。 If the current mode is the single view mode and the slave terminal 12 other than the master terminal 11 or the slave terminal 12 other than itself is selected as a subject that attracts the attention of others, transmission is performed only from the master terminal 11 or the slave terminal 12 While receiving the high-definition video, the audio transmitted from all the slave terminals 12 is received (step S22), and the received video and audio are output via the display 1i and the speaker 1f (step S23). At the same time, the user's own video captured by the camera 1h and the user's own voice collected by the microphone 1e are transmitted only to the main terminal 11 or the slave terminal 12 that is the subject that attracts the attention of others. (Step S24). In step S24, the video data is encoded into a low-quality video, and the transfer rate is suppressed below the maximum transfer rate.
 また、主側端末11から切替指令がもたらされた場合には、これを受信する(ステップS25)。 Further, when a switching command is provided from the main terminal 11, it is received (step S25).
 従側端末12は、上記のステップS14ないしS25を、ビデオ講義またはビデオ会議等の終了まで反復する。 The slave terminal 12 repeats the above steps S14 to S25 until the end of the video lecture or video conference.
 主側端末11及び従側端末12の機能について補足する。各端末11、12は、ビデオ通話(グループビューモード、シングルビューモードを問わない)中に他の何れかの端末11、12との通信接続が途切れたときには、通信接続が途切れた相手方の端末11、12に向けて自動的に通話要求を再送信して、再接続を試みる。 The functions of the master terminal 11 and slave terminal 12 will be supplemented. When the communication connection with any of the other terminals 11 and 12 is interrupted during the video call (regardless of the group view mode or the single view mode), each of the terminals 11 and 12 has lost the communication connection. , 12 and automatically re-send the call request to attempt reconnection.
 ビデオ通話の開始後、従側端末12は、当該従側端末12の使用者の都合等に応じて、現在参加しているビデオ通話から離脱することが可能である。従側端末12は、使用者の手による、ビデオ通話から離脱するべきことを指令する操作入力を操作入力デバイス1dを介して受け付けたとき、ビデオ通話から離脱する旨の情報を、主側端末11及び他の従側端末12に向けて送信する。上記の情報を受信した主側端末11及び従側端末12は、ビデオ通話から離脱しようとする従側端末12との通信接続を切断する。 After the start of the video call, the slave terminal 12 can leave the currently participating video call according to the convenience of the user of the slave terminal 12. When the slave terminal 12 receives, via the operation input device 1d, an operation input that instructs the user to leave the video call, information indicating that the slave terminal 12 is to leave the video call is displayed. And to the other slave terminal 12. The master terminal 11 and the slave terminal 12 that have received the above information disconnect the communication connection with the slave terminal 12 that is about to leave the video call.
 さらに、主側端末11は、ビデオ通話に参加している拠点数が減少することから、ビデオ通話開始の際と同様にして、最大転送レートを再度計算し、算定した最大転送レートを記述した最大レート指定情報を、ビデオ通話から離脱する従側端末12以外の従側端末12の各々に対して送信する。従側端末12は、主側端末11から再びもたらされる最大レート指定情報を受信する。以後、ビデオ通話を行う主側端末11及び従側端末12はそれぞれ、少なくともグループビューモードにおいて、再計算された最大転送レート以下の映像を生成して送出する。 In addition, since the number of sites participating in the video call decreases, the main terminal 11 calculates the maximum transfer rate again in the same manner as when the video call starts, and describes the calculated maximum transfer rate. The rate designation information is transmitted to each of the slave terminals 12 other than the slave terminal 12 that leaves the video call. The slave side terminal 12 receives the maximum rate designation information brought again from the master side terminal 11. Thereafter, the master terminal 11 and the slave terminal 12 that make a video call each generate and transmit a video that is less than or equal to the recalculated maximum transfer rate at least in the group view mode.
 主側端末11は、既に開始しているビデオ通話に、新たに参加拠点を追加することが可能である。主側端末11は、使用者の手による、既に開始しているビデオ通話に新たに参加させる従側端末12の指定を含む操作入力を、操作入力デバイス1dを介して受け付ける。例えば、図9に示しているように、新たに参加させる従側端末12の候補をディスプレイ1iの画面に表示させ、その候補の中から新たな参加者となる従側端末12を使用者に指定させる。操作入力を受け付けた主側端末11は、新たな参加者として指定された従側端末12に係る識別情報に基づき、当該従側端末12に対してビデオ通話の開始を要求する通話要求を送信する。この通話要求には、既にビデオ通話に参加している複数の従側端末12及び主側端末11の各々の識別情報、ID番号が含まれている。新たな参加者として指定された従側端末12は、主側端末11からもたらされる通話要求を受信し、その通話要求に含まれている識別情報、ID番号を参照して、既にビデオ通話に参加している各従側端末12に向けて通話要求を送信する。これらの処理を通じて、新たにビデオ通話に参加する従側端末12と、既にビデオ通話に参加している従側端末12及び主側端末との間で通信が確立される。また、主側端末11は、既にビデオ通話に参加している従側端末12に対しても、新たな参加者として指定された従側端末12の識別情報、ID番号を含む情報を送信する。この情報を受信した従側端末12が、新たにビデオ通話に参加する従側端末12に対して通話要求を送信するようにしても構わない。 The main terminal 11 can newly add a participating base to a video call that has already been started. The master side terminal 11 receives an operation input including designation of the slave side terminal 12 to newly participate in the already started video call by the user's hand via the operation input device 1d. For example, as shown in FIG. 9, a candidate for a slave terminal 12 to be newly joined is displayed on the screen of the display 1i, and the slave terminal 12 to be a new participant is designated as a user among the candidates. Let The master terminal 11 that has received the operation input transmits a call request for requesting the slave terminal 12 to start a video call based on the identification information related to the slave terminal 12 designated as a new participant. . This call request includes identification information and ID numbers of each of the plurality of slave terminals 12 and the master terminal 11 already participating in the video call. The slave terminal 12 designated as the new participant receives the call request from the master terminal 11, and already participates in the video call by referring to the identification information and ID number included in the call request. A call request is transmitted to each slave terminal 12 that is in charge. Through these processes, communication is established between the slave terminal 12 newly participating in the video call and the slave terminal 12 and the master terminal already participating in the video call. The master terminal 11 also transmits information including identification information and ID number of the slave terminal 12 designated as a new participant to the slave terminal 12 already participating in the video call. The slave terminal 12 having received this information may transmit a call request to the slave terminal 12 newly participating in the video call.
 さらに、主側端末11は、ビデオ通話に参加している拠点数が増加することから、ビデオ通話開始の際と同様にして、最大転送レートを再度計算し、算定した最大転送レートを記述した最大レート指定情報を、従側端末12(新たにビデオ通話に参加する拠点を含む)の各々に対して送信する。従側端末12は、主側端末11から再びもたらされる最大レート指定情報を受信する。以後、ビデオ通話を行う主側端末11及び従側端末12はそれぞれ、少なくともグループビューモードにおいて、再計算された最大転送レート以下の映像を生成して送出する。 Furthermore, since the number of sites participating in the video call increases, the main terminal 11 calculates the maximum transfer rate again in the same manner as when starting the video call, and describes the calculated maximum transfer rate. The rate designation information is transmitted to each of the slave terminals 12 (including bases newly participating in video calls). The slave side terminal 12 receives the maximum rate designation information brought again from the master side terminal 11. Thereafter, the master terminal 11 and the slave terminal 12 that make a video call each generate and transmit a video that is less than or equal to the recalculated maximum transfer rate at least in the group view mode.
 並びに、主側端末11は、ビデオ通話に参加している拠点の何れかを選択的にビデオ通話から排除することも可能である。主側端末11は、使用者の手による、既に開始しているビデオ通話から排除するべき従側端末12の指定を含む操作入力を、操作入力デバイス1dを介して受け付ける。操作入力を受け付けた主側端末11は、排除するべき従側端末12に係る識別情報、ID番号を含む情報を、当該従側端末12以外の従側端末12に対して送信する。上記の情報を受信した従側端末12は、排除するべき従側端末12との通信接続を切断する。また、主側端末11は、排除するべき従側端末12に対しても、その旨を示す情報を送信する。この情報を受信した従側端末12が、他の従側端末12及び主側端末11との通信接続を切断するようにしても構わない。 In addition, the main terminal 11 can selectively exclude any of the bases participating in the video call from the video call. The master side terminal 11 receives an operation input including designation of the slave side terminal 12 to be excluded from the already started video call by the user's hand via the operation input device 1d. The master terminal 11 that has received the operation input transmits information including the identification information and ID number related to the slave terminal 12 to be excluded to the slave terminals 12 other than the slave terminal 12. The slave terminal 12 that has received the information disconnects the communication connection with the slave terminal 12 that should be excluded. In addition, the master terminal 11 transmits information indicating that to the slave terminal 12 to be excluded. The slave terminal 12 that has received this information may disconnect the communication connection with the other slave terminals 12 and the master terminal 11.
 さらに、主側端末11は、ビデオ通話に参加している拠点数が減少することから、ビデオ通話開始の際と同様にして、最大転送レートを再度計算し、算定した最大転送レートを記述した最大レート指定情報を、ビデオ通話から排除する従側端末12以外の従側端末12の各々に対して送信する。従側端末12は、主側端末11から再びもたらされる最大レート指定情報を受信する。以後、ビデオ通話を行う主側端末11及び従側端末12はそれぞれ、少なくともグループビューモードにおいて、再計算された最大転送レート以下の映像を生成して送出する。 In addition, since the number of sites participating in the video call decreases, the main terminal 11 calculates the maximum transfer rate again in the same manner as when the video call starts, and describes the calculated maximum transfer rate. The rate designation information is transmitted to each of the slave terminals 12 other than the slave terminal 12 excluded from the video call. The slave side terminal 12 receives the maximum rate designation information brought again from the master side terminal 11. Thereafter, the master terminal 11 and the slave terminal 12 that make a video call each generate and transmit a video that is less than or equal to the recalculated maximum transfer rate at least in the group view mode.
 本実施形態のビデオ通話システムによれば、ビデオ通話の開始に際して転送レートの最適化処理を必要とせず、速やかにグループビューモードにて映像の送受信及び画面表示を開始することができ、しかもその開始直後から映像が安定化する。 According to the video call system of the present embodiment, transmission rate optimization processing is not required at the start of a video call, and video transmission / reception and screen display can be started quickly in the group view mode. Immediately after the image stabilizes.
 さらに、三者以上が同時に参加できるシステムでありながら、シングルビューモードにおいて一部の者に限定して高品位映像を送出させる可能であるので、やりとりする映像の高品位化が通信帯域をパンクさせる問題を有効に解消することができ、専用回線やMCU、あるいはIPマルチキャストを利用することなく、比較的安価なコストで実用に供することができるのである。 Furthermore, although it is a system in which three or more parties can participate at the same time, it is possible to send high-definition video to a limited number of people in the single view mode. The problem can be solved effectively, and it can be put to practical use at a relatively low cost without using a dedicated line, MCU, or IP multicast.
 なお、本発明は以上に詳述した実施形態に限られるものではない。主側端末11の最大レート算定部116による、映像の最大転送レートの算定手法は、上記実施形態の如きものに限定はされない。 Note that the present invention is not limited to the embodiment described in detail above. The method for calculating the maximum transfer rate of video by the maximum rate calculation unit 116 of the main terminal 11 is not limited to that in the above embodiment.
 端末11、12が接続している通信回線2がxDSL回線であるならば、端末11、12が接続している当該回線2を介して送信可能なデータの転送レート、即ち上り転送レートこそがボトルネックとなる。何故ならば、xDSL回線は、上りの通信速度が下りの通信速度に比して顕著に遅いからである。このような場合、各端末11、12が接続している回線2の上り転送レートのみを参照して映像の最大転送レートの算出を行うことが合理的であり、各端末11、12が回線2を介して受信可能なデータの転送レート、即ち下り転送レートは無視してもよい。 If the communication line 2 to which the terminals 11 and 12 are connected is an xDSL line, the transfer rate of data that can be transmitted via the line 2 to which the terminals 11 and 12 are connected, that is, the uplink transfer rate is the bottle. It becomes a neck. This is because the xDSL line has an uplink communication speed that is significantly slower than the downlink communication speed. In such a case, it is reasonable to calculate the maximum video transfer rate by referring only to the uplink transfer rate of the line 2 to which the terminals 11 and 12 are connected. The transfer rate of data that can be received through the network, that is, the downlink transfer rate may be ignored.
 逆に、上りと下りとで通信速度が等しい対称な回線であるならば、上り転送レートのみを参照して最大転送レートを算出してもよく、下り転送レートのみを参照して最大転送レートを算出してもよいこととなる。後者の場合、例えば、各端末11、12毎の下り転送レートのうちの、ボトルネックとなる最小値を、従側端末12の数で除算するか割り引くかして、映像の最大転送レートを算出することが考えられる。 Conversely, if the uplink and downlink are symmetric lines with the same communication speed, the maximum transfer rate may be calculated by referring only to the uplink transfer rate, and the maximum transfer rate may be calculated by referring only to the downlink transfer rate. It may be calculated. In the latter case, for example, the maximum transfer rate of video is calculated by dividing or discounting the minimum value that is a bottleneck among the downlink transfer rates for each terminal 11 and 12 by the number of slave terminals 12. It is possible to do.
 無論、上り転送レート、下り転送レートの両方を参照して、映像の最大送出レートを算出しても構わない。結局のところ、グループビューモードにおいて、各端末11、12が他の全ての端末11、12から同時に映像を受信したときに、当該端末11、12が接続している通信回線2の下り転送レートがパンクしないこと、そして、各端末11、12が他の全ての端末11、12に向けて同時に映像を発信したときに、当該端末11、12が接続している通信回線2の上り転送レートがパンクしないこと、以上の二点を満足するように最大転送レートを算定することが要求される。 Of course, the maximum video transmission rate may be calculated by referring to both the upstream transfer rate and the downstream transfer rate. After all, in the group view mode, when each terminal 11, 12 receives video from all the other terminals 11, 12 simultaneously, the downlink transfer rate of the communication line 2 to which the terminal 11, 12 is connected is Do not puncture, and when each terminal 11, 12 transmits video to all other terminals 11, 12 simultaneously, the uplink transfer rate of the communication line 2 to which the terminal 11, 12 is connected is punctured No, it is required to calculate the maximum transfer rate so as to satisfy the above two points.
 その他、各部の具体的構成や処理の手順等は、本発明の趣旨を逸脱しない範囲で種々変形が可能である。 In addition, the specific configuration of each part, processing procedure, and the like can be variously modified without departing from the spirit of the present invention.
 本発明は、例えば、多地点間でのビデオ講義またはビデオ会議等を実現するシステムとして利用することができる。 The present invention can be used, for example, as a system for realizing video lectures or video conferences between multiple points.
 1…端末
 11…主側端末
 12…従側端末
DESCRIPTION OF SYMBOLS 1 ... Terminal 11 ... Master side terminal 12 ... Slave side terminal

Claims (12)

  1. 三以上の拠点のそれぞれに設置された端末間で映像を送受信するビデオ通話機能を実現するシステムであり、何れか一の端末が主側端末、残りの端末が従側端末となるものであって、
    主側端末は、
    ビデオ通話の開始に際して、各端末毎の、当該端末が接続している通信回線を介して送信可能なデータの転送レートを参照し、その転送レートを従側端末の数の多寡に応じて割り引くことを通じて、各端末から他の端末に向けてビデオ通話中に送出するべき転送レートの最大値を算定する最大レート算定部と、
    算定した最大転送レートを記述した最大レート指定情報を従側端末の各々に対して送信する最大レート指定情報送信部と、
    当該主側端末の設置拠点においてカメラで撮影される映像から、算定した最大転送レート以下のレートの映像データを生成して従側端末の各々に対して送信する通話送信部と、
    従側端末の各々から送信される映像データを受信する通話受信部と、
    受信した映像データをディスプレイの画面に表示させる通話出力制御部とを具備し、
    従側端末は、
    ビデオ通話の開始に際して、主側端末から送信される最大レート指定情報を受信する最大レート指定情報受信部と、
    当該従側端末の設置拠点においてカメラで撮影される映像から、最大レート指定情報に記述された最大転送レート以下のレートの映像データを生成して主側端末及び他の従側端末の各々に対して送信する通話送信部と、
    主側端末及び他の従側端末の各々から送信される映像データを受信する通話受信部と、
    受信した映像データをディスプレイの画面に表示させる通話出力制御部とを具備していることを特徴とするビデオ通話システム。
    It is a system that realizes a video call function for transmitting and receiving video between terminals installed at each of three or more bases, and any one terminal is a master terminal and the remaining terminals are slave terminals. ,
    The main terminal is
    At the start of a video call, refer to the transfer rate of data that can be transmitted for each terminal via the communication line connected to the terminal, and discount the transfer rate according to the number of slave terminals A maximum rate calculation unit that calculates the maximum value of the transfer rate that should be transmitted during a video call from each terminal to another terminal,
    A maximum rate designation information transmitting unit that transmits maximum rate designation information describing the calculated maximum transfer rate to each of the slave terminals; and
    A call transmission unit that generates video data at a rate equal to or less than the calculated maximum transfer rate from video captured by the camera at the installation site of the main terminal, and transmits the video data to each of the slave terminals;
    A call receiver for receiving video data transmitted from each of the slave terminals;
    A call output control unit for displaying received video data on a display screen;
    The slave terminal
    A maximum rate designation information receiving unit for receiving maximum rate designation information transmitted from the main terminal at the start of a video call;
    Generate video data at a rate equal to or less than the maximum transfer rate described in the maximum rate specification information from the video captured by the camera at the location where the slave terminal is installed for each of the master terminal and other slave terminals. A call transmitter that transmits
    A call receiver that receives video data transmitted from each of the master terminal and the other slave terminals;
    A video call system comprising: a call output control unit for displaying received video data on a display screen.
  2. 三以上の拠点のそれぞれに設置された端末間で映像を送受信するビデオ通話機能を実現するシステムであり、何れか一の端末が主側端末、残りの端末が従側端末となるものであって、
    主側端末は、
    ビデオ通話の開始に際して、各端末毎の、当該端末が接続している通信回線を介して受信可能なデータの転送レートを参照し、その転送レートを従側端末の数の多寡に応じて割り引くことを通じて、各端末から他の端末に向けてビデオ通話中に送出するべき転送レートの最大値を算定する最大レート算定部と、
    算定した最大転送レートを記述した最大レート指定情報を従側端末の各々に対して送信する最大レート指定情報送信部と、
    当該主側端末の設置拠点においてカメラで撮影される映像から、算定した最大転送レート以下のレートの映像データを生成して従側端末の各々に対して送信する通話送信部と、
    従側端末の各々から送信される映像データを受信する通話受信部と、
    受信した映像データをディスプレイの画面に表示させる通話出力制御部とを具備し、
    従側端末は、
    ビデオ通話の開始に際して、主側端末から送信される最大レート指定情報を受信する最大レート指定情報受信部と、
    当該従側端末の設置拠点においてカメラで撮影される映像から、最大レート指定情報に記述された最大転送レート以下のレートの映像データを生成して主側端末及び他の従側端末の各々に対して送信する通話送信部と、
    主側端末及び他の従側端末の各々から送信される映像データを受信する通話受信部と、
    受信した映像データをディスプレイの画面に表示させる通話出力制御部とを具備していることを特徴とするビデオ通話システム。
    It is a system that realizes a video call function for transmitting and receiving video between terminals installed at each of three or more bases, and any one terminal is a master terminal and the remaining terminals are slave terminals. ,
    The main terminal is
    When starting a video call, refer to the transfer rate of data that can be received via the communication line connected to the terminal for each terminal, and discount the transfer rate according to the number of slave terminals. A maximum rate calculation unit that calculates the maximum value of the transfer rate that should be transmitted during a video call from each terminal to another terminal,
    A maximum rate designation information transmitting unit that transmits maximum rate designation information describing the calculated maximum transfer rate to each of the slave terminals; and
    A call transmission unit that generates video data at a rate equal to or less than the calculated maximum transfer rate from video captured by the camera at the installation site of the main terminal, and transmits the video data to each of the slave terminals;
    A call receiver for receiving video data transmitted from each of the slave terminals;
    A call output control unit for displaying received video data on a display screen;
    The slave terminal
    A maximum rate designation information receiving unit for receiving maximum rate designation information transmitted from the main terminal at the start of a video call;
    Generate video data at a rate equal to or less than the maximum transfer rate described in the maximum rate specification information from the video captured by the camera at the location where the slave terminal is installed for each of the master terminal and other slave terminals. A call transmitter that transmits
    A call receiver that receives video data transmitted from each of the master terminal and the other slave terminals;
    A video call system comprising a call output control unit for displaying received video data on a display screen.
  3. 全ての端末から他の端末の各々に対して映像データを送信し、全ての端末が自拠点以外の全拠点の映像を表示出力するグループビューモードと、主側端末から従側端末の各々に対して映像データを送信し、従側端末が主側端末の設置拠点の映像を表示出力するシングルビューモードとを切り替えることが可能であって、
    前記主側端末は、
    グループビューモードとシングルビューモードとを切り替えるべき旨の操作入力を受け付ける操作入力受付部と、
    受け付けた操作入力に対応してグループビューモードとシングルビューモードとを切り替えるための指令である切替指令を従側端末の各々に対して送信する指令送信部とをさらに具備した上、
    通話送信部が、シングルビューモードにおいては前記最大転送レートに拘束されることなくグループビューモードよりも転送レートの高い映像データを生成して従側端末の各々に対して送信するものであり、
    前記従側端末は、
    主側端末から送信される切替指令を受信する指令受信部をさらに具備した上、
    通話送信部が、シングルビューモードにおいては主側端末に対してのみ映像データを送信し、
    通話受信部が、シングルビューモードにおいては主側端末からのみ映像データを受信するものである請求項1または2記載のビデオ通話システム。
    Group view mode in which video data is transmitted from all terminals to each of the other terminals, and all terminals display and output videos from all bases other than their own base, and each of the main side terminals to each of the subordinate terminals The slave terminal can switch between the single view mode in which the slave terminal displays and outputs the video of the installation site of the master terminal,
    The main terminal is
    An operation input receiving unit that receives an operation input indicating that switching between the group view mode and the single view mode is to be performed;
    In addition to further comprising a command transmission unit that transmits to each of the slave terminals a switching command that is a command for switching between the group view mode and the single view mode in response to the received operation input.
    The call transmission unit generates video data having a higher transfer rate than the group view mode without being restricted by the maximum transfer rate in the single view mode, and transmits the video data to each of the slave terminals.
    The slave terminal is
    Further comprising a command receiving unit for receiving a switching command transmitted from the main terminal,
    The call transmitter transmits video data only to the main terminal in single view mode,
    The video call system according to claim 1 or 2, wherein the call receiving unit receives video data only from the main terminal in the single view mode.
  4. 全ての端末から他の端末の各々に対して映像データを送信し、全ての端末が自拠点以外の全拠点の映像を表示出力するグループビューモードと、選択された何れか一の端末から他の端末の各々に対して映像データを送信し、他の端末が選択された一の端末の設置拠点の映像を表示出力するシングルビューモードとを切り替えることが可能であって、
    前記主側端末は、
    グループビューモードとシングルビューモードとを切り替えるべき旨の操作入力、及びシングルビューモードにおける何れか一の端末を選択する操作入力を受け付ける操作入力受付部と、
    受け付けた操作入力に対応して、グループビューモードとシングルビューモードとを切り替えるための指令、またはシングルビューモードにおいて何れの端末が選択されたのかを指示するための指令である切替指令を従側端末の各々に対して送信する指令送信部とをさらに具備した上、
    通話送信部が、シングルビューモードかつ当該主側端末自身が選択されている場合には、前記最大転送レートに拘束されることなくそれ以外の場合よりも転送レートの高い映像データを生成して従側端末の各々に対して送信する一方、シングルビューモードかつ従側端末が選択されている場合には、その選択された端末に対してのみ映像データを送信し、
    通話受信部が、シングルビューモードかつ従側端末が選択されている場合には、その選択された端末からのみ映像データを送信するものであり、
    前記従側端末は、
    主側端末から送信される切替指令を受信する指令受信部をさらに具備した上、
    通話送信部が、シングルビューモードかつ当該従側端末自身が選択されている場合には、前記最大転送レートに拘束されることなくそれ以外の場合よりも転送レートの高い映像データを生成して主側端末及び他の従側端末の各々に対して送信する一方、シングルビューモードかつ主側端末または他の従側端末が選択されている場合には、その選択された端末に対してのみ映像データを送信し、
    通話受信部が、シングルビューモードかつ主側端末または他の従側端末が選択されている場合には、その選択された端末からのみ映像データを受信するものである請求項1または2記載のビデオ通話システム。
    Sends video data from all terminals to each of the other terminals, and all terminals display and output video from all bases other than their own base, and from any one selected terminal to the other It is possible to switch between single view mode that transmits video data to each of the terminals and displays and outputs the video of the installation base of one terminal where the other terminal is selected,
    The main terminal is
    An operation input accepting unit for accepting an operation input for switching between the group view mode and the single view mode, and an operation input for selecting any one terminal in the single view mode;
    Corresponding to the received operation input, the slave terminal receives a switching command that is a command for switching between the group view mode and the single view mode, or a command for instructing which terminal is selected in the single view mode. And a command transmitter for transmitting to each of the above,
    When the call transmission unit is in the single view mode and the main terminal itself is selected, the call transmission unit generates video data having a transfer rate higher than that in other cases without being restricted by the maximum transfer rate, and follows it. While transmitting to each of the terminal side, when the single view mode and the slave terminal is selected, the video data is transmitted only to the selected terminal,
    When the call receiving unit is in single view mode and the slave terminal is selected, the video data is transmitted only from the selected terminal.
    The slave terminal is
    Further comprising a command receiving unit for receiving a switching command transmitted from the main terminal,
    When the call transmission unit is in the single view mode and the slave terminal itself is selected, the call transmission unit generates video data having a higher transfer rate than the other cases without being restricted by the maximum transfer rate. When transmitting to each of the side terminal and the other slave terminal while the single view mode and the master side terminal or another slave terminal are selected, the video data is transmitted only to the selected terminal. Send
    3. The video according to claim 1, wherein the call receiving unit receives video data only from the selected terminal when the single-view mode and the master terminal or another slave terminal are selected. Call system.
  5. 前記主側端末の操作入力受付部が、シングルビューモードかつ他の従側端末が選択されている場合に、当該従側端末に付随するカメラのパン、チルトまたはズームを操作するべき旨の操作入力を受け付け、
    前記主側端末の指令送信部が、受け付けた操作入力に対応してカメラのパン、チルトまたはズームを操作するためのカメラ操作指令を選択された従側端末に対して送信し、
    前記従側端末の指令受信部が、シングルビューモードかつ当該従側端末自身が選択されている場合に、主側端末から送信されるカメラ操作指令を受信する請求項4記載のビデオ通話システム。
    Operation input indicating that the operation input reception unit of the master terminal should operate pan, tilt, or zoom of the camera associated with the slave terminal when a single view mode and another slave terminal are selected. Accept
    The command transmission unit of the master terminal transmits a camera operation command for operating pan, tilt, or zoom of the camera in response to the received operation input to the selected slave terminal,
    The video call system according to claim 4, wherein the command receiving unit of the slave terminal receives a camera operation command transmitted from the master terminal when the slave terminal itself is selected in the single view mode.
  6. 請求項1記載のビデオ通話システムを構成するために用いられるものであって、
    ビデオ通話の開始に際して、各端末毎の、当該端末が接続している通信回線を介して送信可能なデータの転送レートを参照し、その転送レートを従側端末の数の多寡に応じて割り引くことを通じて、各端末から他の端末に向けてビデオ通話中に送出するべき転送レートの最大値を算定する最大レート算定部と、
    算定した最大転送レートを記述した最大レート指定情報を従側端末の各々に対して送信する最大レート指定情報送信部と、
    当該主側端末の設置拠点においてカメラで撮影される映像から、算定した最大転送レート以下のレートの映像データを生成して従側端末の各々に対して送信する通話送信部と、
    従側端末の各々から送信される映像データを受信する通話受信部と、
    受信した映像データをディスプレイの画面に表示させる通話出力制御部とを具備する主側端末。
    The video call system according to claim 1, wherein the video call system is used.
    At the start of a video call, refer to the transfer rate of data that can be transmitted for each terminal via the communication line connected to the terminal, and discount the transfer rate according to the number of slave terminals A maximum rate calculation unit that calculates the maximum value of the transfer rate that should be transmitted during a video call from each terminal to another terminal,
    A maximum rate designation information transmitting unit that transmits maximum rate designation information describing the calculated maximum transfer rate to each of the slave terminals; and
    A call transmission unit that generates video data at a rate equal to or less than the calculated maximum transfer rate from video captured by the camera at the installation site of the main terminal, and transmits the video data to each of the slave terminals;
    A call receiver for receiving video data transmitted from each of the slave terminals;
    A main terminal including a call output control unit for displaying received video data on a display screen.
  7. 請求項2記載のビデオ通話システムを構成するために用いられるものであって、
    ビデオ通話の開始に際して、各端末毎の、当該端末が接続している通信回線を介して受信可能なデータの転送レートを参照し、その転送レートを従側端末の数の多寡に応じて割り引くことを通じて、各端末から他の端末に向けてビデオ通話中に送出するべき転送レートの最大値を算定する最大レート算定部と、
    算定した最大転送レートを記述した最大レート指定情報を従側端末の各々に対して送信する最大レート指定情報送信部と、
    当該主側端末の設置拠点においてカメラで撮影される映像から、算定した最大転送レート以下のレートの映像データを生成して従側端末の各々に対して送信する通話送信部と、
    従側端末の各々から送信される映像データを受信する通話受信部と、
    受信した映像データをディスプレイの画面に表示させる通話出力制御部とを具備する主側端末。
    The video call system according to claim 2 is used to configure the video call system.
    When starting a video call, refer to the transfer rate of data that can be received via the communication line connected to the terminal for each terminal, and discount the transfer rate according to the number of slave terminals. A maximum rate calculation unit that calculates the maximum value of the transfer rate that should be transmitted during a video call from each terminal to another terminal,
    A maximum rate designation information transmitting unit that transmits maximum rate designation information describing the calculated maximum transfer rate to each of the slave terminals; and
    A call transmission unit that generates video data at a rate equal to or less than the calculated maximum transfer rate from video captured by the camera at the installation site of the main terminal, and transmits the video data to each of the slave terminals;
    A call receiver for receiving video data transmitted from each of the slave terminals;
    A main terminal including a call output control unit for displaying received video data on a display screen.
  8. 請求項3記載のビデオ通話システムを構成するために用いられるものであって、
    グループビューモードとシングルビューモードとを切り替えるべき旨の操作入力を受け付ける操作入力受付部と、
    受け付けた操作入力に対応してグループビューモードとシングルビューモードとを切り替えるための指令である切替指令を従側端末の各々に対して送信する指令送信部と、
    当該主側端末の設置拠点においてカメラで撮影される映像から、映像データを生成して従側端末の各々に対して送信する通話送信部と、
    従側端末の各々から送信される映像データを受信する通話受信部と、
    受信した映像データをディスプレイの画面に表示させる通話出力制御部とを具備し、
    前記通話送信部が、シングルビューモードにおいてはグループビューモードよりも転送レートの高い映像データを生成して従側端末の各々に対して送信する主側端末。
    The video call system according to claim 3 is used to configure the video call system.
    An operation input receiving unit that receives an operation input indicating that switching between the group view mode and the single view mode is to be performed;
    A command transmission unit that transmits a switching command that is a command for switching between the group view mode and the single view mode in response to the received operation input, to each of the slave terminals;
    A call transmission unit that generates video data from a video captured by a camera at the installation site of the main terminal and transmits the video data to each of the slave terminals;
    A call receiver for receiving video data transmitted from each of the slave terminals;
    A call output control unit for displaying received video data on a display screen;
    The call side transmission unit generates video data having a transfer rate higher than that of the group view mode in the single view mode and transmits the video data to each of the slave side terminals.
  9. 請求項4記載のビデオ通話システムを構成するために用いられるものであって、
    グループビューモードとシングルビューモードとを切り替えるべき旨の操作入力、及びシングルビューモードにおける何れか一の端末を選択する操作入力を受け付ける操作入力受付部と、
    受け付けた操作入力に対応して、グループビューモードとシングルビューモードとを切り替えるための指令、またはシングルビューモードにおいて何れの端末が選択されたのかを指示するための指令である切替指令を従側端末の各々に対して送信する指令送信部と、
    当該主側端末の設置拠点においてカメラで撮影される映像から、映像データを生成して従側端末の各々に対して送信する通話送信部と、
    従側端末の各々から送信される映像データを受信する通話受信部と、
    受信した映像データをディスプレイの画面に表示させる通話出力制御部とを具備し、
    前記通話送信部が、シングルビューモードかつ当該主側端末自身が選択されている場合には、それ以外の場合よりも転送レートの高い映像データを生成して従側端末の各々に対して送信する一方、シングルビューモードかつ従側端末が選択されている場合には、その選択された端末に対してのみ映像データを送信し、
    前記通話受信部が、シングルビューモードかつ従側端末が選択されている場合には、その選択された端末からのみ映像データを受信する主側端末。
    The video call system according to claim 4, wherein the video call system is used.
    An operation input receiving unit that receives an operation input for switching between the group view mode and the single view mode, and an operation input for selecting any one terminal in the single view mode;
    Corresponding to the received operation input, the slave terminal receives a switching command that is a command for switching between the group view mode and the single view mode, or a command for instructing which terminal is selected in the single view mode. A command transmitter for transmitting to each of the
    A call transmission unit that generates video data from a video captured by a camera at the installation site of the main terminal and transmits the video data to each of the slave terminals;
    A call receiver for receiving video data transmitted from each of the slave terminals;
    A call output control unit for displaying received video data on a display screen;
    When the call transmission unit is in the single view mode and the main terminal itself is selected, the call transmission unit generates video data having a higher transfer rate than other cases and transmits the video data to each of the slave terminals. On the other hand, when the single view mode and the slave terminal are selected, the video data is transmitted only to the selected terminal,
    When the call receiving unit is in a single view mode and a slave terminal is selected, the master terminal receives video data only from the selected terminal.
  10. 請求項1記載のビデオ通話システムを構成するために用いられるものであって、
    ビデオ通話の開始に際して、主側端末から送信される最大レート指定情報を受信する最大レート指定情報受信部と、
    当該従側端末の設置拠点においてカメラで撮影される映像から、最大レート指定情報に記述された最大転送レート以下のレートの映像データを生成して主側端末及び他の従側端末の各々に対して送信する通話送信部と、
    主側端末及び他の従側端末の各々から送信される映像データを受信する通話受信部と、
    受信した映像データをディスプレイの画面に表示させる通話出力制御部とを具備する従側端末。
    The video call system according to claim 1, wherein the video call system is used.
    A maximum rate designation information receiving unit for receiving maximum rate designation information transmitted from the main terminal at the start of a video call;
    Generate video data at a rate equal to or less than the maximum transfer rate described in the maximum rate specification information from the video captured by the camera at the location where the slave terminal is installed for each of the master terminal and other slave terminals. A call transmitter that transmits
    A call receiver that receives video data transmitted from each of the master terminal and the other slave terminals;
    A slave terminal comprising a call output control unit for displaying received video data on a display screen.
  11. 請求項3記載のビデオ通話システムを構成するために用いられるものであって、
    主側端末から送信される、グループビューモードとシングルビューモードとを切り替えるための指令である切替指令を受信する指令受信部と、
    当該従側端末の設置拠点においてカメラで撮影される映像から、映像データを生成して主側端末及び他の従側端末の各々に対して送信する通話送信部と、
    主側端末及び他の従側端末の各々から送信される映像データを受信する通話受信部と、
    受信した映像データをディスプレイの画面に表示させる通話出力制御部とを具備し、
    前記通話送信部が、シングルビューモードにおいては主側端末に対してのみ映像データを送信し、
    前記通話受信部が、シングルビューモードにおいては主側端末からのみ映像データを受信する従側端末。
    The video call system according to claim 3 is used to configure the video call system.
    A command receiving unit that receives a switching command, which is a command for switching between the group view mode and the single view mode, transmitted from the main-side terminal;
    A call transmission unit that generates video data from video captured by a camera at the installation site of the slave terminal and transmits the video data to each of the master terminal and the other slave terminals;
    A call receiver that receives video data transmitted from each of the master terminal and the other slave terminals;
    A call output control unit for displaying received video data on a display screen;
    The call transmission unit transmits video data only to the main terminal in the single view mode,
    The slave terminal in which the call receiving unit receives video data only from the master terminal in the single view mode.
  12. 請求項4記載のビデオ通話システムを構成するために用いられるものであって、
    主側端末から送信される、グループビューモードとシングルビューモードとを切り替えるための指令、またはシングルビューモードにおいて何れの端末が選択されたのかを指示するための指令である切替指令を受信する指令受信部と、
    当該従側端末の設置拠点においてカメラで撮影される映像から、映像データを生成して主側端末及び他の従側端末の各々に対して送信する通話送信部と、
    主側端末及び他の従側端末の各々から送信される映像データを受信する通話受信部と、
    受信した映像データをディスプレイの画面に表示させる通話出力制御部とを具備し、
    通話送信部が、シングルビューモードかつ当該従側端末自身が選択されている場合には、それ以外の場合よりも転送レートの高い映像データを生成して主側端末及び他の従側端末の各々に対して送信する一方、シングルビューモードかつ主側端末または他の従側端末が選択されている場合には、その選択された端末に対してのみ映像データを送信し、
    通話受信部が、シングルビューモードかつ主側端末または他の従側端末が選択されている場合には、その選択された端末からのみ映像データを受信する従側端末。
    The video call system according to claim 4, wherein the video call system is used.
    Command reception for receiving a switching command, which is a command for switching between the group view mode and the single view mode, or a command for instructing which terminal is selected in the single view mode, transmitted from the main terminal. And
    A call transmission unit that generates video data from video captured by a camera at the installation site of the slave terminal and transmits the video data to each of the master terminal and the other slave terminals;
    A call receiver that receives video data transmitted from each of the master terminal and the other slave terminals;
    A call output control unit for displaying received video data on a display screen;
    When the call transmission unit is in the single view mode and the slave terminal itself is selected, each of the master terminal and the other slave terminals generates video data having a higher transfer rate than other cases. On the other hand, when the single-view mode and the master terminal or another slave terminal are selected, the video data is transmitted only to the selected terminal,
    A slave terminal that receives video data only from the selected terminal when the call receiving unit is in the single view mode and the master terminal or another slave terminal is selected.
PCT/JP2011/071306 2010-09-30 2011-09-20 Video call system, main-side terminal, and subordinate-side terminals WO2012043290A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2010221091A JP2012080186A (en) 2010-09-30 2010-09-30 Video speech system, master terminal, and slave terminal
JP2010-221091 2010-09-30

Publications (1)

Publication Number Publication Date
WO2012043290A1 true WO2012043290A1 (en) 2012-04-05

Family

ID=45892754

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2011/071306 WO2012043290A1 (en) 2010-09-30 2011-09-20 Video call system, main-side terminal, and subordinate-side terminals

Country Status (3)

Country Link
JP (1) JP2012080186A (en)
TW (1) TW201220847A (en)
WO (1) WO2012043290A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106465087A (en) * 2014-05-16 2017-02-22 华为技术有限公司 Method for transferring communication message and related apparatus

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106572320A (en) * 2016-11-11 2017-04-19 上海斐讯数据通信技术有限公司 Multiparty video conversation method and system

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005260384A (en) * 2004-03-10 2005-09-22 Fujitsu Ltd Video conference system
JP2006005422A (en) * 2004-06-15 2006-01-05 Nec Corp Mobile communication system, wireless base station, mobile communication method, and mobile communication program
JP2006191283A (en) * 2005-01-05 2006-07-20 Hitachi Communication Technologies Ltd Conference server, client terminal device for teleconferencing, speed adjusting method, and program

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005260384A (en) * 2004-03-10 2005-09-22 Fujitsu Ltd Video conference system
JP2006005422A (en) * 2004-06-15 2006-01-05 Nec Corp Mobile communication system, wireless base station, mobile communication method, and mobile communication program
JP2006191283A (en) * 2005-01-05 2006-07-20 Hitachi Communication Technologies Ltd Conference server, client terminal device for teleconferencing, speed adjusting method, and program

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106465087A (en) * 2014-05-16 2017-02-22 华为技术有限公司 Method for transferring communication message and related apparatus
CN106465087B (en) * 2014-05-16 2020-03-20 华为技术有限公司 Communication message transfer method and related device

Also Published As

Publication number Publication date
TW201220847A (en) 2012-05-16
JP2012080186A (en) 2012-04-19

Similar Documents

Publication Publication Date Title
AU2017254981B2 (en) Reduced latency server-mediated audio-video communication
US20170208291A1 (en) System and method for video communication on mobile devices
KR100939182B1 (en) Group communication server
EP2437491A1 (en) Method and system for video conference control, network equipment and meeting places for video conference
EP1676439B1 (en) Video conference with improved multi media capabilities
WO2012075937A1 (en) Video call method and videophone
WO2015154608A1 (en) Method, system and apparatus for sharing video conference material
KR20140098573A (en) Apparatus and Methd for Providing Video Conference
US20100066806A1 (en) Internet video image producing method
US20160205347A1 (en) Video conferencing system and multi-way video conference switching method
JP2004187170A (en) Video conference system
JP2016163222A (en) Communication system, communication method, relay device, and program
WO2016206471A1 (en) Multimedia service processing method, system and device
US20130265380A1 (en) Method, Device, and Network System for Controlling Multiple Auxiliary Streams
EP3734967A1 (en) Video conference transmission method and apparatus, and mcu
WO2012043290A1 (en) Video call system, main-side terminal, and subordinate-side terminals
JP5481693B2 (en) Video call system, calling terminal, receiving terminal, program
CN111405229A (en) Video conference processing method, system, client, electronic device and storage medium
WO2011010563A1 (en) Video call system, master-side terminal, slave-side terminal, and program
KR20170071251A (en) Multi-point control unit for providing conference service
KR100911692B1 (en) Communication System and Method for Providing Real-time Watching of Multi-point Conversation Service
CN112839197B (en) Image code stream processing method, device, system and storage medium
KR101458408B1 (en) Method and System for Sharing Information using SIP Based Smart Devices
CN117880422A (en) Audio telephone video capability expanding method, service system, equipment and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11828847

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 11828847

Country of ref document: EP

Kind code of ref document: A1