JP2010093479A

JP2010093479A - Conference system and conference terminal device

Info

Publication number: JP2010093479A
Application number: JP2008260538A
Authority: JP
Inventors: Shoichi Koga; 正一古賀
Original assignee: Panasonic Corp
Current assignee: Panasonic Corp
Priority date: 2008-10-07
Filing date: 2008-10-07
Publication date: 2010-04-22

Abstract

<P>PROBLEM TO BE SOLVED: To provide a conference system and a conference terminal device that are composed of simple hardware by reducing a load required for distribution to a communication terminal operated by a person who sits in on a conference. <P>SOLUTION: A conference system has a host terminal M operated by a host among a plurality of speakers, speaker terminals A1 and A2 operated by other speakers, a speaker terminal A3 connected to the speaker terminal A1 in cascade and transmitting a conference moving image to the host terminal M through the speaker terminal A1, a distribution server SV receiving the conference moving image from the host terminal M and simultaneously distributing the image, and one or more terminals B of a person who sits in on a conference which receive the conference moving image simultaneously distributed from the distribution server SV by an operation of a person who sits in on a conference. The host terminal M synthesizes the conference moving image transmitted from the speaker terminal A1 with its own conference moving image to be transmitted to the speaker terminal A1 and the distribution server SV through an IP network NW. The distribution server SV transmits the conference moving image transmitted from the host terminal M to the terminal B of a person who sits in on a conference. <P>COPYRIGHT: (C)2010,JPO&INPIT

Description

本発明は、複数の発言者が発言する会議を１以上の傍聴者が傍聴することが可能な会議システムおよび会議端末装置に関する。 The present invention relates to a conference system and a conference terminal device in which one or more listeners can observe a conference where a plurality of speakers speak.

多地点に配置されている端末がネットワークに通信可能に接続され、それぞれの端末から送信される複数の発言者からの会議音声をサーバにて合成して、多地点の端末へ配信する会議システムが使用されるようになってきた。 A conference system in which terminals arranged at multipoints are communicably connected to a network, and conference audio from a plurality of speakers transmitted from each terminal is synthesized by a server and distributed to the multipoint terminals. Has come to be used.

例えば、特許文献１に記載の多地点会議システムでは、発言者グループに属する端末からの音声データをミキシングした音声データを生成する電話会議サーバと、前記電話会議サーバによりミキシングされた音声データを発言者グループと聴取者グループに属する端末に配信する音声配信サーバとを有したものである。
特開２００５−２６９３４７号公報 For example, in the multipoint conference system described in Patent Document 1, a conference call server that generates audio data obtained by mixing audio data from terminals belonging to a speaker group, and audio data mixed by the conference call server It has an audio distribution server for distributing to groups and terminals belonging to a listener group.
JP 2005-269347 A

しかし、特許文献１に記載の多地点会議システムでは、電話会議サーバにより合成された発言者グループの端末からの会議音声を、音声配信サーバが発言者グループに属する端末全部だけでなく、聴取者グループに属する端末全部までも会議音声を配信しているので、高速に音声処理したり、ネットワークに対して高速に配信したりするハードウェアが必要である。 However, in the multipoint conference system described in Patent Document 1, the conference audio from the terminals of the speaker group synthesized by the telephone conference server is transmitted not only to all terminals to which the voice distribution server belongs to the speaker group, but also to the listener group. Since the conference audio is distributed to all the terminals belonging to the network, hardware that performs high-speed audio processing or high-speed distribution to the network is required.

そこで本発明は、傍聴者が操作する通信端末への配信に要する負荷を軽減することで、簡易なハードウェアで構成することが可能な会議システムおよび会議端末装置を提供することを目的とする。 Accordingly, an object of the present invention is to provide a conference system and a conference terminal device that can be configured with simple hardware by reducing the load required for distribution to a communication terminal operated by a listener.

本発明は、複数の発言者のうち一の発言者が操作する第１の通信端末と、他の発言者が操作する少なくとも１台の第２の通信端末と、前記第１の通信端末からの会議音声を受信して同報配信する配信サーバと、傍聴者が操作して、前記配信サーバから同報配信される会議音声を受信して会議を傍聴する１以上の第３の通信端末とが通信可能にネットワークに接続され、前記第１の通信端末は、前記第２の通信端末から送信された会議音声と、自らの会議音声とを合成する合成手段と、前記合成手段で合成された会議音声を配信サーバおよび前記第２の通信端末に前記ネットワークを介して送信する送信手段とを備え、前記配信サーバは、同報配信する送信先の第３の通信端末を記憶する記憶手段と、前記第１の通信端末から送信される会議音声を、前記記憶手段に記憶された送信先の第３の通信端末に対して前記ネットワークを介して送信する配信手段とを備えたことを特徴とする。 The present invention relates to a first communication terminal operated by one of a plurality of speakers, at least one second communication terminal operated by another speaker, and from the first communication terminal. A distribution server that receives conference audio and broadcasts it, and one or more third communication terminals that are operated by an observer and receive conference audio broadcast from the distribution server and listen to the conference The first communication terminal is communicably connected to the network, and the first communication terminal synthesizes the conference voice transmitted from the second communication terminal and its own conference voice, and the conference synthesized by the synthesis means Transmitting means for transmitting voice to the distribution server and the second communication terminal via the network, the distribution server storing a third communication terminal as a transmission destination for broadcast distribution; Conference sound transmitted from the first communication terminal And it is characterized in that a distribution means for transmitting via the network to the third communication terminal stored destination in the storage means.

本発明は、第１の通信端末が会議音声の合成と共に、第２の通信端末への送信までを行うことで、配信サーバに対して、傍聴者が操作する第３の通信端末への配信に要する負荷を軽減することで、簡易なハードウェアで構成することが可能である。 In the present invention, the first communication terminal performs the conference voice synthesis and the transmission to the second communication terminal, thereby delivering the distribution server to the third communication terminal operated by the listener. By reducing the required load, it is possible to configure with simple hardware.

本願の第１の発明は、複数の発言者のうち一の発言者が操作する第１の通信端末と、他の発言者が操作する少なくとも１台の第２の通信端末と、第１の通信端末からの会議音声を受信して同報配信する配信サーバと、傍聴者が操作して、配信サーバから同報配信される会議音声を受信して会議を傍聴する１以上の第３の通信端末とが通信可能にネットワークに接続され、第１の通信端末は、第２の通信端末から送信された会議音声と、自らの会議音声とを合成する合成手段と、合成手段で合成された会議音声を配信サーバおよび第２の通信端末にネットワークを介して送信する送信手段とを備え、配信サーバは、同報配信する送信先の第３の通信端末を記憶する記憶手段と、第１の通信端末から送信される会議音声を、記憶手段に記憶された送信先の第３の通信端末に対してネットワークを介して送信する配信手段とを備えたことを特徴としたものである。 The first invention of the present application is directed to a first communication terminal operated by one of a plurality of speakers, at least one second communication terminal operated by another speaker, and a first communication. A distribution server that receives conference audio from a terminal and broadcasts it, and one or more third communication terminals that are operated by a listener and receive the conference audio broadcast from the distribution server and listen to the conference Are communicably connected to the network, and the first communication terminal synthesizes the conference voice transmitted from the second communication terminal and its own conference voice, and the conference voice synthesized by the synthesizing means. Transmitting means for transmitting to the distribution server and the second communication terminal via the network, the distribution server storing a third communication terminal as a transmission destination for broadcast distribution, and the first communication terminal The meeting audio sent from Is obtained is characterized in that a distribution means for transmitting over the network to signal the destination of the third communication terminal.

第１の発明によれば、複数の発言者のうちの一の発言者が操作する第１の通信端末の合成手段が第２の通信端末から送信された会議音声と、自らの会議音声とを合成し、送信手段が合成された会議音声を配信サーバおよび第２の通信端末に送信するので、配信サーバは、記憶手段に記憶された送信先の第３の通信端末へ第１の通信端末から送信される会議音声を、配信手段より送信しているので、配信サーバは負荷を配信に要する負荷が軽い。従って、第１の通信端末が、第２の通信端末から送信された会議音声と、自らの会議音声とを合成し、合成された会議音声を第２の通信端末に送信する機能を備えていれば、送信先に配信サーバを加えるだけでよく、配信サーバは、第１の通信端末や、１台以上の第２の通信端末への配信は不要であり、傍聴者が操作する第３の通信端末へ配信するだけよいので、高い性能は必要ない。よって、本発明は、傍聴者が操作する第３の通信端末への配信に要する負荷を軽減することで、簡易なハードウェアで構成することが可能である。 According to the first aspect of the present invention, the synthesizing means of the first communication terminal operated by one of the plurality of speakers operates the conference voice transmitted from the second communication terminal and the own conference voice. Since the synthesized voice is transmitted to the distribution server and the second communication terminal by the transmission means, the distribution server sends the third communication terminal stored in the storage means to the third communication terminal from the first communication terminal. Since the conference voice to be transmitted is transmitted from the distribution means, the distribution server is light in load required for distribution. Therefore, the first communication terminal has a function of synthesizing the conference voice transmitted from the second communication terminal and its own conference voice and transmitting the synthesized conference voice to the second communication terminal. For example, it is only necessary to add a distribution server to the transmission destination, and the distribution server does not require distribution to the first communication terminal or one or more second communication terminals, and the third communication operated by the listener. Since it only needs to be delivered to the terminal, high performance is not necessary. Therefore, the present invention can be configured with simple hardware by reducing the load required for distribution to the third communication terminal operated by the listener.

本願の第２の発明は、第１の発明において、第１の通信端末における合成手段は、会議音声に加えて映像を合成し、配信サーバにおける配信手段は、合成手段で合成された会議音声および映像を会議音声／映像として配信することを特徴としたものである。第２の発明により、合成手段が会議音声に加えて映像を合成するので、第１の通信端末は音声のみならず映像も送信することができる。 According to a second invention of the present application, in the first invention, the synthesizing unit in the first communication terminal synthesizes the video in addition to the conference audio, and the distribution unit in the distribution server includes the conference audio synthesized by the synthesizing unit and The video is distributed as conference audio / video. According to the second invention, since the synthesizing unit synthesizes the video in addition to the conference voice, the first communication terminal can transmit not only the voice but also the video.

本願の第３の発明は、第２の発明において、第３の通信端末は、ストリーミング再生手段を有し、配信サーバから配信される会議音声／映像をストリーミング再生手段により再生することを特徴としたものである。第３の発明により、第３の通信端末を操作する傍聴者は音声のみならず会議における映像も傍聴することができる。 A third invention of the present application is characterized in that, in the second invention, the third communication terminal has a streaming reproduction means, and the meeting audio / video distributed from the distribution server is reproduced by the streaming reproduction means. Is. According to the third invention, the listener who operates the third communication terminal can observe not only the voice but also the video in the conference.

本願の第４の発明は、第１から第３のいずれかの発明において、配信サーバは、記憶手段に記憶された第３の通信端末に対して会議の開始を通知する通知手段を有することを特徴としたものである。第３の発明により、第３の通信端末を操作する傍聴者は通知手段による通知で会議の開始を知ることができる。 According to a fourth invention of the present application, in any one of the first to third inventions, the distribution server has a notification means for notifying the start of the conference to the third communication terminal stored in the storage means. It is a feature. According to the third invention, the listener who operates the third communication terminal can know the start of the conference by the notification by the notification means.

本願の第５の発明は、第１から第４のいずれかの発明において、第１の通信端末は、会議に新たに加わろうとする第４の通信端末からの会議参加要求を受信すると、第２の通信端末に会議参加要求するよう指示し、第２の通信端末は、第４の通信端末からの会議参加要求を受信すると通話を許可し、第４の通信端末からの会議音声と自らの音声とを合成して第１の通信端末へ送信することを特徴としたものである。第５の発明により、第１の通信端末は、第４の通信端末が新たに会議に加わっても、第２の通信端末から送信されるからの会議音声と自らの音声とを合成すればよいので、会議音声の合成数が増加しない。 According to a fifth invention of the present application, in any one of the first to fourth inventions, when the first communication terminal receives a conference participation request from a fourth communication terminal to newly join the conference, The second communication terminal accepts the conference participation request from the fourth communication terminal, and the second communication terminal permits the call, and receives the conference voice from the fourth communication terminal and its own voice. Are combined and transmitted to the first communication terminal. According to the fifth invention, even if the fourth communication terminal newly joins the conference, the first communication terminal may synthesize the conference voice transmitted from the second communication terminal and its own voice. Therefore, the number of conference voices does not increase.

本願の第６の発明は、第５の発明において、第１の通信端末は、第２の通信端末が会議から外れたときに、当該第２の通信端末が会議通話許可した第４の通信端末が存在する場合には、この第４の通信端末を第２の通信端末として会議を行うことを特徴としたものである。第６の発明により、第１の通信端末は第４の通信端末からの会議音声と他の会議音声とを合成して第２の通信端末の代わりに第４の通信端末へ送信するので、第２の通信端末が会議を外れても、支障なく会議を継続することができる。 According to a sixth aspect of the present invention, in the fifth aspect, the first communication terminal is a fourth communication terminal that the second communication terminal permits a conference call when the second communication terminal is removed from the conference. Is present, the fourth communication terminal is used as a second communication terminal for a conference. According to the sixth invention, the first communication terminal synthesizes the conference voice from the fourth communication terminal and the other conference voice and transmits them to the fourth communication terminal instead of the second communication terminal. Even if two communication terminals leave the conference, the conference can be continued without any problem.

本願の第７の発明は、第５の発明において、第４の端末は、第２の通信端末に障害を検知したときに、第１の通信端末に障害発生を通知し、第１の通信端末は、この第４の通信端末を第２の通信端末として会議を行うことを特徴としたものである。第７の発明により、第１の通信端末は第４の通信端末からの会議音声と他の会議音声とを合成して第２の通信端末の代わりに第４の通信端末へ送信するので、第２の通信端末が障害で会議を継続できなくなって会議を外れても、支障なく会議を継続することができる。 According to a seventh aspect of the present invention, in the fifth aspect, when the fourth terminal detects a failure in the second communication terminal, the fourth communication terminal notifies the first communication terminal of the occurrence of the failure, and the first communication terminal Is characterized in that the fourth communication terminal is used as a second communication terminal for a conference. According to the seventh invention, the first communication terminal synthesizes the conference voice from the fourth communication terminal and another conference voice and transmits the synthesized voice to the fourth communication terminal instead of the second communication terminal. Even if the communication terminal 2 becomes unable to continue the conference due to a failure and leaves the conference, the conference can be continued without any problem.

本願の第８の発明は、第５から第７のいずれかの発明において、第１の通信端末、第２の通信端末および第４の通信端末は、イントラネットで接続され、第１の通信端末、配信サーバおよび第３の通信端末は、インターネットで接続されていることを特徴としたものである。第８の発明により、第１の通信端末、第２の通信端末および第４の通信端末は同じ企業内や施設内に配置することができ、配信サーバおよび第３の通信端末が、遠隔地にあってもインターネットを介して通信することができる。 According to an eighth invention of the present application, in any one of the fifth to seventh inventions, the first communication terminal, the second communication terminal, and the fourth communication terminal are connected via an intranet, and the first communication terminal, The distribution server and the third communication terminal are connected by the Internet. According to the eighth invention, the first communication terminal, the second communication terminal, and the fourth communication terminal can be arranged in the same company or facility, and the distribution server and the third communication terminal are located in a remote place. Even if it exists, it can communicate via the Internet.

本願の第９の発明は、第１の通信端末、第２の通信端末、第４の通信端末および配信サーバは、イントラネットで接続され、配信サーバと第３の通信端末は、インターネットで接続されていることを特徴としたものである。第９の発明により、第１の通信端末、第２の通信端末、第４の通信端末および配信サーバは同じ企業内や施設内に配置することができ、第３の通信端末は遠隔地にあってもインターネットを介して通信することができる。 In the ninth invention of the present application, the first communication terminal, the second communication terminal, the fourth communication terminal, and the distribution server are connected via an intranet, and the distribution server and the third communication terminal are connected via the Internet. It is characterized by being. According to the ninth invention, the first communication terminal, the second communication terminal, the fourth communication terminal and the distribution server can be arranged in the same company or facility, and the third communication terminal is located in a remote place. Can communicate via the Internet.

本願の第１０の発明は、複数の発言者のうち一の発言者が操作する会議端末装置において、他の発言者が操作する少なくとも１台の発言者端末から送信された会議音声と、自らの会議音声とを合成する合成手段と、合成手段で合成された会議音声を、発言者端末へ送信すると共に、会議を傍聴する１以上の傍聴者端末へ同報配信する配信サーバへネットワークを介して送信する送信手段とを備えていることを特徴としたものである。第１０の発明により、第１の発明と同様に、本発明の会議端末装置が、発言者端末から送信された会議音声と、自らの会議音声とを合成し、合成された会議音声を発言者端末に送信する機能を備えていれば、送信先に配信サーバを加えるだけでよく、配信サーバは、本発明の会議端末装置や、１台以上の発言者端末への配信は不要であり、傍聴者が操作する傍聴者端末へ配信するだけよいので、高い性能は必要ない。よって、傍聴者が操作する傍聴者端末への配信に要する負荷を軽減することで、簡易なハードウェアで構成することが可能である。 According to a tenth aspect of the present application, in a conference terminal device operated by one of a plurality of speakers, a conference voice transmitted from at least one speaker terminal operated by another speaker, A synthesizing unit that synthesizes the conference audio, and the conference audio synthesized by the synthesizing unit is transmitted to the speaker terminal and broadcasted to one or more listener terminals that listen to the conference via the network. And a transmission means for transmitting. According to the tenth invention, as in the first invention, the conference terminal device of the present invention synthesizes the conference voice transmitted from the speaker terminal and its own conference voice, and the synthesized conference voice is given to the speaker. If it has a function to transmit to a terminal, it is only necessary to add a distribution server to the transmission destination. The distribution server does not require distribution to the conference terminal device of the present invention or one or more speaker terminals. High performance is not necessary because it only needs to be distributed to the listener terminal operated by the user. Therefore, it is possible to configure with simple hardware by reducing the load required for distribution to the observer terminal operated by the observer.

（実施の形態）
本発明の実施の形態に係る会議システムを図面に基づいて説明する。図１は、本発明の実施の形態に係る会議システム全体の構成を示す図である。 (Embodiment)
A conference system according to an embodiment of the present invention will be described with reference to the drawings. FIG. 1 is a diagram showing a configuration of the entire conference system according to the embodiment of the present invention.

図１に示すように会議システム１は、会議の出席者の音声や映像を混合（ミキシング）して会議音声／映像として配信するものである。会議システム１は、主催者となる出席者が操作する主催者端末Ｍ（第１の通信端末）と、会議で発言をする出席者が操作する発言者端末Ａ（Ａ１，Ａ２：第２の通信端末，Ａ３：第４の通信端末）と、配信サーバＳＶと、会議を傍聴する傍聴者が操作する傍聴者端末Ｂ（Ｂ１〜Ｂｎ：第３の通信端末）とが、ＩＰ（ＩｎｔｅｒｎｅｔＰｒｏｔｏｃｏｌ）ネットワークＮＷを介して通信可能に接続されている。 As shown in FIG. 1, the conference system 1 mixes (mixes) audio and video of attendees of a conference and distributes them as conference audio / video. The conference system 1 includes an organizer terminal M (first communication terminal) operated by an attendant serving as an organizer, and a speaker terminal A (A1, A2: second communication) operated by an attendant speaking in a conference. A terminal, A3: a fourth communication terminal), a distribution server SV, and a listener terminal B (B1-Bn: third communication terminal) operated by a listener who observes the conference is an IP (Internet Protocol) network. Communication is established via the NW.

本実施の形態では、会議端末装置として機能する主催者端末Ｍと、発言者端末Ａ１〜Ａ４は、企業内のイントラネットＮ１に接続され、配信サーバＳＶおよび傍聴者端末Ｂは、これらとインターネットＮ２を介して接続されている。以下、主催者端末Ｍ、発言者端末Ａおよび傍聴者端末Ｂを、単に「端末」と称することがある。 In the present embodiment, the organizer terminal M functioning as a conference terminal device and the speaker terminals A1 to A4 are connected to an intranet N1 in the company, and the distribution server SV and the listener terminal B use the Internet N2 with them. Connected through. Hereinafter, the organizer terminal M, the speaker terminal A, and the listener terminal B may be simply referred to as “terminals”.

なお、図１においては、各端末や配信サーバＳＶとＩＰネットワークＮＷとを接続するためのハブや、ルータなどは図示していない。また、配信サーバＳＶがマルチキャストでＩＰパケットを送信する経路に配置されるマルチキャスト対応ルータも省略している。
（主催者端末Ｍの構成）
次に、主催者端末Ｍについて図２に基づいて説明する。図２は、図１に示す会議システムの会議端末装置として機能する主催者端末の構成を示すブロック図である。 In FIG. 1, a hub, a router, and the like for connecting each terminal, distribution server SV, and IP network NW are not shown. Also, a multicast-compatible router that is arranged on a route through which the distribution server SV transmits IP packets by multicast is omitted.
(Configuration of organizer terminal M)
Next, the organizer terminal M will be described with reference to FIG. FIG. 2 is a block diagram illustrating a configuration of the organizer terminal that functions as a conference terminal device of the conference system illustrated in FIG. 1.

主催者端末Ｍは、自らの会議音声および会議映像と、主催者端末Ｍに接続されたそれぞれの発言者端末Ａから送られてくる会議音声および会議映像を加えた音声および映像（以下、会議音声／映像とする）を合成して、それぞれの発言者端末Ａへ返送する。また主催者端末Ｍは前記合成された会議音声／映像を、傍聴者端末Ｂを操作する傍聴者に会議を傍聴させるために配信サーバＳＶへ送信する。主催者端末Ｍは、入出力部２と、ＡＤ／ＤＡ部３と、演算処理部４と、符号化部５と、復号化部６と、チャネルデータ処理部７と、パケット制御部８と、呼制御部９と、通信部１０と、情報格納部１１と、通信制御部１２とを備えている。 The organizer terminal M has its own conference audio and video, and audio and video (hereinafter referred to as conference audio) including the conference audio and conference video sent from each speaker terminal A connected to the organizer terminal M. / Video), and returns to each speaker terminal A. In addition, the organizer terminal M transmits the synthesized conference audio / video to the distribution server SV in order to allow the observer operating the observer terminal B to observe the conference. The organizer terminal M includes an input / output unit 2, an AD / DA unit 3, an arithmetic processing unit 4, an encoding unit 5, a decoding unit 6, a channel data processing unit 7, a packet control unit 8, A call control unit 9, a communication unit 10, an information storage unit 11, and a communication control unit 12 are provided.

入出力部２は、出席者の音声（周囲の音も含む）や映像を取得して音声信号および映像信号としてＡＤ／ＤＡ部３へ出力したり、ＡＤ／ＤＡ部３からの音声信号および映像信号を再生したりする。入出力部２は、音声についてマイクおよびスピーカとすることができ、映像についてカメラおよびディスプレイとすることができる。 The input / output unit 2 acquires attendee's voice (including ambient sounds) and video and outputs the voice signal and video signal to the AD / DA unit 3, and the audio signal and video from the AD / DA unit 3. Play the signal. The input / output unit 2 can be a microphone and a speaker for audio, and can be a camera and a display for video.

ＡＤ／ＤＡ部３は、入出力部２からのアナログ信号と、演算処理部４からのデジタル信号を相互に変換して、アナログ信号は入出力部２へ出力され、デジタル信号は非圧縮の線形データの音声・映像として演算処理部４へ出力される。 The AD / DA unit 3 mutually converts the analog signal from the input / output unit 2 and the digital signal from the arithmetic processing unit 4 so that the analog signal is output to the input / output unit 2, and the digital signal is an uncompressed linear signal. The data is output to the arithmetic processing unit 4 as audio / video.

演算処理部４は、復号化部６から出力された他の端末からの音声・映像と、入出力部２からの音声・映像とをそれぞれ合成して符号化部５へ出力する合成手段として機能する。また、演算処理部４は、他の端末からの音声・映像を受信していないときは、入出力部２からの音声・映像を合成データとして符号化部５へ出力する。演算処理部４が音声を合成するときは、それぞれの音声に対して音声レベルを調整して合成される。映像を合成するときは、通信制御部１２からの制御により合成する数に応じて表示サイズ全体を分割して全体に収まるように合成される。例えば、主催者端末Ｍと発言者端末Ａ１との映像を合成する場合には、表示サイズの１／２ずつの大きさとしたものを左右に配置して一つの映像とする。 The arithmetic processing unit 4 functions as a synthesizing unit that synthesizes audio / video from another terminal output from the decoding unit 6 and audio / video from the input / output unit 2 and outputs them to the encoding unit 5. To do. In addition, when the audio / video from another terminal is not received, the arithmetic processing unit 4 outputs the audio / video from the input / output unit 2 to the encoding unit 5 as synthesized data. When the arithmetic processing unit 4 synthesizes speech, the speech level is adjusted for each speech and synthesized. When the images are combined, the entire display size is divided according to the number to be combined under the control of the communication control unit 12 and is combined so as to fit in the whole. For example, in the case of compositing the video of the organizer terminal M and the speaker terminal A1, a video that is ½ the display size is arranged on the left and right to form one video.

符号化部５は、入力された非圧縮の線形データに対して符号化することで、全体のデータ容量を圧縮した会議音声／映像を生成して、チャネルデータ処理部７へ出力する。符号化の方式は、音声データであれば、例えば、Ｇ．７１１とすることができる。また、映像データであれば、Ｈ．２６１，Ｈ．２６３，Ｈ．２６４とすることができるが、他の端末と通信可能なコーデックであれば採用することが可能である。 The encoding unit 5 encodes the input uncompressed linear data to generate a conference audio / video in which the entire data capacity is compressed, and outputs the conference audio / video to the channel data processing unit 7. If the encoding method is audio data, for example, G.P. 711. In the case of video data, H.264. 261, H.M. 263, H.I. However, any codec that can communicate with other terminals can be used.

復号化部６は、チャネルデータ処理部７から出力された複数の出席者からの会議音声／映像を復号化して音声・映像として演算処理部４へ出力する。 The decoding unit 6 decodes the conference audio / video from the plurality of attendees output from the channel data processing unit 7 and outputs the decoded audio / video to the arithmetic processing unit 4.

チャネルデータ処理部７は、演算処理部４により合成され、符号化部５により符号化された会議音声／映像（圧縮データ）を、通信制御部１２から指定された送信先の通話チャネルごとに対応させてパケット制御部８へ出力する。また、チャネルデータ処理部７は、パケット制御部８からの会議音声／映像を、送信元の通話チャネルごとに対応させて復号化部６へ出力する。 The channel data processing unit 7 supports the conference audio / video (compressed data) synthesized by the arithmetic processing unit 4 and encoded by the encoding unit 5 for each call channel of the transmission destination designated by the communication control unit 12. And output to the packet control unit 8. Further, the channel data processing unit 7 outputs the conference audio / video from the packet control unit 8 to the decoding unit 6 in association with each call channel of the transmission source.

パケット制御部８は、チャネルデータ処理部７からの符号化された会議音声／映像をフレーム単位に分割し、フレームとした音声データにヘッダ（コーデックタイプおよび通し番号（シーケンス番号）を含む）を付加してＲＴＰ（Ｒｅａｌ−ｔｉｍｅＴｒａｎｓｐｏｒｔＰｒｏｔｏｃｏｌ）パケットを生成して通信部１０を介してＩＰネットワークＮＷへ送信する。また、パケット制御部８は、通信部１０を介して受信したＲＴＰパケットから会議音声／映像を抽出してチャネルデータ処理部７へ出力する。 The packet control unit 8 divides the encoded conference audio / video from the channel data processing unit 7 into frames, and adds a header (including a codec type and a serial number (sequence number)) to the audio data used as a frame. RTP (Real-time Transport Protocol) packet is generated and transmitted to the IP network NW via the communication unit 10. Further, the packet control unit 8 extracts the conference audio / video from the RTP packet received via the communication unit 10 and outputs it to the channel data processing unit 7.

また、パケット制御部８は、呼制御部９からの呼制御メッセージをＩＰパケットとして送信したり、通信部１０からＩＰパケットから呼制御メッセージを抽出して呼制御部９へ出力したりする。 Further, the packet control unit 8 transmits the call control message from the call control unit 9 as an IP packet, or extracts the call control message from the IP packet from the communication unit 10 and outputs it to the call control unit 9.

呼制御部９は、ＳＩＰ（ＳｅｓｓｉｏｎＩｎｉｔｉａｔｉｏｎＰｒｏｔｏｃｏｌ）に基づいて通信相手への発呼、通信相手からの着呼、または通信相手との通話などの動作を制御する。 The call control unit 9 controls operations such as a call to the communication partner, an incoming call from the communication partner, or a call with the communication partner based on SIP (Session Initiation Protocol).

通信部１０は、ＩＰネットワークＮＷとのＩＰパケットの送受信を行うＩＥＥＥ８０２．３等のＬＡＮ（ＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ）物理層インタフェースである。 The communication unit 10 is a LAN (Local Area Network) physical layer interface such as IEEE 802.3 that transmits and receives IP packets to and from the IP network NW.

情報格納部１１は、会議音声／映像の送信先である配信サーバＳＶのＩＰアドレスや、会議に参加している発言者端末ＡのＩＰアドレスなどの管理情報や、その他の様々な情報が記憶されるメモリである。 The information storage unit 11 stores management information such as the IP address of the distribution server SV that is the destination of the conference audio / video, the IP address of the speaker terminal A participating in the conference, and various other information. Memory.

通信制御部１２は、主催者端末Ｍ全体の統括制御をする他、発言者端末Ａからの会議への参加要求に対する許可制御や、演算処理部４に対して合成の制御を行う。また、通信制御部１２は、会議に参加している発言者端末Ａを管理し、合成された会議音声／映像を参加している発言者端末Ａへ向けて送信するよう呼制御部９を介してパケット制御部８へ指示する。本実施の形態の主催者端末Ｍは、通信制御部１２と呼制御部９とパケット制御部８と通信部１０とで送信部が構成されている。 The communication control unit 12 performs overall control of the organizer terminal M, performs permission control for a request for participation in the conference from the speaker terminal A, and performs synthesis control for the arithmetic processing unit 4. Further, the communication control unit 12 manages the speaker terminal A participating in the conference, and transmits the synthesized conference voice / video to the speaker terminal A participating, via the call control unit 9. To the packet controller 8. In the organizer terminal M according to the present embodiment, the communication control unit 12, the call control unit 9, the packet control unit 8, and the communication unit 10 constitute a transmission unit.

操作部１３は、電話番号を入力する数字キーや、各種の機能キーと、表示画面とを備えている。この操作部１３を操作して設定や会議音声／映像の開始などを指示する。通信制御部１２は、押下された操作部１３の機能キーに対応して各種の機能を実現している。
（発言者端末Ａの構成の説明）
次に、発言者端末Ａの構成を図３に基づいて説明する。図３は、図１に示す会議システムの出席者端末の構成を示すブロック図である。 The operation unit 13 includes numeric keys for inputting a telephone number, various function keys, and a display screen. The operation unit 13 is operated to instruct setting, start of conference audio / video, and the like. The communication control unit 12 realizes various functions corresponding to the pressed function key of the operation unit 13.
(Description of the configuration of the speaker terminal A)
Next, the configuration of the speaker terminal A will be described with reference to FIG. FIG. 3 is a block diagram showing the configuration of the attendee terminal of the conference system shown in FIG.

なお、図３に示す発言者端末Ａの入出力部２、ＡＤ／ＤＡ部３、演算処理部４、符号化部５、復号化部６、チャネルデータ処理部７、パケット制御部８、呼制御部９、通信部１０および操作部１３は、図２に示す主催者端末Ｍの各構成と同じ機能であるため、同符号を付して説明を省略する。 The input / output unit 2, the AD / DA unit 3, the arithmetic processing unit 4, the encoding unit 5, the decoding unit 6, the channel data processing unit 7, the packet control unit 8, the call control of the speaker terminal A shown in FIG. The unit 9, the communication unit 10, and the operation unit 13 have the same functions as the components of the organizer terminal M shown in FIG.

発言者端末Ａは、他の発言者端末が接続されていない場合は自らの音声および映像を主催者端末Ｍへ送信する。また発言者端末Ａは、他の発言者端末が接続している場合は、自らの音声および映像に、連結された他の発言者端末からの音声および映像を合成して主催者端末Ｍへ送信する。この発言者端末Ａの情報格納部１１ｘには、自らの会議音声および映像または合成した会議音声／映像を送信する主催者端末ＭのＩＰアドレスが格納されている。 The speaker terminal A transmits its own audio and video to the organizer terminal M when no other speaker terminal is connected. In addition, when another speaker terminal is connected, the speaker terminal A synthesizes the voice and video from the other speaker terminals connected to its own voice and video and transmits them to the organizer terminal M. To do. The information storage unit 11x of the speaker terminal A stores the IP address of the organizer terminal M that transmits its own conference audio and video or synthesized conference audio / video.

通信制御部１２ｘは、発言者端末Ａ全体を統括制御する他、主催者端末Ｍへの会議の参加要求の制御や、他の発言者端末Ａからの会議参加要求に対しての許可制御などを行っている。
（配信サーバＳＶの構成）
次に、配信サーバＳＶを、図４に基づいて説明する。図４は、図１に示す会議システムの配信サーバの構成を示すブロック図である。 The communication control unit 12x performs overall control of the speaker terminal A as well as control of a conference participation request to the organizer terminal M, permission control for a conference participation request from another speaker terminal A, and the like. Is going.
(Configuration of distribution server SV)
Next, the distribution server SV will be described with reference to FIG. FIG. 4 is a block diagram showing the configuration of the distribution server of the conference system shown in FIG.

なお、図４に示す配信サーバＳＶのパケット制御部８、呼制御部９および通信部１０は図２に示す主催者端末Ｍの各構成と同じ機能であるため、同符号を付して説明を省略する。 Note that the packet control unit 8, the call control unit 9, and the communication unit 10 of the distribution server SV shown in FIG. 4 have the same functions as the components of the organizer terminal M shown in FIG. Omitted.

配信サーバＳＶにはマルチキャスト処理部１４と、チャネルコーデック部１５と、記憶装置１６とが設けられている。 The distribution server SV is provided with a multicast processing unit 14, a channel codec unit 15, and a storage device 16.

マルチキャスト処理部１４は、複数の傍聴者端末Ｂをマルチキャストアドレスで指定し、マルチキャストアドレスに対して会議音声／映像をマルチキャスト送信する。 The multicast processing unit 14 designates a plurality of listener terminals B by a multicast address, and multicasts conference audio / video to the multicast address.

配信経路中のマルチキャスト対応ルータは、宛先に応じて会議音声／映像をコピーし、マルチキャストグループへ同報配信をする。このマルチキャスト送信は、ＩＧＭＰ（ｉｎｔｅｒｎｅｔｇｒｏｕｐｍａｎａｇｅｍｅｎｔｐｒｏｔｏｃｏｌ）を使用してマルチキャストグループへ参加した傍聴者端末Ｂへ送信される。 The multicast compatible router in the distribution route copies the conference audio / video according to the destination and broadcasts it to the multicast group. This multicast transmission is transmitted to the listener terminal B that has joined the multicast group using IGMP (Internet group management protocol).

チャネルコーデック部１５は、パケット損失補償（ＰａｃｋｅｔＬｏｓｓＣｏｎｃｅａｌｍｅｎｔ）処理を行う。すなわち、チャネルコーデック部１５は、主催者端末Ｍや発言者端末Ａの符号化部５および復号化部６と同様な符号化および復号化の機能を備え、パケット制御部８からの会議音声／映像を復号化し、音声データおよび映像データが正常であるか否かをチェックする。データに異常が生じている場合、影響を低減できるようなデータに補間する。復号化され補間された音声・映像データは、記憶装置１６へ一旦格納される。チャネルコーデック部１５は、記憶装置１６へ一旦格納した音声・映像データを符号化し、マルチキャスト処理部１４へ渡す。記憶装置１６としては、リアルタイムで会議音声／映像を転送可能なように高速アクセス可能かつ大容量のハードディスク装置が使用される。 The channel codec unit 15 performs a packet loss compensation process. That is, the channel codec unit 15 has the same encoding and decoding functions as the encoding unit 5 and decoding unit 6 of the organizer terminal M and the speaker terminal A, and the conference audio / video from the packet control unit 8. And whether or not the audio data and the video data are normal is checked. If there is an abnormality in the data, the data is interpolated so that the influence can be reduced. The decoded and interpolated audio / video data is temporarily stored in the storage device 16. The channel codec unit 15 encodes the audio / video data once stored in the storage device 16 and passes it to the multicast processing unit 14. As the storage device 16, a high-capacity hard disk device that can be accessed at high speed so that conference audio / video can be transferred in real time is used.

また、配信サーバＳＶには、情報格納部１１ｙと通信制御部１２ｙとが設けられている。情報格納部１１ｙには、配信サーバＳＶが配信する会議音声／映像の配信先である傍聴者端末Ｂ（Ｂ１〜Ｂｎ）のＩＰアドレスや、この傍聴者端末Ｂを操作する傍聴者の所属、氏名、電話番号、電子メールアドレスが含まれる情報が登録リストとして格納される記憶手段として機能する。 The distribution server SV is provided with an information storage unit 11y and a communication control unit 12y. In the information storage unit 11y, the IP address of the observer terminal B (B1 to Bn) that is the destination of the conference audio / video distributed by the distribution server SV, the affiliation of the observer operating the observer terminal B, and the name , Functions as a storage means for storing information including a telephone number and an e-mail address as a registration list.

通信制御部１２ｙは、配信サーバＳＶ全体の統括制御をする他、主催者端末Ｍからのマルチキャスト要求に応じて傍聴者端末Ｂへ会議音声／映像を配信するようにマルチキャスト処理部１４へ指示する機能を備えている。また、通信制御部１２ｙは、傍聴者端末Ｂへ会議音声／映像を配信するときには事前に会議の開催を通知して傍聴者へ傍聴開始を促す通知手段として機能する。
（傍聴者端末Ｂの構成）
次に、傍聴者端末Ｂを、図５に基づいて説明する。図５は、図１に示す会議システムの傍聴者端末の構成を示すブロック図である。 The communication control unit 12y performs overall control of the entire distribution server SV, and also instructs the multicast processing unit 14 to distribute conference audio / video to the listener terminal B in response to a multicast request from the organizer terminal M. It has. In addition, the communication control unit 12y functions as a notification unit that notifies the start of a hearing by notifying the holding of the conference in advance when delivering the conference audio / video to the listener terminal B.
(Configuration of the listener terminal B)
Next, the listener terminal B will be described with reference to FIG. FIG. 5 is a block diagram showing a configuration of a listener terminal of the conference system shown in FIG.

図５に示す傍聴者端末Ｂのチャネルデータ処理部７、復号化部６、呼制御部９、通信部１０は、図２に示す主催者端末Ｍの各構成と同じ機能であるため、同符号を付して説明を省略する。また、傍聴者端末Ｂのチャネルデータ処理部７ｘとパケット制御部８ｘとは、図２に示す主催者端末Ｍのチャネルデータ処理部７およびパケット制御部８の一方向のみの機能を備えたものであるため説明を省略する。 The channel data processing unit 7, the decoding unit 6, the call control unit 9, and the communication unit 10 of the listener terminal B shown in FIG. 5 have the same functions as those of the organizer terminal M shown in FIG. The description is omitted. The channel data processing unit 7x and the packet control unit 8x of the listener terminal B are provided with functions in only one direction of the channel data processing unit 7 and the packet control unit 8 of the organizer terminal M shown in FIG. Since there is, explanation is omitted.

傍聴者端末Ｂに設けられている出力部２ｘは、ＤＡ部３ｘからの会議の出席者の合成された音声・映像を再生するスピーカおよびディスプレイである。ＤＡ部３ｘは、ストリーミング処理部１７からのデジタル信号である非圧縮の線形データをアナログ信号に変換する。 The output unit 2x provided in the listener terminal B is a speaker and a display for reproducing synthesized voice / video of the attendees of the conference from the DA unit 3x. The DA unit 3x converts uncompressed linear data, which is a digital signal from the streaming processing unit 17, into an analog signal.

ストリーミング処理部１７は、復号化部６から非圧縮の線形データをバファリングしながら、ＤＡ部３ｘに対してストリーミングデータとして出力するストリーミング再生手段として機能するものである。 The streaming processing unit 17 functions as a streaming reproducing unit that outputs uncompressed linear data from the decoding unit 6 as streaming data to the DA unit 3x while buffering.

情報格納部１１ｚには、様々な設定情報が格納されている。通信制御部１２ｚは、傍聴者端末Ｂ全体の資源管理などの統括的な制御をする他、配信サーバＳＶに対して会議音声／映像の配信を要求する。
（会議システムの動作および使用状態）
以上のように構成された本発明の実施の形態に係る会議システムの動作および使用状態を図６から図１８に基づいて説明する。
（傍聴者端末Ｂの登録）
まず傍聴者が、開催される会議を傍聴したい場合に行う配信要求の登録を、図６に基づいて説明する。図６は、傍聴者端末の登録を説明するためのシーケンス図である。なお、図６において、傍聴者は傍聴者端末Ｂの操作部１３を操作して配信サーバＳＶへ配信要求の登録を行っているが、他のパーソナルコンピュータから登録操作してもよい。 Various setting information is stored in the information storage unit 11z. The communication control unit 12z performs overall control such as resource management of the entire listener terminal B, and requests the distribution server SV to distribute conference audio / video.
(Conference system operation and usage status)
The operation and use state of the conference system according to the embodiment of the present invention configured as described above will be described with reference to FIGS.
(Registration of the listener terminal B)
First, the distribution request registration performed when the listener wants to observe the conference to be held will be described with reference to FIG. FIG. 6 is a sequence diagram for explaining registration of a listener terminal. In FIG. 6, the observer operates the operation unit 13 of the observer terminal B to register the distribution request to the distribution server SV. However, the observer may perform the registration operation from another personal computer.

図６に示すように、傍聴者端末Ｂを操作する傍聴者は、傍聴者端末Ｂ上で動作する認証プログラムを起動して認証画面を表示させたり、配信サーバＳＶにウェブアクセスし、傍聴者端末Ｂで動作させたブラウザに配信サーバＳＶから送信された認証画面を表示させたりして、それぞれの傍聴者に割り当てられた識別情報（例えば、ユーザＩＤとパスワードなど）を入力する。傍聴者端末Ｂは、入力された識別情報を配信サーバＳＶへ送信する（ステップＳ１０）。 As shown in FIG. 6, the observer operating the observer terminal B activates an authentication program that operates on the observer terminal B to display an authentication screen, or accesses the distribution server SV via the web, The authentication screen transmitted from the distribution server SV is displayed on the browser operated in B, and identification information (for example, user ID and password) assigned to each listener is input. The observer terminal B transmits the input identification information to the distribution server SV (step S10).

配信サーバＳＶは、傍聴者端末Ｂから送信された識別情報が、情報格納部１１ｙに格納された登録リストに含まれているか否かを判定する。この登録リストには、配信サーバＳＶへのアクセスおよび会議の傍聴が許可されている傍聴者に関する情報が登録されており、傍聴者の氏名、所属、電話番号、ユーザＩＤ、パスワード、傍聴者端末ＢのＩＰアドレス、配信の有効／無効フラグなどが含まれている。傍聴者端末Ｂに入力されたユーザＩＤおよびパスワード（識別情報）が、登録リストに含まれていれば、アクセス許可の通知を傍聴者端末Ｂへ送信する（ステップＳ１１）。なお、新規の傍聴者が認証を得たい場合には、新規の傍聴者は配信サーバＳＶに対して管理者権限を有する管理者に依頼しておく。管理者は傍聴者の認証のための識別情報を管理者権限のＩＤ・パスワードを用いて配信サーバにログインして、予め傍聴者から取得しておいた傍聴者に関する情報を登録する。これで、傍聴者は傍聴者端末Ｂを操作して傍聴可能な状態となる。 The distribution server SV determines whether or not the identification information transmitted from the observer terminal B is included in the registration list stored in the information storage unit 11y. In this registration list, information on the listener who is permitted to access the distribution server SV and hear the conference is registered, and the name, affiliation, telephone number, user ID, password, and listener terminal B of the listener are registered. IP address, delivery valid / invalid flag, and the like. If the user ID and password (identification information) input to the observer terminal B are included in the registration list, a notification of access permission is transmitted to the observer terminal B (step S11). When a new observer wants to obtain authentication, the new observer asks an administrator who has administrator authority over the distribution server SV. The administrator logs in identification information for authentication of the observer using the administrator authority ID and password to the distribution server, and registers information about the observer acquired in advance from the observer. Thus, the observer can operate the observer terminal B and can observe it.

次に、配信サーバＳＶからのアクセス許可を受信した傍聴者端末Ｂは、開催された会議の傍聴を希望することを示す配信要求を送信する（ステップＳ１２）。 Next, the listener terminal B that has received the access permission from the distribution server SV transmits a distribution request indicating that he / she wishes to listen to the held conference (step S12).

配信サーバＳＶは、配信要求が送信された傍聴者端末ＢのＩＰアドレスに関連付けられている配信の有効／無効フラグを有効とする傍聴者端末Ｂの登録を行う（ステップＳ１３）。 The distribution server SV registers the listener terminal B that validates the distribution valid / invalid flag associated with the IP address of the listener terminal B to which the distribution request is transmitted (step S13).

傍聴者端末Ｂの登録が完了すると、配信サーバＳＶは登録が完了したことを示す完了通知を傍聴者端末Ｂへ送信する（ステップＳ１４）。 When the registration of the observer terminal B is completed, the distribution server SV transmits a completion notification indicating that the registration is completed to the observer terminal B (step S14).

完了通知を受信した傍聴者端末Ｂは、配信サーバＳＶがマルチキャストで会議音声／映像を配信する際の配信条件を示すマルチキャスト配信条件設定要求を送信する。このマルチキャスト配信条件設定要求は、例えば、会議音声の音質や、スロー再生など設定を配信条件として送信する（ステップＳ１５）。 The listener terminal B that has received the completion notification transmits a multicast distribution condition setting request indicating a distribution condition when the distribution server SV distributes conference audio / video by multicast. This multicast distribution condition setting request transmits, for example, settings such as sound quality of conference audio and slow playback as distribution conditions (step S15).

配信サーバＳＶは、このマルチキャスト条件設定要求を受信すると、会議音声／映像をマルチキャスト送信する際の条件として情報格納部１１ｙに登録する（ステップＳ１６）。 When receiving the multicast condition setting request, the distribution server SV registers it in the information storage unit 11y as a condition for multicast transmission of conference audio / video (step S16).

そして、配信サーバＳＶは、傍聴者端末Ｂの登録が完了したことを示す完了の通知を傍聴者端末Ｂへ送信する（ステップＳ１７）。
（全体の流れ）
次に、主催者端末Ｍを操作する主催者と、発言者端末Ａ１を操作する出席者とが会議を行っている会議音声／映像を、傍聴者端末Ｂ（Ｂ１〜Ｂｎ）を操作する傍聴者が傍聴する一連の流れについて、図７および図８に基づいて説明する。図７は、会議システムによる会議の全体の流れを説明するためのシーケンス図、図８は、２者会議を行っている会議システムを示す図である。 Then, the distribution server SV transmits a notification of completion indicating that the registration of the observer terminal B is completed to the observer terminal B (step S17).
(Overall flow)
Next, the conference voice / video in which the organizer who operates the organizer terminal M and the attendee who operates the speaker terminal A1 have a meeting, and the listener who operates the listener terminal B (B1 to Bn). A series of flows to listen to will be described with reference to FIGS. FIG. 7 is a sequence diagram for explaining the overall flow of the conference by the conference system, and FIG. 8 is a diagram showing the conference system performing a two-party conference.

なお、図７においては、発言者端末Ａ１と主催者端末Ｍとの間で、会議参加要求および会議通話許可のセッション確立を行う際に、本実施の形態では発言者端末Ａ１と主催者端末Ｍとが直接通信しているが、発言者端末Ａ１はＳＩＰサーバ（図示せず）を介して主催者端末Ｍとセッションを確立させるようにしてもよい。 In FIG. 7, when a session participation request and a conference call permission session are established between the speaker terminal A1 and the organizer terminal M, the speaker terminal A1 and the organizer terminal M are used in the present embodiment. However, the speaker terminal A1 may establish a session with the organizer terminal M via a SIP server (not shown).

会議で発言する出席者は、発言者端末Ａ１を操作して主催者端末Ｍへ会議参加要求を行う（ステップＳ２０）。この会議参加要求は、次のような動作で、発言者端末Ａ１から主催者端末Ｍへ送信される。 The attendee who speaks at the conference operates the speaker terminal A1 and makes a conference participation request to the organizer terminal M (step S20). The conference participation request is transmitted from the speaker terminal A1 to the organizer terminal M by the following operation.

まず、発言者端末Ａ１では、通信制御部１２ｘが、操作部１３の会議参加要求の機能キーの押下を検知すると、情報格納部１１ｘに格納された主催者端末Ｍのアドレスを読み込み、呼制御部９へ会議参加要求の送信を指示する。呼制御部９は、主催者端末Ｍとのセッションを確立するために、パケット制御部８にＩＮＶＩＴＥメッセージの送信を指示する。パケット制御部８は、ＩＮＶＩＴＥメッセージをＩＰパケットとして通信部１０を介して主催者端末Ｍへ送信する。ＩＮＶＩＴＥメッセージには、発言者端末Ａ１と主催者端末Ｍとの間でＲＴＰパケットを交換するセッション確立のために必要なセッション情報であるＳＤＰ情報が含まれている。 First, in the speaker terminal A1, when the communication control unit 12x detects that the function key of the conference participation request of the operation unit 13 is pressed, the address of the organizer terminal M stored in the information storage unit 11x is read and the call control unit 9 is instructed to transmit a conference participation request. In order to establish a session with the organizer terminal M, the call control unit 9 instructs the packet control unit 8 to transmit an INVITE message. The packet control unit 8 transmits the INVITE message as an IP packet to the organizer terminal M via the communication unit 10. The INVITE message includes SDP information, which is session information necessary for establishing a session for exchanging RTP packets between the speaker terminal A1 and the organizer terminal M.

主催者端末Ｍでは、通信部１０を介してＩＮＶＩＴＥメッセージのＩＰパケットをＩＰネットワークＮＷから受信すると、パケット制御部８により呼制御部９へＩＮＶＩＴＥメッセージが通知される。呼制御部９は、ＳＤＰ情報によりＲＴＰパケットの送信先のアドレスが発言者端末Ａ１であることが通信制御部１２へ通知される。また、呼制御部９は、このＩＮＶＩＴＥメッセージに対して、ＩＮＶＩＴＥメッセージが届いたことを示す１００Ｔｒｙｉｎｇメッセージをパケット制御部８へ出力する。パケット制御部８は、この１００ＴｒｙｉｎｇメッセージをＩＰパケットとして通信部１０を介して発言者端末Ａ１へ送信する。また、呼制御部９は、主催者端末Ｍで呼び出し音を鳴らすと共に、相手を呼び出し中であることを示す１８０Ｒｉｎｇｉｎｇメッセージをパケット制御部８へ出力することで、発言者端末Ａ１へ送信する。 When the host terminal M receives an IP packet of the INVITE message from the IP network NW via the communication unit 10, the packet control unit 8 notifies the call control unit 9 of the INVITE message. The call control unit 9 notifies the communication control unit 12 that the destination address of the RTP packet is the speaker terminal A1 based on the SDP information. In response to this INVITE message, the call control unit 9 outputs a 100 Trying message indicating that the INVITE message has arrived to the packet control unit 8. The packet control unit 8 transmits the 100Trying message as an IP packet to the speaker terminal A1 via the communication unit 10. In addition, the call control unit 9 transmits a ringing tone at the organizer terminal M and outputs a 180 Ringing message indicating that the other party is being called to the packet control unit 8 to transmit it to the speaker terminal A1.

主催者端末Ｍを操作する主催者が、会議参加要求を送信した相手を見て問題がない場合には、操作部１３のオフフックボタンを操作する。この操作により会議通話許可が発言者端末Ａ１へ送信される（ステップＳ２１）。この会議通話許可は、次のような動作で、主催者端末Ｍから発言者端末Ａ１へ送信される。 If there is no problem when the organizer operating the organizer terminal M sees the other party who has transmitted the conference participation request, the off-hook button of the operation unit 13 is operated. By this operation, the conference call permission is transmitted to the speaker terminal A1 (step S21). This conference call permission is transmitted from the organizer terminal M to the speaker terminal A1 by the following operation.

主催者端末Ｍでは、通信制御部１２がオフフックボタンの操作を検知すると、呼制御部９に対して会議通話許可を通知する。呼制御部９は、会議通話許可の通知により２００ＯＫメッセージを、ＩＰパケットとしてパケット制御部８へ出力することで、発言者端末Ａ１へ送信する。 In the organizer terminal M, when the communication control unit 12 detects the operation of the off-hook button, the call control unit 9 is notified of permission for the conference call. The call control unit 9 transmits a 200 OK message as an IP packet to the packet control unit 8 in response to the conference call permission notification, thereby transmitting it to the speaker terminal A1.

発言者端末Ａ１では、２００ＯＫメッセージを受信したことを、呼制御部９から通信制御部１２ｘへ通知される。呼制御部９は、２００ＯＫメッセージに対して応答を示すＡＣＫメッセージをパケット制御部８により送信する。 In the speaker terminal A1, the call control unit 9 notifies the communication control unit 12x that the 200 OK message has been received. The call control unit 9 transmits an ACK message indicating a response to the 200 OK message by the packet control unit 8.

主催者端末Ｍでは、このＡＣＫメッセージを受信したことで、発言者端末Ａ１との間でセッションが確立したことを認識する。セッションが確立したことで、通信制御部１２は、発言者端末Ａ１用の通信チャネルを確保したり、演算処理部４に対して自端末と発言者端末Ａ１との会議音声／映像を合成するための通話路を確保したりする。 The organizer terminal M recognizes that a session has been established with the speaker terminal A1 by receiving this ACK message. When the session is established, the communication control unit 12 secures a communication channel for the speaker terminal A1, or synthesizes conference audio / video between the own terminal and the speaker terminal A1 to the arithmetic processing unit 4. Or secure a call path.

主催者端末Ｍと発言者端末Ａ１との間で、セッションが確立すれば、会議通話が開始される（ステップＳ２２）。会議通話はＲＴＰパケットにより行われる。 If a session is established between the organizer terminal M and the speaker terminal A1, a conference call is started (step S22). The conference call is performed using an RTP packet.

発言者端末Ａ１を操作する出席者が発言した音声とその様子の映像は、発言者端末Ａ１の入出力部２から取得され、ＡＤ／ＤＡ部３により音声・映像として演算処理部４へ出力される。この場合、合成される他の音声・映像はないので、そのまま符号化部５へ出力される。符号化部５により符号化され圧縮された会議音声／映像は、チャネルデータ処理部７により主催者端末Ｍへの会議音声／映像としてパケット制御部８へ出力される。パケット制御部８は、この会議音声／映像を主催者端末ＭへのＲＴＰパケットとして通信部１０を介してＩＰネットワークＮＷへ送信する。 The voice spoken by the attendee operating the speaker terminal A1 and the video of the situation are acquired from the input / output unit 2 of the speaker terminal A1, and output to the arithmetic processing unit 4 as voice / video by the AD / DA unit 3. The In this case, since there is no other audio / video to be synthesized, it is output to the encoding unit 5 as it is. The conference audio / video encoded and compressed by the encoding unit 5 is output to the packet control unit 8 as a conference audio / video to the organizer terminal M by the channel data processing unit 7. The packet control unit 8 transmits the conference audio / video as an RTP packet to the organizer terminal M to the IP network NW via the communication unit 10.

主催者端末Ｍでは、発言者端末Ａ１からのＲＴＰパケットが通信部１０を介して受信されると、パケット制御部８にて会議音声／映像が抽出されてチャネルデータ処理部７へ出力される。チャネルデータ処理部７では、発言者端末Ａ１からの会議音声／映像として管理されて、復号化部６へ出力される。また、復号化部６は、発言者端末Ａ１からの会議音声／映像を復号化し、非圧縮の線形データである音声データと映像データに戻す。 In the organizer terminal M, when the RTP packet from the speaker terminal A1 is received via the communication unit 10, the conference audio / video is extracted by the packet control unit 8 and output to the channel data processing unit 7. The channel data processing unit 7 manages the conference audio / video from the speaker terminal A1 and outputs it to the decoding unit 6. In addition, the decoding unit 6 decodes the conference audio / video from the speaker terminal A1, and returns the audio data and video data as uncompressed linear data.

演算処理部４は、通信制御部１２からの制御により、主催者端末Ｍを操作する主催者と、発言者端末Ａ１を操作する出席者との２つの音声と映像を合成するように制御される。つまり、主催者が発言した音声とその様子の映像とを入出力部２により取得し、ＡＤ／ＤＡ部３により音声・映像としたものと、復号化部６から出力された発言者端末Ａ１からの音声・映像とが合成される。そして、合成された音声・映像は、一方は発言者端末Ａ１へ向けて符号化部５により符号化され、チャネルデータ処理部７およびパケット制御部８により会議音声／映像を含むＲＴＰパケットが生成されて通信部１０から送信される。他方では、ＡＤ／ＤＡ部３により合成された音声・映像が音声信号と映像信号に変換され、入出力部２のスピーカおよびディスプレイにより主催者および出席者の発言と映像を合成したものが再生される。 The arithmetic processing unit 4 is controlled by the control from the communication control unit 12 so as to synthesize two voices and images of the organizer who operates the organizer terminal M and the attendee who operates the speaker terminal A1. . In other words, the voice spoken by the organizer and the video of the situation are acquired by the input / output unit 2 and converted into voice / video by the AD / DA unit 3, and the speaker terminal A1 output from the decoding unit 6 Audio and video are synthesized. Then, one of the synthesized audio / video is encoded by the encoding unit 5 toward the speaker terminal A1, and an RTP packet including conference audio / video is generated by the channel data processing unit 7 and the packet control unit 8. And transmitted from the communication unit 10. On the other hand, the audio / video synthesized by the AD / DA unit 3 is converted into an audio signal and a video signal, and the synthesized speech and video of the organizer and attendees are reproduced by the speaker and display of the input / output unit 2. The

ここで、主催者が会議の内容を、他の関係者にも把握させておく必要があると感じると、操作部１３を操作して他の関係者に対して傍聴許可を与える（ステップＳ２３）。 Here, when the organizer feels that the content of the conference needs to be understood by other parties, the operation unit 13 is operated to give permission to the other parties to listen (step S23). .

この傍聴許可は、傍聴許可を与える操作部１３の機能キーの押下を検知した通信制御部１２は、配信サーバＳＶに向けてマルチキャストによる会議音声／映像の配信要求を、呼制御部９へ指示することによりパケット制御部８からマルチキャスト要求を示すＩＰパケットが送信される（ステップＳ２４）。 The communication control unit 12 that has detected the pressing of the function key of the operation unit 13 that gives the permission to listen instructs the call control unit 9 to distribute the conference audio / video by multicast to the distribution server SV. Thus, an IP packet indicating a multicast request is transmitted from the packet control unit 8 (step S24).

配信サーバＳＶでは、主催者端末Ｍからのマルチキャスト要求のＩＰパケットを、パケット制御部８および呼制御部９を介して受信した通信制御部１２ｙが、情報格納部１１ｙに格納された登録リストのうち、配信の有効／無効フラグが有効となっているものを読み込む（ステップＳ２５）。そして、配信サーバＳＶは、読み込んだ登録リストのうち、傍聴者の氏名、所属、電話番号を登録端末の情報として、主催者端末Ｍへ向けて送信する（ステップＳ２６）。 In the distribution server SV, the communication control unit 12y that receives the IP packet of the multicast request from the organizer terminal M via the packet control unit 8 and the call control unit 9 is included in the registration list stored in the information storage unit 11y. Then, the one in which the delivery valid / invalid flag is valid is read (step S25). Then, the distribution server SV transmits the name, affiliation, and telephone number of the listener in the read registration list to the organizer terminal M as information on the registration terminal (step S26).

主催者端末Ｍでは、登録端末の情報を受信すると、通信制御部１２は操作部１３のディスプレイに表示する。主催者は、操作部１３のディスプレイに表示された登録端末の情報から傍聴者の一覧をチェックする（ステップＳ２７）。チェックした結果、問題がなければ、承認することを示す操作部１３の機能キーを押下する。この機能キーの押下を検知した通信制御部１２は、承認を示す通知を配信サーバＳＶへ送信する（ステップＳ２８）。 When the organizer terminal M receives the information of the registered terminal, the communication control unit 12 displays the information on the display of the operation unit 13. The organizer checks the list of listeners from the information of the registered terminal displayed on the display of the operation unit 13 (step S27). If there is no problem as a result of the check, the function key of the operation unit 13 indicating approval is pressed. The communication control unit 12 that has detected the pressing of the function key transmits a notification indicating approval to the distribution server SV (step S28).

配信サーバＳＶでは、承認を示す通知がパケット制御部８および呼制御部９を介して受信され、通信制御部１２ｙに通知されると、登録リストに登録されている傍聴者端末Ｂに向けて会議開始を示す通知を送信する（ステップＳ２９）。 In the distribution server SV, when a notification indicating approval is received via the packet control unit 8 and the call control unit 9 and notified to the communication control unit 12y, a conference is held for the listener terminal B registered in the registration list. A notification indicating the start is transmitted (step S29).

傍聴者端末Ｂでは、パケット制御部８および呼制御部９を介して会議開始を示す通知が受信され、通信制御部１２ｚに通知されると、操作部１３のディスプレイに会議開始を示すメッセージを、例えばポップアップウィンドウにより表示する。傍聴者は、操作部１３のディスプレイに表示された会議開始を示すメッセージを見て、会議が開催されることを認識することができる。傍聴者は、会議が開催されることを事前に促されるので、会議が開催されることを忘れていても、会議の最初から傍聴をすることができる。 In the observer terminal B, when the notification indicating the start of the conference is received via the packet control unit 8 and the call control unit 9 and notified to the communication control unit 12z, a message indicating the start of the conference is displayed on the display of the operation unit 13. For example, it is displayed by a pop-up window. The observer can recognize that the conference is held by looking at the message indicating the start of the conference displayed on the display of the operation unit 13. Since the listener is prompted in advance to hold the conference, he can hear from the beginning of the conference even if he forgets that the conference will be held.

傍聴者は、会議を傍聴する場合には操作部１３を操作する。この操作を検知した通信制御部１２ｚは、会議の傍聴の要求を示す通知をＩＰパケットとして配信サーバＳＶへ向けて送信する（ステップＳ３０）。また通信制御部１２ｚは、この会議の傍聴の要求を示す通知を配信サーバＳＶへ送信すると共に、この傍聴者端末Ｂが属するローカルグループのマルチキャスト対応ルータに対してマルチキャストグループに参加を要求することを示すホストメンバーシップレポート（ＩＧＭＰＪｏｉｎメッセージ）を送信する。このホストメンバーシップレポートを送信することで、傍聴者端末Ｂはマルチキャストグループに参加することができる。その後、主催者端末Ｍは配信サーバＳＶに向けて会議音声／映像のユニキャスト送信を開始する（ステップＳ３１ａ）。 The observer operates the operation unit 13 in order to observe the conference. The communication control unit 12z that has detected this operation transmits a notification indicating a request for listening to the conference as an IP packet to the distribution server SV (step S30). In addition, the communication control unit 12z transmits a notification indicating a request for hearing of the conference to the distribution server SV, and requests that the multicast-capable router of the local group to which the listener terminal B belongs participate in the multicast group. Send the indicated host membership report (IGMP Join message). By transmitting this host membership report, the listener terminal B can participate in the multicast group. Thereafter, the organizer terminal M starts unicast transmission of conference audio / video toward the distribution server SV (step S31a).

配信サーバＳＶでは、主催者端末Ｍからユニキャストで送信される会議音声／映像を受信すると、パケット制御部８を介してチャネルコーデック部１５へ入力される。チャネルコーデック部１５は、会議音声／映像を復号化して非圧縮の線形データとした後、記憶装置１６に格納する。並行してチャネルコーデック部１５はパケット損失補償処理を行う。また、パケット損失補償処理を行わない場合、チャネルコーデック部１５は、会議音声／映像を復号化せずに、そのままのデータをマルチキャスト処理部１４へ出力する。 When the distribution server SV receives the conference audio / video transmitted from the organizer terminal M by unicast, it is input to the channel codec unit 15 via the packet control unit 8. The channel codec unit 15 decodes the conference audio / video to form uncompressed linear data, and then stores it in the storage device 16. In parallel, the channel codec unit 15 performs packet loss compensation processing. When packet loss compensation processing is not performed, the channel codec unit 15 outputs the data as it is to the multicast processing unit 14 without decoding the conference audio / video.

マルチキャスト処理部１４は、配送経路において配信サーバＳＶの直下に位置するマルチキャストに対応したルータ（図示せず）へ向けて会議音声／映像を送信するようパケット制御部８へ指示する。ルータは、この会議音声／映像の送信により傍聴者端末Ｂ（Ｂ１〜Ｂｎ）に向けてマルチキャスト送信を行う（ステップＳ３１ｂ）。 The multicast processing unit 14 instructs the packet control unit 8 to transmit the conference audio / video to a router (not shown) corresponding to the multicast located directly under the distribution server SV in the delivery route. The router performs multicast transmission toward the listener terminal B (B1 to Bn) by transmitting the conference audio / video (step S31b).

傍聴者端末Ｂでは、通信部１０を介して受信した会議音声／映像を、パケット制御部８によりＩＰパケットから抽出する。そしてチャネルデータ処理部７を経由して復号化部６により伸張して非圧縮の線形データへ変換して音声・映像を生成する。生成された音声・映像は、ストリーミング処理部１７にて所定容量分ほどバッファリングされる。そして所定容量となるとＤＡ部３ｘにて音声信号および映像信号に変換され、出力部２ｘのスピーカから音声が、ディスプレイから映像が再生される。図８に示すように主催者端末Ｍにより取得された主催者の音声および映像である会議音声／映像（ａ）と発言者端末Ａ１により取得された発言者の音声および映像である会議音声／映像（ｂ）とが、主催者端末Ｍにより会議音声／映像（ａ＋ｂ）として合成され、配信サーバＳＶにより配信されることで、傍聴者は会議の様子を傍聴することができる（ステップＳ３２）。 In the listener terminal B, the conference audio / video received via the communication unit 10 is extracted from the IP packet by the packet control unit 8. Then, the data is decompressed by the decoding unit 6 via the channel data processing unit 7 and converted into uncompressed linear data to generate audio / video. The generated audio / video is buffered by the streaming processing unit 17 by a predetermined capacity. When the predetermined capacity is reached, the DA unit 3x converts it into an audio signal and a video signal, and audio is reproduced from the speaker of the output unit 2x and video is reproduced from the display. As shown in FIG. 8, the conference audio / video (a), which is the organizer's audio and video acquired by the organizer terminal M, and the conference audio / video, which is the audio and video of the speaker acquired by the speaker terminal A1. (B) is synthesized as conference audio / video (a + b) by the organizer terminal M and distributed by the distribution server SV, so that the listener can observe the state of the conference (step S32).

主催者および出席者による会議が終了すると、主催者端末Ｍか、または発言者端末Ａ１のいずれかの端末で、会議終了を示す操作部１３の機能キーを押下することで、会議相手へ会議終了を示す通知が送信される（ステップＳ３３）。 When the meeting by the organizer and the attendee is finished, the function is terminated at the meeting partner by pressing the function key of the operation unit 13 indicating the end of the meeting on either the organizer terminal M or the speaker terminal A1. Is transmitted (step S33).

会議が終了したことにより主催者端末Ｍの通信制御部１２は、マルチキャスト終了の要求を示す通知を配信サーバＳＶへ送信する（ステップＳ３４）。 When the conference is ended, the communication control unit 12 of the organizer terminal M transmits a notification indicating a request to end the multicast to the distribution server SV (step S34).

配信サーバＳＶでは、マルチキャスト終了の要求を示す通知を受信することで、了承したことを示すメッセージを主催者端末Ｍへ送信する（ステップＳ３５）と共に、マルチキャスト送信を中止する（ステップＳ３６）。 The distribution server SV receives a notification indicating a request to end multicasting, thereby transmitting a message indicating that the request has been accepted to the organizer terminal M (step S35) and canceling multicast transmission (step S36).

このように主催者端末Ｍは、発言者端末Ａから送信された会議音声／映像と、自らの音声・映像とを合成し、合成された会議音声／映像を、全ての発言者端末Ａに加えて配信サーバに送信するだけでよく、配信サーバＳＶは、主催者端末Ｍや、１台以上の発言者端末Ａへの配信は不要であり、傍聴者が操作する傍聴者端末Ｂへ配信するだけよいので、高い性能は必要ない。よって、傍聴者が操作する通信端末への配信に要する負荷を軽減することで、簡易なハードウェアで構成することが可能である。
（会議への参加）
次に、主催者端末Ｍと発言者端末Ａ１，Ａ２とが３者会議を行っているときに、更に発言者端末Ａ３が参加して４者会議を行う手順を、図９から図１３に基づいて説明する。図９は、３者会議を行っている会議システムを示す図、図１０は、３者会議から４者会議への移行を説明するためのシーケンス図、図１１は、発言者端末Ａ３が会議参加要求を主催者端末Ｍへ送信して会議に加わるときの会議システムを示す図、図１２は、主催者端末Ｍからの指示により発言者端末Ａ３が会議参加要求を再通知するときの会議システムを示す図、図１３は、４者会議を行っている会議システムを示す図である。 Thus, the organizer terminal M combines the conference audio / video transmitted from the speaker terminal A and its own audio / video, and adds the synthesized conference audio / video to all the speaker terminals A. The distribution server SV does not need to be distributed to the organizer terminal M or one or more speaker terminals A, and only distributes to the listener terminal B operated by the listener. Because it is good, high performance is not necessary. Therefore, it is possible to configure with simple hardware by reducing the load required for distribution to the communication terminal operated by the listener.
(Participation in the meeting)
Next, when the organizer terminal M and the speaker terminals A1 and A2 are holding a three-party conference, the procedure for the speaker terminal A3 to further participate and hold the four-party conference is based on FIGS. 9 to 13. I will explain. FIG. 9 is a diagram showing a conference system performing a three-party conference, FIG. 10 is a sequence diagram for explaining the transition from a three-party conference to a four-party conference, and FIG. 11 is a conference terminal A3 participating in the conference FIG. 12 is a diagram showing a conference system when a request is transmitted to the organizer terminal M to join the conference. FIG. 12 shows the conference system when the speaker terminal A3 re-notifies the conference participation request according to an instruction from the organizer terminal M. FIG. 13 is a diagram illustrating a conference system in which a four-party conference is performed.

なお、本実施の形態では、主催者端末Ｍは最大２台の発言者端末Ａからの音声・映像を合成することができるものとする。 In the present embodiment, it is assumed that the organizer terminal M can synthesize audio / video from up to two speaker terminals A.

図９および図１０に示すように、まず、主催者端末Ｍを操作する主催者と、発言者端末Ａ１および発言者端末Ａ２を操作するそれぞれの発言者が会議を行っている。この場合、主催者端末Ｍは、自ら取得した会議音声／映像（ａ）発言者端末Ａ１からの会議音声／映像（ｂ）と、発言者端末Ａ２からの会議音声／映像（ｃ）と、主催者端末Ｍにより合成された会議音声／映像（ａ＋ｂ＋ｃ）は、配信サーバＳＶにより傍聴者端末Ｂ１〜Ｂｎへマルチキャストで配信されている。また、主催者端末Ｍにて合成された会議音声／映像（ａ＋ｂ＋ｃ）は、主催者端末Ｍにて再生される他、発言者端末Ａ１および発言者端末Ａ２へも配信され再生される（ステップＳ４０）。 As shown in FIGS. 9 and 10, first, the organizer who operates the organizer terminal M and each speaker who operates the speaker terminal A1 and the speaker terminal A2 have a meeting. In this case, the organizer terminal M sponsors the conference audio / video (a), the conference audio / video (b) from the speaker terminal A1, the conference audio / video (c) from the speaker terminal A2, The conference audio / video (a + b + c) synthesized by the listener terminal M is distributed by multicast to the listener terminals B1 to Bn by the distribution server SV. In addition, the conference audio / video (a + b + c) synthesized at the organizer terminal M is played back by the organizer terminal M, and is also distributed and played back to the speaker terminal A1 and the speaker terminal A2 (step S40). ).

そこへ発言者端末Ａ３を操作する発言者が会議へ参加を希望したとする。発言者は発言者端末Ａ３の操作部１３を操作して会議参加要求の入力を行う。この操作により図１１に示すように発言者端末Ａ３は会議参加要求を主催者端末Ｍへ送信する（ステップＳ４１）。 It is assumed that a speaker who operates the speaker terminal A3 desires to participate in the conference. The speaker operates the operation unit 13 of the speaker terminal A3 to input a conference participation request. By this operation, as shown in FIG. 11, the speaker terminal A3 transmits a conference participation request to the organizer terminal M (step S41).

主催者端末Ｍでは、通信制御部１２が、現在、発言者端末Ａ１と発言者端末Ａ２との会議中であることにより接続台数が最大となっているので、これ以上の発言者端末Ａとの会議音声／映像の合成は不可であると判定する。この判定により通信制御部１２は、情報格納部１１から会議をしている発言者端末Ａを読み出し、そのうちの１台の発言者端末Ａを選択して、選択された発言者端末Ａに会議参加要求を送信するように指示を発言者端末Ａ３へ返信する。 In the organizer terminal M, since the communication control unit 12 is currently in a conference between the speaker terminal A1 and the speaker terminal A2, the number of connected terminals is maximized. It is determined that conference audio / video synthesis is not possible. Based on this determination, the communication control unit 12 reads the speaker terminal A having a meeting from the information storage unit 11, selects one of the speaker terminals A, and joins the selected speaker terminal A in the conference. An instruction is sent back to the speaker terminal A3 to send the request.

本実施の形態では、図１２に示すように、発言者端末Ａ１が選択されたことにより、発言者端末Ａ１に会議参加要求を送信する指示の返信を発言者端末Ａ３へ通知している（ステップＳ４２）。主催者端末Ｍは、発言者端末Ａ３からの会議参加要求を受信することで、通信制御部１２が情報格納部１１に発言者端末Ａ１の配下に発言者端末Ａ３が接続していることを示す情報を格納する。 In this embodiment, as shown in FIG. 12, when the speaker terminal A1 is selected, the speaker terminal A3 is notified of a reply to an instruction to transmit a conference participation request to the speaker terminal A1 (step S1). S42). When the organizer terminal M receives the conference participation request from the speaker terminal A3, the communication control unit 12 indicates that the speaker terminal A3 is connected to the information storage unit 11 under the speaker terminal A1. Store information.

発言者端末Ａ３では、主催者端末Ｍからの返信により、会議参加要求を発言者端末Ａ１へ送信する（ステップＳ４３）。 The speaker terminal A3 transmits a conference participation request to the speaker terminal A1 by a reply from the organizer terminal M (step S43).

発言者端末Ａ１では、現在、他の発言者端末Ａの接続はないので、会議通話許可の通知を発言者端末Ａ３へ送信する（ステップＳ４４）。これは、通信制御部１２ｘが現在接続している他の発言者端末Ａの台数と接続が許容できる台数とを比較して判定している。 At the speaker terminal A1, since there is currently no connection to the other speaker terminal A, a conference call permission notification is transmitted to the speaker terminal A3 (step S44). This is determined by comparing the number of other speaker terminals A to which the communication control unit 12x is currently connected with the number of connections that are allowed to be connected.

発言者端末Ａ３では、発言者端末Ａ１からの会議通話許可の通知により自ら取得した音声および映像を会議音声／映像（ｄ）として、図１３に示すように発言者端末Ａ１へ送信する。４者会議では、発言者端末Ａ３と同様に自ら取得した音声および映像を会議音声／映像（ｃ）として主催者端末Ｍへ送信する発言者端末Ａ２と、発言者端末Ａ３と合成された会議音声／映像（ｂ＋ｄ）とを合成とした会議音声／映像（ａ＋ｂ＋ｃ＋ｄ）が配信サーバＳＶを介して傍聴者端末Ｂへ配信される。また、主催者端末Ｍにて合成された会議音声／映像（ａ＋ｂ＋ｃ＋ｄ）は、主催者端末Ｍにて再生される他、発言者端末Ａ１および発言者端末Ａ２へも配信され、発言者端末Ａ３へも発言者端末Ａ２を経由して配信される（ステップＳ４５）。 The speaker terminal A3 transmits the audio and video acquired by the conference call permission notification from the speaker terminal A1 as conference audio / video (d) to the speaker terminal A1 as shown in FIG. In the four-party conference, as with the speaker terminal A3, the speaker terminal A2 that transmits the audio and video acquired by itself as the conference audio / video (c) to the organizer terminal M, and the conference voice synthesized with the speaker terminal A3 / Conference audio / video (a + b + c + d) synthesized with / video (b + d) is distributed to the listener terminal B via the distribution server SV. In addition, the conference voice / video (a + b + c + d) synthesized at the organizer terminal M is played back at the organizer terminal M, and is also distributed to the speaker terminal A1 and the speaker terminal A2, and to the speaker terminal A3. Is also delivered via the speaker terminal A2 (step S45).

発言者端末Ａ３から会議音声／映像のＩＰパケットが発言者端末Ａ１の会議音声／映像と合成されて主催者端末Ｍへ送信されることで、物理的にはそれぞれの発言者端末ＡがネットワークＮＷに個々に接続され通信しているが、あたかも発言者端末Ａ３は主催者端末Ｍに発言者端末Ａ１を介在させて縦続接続されているように通信している。 The conference audio / video IP packet is synthesized with the conference audio / video of the speaker terminal A1 and transmitted to the organizer terminal M from the speaker terminal A3, so that each speaker terminal A physically has the network NW. The speaker terminal A3 communicates as if it were connected in cascade with the organizer terminal M through the speaker terminal A1.

この３者会議から４者会議への移行は、４者会議から更に他の発言者端末Ａの参加が加わる５者会議でも同様に行うことができる。この場合、発言者端末Ａの許容台数に応じて、新たな発言者端末Ａを発言者端末Ａ１に縦続接続させたり、発言者端末Ａ３へ縦続接続させたりすることができる。 The transition from the three-party conference to the four-party conference can be similarly performed in the five-party conference in which the participation of another speaker terminal A is further added from the four-party conference. In this case, according to the allowable number of speaker terminals A, a new speaker terminal A can be cascaded to the speaker terminal A1 or cascaded to the speaker terminal A3.

このように発言者端末Ａを縦続接続することで、主催者端末Ｍは発言者端末Ａ３が新たに会議に加わっても、発言者端末Ａ１から送信されるからの会議音声／映像と自らの音声・映像とを合成すればよいので、会議音声の合成数が増加しない。従って、主催者端末Ｍの負荷を増加させることなく参加可能な発言者端末Ａを増やすことができる。
（会議からの退出）
次に、４者会議中に発言者端末Ａ１が会議から退出することで３者会議となる手順について、更に図１４に基づいて説明する。図１４は、発言者端末Ａ１が会議から退出して、４者会議から３者会議へ移行するときの状態を説明するためのシーケンス図である。 By connecting the speaker terminals A in this way, the organizer terminal M allows the conference audio / video and its own audio to be transmitted from the speaker terminal A1 even if the speaker terminal A3 newly joins the conference.・ Since video is synthesized, the number of synthesized conference audio does not increase. Accordingly, it is possible to increase the number of speaker terminals A that can participate without increasing the load on the organizer terminal M.
(Exit from meeting)
Next, a procedure for forming a three-party conference by the speaker terminal A1 leaving the conference during the four-party conference will be described with reference to FIG. FIG. 14 is a sequence diagram for explaining a state when the speaker terminal A1 leaves the conference and shifts from the four-party conference to the three-party conference.

図１３に基づいて説明したように主催者端末Ｍに対して発言者端末Ａ１に発言者端末Ａ３が縦続接続していることで、主催者端末Ｍと、発言者端末Ａ１〜Ａ３とで４者会議を行っている（図１４に示すステップＳ５０）。そこへ発言者端末Ａ１を操作する発言者が会議からの退出を希望したとする。発言者は発言者端末Ａ１の操作部１３を操作して会議終了通知の入力を行う。この操作により発言者端末Ａ１は会議終了通知を、主催者端末Ｍと、発言者端末Ａ１へ直接会議音声／映像を送信している発言者端末Ａ３へ送信する（ステップＳ５１）。 As described based on FIG. 13, the speaker terminal A3 is connected in cascade to the speaker terminal A1 with respect to the organizer terminal M, so that the organizer terminal M and the speaker terminals A1 to A3 have four parties. A meeting is held (step S50 shown in FIG. 14). It is assumed that a speaker operating the speaker terminal A1 desires to leave the conference. The speaker operates the operation unit 13 of the speaker terminal A1 to input a conference end notification. By this operation, the speaker terminal A1 transmits a conference end notification to the organizer terminal M and the speaker terminal A3 that is directly transmitting the conference audio / video to the speaker terminal A1 (step S51).

主催者端末Ｍでは、発言者端末Ａ１からの会議終了通知により発言者端末Ａ１が会議から外れたことを認識することで、通信制御部１２は発言者端末Ａ１との会議のために確保していた資源を開放して会議を終了する（ステップＳ５２）。そして、発言者端末Ａ１へ会議音声／映像を送信していた発言者端末Ａ３へ会議通話許可を通知する（ステップＳ５３）。 In the organizer terminal M, the communication control unit 12 reserves for the conference with the speaker terminal A1 by recognizing that the speaker terminal A1 is out of the conference by the conference end notification from the speaker terminal A1. The resources are released and the conference is terminated (step S52). Then, the conference call permission is notified to the speaker terminal A3 that has transmitted the conference audio / video to the speaker terminal A1 (step S53).

発言者端末Ａ３では、発言者端末Ａ１からの会議終了通知と、主催者端末Ｍからの会議通話許可の通知により、今まで発言者端末Ａ１へ送信していた会議音声／映像を主催者端末Ｍへ直接送信することで、４者会議から図９に示す３者会議への移行が完了する（ステップＳ５４）。 In the speaker terminal A3, the conference audio / video that has been transmitted to the speaker terminal A1 so far by the conference end notification from the speaker terminal A1 and the conference call permission notification from the organizer terminal M is transmitted to the speaker terminal M1. By directly transmitting to, the transition from the four-party conference to the three-party conference shown in FIG. 9 is completed (step S54).

４者会議では、発言者端末Ａ３の会議音声／映像を発言者端末Ａ１が自ら取得した音声・映像と共に主催者端末Ｍへ送信し、主催者端末Ｍが合成した会議音声／映像を発言者端末Ａ３へ発言者端末Ａ１を経由して送信したが、発言者端末Ａ１が会議から外れた３者会議では、発言者端末Ａ３が直接主催者端末Ｍへ送信し、主催者端末Ｍが発言者端末Ａ１に代わって発言者端末Ａ３へ送信している。これにより、主催者端末Ｍと発言者端末Ａ３との間に介在していた発言者端末Ａ１が会議から外れても、支障なく会議を継続することができる。
（発言者端末Ａの障害の発生）
次に、４者会議中に発言者端末Ａ１に障害が発生して発言者端末Ａ３からの会議音声／映像の中継ができなくなった場合に発言者端末Ａ１が外れて３者会議となる手順について、図１５から図１８に基づいて説明する。図１５は、発言者端末Ａ１の障害により４者会議から３者会議への移行するときの状態を説明するためのシーケンス図、図１６は、発言者端末Ａ１に障害が発生したときの会議システムを示す図、図１７は、下位の発言者端末Ａ３から上位の主催者端末Ｍへ障害発生を通知するときの会議システムを示す図、図１８は、３者会議へ復旧したときの会議システムを示す図である。 In the four-party conference, the conference audio / video of the speaker terminal A3 is transmitted to the organizer terminal M together with the audio / video acquired by the speaker terminal A1, and the conference audio / video synthesized by the organizer terminal M is transmitted to the speaker terminal. The message is transmitted to A3 via the speaker terminal A1, but in a three-party conference in which the speaker terminal A1 is removed from the conference, the speaker terminal A3 transmits directly to the organizer terminal M, and the organizer terminal M It is transmitted to the speaker terminal A3 in place of A1. Thereby, even if the speaker terminal A1 interposed between the organizer terminal M and the speaker terminal A3 is removed from the conference, the conference can be continued without any trouble.
(Failure of speaker terminal A)
Next, when a failure occurs in the speaker terminal A1 during the four-party conference and the conference voice / video cannot be relayed from the speaker terminal A3, the speaker terminal A1 is disconnected and becomes a three-party conference. This will be described with reference to FIGS. FIG. 15 is a sequence diagram for explaining a state when a transition from a four-party conference to a three-party conference occurs due to a failure of the speaker terminal A1, and FIG. 16 is a conference system when a failure occurs in the speaker terminal A1. FIG. 17 is a diagram showing a conference system when notifying the occurrence of a failure from the lower speaker terminal A3 to the upper organizer terminal M, and FIG. 18 shows the conference system when the three-party conference is restored. FIG.

図１３に基づいて説明したように主催者端末Ｍに対して発言者端末Ａ１に発言者端末Ａ３が縦続接続していることで、主催者端末Ｍと、発言者端末Ａ１〜Ａ３とで４者会議を行っている（図１５に示すステップＳ６０）。 As described based on FIG. 13, the speaker terminal A3 is connected in cascade to the speaker terminal A1 with respect to the organizer terminal M, so that the organizer terminal M and the speaker terminals A1 to A3 have four parties. A meeting is held (step S60 shown in FIG. 15).

図１６に示すように、発言者端末Ａ１に障害が発生して会議音声／映像の合成、もしくはＩＰパケットの受信または送信ができなくなったとする。従って、発言者端末Ａ２には、発言者端末Ａ２により取得された会議音声／映像（ｃ）と、主催者端末Ｍにより取得された会議音声／映像（ａ）とが合成された会議音声／映像（ａ＋ｂ）が、主催者端末Ｍから送信されているが、発言者端末Ａ３では会議音声／映像を受信することができない（ステップＳ６１）。 As shown in FIG. 16, it is assumed that a failure occurs in the speaker terminal A1, and it becomes impossible to synthesize conference audio / video or receive or transmit IP packets. Therefore, the conference audio / video (c) obtained by synthesizing the conference audio / video (c) acquired by the speaker terminal A2 and the conference audio / video (a) acquired by the organizer terminal M is included in the speaker terminal A2. (A + b) is transmitted from the organizer terminal M, but the speaker terminal A3 cannot receive the conference audio / video (step S61).

発言者端末Ａ３では、通信制御部１２ｘが、所定時間経過しても発言者端末Ａ１からの会議音声／映像のＩＰパケットが送信されてこないことを検知した場合や、発言者端末Ａ１からのパケットに付加されたＦＣＳ（ＦｒａｍｅＣｈｅｃｋＳｅｑｕｅｎｃｅ）などのチェックコードのエラーが多発していたり、パケットの抜けが多発していたりしていることを検知した場合に、発言者端末Ａ１の障害と判断する。発言者端末Ａ３の通信制御部１２ｘは、図１７に示すように、発言者端末Ａ１の障害を検知すると、主催者端末Ｍへ発言者端末Ａ１の障害検知の通知を送信する（ステップＳ６２）。 In the speaker terminal A3, when the communication control unit 12x detects that the conference audio / video IP packet from the speaker terminal A1 is not transmitted even after a predetermined time has passed, or the packet from the speaker terminal A1. When it is detected that there are many check code errors such as FCS (Frame Check Sequence) added to the URL, or there are many packet drops, it is determined that the speaker terminal A1 has a fault. As shown in FIG. 17, the communication control unit 12x of the speaker terminal A3 transmits a failure detection notification of the speaker terminal A1 to the organizer terminal M when detecting the failure of the speaker terminal A1 (step S62).

主催者端末Ｍでは、発言者端末Ａ３からの障害検知の通知を受信すると、送信元である発言者端末Ａ３へ会議通話許可の通知を送信する（ステップＳ６３）。この通知には、発言者端末Ａ３が送信する会議音声／映像の送信先のアドレスも送信される。本実施の形態では、発言者端末Ａ１の上位の端末は主催者端末Ｍであるので、会議音声／映像の送信先のアドレスとして主催者端末ＭのＩＰアドレスを通知する。 Upon receiving the failure detection notification from the speaker terminal A3, the sponsor terminal M transmits a conference call permission notification to the speaker terminal A3 that is the transmission source (step S63). In this notification, the address of the conference audio / video transmission destination transmitted by the speaker terminal A3 is also transmitted. In the present embodiment, since the upper terminal of the speaker terminal A1 is the organizer terminal M, the IP address of the organizer terminal M is notified as the conference audio / video transmission destination address.

発言者端末Ａ３では、主催者端末Ｍからの会議通話許可の通知により、今まで発言者端末Ａ１へ送信していた会議音声／映像を主催者端末Ｍへ直接送信することで、図１８に示すように、主催者端末Ｍは、合成した会議音声／映像を発言者端末Ａ１ではなく、発言者端末Ａ３へ直接送信することで、４者会議から３者会議へ移行して、障害からの復旧が完了する（ステップＳ６４）。 The speaker terminal A3 directly transmits the conference audio / video that has been transmitted to the speaker terminal A1 up to now to the organizer terminal M in response to the conference call permission notification from the organizer terminal M, as shown in FIG. Thus, the organizer terminal M shifts from the four-party conference to the three-party conference by directly transmitting the synthesized conference audio / video to the speaker terminal A3 instead of the speaker terminal A1, and recovers from the failure. Is completed (step S64).

このように、４者会議では、発言者端末Ａ３の会議音声／映像を発言者端末Ａ１が自ら取得した音声・映像と共に主催者端末Ｍへ送信し、主催者端末Ｍが合成した会議音声／映像を発言者端末Ａ３へ発言者端末Ａ１を経由して送信していたが、発言者端末Ａ１に障害が発生して会議音声／映像の受け渡しができなくなると、発言者端末Ａ３が直接主催者端末Ｍへ送信し、主催者端末Ｍが発言者端末Ａ１に代わって発言者端末Ａ３へ送信するので、支障なく会議を継続することができる。 Thus, in the four-party conference, the conference audio / video of the speaker terminal A3 is transmitted to the organizer terminal M together with the audio / video acquired by the speaker terminal A1, and the conference audio / video synthesized by the organizer terminal M is transmitted. Is sent to the speaker terminal A3 via the speaker terminal A1, but if the speaker terminal A1 fails and cannot deliver the conference audio / video, the speaker terminal A3 directly Since the organizer terminal M transmits to the speaker terminal A3 instead of the speaker terminal A1, the conference can be continued without any trouble.

本実施の形態では、発言者端末Ａ３のみが縦続接続された発言者端末Ａ１に障害が発生した場合を説明しているが、発言者端末Ａに他の発言者端末Ａが数段縦続されていても、同様の手順で復旧させることができる。具体的には、主催者端末Ｍから、障害が発生した発言者端末Ａの一段上位の発言者端末Ａのアドレスを送信先アドレスとして、障害が発生した発言者端末Ａの一段下位の発言者端末Ａに通知する。そうすることで、障害が発生した発言者端末Ａの一段下位の発言者端末Ａは、送信先アドレスへ会議音声／映像を送信するようにする。そして、一段上位の発言者端末Ａは会議音声／映像を送信している発言者端末Ａに向けて合成された会議音声／映像を送信する。このような手順とすることで、縦続接続された発言者端末Ａの途中で障害が発生しても問題なく会議を継続することができる。 In the present embodiment, a case where a failure occurs in the speaker terminal A1 in which only the speaker terminal A3 is cascade-connected is described, but other speaker terminals A are cascaded in the speaker terminal A in several stages. However, it can be recovered in the same procedure. Specifically, from the organizer terminal M, the address of the speaker terminal A that is one level higher than the speaker terminal A in which the failure has occurred is used as the destination address, and the speaker terminal that is one level lower than the speaker terminal A in which the failure has occurred Notify A. By doing so, the speaker terminal A, one level lower than the speaker terminal A in which the failure has occurred, transmits the conference audio / video to the transmission destination address. Then, the speaker terminal A, which is one level higher, transmits the synthesized conference audio / video toward the speaker terminal A that is transmitting the conference audio / video. By adopting such a procedure, even if a failure occurs in the middle of the speaker terminals A connected in cascade, the conference can be continued without any problem.

また、本実施の形態では、図１に示すように会議システム１は、主催者端末Ｍと、発言者端末Ａ１〜Ａ３とは、イントラネットＮ１で接続され、主催者端末Ｍと、配信サーバＳＶと傍聴者端末Ｂとは、インターネットＮ２で接続されていることで、主催者端末Ｍと、発言者端末Ａ１〜Ａ４とを同じ企業内や施設内に配置することができ、配信サーバＳＶおよび傍聴者端末Ｂが遠隔地にあってもインターネットＮ２を介して会議音声／映像を配信することができる。 In the present embodiment, as shown in FIG. 1, in the conference system 1, the organizer terminal M and the speaker terminals A1 to A3 are connected via the intranet N1, and the organizer terminal M and the distribution server SV By connecting with the listener terminal B via the Internet N2, the organizer terminal M and the speaker terminals A1 to A4 can be placed in the same company or facility, and the distribution server SV and the listener Even if the terminal B is at a remote location, the conference audio / video can be distributed via the Internet N2.

例えば、図１９に示す会議システム１ｘでは、主催者端末Ｍと、発言者端末Ａ１〜Ａ４の他にも配信サーバＳＶがイントラネットＮ１内に配置され、配信サーバＳＶと傍聴者端末ＢとインターネットＮ２を介して接続するように配置されている。このように配置されていることで、配信サーバＳＶは施設内のものが利用でき、傍聴者端末Ｂが遠隔地にあってもインターネットＮ２を介して会議音声／映像を配信することができる。 For example, in the conference system 1x shown in FIG. 19, in addition to the organizer terminal M and the speaker terminals A1 to A4, the distribution server SV is arranged in the intranet N1, and the distribution server SV, the listener terminal B, and the Internet N2 are connected. Arranged to connect through. By being arranged in this way, the distribution server SV can be used within the facility, and the conference audio / video can be distributed via the Internet N2 even if the listener terminal B is at a remote location.

なお、本実施の形態では、会議端末装置として機能する主催者端末Ｍは会議を主催する主催者が操作しているが、会議で発言する複数の発言者のうち一の発言者が操作できればよく、主催者以外の発言者が代わりに操作してもよい。 In the present embodiment, the organizer terminal M that functions as a conference terminal device is operated by the organizer that hosts the conference. However, it is only necessary that one speaker can be operated among a plurality of speakers who speak in the conference. A speaker other than the organizer may operate instead.

本発明は、傍聴者が操作する通信端末への配信に要する負荷を軽減することで、簡易なハードウェアで構成することが可能なので、複数の発言者が発言する会議を１以上の傍聴者が傍聴することが可能な会議システムおよび会議端末装置に好適である。 Since the present invention can be configured with simple hardware by reducing the load required for distribution to a communication terminal operated by a listener, one or more listeners can hold a conference where a plurality of speakers speak. It is suitable for a conference system and a conference terminal device that can be heard.

本発明の実施の形態に係る会議システム全体の構成を示す図The figure which shows the structure of the whole conference system which concerns on embodiment of this invention. 図１に示す会議システムの会議端末装置として機能する主催者端末の構成を示すブロック図The block diagram which shows the structure of the sponsor terminal which functions as a conference terminal device of the conference system shown in FIG. 図１に示す会議システムの出席者端末の構成を示すブロック図The block diagram which shows the structure of the attendee terminal of the conference system shown in FIG. 図１に示す会議システムの配信サーバの構成を示すブロック図The block diagram which shows the structure of the delivery server of the conference system shown in FIG. 図１に示す会議システムの傍聴者端末の構成を示すブロック図The block diagram which shows the structure of the observer terminal of the conference system shown in FIG. 傍聴者端末の登録を説明するためのシーケンス図Sequence diagram for explaining registration of the listener terminal 会議システムによる会議の全体の流れを説明するためのシーケンス図Sequence diagram for explaining the overall flow of the conference by the conference system ２者会議を行っている会議システムを示す図The figure which shows the conference system which has the two-party conference ３者会議を行っている会議システムを示す図The figure which shows the conference system which has 3 party conference ３者会議から４者会議への移行を説明するためのシーケンス図Sequence diagram for explaining the transition from a three-party conference to a four-party conference 発言者端末が会議参加要求を主催者端末へ送信して会議に加わるときの会議システムを示す図The figure which shows a conference system when a speaker terminal transmits a meeting participation request to an organizer terminal, and joins a meeting. 主催者端末からの指示により発言者端末が会議参加要求を再通知するときの会議システムを示す図The figure which shows the conference system when a speaker terminal re-notifies a meeting participation request by the instruction | indication from a sponsor terminal. ４者会議を行っている会議システムを示す図The figure which shows the conference system which is performing the four-party conference 発言者端末が会議から退出して、４者会議から３者会議へ移行するときの状態を説明するためのシーケンス図Sequence diagram for explaining a state when a speaker terminal leaves the conference and shifts from a four-party conference to a three-party conference 発言者端末の障害により４者会議から３者会議への移行するときの状態を説明するためのシーケンス図Sequence diagram for explaining a state when transitioning from a four-party conference to a three-party conference due to a failure of a speaker terminal 発言者端末に障害が発生したときの会議システムを示す図Diagram showing the conference system when a failure occurs in the speaker terminal 下位の発言者端末から上位の主催者端末へ障害発生を通知するときの会議システムを示す図The figure which shows the conference system when notifying a failure occurrence from the lower speaker terminal to the upper organizer terminal ３者会議へ復旧したときの会議システムを示す図Diagram showing the conference system when restored to a three-party conference 本発明の他の実施の形態に係る会議システムを示す図The figure which shows the conference system which concerns on other embodiment of this invention.

Explanation of symbols

Ａ，Ａ１〜Ａ３発言者端末
Ｂ，Ｂ１〜Ｂｎ傍聴者端末
Ｍ主催者端末
Ｎ１イントラネット
Ｎ２インターネット
ＮＷネットワーク
ＳＶ配信サーバ
１，１ｘ会議システム
２入出力部
２ｘ出力部
３ＡＤ／ＤＡ部
３ｘＤＡ部
４演算処理部
５符号化部
６復号化部
７，７ｘチャネルデータ処理部
８，８ｘパケット制御部
９呼制御部
１０通信部
１１，１１ｘ，１１ｙ，１１ｚ情報格納部
１２，１２ｘ，１２ｙ，１２ｚ通信制御部
１３操作部
１４マルチキャスト処理部
１５チャネルコーデック部
１６記憶装置
１７ストリーミング処理部 A, A1 to A3 Speaker terminal B, B1 to Bn Listener terminal M Organizer terminal N1 Intranet N2 Internet NW network SV distribution server 1, 1x Conference system 2 Input / output unit 2x Output unit 3 AD / DA unit 3x DA unit 4 Arithmetic processing unit 5 Encoding unit 6 Decoding unit 7, 7x Channel data processing unit 8, 8x Packet control unit 9 Call control unit 10 Communication unit 11, 11x, 11y, 11z Information storage unit 12, 12x, 12y, 12z Communication control Unit 13 operation unit 14 multicast processing unit 15 channel codec unit 16 storage device 17 streaming processing unit

Claims

A first communication terminal operated by one of a plurality of speakers, at least one second communication terminal operated by another speaker, and conference audio from the first communication terminal are received. And a distribution server that broadcasts and a network that can be communicated with an at least one third communication terminal that receives a conference voice broadcast from the distribution server and listens to the conference. Connected to
The first communication terminal includes a synthesizing unit that synthesizes the conference voice transmitted from the second communication terminal and its own conference voice, a distribution server for the conference voice synthesized by the synthesizing unit, and the second Transmitting means for transmitting to the communication terminal via the network,
The distribution server stores a third communication terminal as a transmission destination for broadcast distribution, and a conference voice transmitted from the first communication terminal as a third destination stored in the storage means. And a distribution means for transmitting to the communication terminal via the network.

The synthesizing unit in the first communication terminal synthesizes video in addition to conference audio, and the distribution unit in the distribution server distributes the conference audio and video synthesized by the synthesizing unit as a conference video. The conference system according to claim 1, wherein:

3. The conference system according to claim 2, wherein the third communication terminal includes a streaming playback unit, and the streaming playback unit plays back the conference video distributed from the distribution server.

4. The conference according to claim 1, wherein the distribution server includes a notification unit that notifies the third communication terminal stored in the storage unit of the start of the conference. 5. system.

When the first communication terminal receives a conference participation request from a fourth communication terminal that is about to newly join the conference, the first communication terminal instructs the second communication terminal to request the conference participation,
When the second communication terminal receives the conference participation request from the fourth communication terminal, the second communication terminal permits a call, and synthesizes the conference voice from the fourth communication terminal and its own voice to combine the first voice. The conference system according to claim 1, wherein the conference system transmits to a communication terminal.

When the second communication terminal is disconnected from the conference and the fourth communication terminal is permitted to the conference call by the second communication terminal, the first communication terminal The conference system according to claim 5, wherein the conference is performed using the terminal as the second communication terminal.

When the fourth terminal detects a failure in the second communication terminal, the fourth terminal notifies the first communication terminal of the occurrence of the failure,
6. The conference system according to claim 5, wherein the first communication terminal performs a conference using the fourth communication terminal as a second communication terminal.

The first communication terminal, the second communication terminal, and the fourth communication terminal are connected via an intranet, and the first communication terminal, the distribution server, and the third communication terminal are connected via the Internet. The conference system according to claim 5, wherein the conference system is provided.

The first communication terminal, the second communication terminal, the fourth communication terminal, and the distribution server are connected via an intranet, and the distribution server and the third communication terminal are connected via the Internet. The conference system according to claim 5, wherein:

In a conference terminal device operated by one of a plurality of speakers,
Synthesizing means for synthesizing the conference voice transmitted from at least one speaker terminal operated by another speaker and the conference voice;
Transmitting means for transmitting the conference voice synthesized by the synthesizing means to the speaker terminal and transmitting via a network to a distribution server that broadcasts the conference to one or more listener terminals. A conference terminal device characterized by that.