WO2005112453A1 - Système de service vidéophonique ip - Google Patents

Système de service vidéophonique ip Download PDF

Info

Publication number
WO2005112453A1
WO2005112453A1 PCT/JP2005/008500 JP2005008500W WO2005112453A1 WO 2005112453 A1 WO2005112453 A1 WO 2005112453A1 JP 2005008500 W JP2005008500 W JP 2005008500W WO 2005112453 A1 WO2005112453 A1 WO 2005112453A1
Authority
WO
WIPO (PCT)
Prior art keywords
videophone
video
distribution
media
scenario
Prior art date
Application number
PCT/JP2005/008500
Other languages
English (en)
Japanese (ja)
Inventor
Arnold Siboro
Weeyee Lim
Original Assignee
Arnold Siboro
Weeyee Lim
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Arnold Siboro, Weeyee Lim filed Critical Arnold Siboro
Publication of WO2005112453A1 publication Critical patent/WO2005112453A1/fr

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/16Analogue secrecy systems; Analogue subscription systems
    • H04N7/173Analogue secrecy systems; Analogue subscription systems with two-way working, e.g. subscriber sending a programme selection signal
    • H04N7/17309Transmission or handling of upstream communications
    • H04N7/17336Handling of requests in head-ends
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • H04N21/47202End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for requesting content on demand, e.g. video on demand
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/64Addressing
    • H04N21/6408Unicasting
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/643Communication protocols
    • H04N21/64322IP
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/65Transmission of management data between client and server
    • H04N21/658Transmission by the client directed to the server
    • H04N21/6581Reference data, e.g. a movie identifier for ordering a movie or a product identifier in a home shopping application
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/81Monomedia components thereof
    • H04N21/8146Monomedia components thereof involving graphical data, e.g. 3D object, 2D graphics
    • H04N21/8153Monomedia components thereof involving graphical data, e.g. 3D object, 2D graphics comprising still images, e.g. texture, background image
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/81Monomedia components thereof
    • H04N21/816Monomedia components thereof involving special video data, e.g 3D video
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • H04N7/148Interfacing a video terminal to a particular transmission medium, e.g. ISDN

Definitions

  • the present invention relates to an IP videophone service system for realizing an interactive service using non-real-time video using an IP network.
  • IP Internet Protocol
  • FIG. 3 The following describes conventional IP telephone technology with reference to FIGS. 3 and 4.
  • FIG. 3 The following describes conventional IP telephone technology with reference to FIGS. 3 and 4.
  • the main IP telephone technology used by the IP telephone service is H.323 (International Standard Organization) standard (used in networks where the transmission quality of communication is not guaranteed, such as the Internet and LAN.
  • H.323 International Standard Organization
  • IP telephony technology based on the ITU-T Recommendation (standard) for multimedia compression, multiplexing, demultiplexing, and terminal control, and based on the Session Initiation Protocol (SIP) specified by the Internet Engineering Task Force (IETF)
  • SIP Session Initiation Protocol
  • IETF Internet Engineering Task Force
  • Real-time calls can be made between two IP phones or IP phone software connected over an IP network.
  • FIGS. 3 (a) and 3 (b) show a system configuration diagram of the related art related to the videophone of the present invention described above, and FIG. 4 shows a block diagram of the related art.
  • IP videophones 1 and 2 are connected to the IP network 10 on a one-to-one basis, and communicate with each other under the control of a gatekeeper 20 described later.
  • a plurality of IP videophones l to n are connected to the IP network 10, and communication is possible between the plurality of IP videophones l to n by the gatekeeper 20 and the multipoint connection unit (MCU) 30. And can hold a video conference.
  • MCU multipoint connection unit
  • an IP videophone 1 (2,..., N) encodes a video signal with a video input unit 11 that inputs a video signal from an imaging unit such as a CCD camera.
  • the gatekeeper technology is a technology that provides address translation and LAN access control for H.323 terminals and the like.
  • the gatekeeper 20 shown in FIG. 4 manages a registry of devices in the multimedia network. These devices register with the gatekeeper at startup and request the gatekeeper to admit (accept) the call. [0008]
  • the gatekeeper 20 has the following four essential functions.
  • Admission Control Controls admission of endpoints to the H.323 network. To achieve this function, the gatekeeper uses:
  • the gatekeeper uses the following H.225 RAS message:
  • the gateway manages the zone for all registered endpoints in the zone. For example, the registration process of the endpoint is controlled.
  • the gateway retains information about active calls and uses this information to indicate busy endpoints and redirect calls.
  • This option allows the gateway to deny admission when the required bandwidth is not available.
  • This option allows the gateway to route call signaling messages between H.323 endpoints using the GKRCS (Gatekeeper-Routed Call Signaling) model.
  • GKRCS Gatekeeper-Routed Call Signaling
  • the endpoints can send H.225 call signaling messages directly to each other.
  • MCU Multipoint connection device
  • the multipoint connection device 30 uses means (transfer means 31, encoding means 32, decoding means 33, IP telephone command means 34) such as IP video phones l to n to connect to the IP video phones l to n.
  • means transfer means 31, encoding means 32, decoding means 33, IP telephone command means 34
  • IP video phones l to n to connect to the IP video phones l to n.
  • the multipoint control means 35 and the multi-point processing means 36 necessary for conducting a telephone conference.
  • the multipoint control means 35 negotiates (handshake) with each IP videophone participating in the conference call, determines a common function, and controls conference resources such as multicast.
  • the multipoint processing means 36 receives actual audio and video data from each IP videophone participating in the telephone conference and performs multiplexing and the like.
  • SMIL Synchronized Multimedia Integration Language
  • SMIL (a language that handles multimedia files in an integrated manner) is a language that controls and synchronizes the playback of various types of data, such as moving images, still images, audio, music, and characters, and is described in XML. It was recommended in April 1998 as a W3C standard. You can control what position, when, and how long to display. A product that implements this is RealNetworks' RealSystem G2.
  • XTL is a runtime description language that uses Microsoft (trademark), DirectShow (trademark), and Editing Services (DES), and is described in XML, similar to SMIL.
  • DES uses the core technology of DirectShow, an application programming interface (API), to enable nonlinear video editing and playback of various media formats. Therefore, it can be said that it is basically similar to SMIL.
  • MPEG4 is a multimedia standard that can handle media such as 3D objects, sprites, texts, etc. that can only be used with audio and video.
  • MPEG4 includes 3D objects and animation objects in addition to audio video! This constitutes each scene of MPEG4.
  • XMT Extensible MPEG-4 Textual format
  • Patent Document 1 JP-A-2002-140162
  • Patent Document 2 JP 2004-48551 A
  • IP networks Internet
  • IP network Internet
  • IP telephony the new part of IP telephony is the always-on broadband IP network and the power of only infrastructure!
  • the new infrastructure that users perceive is only effective at low call rates. Therefore, the merit of introducing IP phones for users is very little other than this low price!
  • IP telephony services are truly unprofitable services and are used by large, well-funded companies as a means of attracting customers for other profitable services.
  • IP telephone services This will enable users to find the motivation for introducing IP phones in addition to the price, create a sound business model in the Japanese economy, and make it possible for IP phones to penetrate various parts of their lives.
  • the e-Japan plan enacted an IT policy.
  • the IT industry has undergone major changes, and the spread of broadband (high-speed, large-capacity) has progressed at a rapid pace.
  • the government has invested money and effort, and the industry has also competed, bringing widespread adoption of broadband, such as ADSL, now with the unthinkable introduction and price in 2000, when the e-Japan plan was enacted.
  • ADSL Advanced Driver Assistance Systems
  • killer content is necessary for the further spread of broadband. There are more situations than this, and killer applications that carry killer content are considered indispensable. The killer application must be something that anyone can treat as everyday. Researching and developing such applications will be the role of the industry in the future and will be the key to growth.
  • the IP telephone technology focuses on the real-time communication of the conventional telephone, and the technology for supporting the non-real-time interactive service is very insufficient. It is only used as a substitute for the network (PSTN). Therefore, it can provide only the services that were conventionally provided by telephone (such as one-to-one voice or television calls or conference calls, and message services).
  • An object of the present invention is to provide an IP videophone service system capable of realizing a bidirectional service based on non-real-time video, which has not been able to be provided by conventional telephones, based on the conventional accumulation technology.
  • a distribution server connected to an IP network provided with an IP videophone service performs an IP videophone call according to a preset distribution procedure.
  • Video clips, still images It is characterized by having means for determining various media such as characters and voices and the distribution method thereof, and distributing these media as streaming video to IP videophones.
  • the user of the IP videophone can obtain various information interactively and audiovisually by the conventional telephone operation and telephone call.
  • a second configuration of the present invention is characterized in that the distribution server includes means for distributing only recorded media or a combination of recorded media and live broadcast media. I do.
  • a third configuration of the present invention is characterized in that the distribution procedure describes behaviors such as a distribution time, a distribution time, a position on an IP videophone screen, and a display method of the various media. I do.
  • the distribution procedure can be described to determine the behavior of various media in response to any response from the user based on the time at which the response was received.
  • the distribution server according to the distribution procedure according to the user environment such as an IP television telephone number and a language setting of an IP video telephone service user, distributes the video to be distributed. And means for adjusting the selection and distribution method.
  • the distribution server it is possible to select the display in a language corresponding to the language used by the IP videophone service user, and to adjust the distribution method depending on whether the user is a beginner or an experienced user. .
  • the present invention can also have the following configurations.
  • the IP videophone terminal automatically accepts connections when connecting to the distribution server so as not to be overloaded. Decide the distribution server.
  • the distribution server receives the user's audio and video, and automatically creates a description of the distribution procedure (also with the distribution procedure manual or scenario ⁇ ⁇ ) and the respective media.
  • the distribution server automatically creates a scenario in accordance with an instruction of the IP videophone service user, and receives the voice or video of the IP videophone service user.
  • the means to automatically create each media! / Is characterized.
  • the fifth configuration it is possible to automatically create a scenario and various media using the received voice and video according to the instruction of the IP videophone service user.
  • a sixth configuration of the present invention provides an IP videophone terminal for using an IP videophone service, a distribution server that distributes video to the IP videophone terminal, a video clip to be distributed, and a still image.
  • the distribution server is an IP network.
  • a command means is also in the range and a medium access means for inputting and outputting various media stored in the Huai Rusaba.
  • the user of the IP videophone can obtain various information interactively and audiovisually by the conventional telephone operation and telephone call.
  • the file server includes a scenario database that holds a scenario, and the distribution server further processes the scenario and executes the scenario according to a preset distribution procedure.
  • Means, and the distribution procedure describes behaviors such as a distribution time, a distribution time, a position on an IP videophone screen, and a display method of the various media.
  • the IP telephone number and language setting of the IP telephone service user According to the user's environment, it is possible to adjust the selection of the video to be distributed and the distribution method according to the distribution procedure described above.
  • the present invention can further have the following configurations.
  • the distribution server is provided with multipoint control means and multipoint processing means for converting a plurality of IP phone images into one image, so that only the media recorded on the file server or Once recorded, it is possible to distribute the combined media and live broadcast media.
  • the file server includes a scenario database that holds a scenario, and the distribution server further stores a video composed of various media cards stored in the storage device. It is equipped with a content caching means that is stored in a storage device and read out when necessary, and by describing the distribution time, distribution time, position on the IP videophone screen, and display method of various media as distribution procedures, It is possible to specify how long a certain media is to be delivered at the timing, and in the following procedure, at which position on the IP videophone screen and in what display manner the media is displayed. In addition, the distribution procedure can be described so as to determine the behavior of various media based on the time at which the response is received for any response from the user.
  • the file server includes a scenario database that holds scenarios, a media database that holds various media such as video clips, still images, characters, and audio, and storage.
  • a cache database that holds images created by the distribution server is provided.
  • the distribution server has a user response processing unit and a scenario processing unit that processes a scenario and executes the scenario as it is.
  • the behavior of the various media is determined based on the time at which the response was received. It is possible to provide services such as
  • the gatekeeper has an address conversion unit and a server load management unit, and the distribution server monitors the usage status of the CPU and the memory of the distribution server from time to time.
  • load monitoring means in a service consisting of multiple distribution networks to prepare for a large amount of access, when an IP videophone connects to a distribution server, A distribution server that automatically accepts connections can be determined so as not to be overloaded.
  • the distribution server connected to the IP network distributes the IP videophone service in response to the IP videophone service user according to a preset distribution procedure.
  • Media such as video clips, still images, text, audio, etc., and methods for distributing them, and a means for distributing these media as streaming video to IP TV phones! Users can interact with audio-visually rich information contents through conventional telephone calls and telephone operations.
  • IP videophone technology that is suitable for non-real-time communication, it is possible to solve the problem of being limited to call services by enabling services outside of calls that have been used up to now.
  • FIG. 1 is a system configuration diagram showing an embodiment of the present invention.
  • FIG. 2 is a block diagram showing an embodiment of the present invention.
  • FIG. 3 is a system configuration diagram of a conventional technology.
  • FIG. 4 is a block diagram of a conventional technique.
  • FIG. 5 is a system configuration diagram showing a configuration of Embodiment 1 of the present invention.
  • FIG. 6 is a block diagram illustrating a configuration of a distribution server according to the first embodiment.
  • FIG. 7 is a block diagram showing a configuration of a part of a distribution server according to the first embodiment.
  • FIG. 8 is a block diagram showing a configuration of an interaction engine in Embodiment 1.
  • FIG. 9 is a block diagram showing a configuration of a media engine in Embodiment 1.
  • FIG. 10 is an explanatory diagram showing a creation state of a dialog scenario in the first embodiment.
  • FIG. 11 is an explanatory diagram showing a configuration example of a content according to the first embodiment.
  • FIG. 12a is a list in which the content configuration of FIG. 11 is used as a scenario.
  • FIG. 12b A list in which the content configuration of FIG. 11 is used as a scenario.
  • FIG. 13 is a system diagram showing a media configuration of the scenario shown in FIG. 12.
  • FIG. 14 is a list showing description contents of a media scenario.
  • FIG. 15 is an explanatory diagram of a basic operation in Embodiment 1.
  • FIG. 16 is a time-series flow chart of the operation shown in FIG.
  • FIG. 17 is an explanatory diagram showing a mechanism of a scenario using a combination of recorded media and live broadcast media.
  • FIG. 18 is an explanatory diagram showing a processing flow for realizing the scenario of FIG. 17.
  • FIG. 19 is an image of a screen displayed on a user's IP videophone.
  • Fig. 20 is an explanatory diagram of a scenario in which delivery is instructed according to the environment of the user in the second embodiment.
  • FIG. 21 is an explanatory diagram showing interactive and real-time exchanges between a clerk and a customer in the third embodiment.
  • FIG. 22 is a time chart showing a flow of exchange in Example 3.
  • FIG. 23 is an image of an exchange screen in Embodiment 3.
  • FIG. 24 is a block diagram showing a flow in an IP videophone load distribution service in Embodiment 5.
  • FIG. 25 is an image of a screen of an IP videophone user participation type service according to the sixth embodiment.
  • Call control signaling means distribution server
  • IP telephone command means Multipoint processing means
  • Multipoint control means Scenario processing means
  • Medium access means File server
  • HTTPZFTP command means User authentication means Access transfer means Server load management means Load status DB
  • the embodiment of the present invention includes an IP videophone 1 to n for using the IP videophone service, a distribution server 50 for distributing video to the IP videophone terminal, a video clip to be distributed, a still image, It comprises an IP network 10 composed of a file server 60 for storing various media such as characters and voices and their caches, and a gatekeeper 20 for IP video phone number resolution and bandwidth management.
  • the IP videophones l to n include a video input unit 11 for inputting a video signal of an imaging unit such as a CCD camera, an encoding unit 12 for encoding a video signal, and a video signal together with audio data.
  • a transfer means 13 for transferring the video signal to an IP telephone command means 14, a decoding means 15 for decoding a received video signal, and a video output means 16 for outputting a video signal to a liquid crystal display or the like. .
  • the distribution server 50 includes a transfer unit 51 for transferring data on the IP network, and a decoding unit 52 for decoding data encoded as data of the IP television phones l to n into audio data, video data, and the like.
  • IP phone command means 55 for issuing commands necessary for communication with the IP videophone, and negotiating (handshake) with each IP videophone participating in the teleconference, determining common functions, and conducting conferences such as multicast It receives multi-point processing means 56 for controlling resources and receives actual audio and video data of each IP video telephone participating in the conference call.
  • Multi-point control means 57 for processing, for example, scenario processing means 58 for processing and executing the scenario as it is, medium access means 59 for inputting and outputting various media stored in each database of the file server 60, Content caching means 71, which stores the various media powers stored in the file server 60 in a database and reads it out when necessary, and a server that monitors the usage status of the CPU and memory of the distribution server 50 from time to time load Monitoring means 72, HTTPZFTP command means 73 for generating commands for transferring various media output from the medium access means 59 to the IP network 10, and user authentication means for authenticating the user of the connected IP television telephone. 74 and access transfer means 75 for transferring the IP videophone connected to the distribution server 50 to another distribution server.
  • the file server 60 includes a scenario DB (database) 61 for storing scenarios, a media DB62 for storing various media such as video clips, still images, characters, and audio, and various types of data stored in a storage device.
  • the media is also equipped with a cache DB63 that holds the images created by the processing of the distribution server!
  • the gatekeeper 20 manages the address conversion means 21 for converting between an IP address and a telephone number, the reception control means 22 for accepting the connection of the incoming IP videophone, and the bandwidth request of the IP videophone.
  • Bandwidth control means 23 which performs processing for connecting the IP videophone to the gatekeeper 24, call authentication means 25 which performs authentication to restrict access and policy access, and connection between IP videophones
  • a call management means 26 for managing the situation, a bandwidth management means 27 for rejecting admission when the required bandwidth cannot be secured, and a call for controlling signaling for establishing a connection with an incoming IP videophone.
  • Control signaling means 28 server load management means 81 for managing load status information of each distribution server, and CPU and memory usage rates of each distribution server Having a load situation D B82, a network load monitoring means 83 for constantly monitoring the load status of each distribution server.
  • Embodiment 1 of the present invention relates to an IP videophone service, and will be described with reference to FIG.
  • the IP videophones l to n for using the IP videophone service, the distribution server 50 for distributing video to the IP videophones l to n, the video clips to be distributed, still images, and text ,
  • a file server 60 for storing various media such as audio data and their caches, and a gatekeeper 20 for IP telephone number resolution and bandwidth management.
  • the transfer means 51 includes TCP (Transmission Control Protocol) and UDP (User
  • Data is transferred over the IP network 10 using a protocol such as Datagram Protocol.
  • the data passed to the transfer means 51 can be passed using RTP (ReaH: ime Transport Protocol) or HTTP (Hyper Text Transfer Protocol) technology.
  • the IP telephone command means 55 can use H.323 or SIP (Session Initiation Protocol) technology.
  • H.323 has been adopted as a protocol for Voice over IP (VoIP) for realizing Internet telephony, as well as integrating real-time multimedia data such as video conferencing between multiple locations. It can handle and multicast communication is also possible.
  • H.261 64 kbps to 2 Mbps
  • H.263 64 kbps or less
  • G.711 and G.711 are used as audio compression technologies. 722, G.723.1, etc. Power used.
  • the user of the IP videophone can obtain various information interactively and audiovisually by the conventional telephone operation and telephone call.
  • FIG. 5 shows a configuration example of the first embodiment. This will be described with reference to FIG.
  • the distribution server 50 includes a distribution server 50-1, a distribution server 50-2, and a distribution server 50-3. These distribution servers share the file server 60.
  • IP videophones l to n consist of IP videophone 1, IP videophone 2, and IP videophone 3.
  • FIG. 6 is a diagram for explaining the entire distribution server 50.
  • FIG. 7 is a diagram for explaining the interaction engine and the media engine of the distribution server 50.
  • FIG. 8 is a diagram for explaining the interaction engine.
  • FIG. 9 is a diagram for explaining a media engine.
  • Decoding means 52 and encoding means 53 are realized as audio codec 101, video codec 102, audio channel 103, video channel 104, audio input / output device 105, and video input / output device 106 for video and audio data, respectively. ing.
  • the scenario processing means 58 and the user response processing means 54 are realized as a media engine 107 and an interaction engine 108.
  • the audio input / output device 105 acquires data from the audio buffer 109
  • the video input / output device 106 acquires data from the video buffer 110, respectively, of audio ′ video to be delivered to the IP telephones l to n.
  • the IP telephone command means 55 is realized by the H.323 endpoint I 1 and the H.323 connection 112.
  • the H.323 endpoint 111 receives the incoming call, establishes an H.323 connection 112, and, based on the scenario, an interaction engine 108 and a media engine 107. Have the video (video and audio) created.
  • the video input / output device 106 acquires the created video from the video buffer 110, and the audio input / output device 105 acquires the created audio from the audio buffer 109, and distributes them to the IP videophones l to n through the H.323 connection 112. .
  • the interaction engine 108 performs a process for dialogue with the user based on the scenario DB 113, and causes the media engine 107 to create a video and an audio corresponding thereto.
  • the interaction engine 108 reads the scenario from the scenario DB 113 and processes the scenario, and stores the connection and conversation status with the IP telephones 1 to n in the session information DB 114.
  • the media engine 107 creates a video 'audio' according to the instruction of the interaction engine 108, and distributes it to the IP videophones l to n.
  • the video and audio are stored in the media DB 115, the clip DB 116, and the cache 117.
  • the interaction engine 108 performs a dialog processing centering on the interaction engine core 120.
  • the interaction engine core 120 uses a local media access 121 for a scenario stored on a local disk, and accesses a scenario stored on a remote disk using a remote media access 122.
  • the accessed scenario is processed, and the response recognition engine 123 recognizes various information such as the user environment and the user response transferred from the IP videophones l to n based on the scenario.
  • the response recognition engine 123 recognizes DTMF (numeric input from a videophone button) 124, voice 125, and video 126.
  • the interaction engine core 120 determines a video / audio to be created based on the recognition result based on the scenario, and notifies the media engine 107 of the determined video / audio.
  • the media engine 107 receives the notification from the interaction engine 108 and creates media (video).
  • the notification includes a file name including a description of video and audio constituting the video to be created, and the media engine 107 reads the file, passes the file through the media interpreter 131, and decrypts the file.
  • This description file may be in a native format (native media script 132), or in a format such as XTL133, SMIL134, or MPEG-4135. A media interpreter corresponding to each format is prepared.
  • the media engine 107 causes the video processor 136 to create a video based on the decoding result, and distributes the video to the IP videophones l to n.
  • the distribution procedure manual is a description of the behavior of various media, such as distribution time, distribution time, position on the IP videophone screen, and display method, and is also called a scenario.
  • Scenario DB113 consists of two parts: one for dialogue and one for creating video.
  • a scenario for dialogue is called a dialogue scenario
  • a scenario for creating video is called a media scenario.
  • the media scenario describes how to create a media DB (video) 115 to be created.
  • the clip DB 116 is video or audio data itself having a single content.
  • the interaction scenario references a media scenario.
  • Dialogue scenarios combine multiple media scenarios to build advanced interactive multimedia content.
  • the media scenario refers to the clip DB 116.
  • Media scenarios combine multiple clips to create complex video and audio.
  • one dialog scenario can refer to another dialog scenario.
  • Unreferenced conversation scenarios are called layer 1 conversation scenarios.
  • the layer of the dialog scenario referred to by this dialog scenario is Layer 2.
  • the layer numbers of the dialog scenario are assigned.
  • the layer number is a relative number.
  • Fig. 11 shows the structure of the content.
  • the content also has four main menu capabilities: “Introduction to Kyushu”, “Introduction to Iizuka”, “Watch New Song”, and “End”.
  • “Introduction to Kyushu” has nine menus
  • “Introduction to Iizuka” has five menus
  • “Watch new songs” also has five menus!
  • the number before and after each menu item in FIG. 11 is a number selected by the user, and when that number is selected, the menu is executed.
  • FIG. 12 shows the above content configuration as a scenario.
  • Figure 12 includes a dialog scenario and a media scenario, but the actual media scenario content is stored in a separate file.
  • the “PlayMedia ()” part in the scenario is a part for creating and distributing media (video), which is a part of the media scenario.
  • the interaction engine reads "PlayMedianndex.mml", true); When reading J, it passes index.mml and other interaction scenarios to the media engine, which creates media based on it and sends it to the IP videophone. Store in a buffer for distribution. The delivery server delivers this to the decoded UP videophone.
  • FIG. 13 shows the media configuration of the scenario shown in FIG.
  • the number is the number accepted as a response from the user.
  • Figure 14 shows the contents of index.mml where the media scenario is described.
  • the media combines each clip to create one video.
  • Specify clip with clip A clip is located on a track that can be specified multiple times, and a tract is located in a group.
  • There are two groups, one for video and one for audio. group is located in the timeline.
  • the timeline is the time axis of the whole image.
  • a video is created by combining a video clip called butterfly.avi, menuindex.bmp, V, and a still image, and this is combined with the audio part of butterfly.avi to create a video.
  • a reception request is sent from the IP videophone to the gatekeeper 20.
  • the gatekeeper 20 sends a reception completion notice to the IP videophone (2).
  • This reception completion notification includes the IP addresses of the distribution servers 50-1 to 50-n.
  • the IP videophone sends a connection establishment request to this IP address (3).
  • the distribution server establishes a connection (4).
  • the distribution server refers to the scenario DB61 and determines the video to be distributed first (5). It obtains various clips that are the material for creating the video and creates the video (6). Encode the created video for IP videophone (7). Then the video is distributed to the IP videophone (8).
  • the IP videophone user can send a response to the distribution server. For example, when the first video is distributed from the distribution server to the IP videophone (8), when the user of the IP videophone operates the telephone button, the information of the pressed button is sent to the distribution server (9). ).
  • the scenario processing means 58 having received the information refers to the scenario DB 61 and determines an operation in response to the user's response. In other words, what kind of video should be delivered in response to the user's response?
  • the video created based on index.mml is distributed to IP videophones. Then, when the user of the IP television phone presses 1 with the telephone button, the information of “1” is sent to the distribution server, recognized by the user response processing means 54, and sceneOll.mml is created based on the dialog scenario.
  • the media scenario is read and a video is created based on the media scenario and distributed to the IP TV phone. In this way, the video sent in response to the user's instantaneous response changes.
  • FIG. 16 shows the operation shown in FIG. 15 in chronological order.
  • the left side is the time axis of the server, and the right side is the time axis of the user.
  • the time lapse is from top to bottom.
  • the multi-point processing means 56 and the multi-point control means 57 provided in the distribution server 50 not only the media recorded in the file server 60 but also the recorded media can be used. Combined distribution with live broadcast media is possible.
  • Figure 17 shows the mechanism of a scenario that uses a combination of recorded media and live broadcast media.
  • Figure 19 shows a screen image displayed on the user's IP videophone.
  • the scenario in this case is similar to the scenario shown in FIG. 10, except that the powerful clip is connected to a distribution server that is not the video or audio that actually exists on the disc. Clips obtained directly from the phone have different characteristics.
  • the distribution server acquires the scenario as usual (2), processes it, and reads various clips (2). If the live media is included in the scenario, the distribution server is designated and sends a call to the IP videophone (this IP videophone is called the service provider's IP videophone), and the IP videophone Initiate a connection. Power of the user's IP videophone The video that is sent is distributed to the service provider's IP videophone [4], and the distribution server uses this video as a clip required by the scenario [5]. An image is created with this clip and distributed to IP videophone users [6].
  • the video of the service provider's IP videophone is placed on the user's IP videophone screen along with other clips on the disk according to the scenario.
  • Example 2
  • the second embodiment of the present invention relates to an IP videophone user-oriented service.
  • This service has, in addition to the basic configuration of the first embodiment, a scenario DB 61 for storing a scenario in the file server 60, and a scenario processing in which the distribution server 50 further processes the scenario and executes the scenario according to a preset distribution procedure.
  • Means 58 are provided.
  • the distribution procedure describes the distribution time, distribution time, position on the IP videophone screen, and display method of various media.
  • the distribution procedure is followed according to the user environment such as the IP phone number and language setting of the IP phone service user! ⁇ You can select the video to distribute and adjust the distribution method.
  • Figure 20 shows a scenario that instructs distribution according to the user's environment.
  • a certain shopping content sends different images to users who transmit from two regional powers.
  • Users from region 1 distribute product information limited to region 1
  • users from region 2 distribute product information limited to region 2.
  • some territories are common throughout the region.
  • Language 1 and Language 2 it is necessary to distribute audio that corresponds to the language of the user who sends audio even if the video is the same depending on the user's language. Has become. This mechanism is shown in the scenario configuration shown in Figure 20.
  • region 1 media scenario is used according to the description of the dialogue scenario (1). Furthermore, according to the media scenario of region 1, video clips are used in region 1 (common) (2), but audio clips (audio parts) are in language 1 (3).
  • Example 3 of the present invention relates to an IP videophone conversation service.
  • a file server 60 includes a scenario DB 61 for storing scenarios and media for storing various media such as video clips, still images, characters, and audio.
  • the distribution server 50 includes a user DB 62 and a cache DB 63 that retains various types of media stored in the storage device and images that are obtained by the processing of the distribution server.
  • This has a scenario processing means 58 to be executed.
  • it is possible to determine at what timing and for how long a certain media is to be distributed, and in the following procedure, at what position on the IP videophone screen and what display The ability to display the media in different ways.
  • As a delivery procedure a description is made for any response of the user to determine the behavior of the various media based on the time at which the response was received.
  • Figure 21 shows the interaction between the clerk and the customer in real time.
  • Realizing such a real-time dialogue with a non-real-time clip collection is the exchange of video between the distribution server and the user shown in FIG.
  • the distribution server distributes different images one after another according to the user's intention.
  • Figure 23 shows an image of the screen in Figure 22.
  • a background video 'audio' a menu video showing options and so on.
  • the person (clerk) in the picture behaves like a person on the other side, but is actually a video clip recorded on a disc.
  • Each of these is a collection of clips, and the attributes of each clip are controlled by the scenario!
  • Embodiment 4 of the present invention relates to an IP videophone user-oriented service.
  • this service is a content cache that stores videos composed of various media stored in the storage device in the storage device and reads them when necessary.
  • a certain media can be distributed at which timing and for how long.
  • the following procedure can be used to specify at which position on the IP videophone screen the media should be displayed in the following manner.
  • the distribution procedure can be described so as to determine the behavior of various media based on the time at which any response from the user is received.
  • Embodiment 5 of the present invention relates to an IP videophone load distribution service.
  • the gatekeeper 20 has an address translation unit 21 and a server load management unit 81, and the distribution server 50 manages the usage status of the CPU and the memory of the distribution server.
  • the provision of the server load monitoring means 72 for monitoring from time to time enables the IP videophones l to n to be overloaded when connected to the distribution server 50 in a service composed of a plurality of distribution servers 50 to prepare for a large amount of access. It is possible to determine the distribution server 50 that automatically accepts the connection so as not to be out of order.
  • FIG. 24 shows load distribution of the distribution server using the gatekeeper 20.
  • IP videophone 1 With IP videophone 1 connected to distribution server 50-1 and receiving services, IP videophone 2 makes a request to gatekeeper 20 to start connection (1).
  • the gatekeeper 20 receiving the reception request checks the load status 82 of each server by the server load management means 81 (2).
  • the IP address of the server that will not be overloaded by providing service to this IP videophone 2 is notified (3).
  • the gatekeeper 20 determines the IP address of the distribution server 50-2 that has room. Notify (5). With this, the IP videophone accesses the distribution server 50-2 as usual and the video is sent.
  • Embodiment 6 of the present invention relates to an IP videophone user participation type service.
  • a user response processing means 54 is provided in the force distribution server 50, which can be realized by any of the configurations of the first to fifth embodiments, so that the user can speak normally from the IP videophones l to n.
  • the distribution server 50 has a function of receiving the user's audio and video and automatically creating an IP television telephone scenario and respective media.
  • a screen as shown in Fig. 25 is displayed on the user's IP videophone. Is done.
  • the live video of the user of the IP videophone is displayed together with the video created by the scenario as usual.
  • the user follows the guidance of the guide shown in the video and speaks self-introduction naturally as if talking to a human on the telephone.
  • the video of the IP videophone user is sent to the distribution server and stored as audio and video clips.
  • the scenario is generated according to the scenario for automatic generation. When any IP videophone user accesses this scenario, the content is played back.
  • the present invention can be used as an IP videophone service system that realizes an interactive service using non-real-time video using an IP network.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Computer Graphics (AREA)
  • Databases & Information Systems (AREA)
  • Human Computer Interaction (AREA)
  • Telephonic Communication Services (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

Il est fourni un système de service vidéophonique susceptible de réaliser un service bidirectionnel basé sur une vidéo non en temps réel qui ne peut pas être fourni par le téléphone classique. Le système inclut un moyen pour décider différents supports tels que la séquence animée, les images fixes, les caractères et le son distribués en réponse d’un utilisateur du service de vidéophonique IP et leur procédé de distribution et la distribution de ces supports en tant que vidéo en continu au vidéophone IP, selon la procédure de distribution préétablie dans un serveur de distribution (50) raccordé à un réseau IP (10) auquel est fourni le service vidéophonique. Ainsi, l’utilisateur du vidéophone IP peut acquérir différentes informations de manière interactive et audiovisuelle uniquement par le fonctionnement du téléphone classique et de la communication téléphonique.
PCT/JP2005/008500 2004-05-14 2005-05-10 Système de service vidéophonique ip WO2005112453A1 (fr)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2004145529A JP2005328394A (ja) 2004-05-14 2004-05-14 Ipテレビ電話サービスシステム
JP2004-145529 2004-05-14

Publications (1)

Publication Number Publication Date
WO2005112453A1 true WO2005112453A1 (fr) 2005-11-24

Family

ID=35394529

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2005/008500 WO2005112453A1 (fr) 2004-05-14 2005-05-10 Système de service vidéophonique ip

Country Status (2)

Country Link
JP (1) JP2005328394A (fr)
WO (1) WO2005112453A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007034550A1 (fr) * 2005-09-22 2007-03-29 Hewlett-Packard Development Company, L.P. Dispositif de distribution d’images dynamiques et procédé associé

Families Citing this family (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8479230B2 (en) 2006-12-19 2013-07-02 At&T Intellectual Property I, Lp System and apparatus for managing media content
US8683527B2 (en) 2007-04-20 2014-03-25 At&T Intellectual Property I, Lp System and apparatus for supplying media content to portable communication devices
US8316410B2 (en) 2007-07-23 2012-11-20 At&T Intellectual Property I, Lp Apparatus and method for communicating media between communication devices
US8555330B2 (en) 2007-10-03 2013-10-08 At&T Intellectual Property I, L.P. System for managing media services
US8266666B2 (en) 2008-09-12 2012-09-11 At&T Intellectual Property I, Lp System for controlling media presentations
US8881205B2 (en) 2008-09-12 2014-11-04 At&T Intellectual Property I, Lp System for controlling media presentation devices
JP6413215B2 (ja) * 2013-09-17 2018-10-31 株式会社リコー 伝送端末及びプログラム
TWI690188B (zh) * 2019-05-02 2020-04-01 新加坡商華康(新加坡)有限公司 以固網電話啟動及執行網路電視遠端互動式客戶服務的系統及其方法

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000137752A (ja) * 1998-10-29 2000-05-16 Recruit Co Ltd 情報配信システム
JP2001168923A (ja) * 1999-12-08 2001-06-22 Toshiba Corp マルチメディア提供システム、マルチメディア変換サーバ、およびマルチメディア端末
JP2001197207A (ja) * 2000-01-17 2001-07-19 Mega Chips Corp 多言語応対装置
JP2001211437A (ja) * 2000-01-28 2001-08-03 Mitsubishi Electric Corp マルチメディアctiシステム
JP2001218181A (ja) * 2000-02-04 2001-08-10 Megafusion Corp 双方向通信方法
JP2003338830A (ja) * 2002-03-12 2003-11-28 Matsushita Electric Ind Co Ltd メディア送信方法、メディア受信方法、メディア送信装置及びメディア受信装置
JP2004356897A (ja) * 2003-05-28 2004-12-16 Neu & Computing Inc ゲートウェイ装置およびそれを用いた情報提供システム

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2000137752A (ja) * 1998-10-29 2000-05-16 Recruit Co Ltd 情報配信システム
JP2001168923A (ja) * 1999-12-08 2001-06-22 Toshiba Corp マルチメディア提供システム、マルチメディア変換サーバ、およびマルチメディア端末
JP2001197207A (ja) * 2000-01-17 2001-07-19 Mega Chips Corp 多言語応対装置
JP2001211437A (ja) * 2000-01-28 2001-08-03 Mitsubishi Electric Corp マルチメディアctiシステム
JP2001218181A (ja) * 2000-02-04 2001-08-10 Megafusion Corp 双方向通信方法
JP2003338830A (ja) * 2002-03-12 2003-11-28 Matsushita Electric Ind Co Ltd メディア送信方法、メディア受信方法、メディア送信装置及びメディア受信装置
JP2004356897A (ja) * 2003-05-28 2004-12-16 Neu & Computing Inc ゲートウェイ装置およびそれを用いた情報提供システム

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2007034550A1 (fr) * 2005-09-22 2007-03-29 Hewlett-Packard Development Company, L.P. Dispositif de distribution d’images dynamiques et procédé associé
JPWO2007034550A1 (ja) * 2005-09-22 2009-03-19 ヒューレット−パッカード デベロップメント カンパニー エル.ピー. 動画像配信装置およびその方法
US8269815B2 (en) 2005-09-22 2012-09-18 Hewlett-Packard Development Company, L.P. Dynamic image distribution device and method thereof

Also Published As

Publication number Publication date
JP2005328394A (ja) 2005-11-24

Similar Documents

Publication Publication Date Title
US9300705B2 (en) Methods and systems for interfacing heterogeneous endpoints and web-based media sources in a video conference
Deshpande et al. A real-time interactive virtual classroom multimedia distance learning system
WO2005112453A1 (fr) Système de service vidéophonique ip
RU2398362C2 (ru) Соединение независимых мультимедийных источников в конференц-связь
US20070294263A1 (en) Associating independent multimedia sources into a conference call
EP1868348B1 (fr) Contrôle de présentation de conférence et protocole de contrôle
US20120086769A1 (en) Conference layout control and control protocol
US20170048286A1 (en) Live broadcast system
US20080192736A1 (en) Method and apparatus for a multimedia value added service delivery system
US20140118471A1 (en) Video Conferencing Method and Device Thereof
JP2005318535A (ja) 帯域幅制御をして会議を開催する方法及び装置
JP2005318534A (ja) ストリーム選択を行う会議開催方法及び装置
NO325064B1 (no) Kommunikasjonsklient
US9398257B2 (en) Methods and systems for sharing a plurality of encoders between a plurality of endpoints
CN103269408A (zh) 一种融合通信系统
JP6719166B2 (ja) ライブ放送システム
CN105553967A (zh) 网络会议方法及装置
WO2007035109A1 (fr) Procede de diffusion en continu de garde-porte
CN102438119B (zh) 一种数字电视的音视频通讯系统
Patrick The human factors of MBone videoconferences: Recommendations for improving sessions and software
KR20020050903A (ko) 인터넷을 이용한 교육,방송, 다자간 화상 회의,채팅서비스 제공 방법
US20100020156A1 (en) Method and device for simultaneous multipoint distributing of video, voice and data
Andberg Video conferencing in distance education
JP2006345103A (ja) メディア会議システム、その制御方法、プログラムおよび記憶媒体
JP5239756B2 (ja) 映像共有時のメディア同期方法

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BW BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE EG ES FI GB GD GE GH GM HR HU ID IL IN IS KE KG KM KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NA NG NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SM SY TJ TM TN TR TT TZ UA UG US UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): BW GH GM KE LS MW MZ NA SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IS IT LT LU MC NL PL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
NENP Non-entry into the national phase

Ref country code: DE

WWW Wipo information: withdrawn in national office

Country of ref document: DE

122 Ep: pct application non-entry in european phase