WO2007082433A1 - Appareil, dispositif de réseau et procédé de transmission de signaux audio et vidéo - Google Patents

Appareil, dispositif de réseau et procédé de transmission de signaux audio et vidéo Download PDF

Info

Publication number
WO2007082433A1
WO2007082433A1 PCT/CN2006/002757 CN2006002757W WO2007082433A1 WO 2007082433 A1 WO2007082433 A1 WO 2007082433A1 CN 2006002757 W CN2006002757 W CN 2006002757W WO 2007082433 A1 WO2007082433 A1 WO 2007082433A1
Authority
WO
WIPO (PCT)
Prior art keywords
video
module
audio
audio signal
signal
Prior art date
Application number
PCT/CN2006/002757
Other languages
English (en)
French (fr)
Inventor
Qingyu Zeng
Original Assignee
Huawei Technologies Co., Ltd.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Huawei Technologies Co., Ltd. filed Critical Huawei Technologies Co., Ltd.
Priority to EP06804975A priority Critical patent/EP1976290A4/en
Priority to CN200680011737A priority patent/CN100579196C/zh
Publication of WO2007082433A1 publication Critical patent/WO2007082433A1/zh

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • H04N7/147Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/21Server components or server architectures
    • H04N21/222Secondary servers, e.g. proxy server, cable television Head-end
    • H04N21/2225Local VOD servers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/4302Content synchronisation processes, e.g. decoder synchronisation
    • H04N21/4305Synchronising client clock from received content stream, e.g. locking decoder clock with encoder clock, extraction of the PCR packets
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/434Disassembling of a multiplex stream, e.g. demultiplexing audio and video streams, extraction of additional data from a video stream; Remultiplexing of multiplex streams; Extraction or processing of SI; Disassembling of packetised elementary stream
    • H04N21/4341Demultiplexing of audio and video streams
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/438Interfacing the downstream path of the transmission network originating from a server, e.g. retrieving MPEG packets from an IP network
    • H04N21/4382Demodulation or channel decoding, e.g. QPSK demodulation
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/462Content or additional data management, e.g. creating a master electronic program guide from data received from the Internet and a Head-end, controlling the complexity of a video stream by scaling the resolution or bit-rate based on the client capabilities
    • H04N21/4622Retrieving content or additional data from different sources, e.g. from a broadcast channel and the Internet
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/633Control signals issued by server directed to the network components or client
    • H04N21/6332Control signals issued by server directed to the network components or client directed to client
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/643Communication protocols
    • H04N21/64322IP
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/65Transmission of management data between client and server
    • H04N21/654Transmission by server directed to the client
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/14Systems for two-way working
    • H04N7/141Systems for two-way working between two video terminals, e.g. videophone
    • H04N7/148Interfacing a video terminal to a particular transmission medium, e.g. ISDN

Definitions

  • the present invention relates to video and audio signal transmission technologies, and in particular, to a device for video audio signal transmission, a network device, and a video and audio signal transmission. Methods. Background of the invention
  • Multimedia applications for IP networks are becoming more mature and popular.
  • This multimedia network is a network application that integrates an audio transmission system and a video transmission system into a data transmission network. It is based on the IP network as the basic transmission network, on the basis of which a network structure of audio and video transmission systems is constructed. This network structure provides us with many multimedia applications, such as audio calls, video conferencing, IPTV, electronic white pages and so on.
  • the networking of a general multimedia network is shown in FIG. 1.
  • the user terminal is a communication and on-demand tool used by the user, and may be a videophone, a set top box, or the like.
  • the streaming media servers in Figure 1 belong to various application servers and are used to support services such as video conferencing and IPTV.
  • Step 23 The two parties exchange device information.
  • the structure of the video and audio receiving module of the videophone is as shown in FIG. 3, wherein the video decoding module and the audio decoding module can respectively decode one video stream and one audio stream, and then transmit the video and audio data to Play on the display and speakers.
  • the user can watch the video on demand using the terminal T1 shown in FIG. 3, the view of the terminal T1.
  • the process of establishing the frequency on-demand broadcast is shown in Figure 4, which is divided into the following two steps:
  • Step 41 The terminal T1 establishes a connection with the streaming media server, and the two parties exchange the on-demand information.
  • the present invention provides an apparatus for video and audio signal transmission, the apparatus comprising:
  • the receiving decoding module is configured to receive the call video audio signal and the streaming media video audio signal transmitted through the IP network, and respectively decode the received call video audio signal and the streaming media video audio signal, and the decoded video signal is input to the video synchronization. Module, the decoded audio signal is input to the mixer;
  • the video synchronization module is configured to superimpose and synchronize the received video signal
  • the mixer is used to mix the received audio signals.
  • the present invention also provides a network device, including: a receiving decoding module, a video synchronization module, and a mixer;
  • the receiving decoding module is configured to receive the call video audio signal and the streaming media video audio signal transmitted through the IP network, and respectively decode the received call video audio signal and the streaming media video audio signal, and the decoded video signal is input to the video synchronization. Module, decoded The audio signal is input to the mixer;
  • the video synchronization module is configured to superimpose and synchronize the received video signal
  • the present invention also provides a method for video audio signal transmission, the method comprising: transmitting a video audio signal sent to a visual terminal in a call, and a video audio signal sent by the streaming media server to the visual terminal, After video overlay synchronization and audio mixing, the user is played through the visual terminal.
  • the device, the network device and the method for transmitting video and audio signals provided by the invention can be used in the IP network communication, and the user terminal does not need to hang up the streaming media being played for IP calling while playing the streaming media;
  • the terminal screen can be displayed by picture-in-picture or split screen, and the end user can simultaneously hear the audio signal of the streaming media and the audio of the other party in the call, thereby ensuring that the user can watch the streaming media information while watching.
  • the audio and video chat is performed by the peer user.
  • FIG. 2 is a schematic diagram of establishing an IP multimedia communication in the prior art
  • FIG. 3 is a schematic diagram of a multimedia video and audio receiving module in a prior art videophone
  • FIG. 4 is a schematic diagram of a user viewing a streaming media through a videophone in the prior art
  • FIG. 5 is a block diagram of a multimedia video and audio receiving module in a videophone according to a first embodiment of the present invention
  • FIG. 6 is a schematic structural diagram of a video synchronization module according to a first embodiment of the present invention.
  • FIG. 7 is a schematic diagram of a call initiated during playback of a streaming media according to a first embodiment of the present invention
  • FIG. 8 is a network diagram of a data synthesizing server scheme according to a second embodiment of the present invention.
  • FIG. 11 is a flow chart showing a call initiated by a user terminal when playing streaming media according to a second embodiment of the present invention.
  • BEST MODE FOR CARRYING OUT THE INVENTION The present invention will be further described in detail below with reference to the accompanying drawings.
  • the main idea of the present invention is to perform superimposed synchronization and mixing processing on the streaming video and audio signals and the call video and audio signals transmitted to the visual terminal through the video synchronization module and the mixer, so that the user can watch and listen through the visual terminal. Streaming the program and making a call at the same time.
  • the present invention can implement the above-mentioned superimposition synchronization and mixing processing on the terminal side, and can also implement the above-described superimposition synchronization and mixing processing on the network side, which will be described below by way of specific embodiments.
  • the above-described superimposition synchronization and mixing processing is implemented on the terminal side.
  • a new video and audio receiving module is set in the videophone.
  • the multimedia video and audio receiving module in the videophone includes the following submodules:
  • the receiving decoding module is configured to receive the call video audio signal and the streaming media video audio signal transmitted through the IP network, and respectively decode the received call video audio signal and the streaming media video audio signal, and decode the two video data inputs.
  • the decoded two channels of audio data are input to the mixer;
  • a video synchronization module configured to superimpose and synchronize two received video data; and a mixer for mixing the received two audio data.
  • the receiving and decoding module may include:
  • Interface module Video and audio data used to transmit the IP network collected by each port of the videophone device, including the call video signal and the call audio signal in the call stream, and the streaming media
  • the streaming video signal and the streaming audio signal in the body code stream are respectively transmitted to corresponding decoding modules.
  • the interface module can include four ports: PORT01, PORT02, PORT03 and PORT04, respectively for receiving the above-mentioned streaming video signal, streaming audio signal, call video signal and call audio signal.
  • the first video decoding module is configured to: decode the call video signal sent by the call peer end from the interface module, and output the decoded image data to the video synchronization module.
  • the second video decoding module is configured to: decode the streaming video signal from the interface module, and output the decoded image data to the video synchronization module.
  • the first audio decoding module is configured to: decode the call audio signal sent by the call peer from the interface module, and output the decoded audio data to the mixer.
  • the second audio decoding module is configured to decode the streaming audio signal from the interface module, and output the decoded audio data to the mixer.
  • the video synchronization module superimposes and synchronizes two pieces of image data from the first video decoding module and the second video decoding module, and outputs the playable video image to the display screen.
  • the specific structure of the video synchronization module can be as shown in FIG. 6, which includes two video frame registers and an adder, and the two video frame registers are respectively used for storing the call end image data from the first video decoding module, and from the second Streaming image data of the video decoding module. If any video frame register is refreshed, the image data in the two frame registers is superimposed and output by the adder; if no new data is passed to the frame register, the frame register always saves the previous image data.
  • This design can solve the problem that the frame rate of a certain video signal will appear on the display when the frame rates of the two video signals are not equal.
  • the image data superimposed by the superimposer can be output to the display screen by means of split screen display or picture-in-picture display, so that the display screen simultaneously displays image data from the opposite end of the call and streaming image data from the streaming media server; Sends an instruction to the adder to control the output mode of the overlay, that is, whether to use the split screen display mode or the picture-in-picture display mode.
  • the above mixer mixes the audio data from the first audio decoding module and the audio data of the second audio decoding module, and outputs the playable audio data to the sound playing device.
  • the sound playback device here is usually a speaker.
  • the proportion of each stream is mixed by the user, that is, the user can send a command to the mixer to control whether the voice of the call is large or the sound of the stream is large, or the sound of one stream is completely listened to.
  • FIG. 7 is a flow chart of a call initiated during streaming of media.
  • Tl, ⁇ 2 are two user terminals.
  • the videophone, T1 initiates a call during the playing of the streaming media.
  • the process of the ⁇ 2 includes the following steps: Step 701: The T1 and the streaming media server perform information exchange to establish a connection, and the two parties exchange the information.
  • Step 702 The T1 receives the multimedia information from the streaming media server, that is, the video audio code stream, and performs decoding and playing.
  • the step specifically includes: T1 receives the video audio signal stream transmitted by the streaming media server in PORT01 and PORT02 of the interface module, and the interface module transmits the code stream of the PORT01 to the second video decoding module, and transmits the code stream of the PORT02.
  • the second audio decoding module is provided.
  • the second video decoding module and the second audio decoding module start to work, respectively output displayable image data and playable audio data, and the output video data and audio data are respectively transmitted to the video synchronization module and the mixer, according to the user's Choose to pass data to the display and speakers for playback.
  • T1 can choose whether to play the video and whether to play the audio.
  • Step 704 ⁇ 2 replies, in step 705, T1 and ⁇ 2 perform information exchange, exchange device information, and then, in step 706, after the call is connected, T1 and ⁇ 2 perform multimedia communication, and T1 receives video and audio messages from the call end ⁇ 2. Number stream.
  • the terminal T1 performs multiplex decoding, then performs video synchronization, superimposition, and audio mixing, and plays the video audio signal.
  • the T1 receives the chirp frequency and audio signal stream sent by the opposite end T2 in the PORT03 and PORT04 of the interface module, and the interface module transmits the code stream of the PORT03 to the first video decoding module, and transmits the code of the PORT04.
  • the first video decoding module and the first audio decoding module start to work, and output displayable image data and playable audio data, respectively, to the video synchronization module and the mixer.
  • the two channels of video data transmitted to the video synchronization module are: video data from the streaming server and video data from the T2, and the two video data are superimposed and synchronized by the video synchronization module and sent to the display;
  • the device includes two audio data: audio data from the streaming server and audio data from T2, and the two audio data is mixed by the mixer and sent to the speaker.
  • the superimposing manner of the video synchronizing module may be the default mode selected by T1.
  • the mixing mode of the mixer may also be the default mode selected by T1.
  • T1 can configure the working mode of the video synchronization module and the mixer in real time through the user interface, that is, sending a video synchronization instruction to the video synchronization module or sending a mixing mode instruction to the mixing, then the video synchronization
  • the module or mixer determines the mode of operation based on the received instructions.
  • the first video decoding module and the first audio decoding module stop working, and T1 only plays the video audio data from the streaming server. Then if T1 turns off the streaming media, the second video decoding module and the second audio decoding module stop working, and T1 stops playing the video audio data from the streaming server. The user can then turn off the display and the speaker.
  • T1 can also close the streaming media first, then T1 only plays the video and audio data from the opposite end of the call T2, and then stops playing the video and audio data from the opposite end of the call after the call ends.
  • the steps for the user to watch the streaming media during the call are similar to the above steps, and will not be described here.
  • FIG. 8 The system networking of the second embodiment of the present invention is as shown in FIG. Compared to Figure 1, a device added in Figure 8 is a data synthesis server.
  • the user terminal cannot provide multiple video and audio decoding modules due to cost and other factors, only one video and audio decoding module is provided, and the decoding device-data synthesizing server can be added on the network side.
  • the media redirection function can be utilized to enable the streaming media stream of the streaming media server and the video audio code stream of the call peer end to be sent to the data synthesizing server, and the data synthesizing server performs multi-channel decoding and video.
  • the work of superimposing, audio mixing, and the like is sent to the user terminal by using one video code stream and one audio code stream respectively.
  • the user terminal adopts the structure of FIG. This also achieves the purpose of playing streaming media while making video calls.
  • the structure of the data synthesizing server is as shown in FIG. 9, and includes a multimedia video audio receiving module, a video encoding module, an audio encoding module, and an interface module.
  • the structure of the multimedia video and audio receiving module is shown in FIG.
  • the multimedia video audio receiving module shown in FIG. 10 includes an interface module, a first video decoding module, a second video decoding module, a first audio decoding module, a second audio decoding module, a video synchronization module, and a mixer, wherein the interface module and
  • the functions of the interface modules in the first embodiment are basically the same, and the functions of the other modules have also been described in detail in the first embodiment.
  • FIG. 10 The structure of the data synthesizing server is as shown in FIG. 9, and includes a multimedia video audio receiving module, a video encoding module, an audio encoding module, and an interface module.
  • the structure of the multimedia video and audio receiving module is shown in FIG.
  • the multimedia video audio receiving module shown in FIG. 10 includes an interface module
  • the video data output by the video synchronization module is output to the video encoding module, and the video encoding module encodes the video data, and then sends the interface module to the interface module through the interface module; the audio data output by the mixer is output to the audio encoding.
  • the module encodes the audio data by the audio encoding module and then sends the interface module to the terminal through the interface module.
  • Step 1101 The user terminal videophone T1 communicates normally with the streaming media server, and successfully views the streaming media.
  • Step 1102 T1 initiates a call to the user terminal videophone T2, and at the same time commands T2 to send the video audio code stream to the data synthesizing server.
  • Step 1103 notifies the data synthesizing server to start working, and notifies the streaming media server to send the video audio data to the data synthesizing server.
  • T1 can report the working mode of the video synchronization module and the mixer selected by the user to the data synthesizing server at the same time.
  • the interface module in the data synthesizing server After receiving the working mode of the video synchronizing module and the mixer selected by the user, the interface module in the data synthesizing server will The corresponding control signals are transmitted to the video synchronizing module and the mixer to perform superimposition and synchronization of the video signals and mixing of the audio signals according to the mode of operation selected by the user.
  • Step 1104 the data composition server starts to receive the video audio data sent by the T2 and the streaming server.
  • the workflow of the data synthesizing server specifically includes: after receiving the startup command sent by the terminal, starting to work to receive the video audio code stream; after receiving the video audio code stream sent by the terminal and the streaming media server,
  • the multimedia video audio receiving module processes the video and audio data that can be used for playing to the video encoding module and the audio encoding module; the video encoding module and the audio encoding module encode the received data, and send the data to the user terminal through the interface module.
  • Step 1105 T1 sends the local video and audio data to T2.
  • Step 1106 receives the synthesized video and audio data sent by the data synthesizing server, and displays and plays.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Databases & Information Systems (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Telephonic Communication Services (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Description

用于视频音频信号传输的装置、 网络设备及方法 技术领域 本发明涉及视频音频信号传输技术, 特别涉及一种用于视频音频信 号传输的装置、 一种网絡设备和一种用于视频音频信号传输的方法。 发明背景
IP网络的多媒体应用日益成熟和普及。这种多媒体网络是将音频传 输系统、 视频传输系统集成到数据传输网络的一种网络应用。 它是以 IP 网络为基本传输网络, 在此基础上构建音频、 视频传输系统的一种网络 结构。 这种网络结构给我们提供了许多的多媒体应用, 比如音频通话, 视频会议, IPTV, 电子白版等。 一般多媒体网络的组网如图 1所示, 其 中用户终端是用户使用的通讯和点播工具, 可以是可视电话、 机顶盒等 等。图 1中的流媒体服务器都属于各种应用服务器,用于支持视频会议, IPTV等业务。
在多媒体网络的众多业务中, 有一种业务使用户通过可视电话可以 进行呼叫通话。 多媒体通讯的呼叫通话建立如图 2所示, 分为如下四个 步骤:
步據 21 , 终端 T1发起呼叫;
步骤 22, 终端 T2作为被叫应答 T1 ;
步驟 23 , 双方交换设备信息;
步骤 24, 通讯建立成功, 双方交换多媒体数据。
在多媒体通讯过程中, 可视电话的视频音频接收模块结构如图 3所 示, 其中的视频解码模块和音频解码模块分别可以解码一路视频码流和 一路音频码流, 然后将视频音频数据传到显示屏和扬声器上进行播放。
用户可以使用图 3所示的终端 T1观看视频点播,所述终端 T1的视 频点播的建立过程如图 4所示, 分为如下两个步骤:
步驟 41 , 终端 T1与流媒体服务器建立连接, 双方交换点播信息; 步骤 42, 终端 T1接收流媒体传来的影音数据。
如上所述的现有技术只能同时进行一路视频码流和一路音频码流 的解码, 当终端用户处于通话状态时就不能同时观看流媒体; 当终端用 户进行视频点播, 即观看流媒体时, 如有来电, 必须断开流媒体播放, 才能接听来电。 发明内容 有鉴于此, 本发明的主要目的在于提供一种用于视频音频信号传输 的装置、 网络设备及方法, 以解决用户不能使用一台终端同时播放流媒 体和进行呼叫通话的问题。
为了达到上述目的, 本发明提供了一种用于视频音频信号传输的装 置, 该装置包括:
接收解码模块、 视频同步模块和混音器; 其中,
接收解码模块用于接收通过 IP 网络传送来的通话视频音频信号和 流媒体视频音频信号 , 并分别对接收的通话视频音频信号和流媒体视频 音频信号进行解码, 解码后的视频信号输入到视频同步模块, 解码后的 音频信号输入到混音器;
视频同步模块用于对接收的视频信号进行叠加和同步;
混音器用于对接收的音频信号进行混合。
本发明还提供了一种网络设备, 该网络设备包括,. 接收解码模块、 视频同步模块和混音器; 其中,
接收解码模块用于接收通过 IP 网络传送来的通话视频音频信号和 流媒体视频音频信号, 并分别对接收的通话视频音频信号和流媒体视频 音频信号进行解码, 解码后的视频信号输入到视频同步模块, 解码后的 音频信号输入到混音器;
视频同步模块用于对接收的视频信号进行叠加和同步;
混音器用于对接收的音频信号进行混合。
本发明还提供了一种用于视频音频信号传输的方法, 该方法包括: 将呼叫中发送给可视终端的视频音频信号, 与流媒体服务器发送给 所述可视终端的视频音频信号, 进行视频叠加同步和音频混音后, 通过 所述可视终端播放给用户。
釆用本发明所提供的用于视频音频信号传输的装置、 网络设备及方 法, 可以在 IP网络通讯中,用户终端在播放流媒体的同时不需要挂断正 在播放的流媒体进行 IP呼叫;在通话过程中,终端屏幕可采用画中画或 分屏等方式显示, 且终端用户能同时听到流媒体的音频信号和呼叫中对 方的音频, 从而保证了用户可以一边观看欣赏流媒体信息一边与对端用 户进行音频视频聊天。 附图简要说明 图 1为现有技术中 BP多媒体通讯组网示意图;
图 2为现有技术中 IP多媒体通讯建立示意图;
图 3为现有技术可视电话中多媒体视频音频接收模块架图; 图 4为现有技术中用户通过可视电话观看流媒体示意图;
图 5为本发明第一实施例的可视电话中多媒体视频音频接收模块框 架图;
图 6为本发明第一实施例中视频同步模块的组成示意图;
图 7为本发明第一实施例中播放流媒体期间发起呼叫示意图; 图 8为本发明第二实施例中数据合成服务器方案组网图;
图 9为本发明第二实施例中数据合成服务器的组成结构示意图; 图 10为本发明第二实施例中数据合成服务器视频音频处理模块结 构示意图;
图 11 为本发明第二实施例中用户终端播放流媒体时发起呼叫的流 程图。 实施本发明的方式 为使本发明的目的、 技术方案和优点更加清楚, 下面结合附图对本 发明作进一步的详细描述。
本发明的主要思想是, 通过视频同步模块和混音器对发送给可视终 端的流媒体视音频信号和通话视音频信号进行叠加同步和混音处理, 从 而使用户通过可视终端可以收看收听流媒体节目并同时进行通话。
本发明可以在终端侧实现上述叠加同步和混音处理, 也可以在网络 侧实现上述叠加同步和混音处理, 以下分别通过具体实施例进行说明。
本发明的第一实施例中 , 在终端侧实现上述叠加同步和混音处理。 以可视终端为可视电话为例, 本实施例中在可视电话中设置新的视频音 频接收模块, 如图 5为所示, 可视电话中的多媒体视频音频接收模块包 括以下子模块:
接收解码模块, 用于接收通过 IP网络传送来的通话视频音频信号和 流媒体视频音频信号 , 并分别对接收的通话视频音频信号和流媒体视频 音频信号进行解码, 解码后的两路视频数据输入到视频同步模块, 解码 后的两路音频数据输入到混音器;
视频同步模块, 用于对接收的两路视频数据进行叠加和同步; 混音器, 用于对接收的两路音频数据进行混合。
具体地, 接收解码模块中可以包括:
接口模块: 用于把可视电话设备各端口采集到的 IP网络传来的视频 音频数据, 包括通话码流中的通话视频信号和通话音频信号, 以及流媒 体码流中的流媒体视频信号和流媒体音频信号, 分别传送到对应的解码 模块。 接口模块中可包括四个端口: PORT01、 PORT02, PORT03和 PORT04, 分别用来接收上述的流媒体视频信号、 流媒体音频信号、 通 话视频信号和通话音频信号。
第一视频解码模块: 用于对来自接口模块的通话对端发送的通话视 频信号进行解码, 并将解码后的图像数据输出到视频同步模块。
第二视频解码模块: 用于对来自接口模块的流媒体视频信号进行解 码, 并将解码后的图像数据输出到视频同步模块。
第一音频解码模块: 用于对来自接口模块的通话对端发送的通话音 频信号进行解码, 并将解码后的音频数据输出到混音器。
第二音频解码模块: 用于对来自接口模块的流媒体音频信号进行解 码, 并将解码后的音频数据输出到混音器。
则上述视频同步模块叠加和同步来自第一视频解码模块和第二视 频解码模块的两路图像数据 , 并输出可播放的视频图像到显示屏。
视频同步模块的具体结构可以如图 6所示, 其中包括两个视频帧寄 存器和叠加器, 两个视频帧寄存器分别用于存放来自第一视频解码模块 的通话对端图像数据, 和来自第二视频解码模块的流媒体图像数据。 任 一视频帧寄存器被刷新, 便通过叠加器将两个帧寄存器中的图像数据进 行一次叠加并输出; 若没有新数据传入帧寄存器, 帧寄存器始终保存以 前的图像数据。 这样设计可解决两路视频信号帧率不相等时, 显示屏上 会出现某路视频信号缺帧现象的问题。 叠加器叠加后的图像数据可以通 过分屏显示或画中画显示的方式, 输出到显示屏, 使显示屏同时显示来 自通话对端的图像数据和来自流媒体服务器的流媒体图像数据; 用户可 以通过向叠加器发送指令来控制叠加器的输出方式, 即选择采用分屏显 示方式还是采用画中画显示方式。 而上述混音器混合来自第一音频解码模块的音频数据和第二音频 解码模块的音频数据两路码流, 并输出可播放的音频数据到声音播放设 备。 这里的声音播放设备通常为扬声器。 这里每路码流混合的比例由用 户选择, 即用户可以通过向混音器发送指令控制是通话的声音大还是流 媒体的声音大, 或者完全只听一路码流的声音。
图 7为播放流媒体期间发起呼叫的流程图。 Tl、 Τ2为两个用户终端 可视电话, T1在播放流媒体期间发起呼叫 Τ2的流程具体包括如下步骤: 步骤 701, T1与流媒体服务器进行信息交互建立连接, 双方交换点 播信息。
步骤 702, T1接收来自流媒体服务器的多媒体信息, 即视频音频码 流, 并进行解码播放。
本步驟具体包括: T1在自身接口模块的 PORT01和 PORT02, 接收由 流媒体服务器传来的视频音频信号码流, 接口模块将 PORT01的码流传 送给第二视频解码模块, 将 PORT02的码流传送给第二音频解码模块。 第二视频解码模块和第二音频解码模块开始工作, 分别输出可显示的图 像数据和可播放的音频数据, 该输出的视频数据和音频数据分别传入视 频同步模块和混音器, 依据用户的选择把数据传入显示屏和扬声器进行 播放。 此时 T1可以选择是否播放视频, 以及选择是否播放音频。
步骤 703, T1呼叫 Τ2。
步骤 704, Τ2应答, 在步骤 705, T1和 Τ2进行信息交互, 交换设备信 息, 然后在步骤 706, 呼叫接通后, T1和 Τ2进行多媒体通讯, T1接收来 自通话对端 Τ2的视频和音频信号码流。
此后, 终端 T1进行多路解码, 然后进行视频同步、 叠加, 以及音频 混合, 并播放视频音频信号。 具体包括: T1在自身接口模块的 PORT03和 PORT04, 接收由通话对 端 T2发送来的枧频和音频信号码流, 接口模块将 PORT03的码流传送给 第一视频解码模块, 将 PORT04的码流传送给第一音频解码模块。 第一 视频解码模块和第一音频解码模块开始工作, 分别输出可显示的图像数 据和可播放的音频数据, 传送给视频同步模块和混音器。
则传送给视频同步模块的包括两路视频数据: 来自流媒体服务器的 视频数据和来自 T2的视频数据, 两路视频数据通过视频同步模块进行叠 加和同步后发送到显示屏; 而传送给混音器的包括两路音频数据: 来自 流媒体服务器的音频数据和来自 T2的音频数据, 两路音频数据通过混音 器混音后发送到扬声器。 这里, 视频同步模块的叠加方式可以是由 T1选 择的默认方式, 类似地, 混音器的混音方式也可以是由 T1选择的默认方 式。
此外, 在通话过程中, T1可通过用户界面实时地对视频同步模块和 混音器的工作方式进行配置, 即向视频同步模块发送视频同步指令或向 混音发送混音方式指令, 则视频同步模块或混音器根据接收的指令确定 工作方式。
在通话结束, T1挂断呼叫后, 第一视频解码模块和第一音频解码模 块停止工作, T1只播放来自流媒体服务器的视频音频数据。 然后如果 T1 关闭流媒体, 第二视频解码模块和第二音频解码模块停止工作, T1停止 播放来自流媒体服务器的视频音频数据。 此后用户可关闭显示屏和扬声 器。
当然, T1也可以先关闭流媒体, 则 T1只播放来自通话对端 T2的视频 音频数据, 而后在通话结束后, 再停止播放来自通话对端的视频音频数 据。 通话期间用户观看流媒体的步骤与上述步骤类似, 这里就不再叙述 了。
本发明的第二个实施例的系统組网如图 8所示。 相对于图 1 , 图 8中 增加了一个设备是数据合成服务器。 当用户终端由于成本等因素不能提 供多个视频音频解码模块, 仅提供视频音频解码模块各一个, 可在网络 侧增加解码设备一一数据合成服务器。 在同时进行流媒体播放和通话 时, 可利用媒体改向功能, 使流媒体服务器的流媒体码流和通话对端的 视频音频码流发送到数据合成服务器, 由数据合成服务器进行多路解 码、 视频叠加、 音频混合等工作, 再分别以一路视频码流和一路音频码 流发送给用户终端, 本实施例中用户终端采用图 3的结构。 这样也可以 达到播放流媒体同时进行视频通话的目的。
数据合成服务器的结构如图 9所示, 包括多媒体视频音频接收模块、 视频编码模块和音频编码模块和接口模块。 其中多媒体视频音频接收模 块的结构如图 10所示。 图 10所示的多媒体视频音频接收模块包括接口模 块、 第一视频解码模块、 第二视频解码模块、 第一音频解码模块、 第二 音频解码模块、 视频同步模块和混音器, 其中接口模块与第一实施例中 接口模块的功能基本相同, 其他各个模块的功能也已在第一实施例中进 行了详细说明。 在图 9中, 视频同步模块输出的视频数据输出到视频编 码模块, 由视频编码模块对视频数据进行编码, 然后通过接口模块发送 到终端的接口模块; 混音器输出的音频数据输出到音频编码模块, 由音 频编码模块对音频数据进行编码, 然后通过接口模块发送到终端的接口 模块。
本实施例方案中用户终端播放流媒体时发起呼叫的流程如图 11所 示, 步骤如下: 步骤 1101 , 用户终端可视电话 Tl与流媒体服务器正常通信, 并成功 观看流媒体。
步骤 1102, T1向用户终端可视电话 T2发起呼叫, 同时命令 T2将视频 音频码流发送到数据合成服务器。
步骤 1103 , T1通知数据合成服务器开始工作, 并通知流媒体服务器 将视频音频数据发送到数据合成服务器。
这里, T1可以同时将用户选择的视频同步模块和混音器的工作方式 上报给数据合成服务器, 数据合成服务器中的接口模块接收到用户选择 的视频同步模块和混音器的工作方式后, 将对应的控制信号传送到视频 同步模块和混音器, 从而根据用户所选择的工作方式执行视频信号的叠 加和同步以及音频信号的混音操作。
步骤 1104, 数据合成服务器开始接收由 T2和流媒体服务器发送过来 的视频音频数据。
本实施例中, 数据合成服务器的工作流程具体包括: 收到终端发来 的启动命令后, 开始工作准备接收视频音频码流; 接收到终端和流媒体 服务器发送来的视频音频码流后, 经过多媒体视频音频接收模块处理后 输出可用于播放的视频和音频数据到视频编码模块和音频编码模块; 视 频编码模块和音频编码模块对接收到的数据进行编码, 通过接口模块 , 发送给用户终端。
步骤 1105, T1向 T2发送本端的视频音频数据。
步骤 1106, T1接收由数据合成服务器发送过来的合成视频音频数 据, 并显示与播放。
通话期间终端观看流媒体的步驟与上述步骤类似, 这里就不再叙述 了。
以上所述仅为本发明的较佳实施例而 , 并不用以限制本发明, 凡在本发明的精神和原则之内,所做的任何修改、等同替换、改进等 均应包含在本发明的保护范围之内。

Claims

权利要求书
1、 一种用于视频音频信号传输的装置, 其特征在于, 该装置包括: 接收解码模块、 视频同步模块和混音器; 其中,
接收解码模块用于接收通过 IP 网络传送来的通话视频音频信号和 流媒体视频音频信号, 并分别对接收的通话视频音频信号和流媒体视频 音频信号进行解码, 解码后的视频信号输入到 ¾L频同步模块, 解码后的 音频信号输入到混音器;
视频同步模块用于对接收的视频信号进行叠加和同步;
混音器用于对接收的音频信号进行混合。
2、 根据权利要求 1 所述的装置, 其特征在于, 所述的接收解码模 块包括: 接口模块、 第一视频解码模块、 第二视频解码模块、 第一音频 解码模块、 第二音频解码模块;
接口模块用于把 IP网络传来的通话视频音频信号 ,以及流媒体视频 音频信号传送到对应的解码模块;
第一视频解码模块用于对来自接口模块的通话视频信号进行解码, 并将解码后的视频信号输出到视频同步模块;
第二视频解码模块用于对来自接口模块的流媒体视频信号进行解 码, 并将解码后的视频信号输出到视频同步模块;
第一音频解码模块用于对来自接口模块的通话音频信号进行解码, 并将解码后的音频信号输出到混音器;
第二音频解码模块用于对来自接口模块的流媒体音频信号进行解 码, 并将解码后的音频信号输出到混音器。
3、 根据权利要求 1或 2所述的装置, 其特征在于, 所述视频同步 模块包括分别用来存放流媒体视频信号和通话视频信号的视频帧寄存 器, 所述任一帧寄存器被刷新, 便进行一次叠加输出; 若没有新数据输 入帧寄存器, 帧寄存器始终保存以前的视频信号。
4、 一种网络设备, 其特征在于, 该网络设备包括: 接收解码模块、 视频同步模块和混音器; 其中,
接收解码模块用于接收通过 IP 网络传送来的通话视频音频信号和 流媒体视频音频信号, 并分别对接收的通话视频音频信号和流媒体视频 音频信号进行解码, 解码后的视频信号输入到视频同步模块, 解码后的 音频信号输入到混音器;
视频同步模块用于对接收的视频信号进行叠加和同步;
混音器用于对接收的音频信号进行混合。
5、 根据权利要求 4所述的网络设备, 其特征在于, 该网络设备为 可视终端。
6、 根据权利要求 5 所述的网络设备, 其特征在于, 所述可视终端 中进一步包括: 显示屏和声音播放设备;
则所述视频同步模块将经过叠加同步的视频信号输入可视终端的 显示屏; 所述的混音器将经过混音的音频信号输入可视终端的声音播放 设备。
7、 根据权利要求 6所述的网络设备, 其特征在于, 所述视频同步 模块根据用户输入的指令选择同步叠加方式; 所述的混音器根据用户输 入的指令选择混音方式。
8、 根据权利要求 4所述的网络设备, 其特征在于, 所述网络设备 设置于网络侧。
9、 根据权利要求 8 所述的网络设备, 其特征在于, 该网络设备中 进一步包括: 视频编码模块、 音频编码模块和接口模块;
所述视频同步模块输出的视频信号输入视频编码模块; 所述混音器 输出的音频信号输入音频编码模块; 所述的视频编码模块和音频编码模 块分别将所输出的视频信号和音频信号通过接口模块发送到可视终端。
10、 根据权利要求 4所述的网络设备, 其特征在于, 所述的接收解 码模块包括: 接口模块、 第一视频解码模块、 第二视频解码模块、 第一 音频解码模块、 第二音频解码模块;
接口模块用于把 ip网络传来的通话视频音频信号,以及流媒体视频 音频信号传送到对应的解码模块;
第一视频解码模块用于对来自接口模块的通话视频信号进行解码, 并将解码后的视频信号输出到视频同步模块;
第二视频解码模块用于对来自接口模块的流媒体视频信号进行解 码, 并将解码后的视频信号输出到视频同步模块;
第一音频解码模块用于对来自接口模块的通话音频信号进行解码, 并将解码后的音频信号输出到混音器;
第二音频解码模块用于对来自接口模块的流媒体音频信号进行解 码, 并将解码后的音频信号输出到混音器。
11、 根据权利要求 4或 10所述的网络设备, 其特征在于, 所述视 频同步模块包括分别用来存放流媒体视频信号和通话视频信号的视频 帧寄存器, 所述任一帧寄存器被刷新, 便进行一次叠加输出; 若没有新 数据输入帧寄存器, 帧寄存器始终保存以前的视频信号。
12、一种用于视频音频信号传输的方法, 其特征在于, 该方法包括: 将呼叫中发送给可视终端的视频音频信号, 与流媒体服务器发送给 所述可视终端的视频音频信号, 进行视频叠加同步和音频混音后, 通过 所述可视终端播放给用户。
13、 根据权利要求 12所述的方法, 其特征在于, 所述将呼叫中发 送给可视终端的视频音频信号, 与流媒体服务器发送给所述可视终端的 视频音频信号, 进行视频叠加同步和音频混音为:
所述可视终端接收来自通话对端的视频音频信号, 与流媒体服务器 发送给所述可视终端的视频音频信号, 对接收的两路视频信号进行叠加 和同步, 并对接收的两路音频信号进行混音。
14、 根据权利要求 13 所述的方法, 其特征在于, 该方法进一步包 括: 用户向所述可视终端发送工作方式指令, 所述可视终端根据接收的 工作方式指令确定工作方式。
15、 根据权利要求 12所述的方法, 其特征在于, 所述将呼叫中发 送给可视终端的视频音频信号, 与流媒体服务器发送给所述可视终端的 视频音频信号, 进行视频叠加同步和混音为:
呼叫中的所述可视终端通知通话对端将呼叫中的视频音频信号发 送给网络侧设置的数据合成服务器, 通知流媒体服务器将发送给所述可 视终端的视频音频信号发送给数据合成服务器;
所述数据合成服务器接收呼叫中的视频音频信号与流媒体服务器 发送给所述可视终端的视频音频信号, 并对接收的两路视频信号进行叠 加和同步, 对接收的两路音频信号进行混音;
则所述通过可视终端播放给用户包括:
所述数据合成服务器将叠加和同步后的视频信号以及混音后的音 频信号, 分别进行编码后, 发送给所述可视终端, 所述可视终端将接收 到视频信号和音频信号播放给用户。
16、 才 M居权利要求 15 所述的方法, 其特征在于, 该方法进一步包 括: 用户通过可视终端向所述数据合成服务器发送工作方式指令, 所述 数据合成服务器根据接收的工作方式指令确定工作方式。
PCT/CN2006/002757 2006-01-18 2006-10-18 Appareil, dispositif de réseau et procédé de transmission de signaux audio et vidéo WO2007082433A1 (fr)

Priority Applications (2)

Application Number Priority Date Filing Date Title
EP06804975A EP1976290A4 (en) 2006-01-18 2006-10-18 APPARATUS, NETWORK DEVICE, AND METHOD FOR TRANSMITTING AUDIO AND VIDEO SIGNALS
CN200680011737A CN100579196C (zh) 2006-01-18 2006-10-18 用于视频音频信号传输的装置、网络设备及方法

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CNB2006100331692A CN100531382C (zh) 2006-01-18 2006-01-18 一种用于可视电话视频音频信号传输的装置及方法
CN200610033169.2 2006-01-18

Publications (1)

Publication Number Publication Date
WO2007082433A1 true WO2007082433A1 (fr) 2007-07-26

Family

ID=37298422

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2006/002757 WO2007082433A1 (fr) 2006-01-18 2006-10-18 Appareil, dispositif de réseau et procédé de transmission de signaux audio et vidéo

Country Status (5)

Country Link
US (1) US7973859B2 (zh)
EP (1) EP1976290A4 (zh)
CN (2) CN100531382C (zh)
FR (1) FR2896372B1 (zh)
WO (1) WO2007082433A1 (zh)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014023184A1 (zh) * 2012-08-09 2014-02-13 腾讯科技(深圳)有限公司 音视频点播方法、服务器、终端以及系统
CN110730362A (zh) * 2019-10-22 2020-01-24 中国农业科学院农业信息研究所 一种低流量视频通讯传输系统及方法

Families Citing this family (26)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8856371B2 (en) * 2006-08-07 2014-10-07 Oovoo Llc Video conferencing over IP networks
KR101434616B1 (ko) * 2007-04-04 2014-08-27 삼성전자주식회사 통신방법 및 이를 적용한 네트워크 디스플레이 장치
EP2020795B1 (en) * 2007-08-03 2017-11-22 Nokia Solutions and Networks Oy Method and network equipment for maintaining a media stream through another network equipment while suspending an associated media stream connection in a communication network
KR100867004B1 (ko) 2007-09-19 2008-11-10 한국전자통신연구원 시청자 참여를 위한 양방향성 iptv 방송 서비스 방법및 그 시스템
KR101487434B1 (ko) * 2007-11-14 2015-01-29 삼성전자 주식회사 디스플레이장치 및 그 제어방법
CN101282386B (zh) * 2008-05-22 2010-11-10 中山大学 一种voip服务器端同步混音转发方法
JP5410720B2 (ja) * 2008-09-25 2014-02-05 日立コンシューマエレクトロニクス株式会社 ディジタル情報信号送受信装置、およびディジタル情報信号送受信方法
CN101924903B (zh) * 2009-06-17 2013-03-20 华为技术有限公司 实现视频通话的方法、装置和系统
US11711592B2 (en) 2010-04-06 2023-07-25 Comcast Cable Communications, Llc Distribution of multiple signals of video content independently over a network
US10448083B2 (en) 2010-04-06 2019-10-15 Comcast Cable Communications, Llc Streaming and rendering of 3-dimensional video
CN102111650A (zh) * 2011-02-28 2011-06-29 中兴通讯股份有限公司 一种在可视固定电话上实现移动电视功能的方法和装置
US8683013B2 (en) * 2011-04-18 2014-03-25 Cisco Technology, Inc. System and method for data streaming in a computer network
US9191413B2 (en) * 2011-11-01 2015-11-17 T-Mobile Usa, Inc. Synchronizing video and audio over heterogeneous transports
CN102523416A (zh) * 2011-11-21 2012-06-27 苏州希图视鼎微电子有限公司 多段媒体流的无缝播放方法
CN102833520A (zh) * 2012-08-16 2012-12-19 华为技术有限公司 一种视频会议信号处理的方法、视频会议服务器及系统
CN103680550B (zh) * 2012-09-17 2017-02-08 扬智科技股份有限公司 多视窗架构下的音频播放方法、音频播放装置与系统
US9232176B2 (en) * 2013-03-04 2016-01-05 Janus Technologies, Inc. Method and apparatus for securing computer video and audio subsystems
CN104038805B (zh) * 2013-03-06 2017-06-27 富泰华工业(深圳)有限公司 显示视频图像的电视机及显示视频图像的方法
US9071798B2 (en) 2013-06-17 2015-06-30 Spotify Ab System and method for switching between media streams for non-adjacent channels while providing a seamless user experience
US10097604B2 (en) 2013-08-01 2018-10-09 Spotify Ab System and method for selecting a transition point for transitioning between media streams
US9529888B2 (en) 2013-09-23 2016-12-27 Spotify Ab System and method for efficiently providing media and associated metadata
US9917869B2 (en) 2013-09-23 2018-03-13 Spotify Ab System and method for identifying a segment of a file that includes target content
US9063640B2 (en) 2013-10-17 2015-06-23 Spotify Ab System and method for switching between media items in a plurality of sequences of media items
CN112533056B (zh) * 2019-09-17 2022-10-28 海信视像科技股份有限公司 一种显示设备及声音再现方法
CN111356009B (zh) * 2020-02-26 2022-05-31 北京大米科技有限公司 音频数据的处理方法、装置、存储介质以及终端
CN113840161B (zh) * 2020-06-23 2023-07-25 龙芯中科技术股份有限公司 流媒体传输方法、接收方法、装置、电子设备及储存介质

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH07184174A (ja) * 1993-12-24 1995-07-21 Sharp Corp テレビ電話機能内蔵テレビジョン受像機
JP2005151446A (ja) * 2003-11-19 2005-06-09 Sharp Corp 携帯端末、クレードル、携帯端末の制御方法、クレードルの制御方法、制御プログラム、および該プログラムを記録した記録媒体
JP2005252774A (ja) * 2004-03-05 2005-09-15 Matsushita Electric Ind Co Ltd テレビ付き携帯電話機
JP2005269508A (ja) * 2004-03-22 2005-09-29 Casio Comput Co Ltd 通信端末装置および通信端末処理プログラム
WO2006003852A1 (en) * 2004-07-02 2006-01-12 Matsushita Electric Industrial Co., Ltd. Av stream reproducing apparatus, decoder switching method, method program, program storage medium, and integrated circuit

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5642498A (en) * 1994-04-12 1997-06-24 Sony Corporation System for simultaneous display of multiple video windows on a display device
KR0137699B1 (ko) * 1994-12-24 1998-05-15 김광호 픽쳐인픽쳐를이용한화상전화기의화면처리회로및방법
US5671226A (en) * 1995-02-09 1997-09-23 Mitsubishi Denki Kabushiki Kaisha Multimedia information processing system
US5900908A (en) * 1995-03-02 1999-05-04 National Captioning Insitute, Inc. System and method for providing described television services
CN2269035Y (zh) 1996-09-23 1997-11-26 颜森辉 多输入多输出内含子母画面a/v多用途切换装置
US6067126A (en) * 1998-01-05 2000-05-23 Intel Corporation Method and apparatus for editing a video recording with audio selections
US6573905B1 (en) * 1999-11-09 2003-06-03 Broadcom Corporation Video and graphics system with parallel processing of graphics windows
US20020089602A1 (en) * 2000-10-18 2002-07-11 Sullivan Gary J. Compressed timing indicators for media samples
KR100374646B1 (ko) * 2001-03-10 2003-03-03 삼성전자주식회사 픽쳐 인 픽쳐 기능과 프레임 속도 변환을 동시에 수행하기위한 영상 처리 장치 및 방법
US6963612B2 (en) * 2001-08-31 2005-11-08 Stmicroelectronic, Inc. System for detecting start codes in MPEG video streams and method of operating the same
US20030112248A1 (en) * 2001-12-19 2003-06-19 Koninklijke Philips Electronics N.V. VGA quad device and apparatuses including same
US20040150748A1 (en) * 2003-01-31 2004-08-05 Qwest Communications International Inc. Systems and methods for providing and displaying picture-in-picture signals
US7453829B2 (en) * 2003-10-29 2008-11-18 Tut Systems, Inc. Method for conducting a video conference
KR100557135B1 (ko) 2004-04-13 2006-03-03 삼성전자주식회사 텔레비전 영상신호 수신 기능을 구비한 휴대용 단말기에서다수 채널 표시 및 채널변경 방법
US20050281417A1 (en) * 2004-06-18 2005-12-22 Gregory Toprover Media device
WO2006091740A2 (en) * 2005-02-24 2006-08-31 Humanizing Technologies, Inc. User-configuration multimedia presentation system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH07184174A (ja) * 1993-12-24 1995-07-21 Sharp Corp テレビ電話機能内蔵テレビジョン受像機
JP2005151446A (ja) * 2003-11-19 2005-06-09 Sharp Corp 携帯端末、クレードル、携帯端末の制御方法、クレードルの制御方法、制御プログラム、および該プログラムを記録した記録媒体
JP2005252774A (ja) * 2004-03-05 2005-09-15 Matsushita Electric Ind Co Ltd テレビ付き携帯電話機
JP2005269508A (ja) * 2004-03-22 2005-09-29 Casio Comput Co Ltd 通信端末装置および通信端末処理プログラム
WO2006003852A1 (en) * 2004-07-02 2006-01-12 Matsushita Electric Industrial Co., Ltd. Av stream reproducing apparatus, decoder switching method, method program, program storage medium, and integrated circuit

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2014023184A1 (zh) * 2012-08-09 2014-02-13 腾讯科技(深圳)有限公司 音视频点播方法、服务器、终端以及系统
CN110730362A (zh) * 2019-10-22 2020-01-24 中国农业科学院农业信息研究所 一种低流量视频通讯传输系统及方法

Also Published As

Publication number Publication date
CN100579196C (zh) 2010-01-06
US20070169156A1 (en) 2007-07-19
FR2896372A1 (fr) 2007-07-20
FR2896372B1 (fr) 2011-05-27
EP1976290A4 (en) 2009-12-02
US7973859B2 (en) 2011-07-05
CN100531382C (zh) 2009-08-19
EP1976290A1 (en) 2008-10-01
CN1859566A (zh) 2006-11-08
CN101156444A (zh) 2008-04-02

Similar Documents

Publication Publication Date Title
WO2007082433A1 (fr) Appareil, dispositif de réseau et procédé de transmission de signaux audio et vidéo
EP2154885B1 (en) A caption display method and a video communication control device
GB2319434A (en) Audio/video conferencing and telephony
WO2011050690A1 (zh) 用于录制和回播多媒体会议的方法和系統
WO2008113269A1 (fr) Procédé et dispositif pour réaliser une conversation privée dans une session multipoint
WO2010034254A1 (zh) 视频及音频处理方法、多点控制单元和视频会议系统
KR101096541B1 (ko) 영상 통화를 구현하기 위한 방법, 장치 및 시스템
CN102404547B (zh) 一种实现视频会议级联的方法及终端
US9088690B2 (en) Video conference system
WO2015127799A1 (zh) 协商媒体能力的方法和设备
WO2012175025A1 (zh) 远程呈现会议系统、远程呈现会议的录制与回放方法
JP2002058005A (ja) テレビ会議・テレビ電話システム、送信装置、受信装置、画像通信システム、通信装置、通信方法、記録媒体、プログラム
KR101585871B1 (ko) 이동통신 시스템에서 화이트 보드 서비스 제공을 위한 장치 및 방법
US20210218932A1 (en) Video conference server capable of providing video conference by using plurality of terminals for video conference, and method for removing audio echo therefor
CN106412646A (zh) 一种实现同步播放的方法和装置
US8515042B2 (en) Method for indicating call progress state, conference control device, and conference system
WO2010094213A1 (zh) 多路媒体流传输和接收的方法、装置及系统
CN101141615B (zh) 会议电视终端支持双流的外置实现方法
JP4572697B2 (ja) Ip電話機能に基づく呼接続中に映像コンテンツデータを再生する方法、端末及びプログラム
WO2014026478A1 (zh) 一种视频会议信号处理的方法、视频会议服务器及系统
CN101110946A (zh) 对会话初始化协议终端的音频和视频通信进行切换的方法
US20120075408A1 (en) Technique for providing in-built audio/video bridge on endpoints capable of video communication over ip
JP3178509B2 (ja) ステレオ音声テレビ会議装置
JP2009100378A (ja) テレビ電話機能付き携帯端末、画像転送方法及びプログラム
JP2006093883A (ja) テレビ通話装置

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 200680011737.7

Country of ref document: CN

NENP Non-entry into the national phase

Ref country code: DE

WWE Wipo information: entry into national phase

Ref document number: 2006804975

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 2006804975

Country of ref document: EP