WO2014057809A1 - Motion video transmission system and method - Google Patents

Motion video transmission system and method Download PDF

Info

Publication number
WO2014057809A1
WO2014057809A1 PCT/JP2013/075965 JP2013075965W WO2014057809A1 WO 2014057809 A1 WO2014057809 A1 WO 2014057809A1 JP 2013075965 W JP2013075965 W JP 2013075965W WO 2014057809 A1 WO2014057809 A1 WO 2014057809A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
screen
block
drawing command
image data
Prior art date
Application number
PCT/JP2013/075965
Other languages
French (fr)
Japanese (ja)
Inventor
一範 小澤
Original Assignee
日本電気株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 日本電気株式会社 filed Critical 日本電気株式会社
Publication of WO2014057809A1 publication Critical patent/WO2014057809A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams, manipulating MPEG-4 scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/65Transmission of management data between client and server
    • H04N21/654Transmission by server directed to the client
    • H04N21/6543Transmission by server directed to the client for forcing some client operations, e.g. recording
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8456Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/61Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding

Definitions

  • the present invention relates to transmission of moving images over a data communication network.
  • a terminal such as a mobile terminal accesses a server remotely via a network such as a mobile network and operates a virtual client on the server
  • the screen of the virtual client is transmitted from the server to the terminal. It relates to the transmission of moving images.
  • Some systems of this type have a configuration in which a screen is always compressed and encoded using an image codec regardless of the screen status.
  • the amount of data per unit time transferred from the server to the terminal is approximately proportional to the screen size of the terminal. For this reason, when the network bandwidth is narrow relative to the screen size of the terminal, data transfer is likely to be delayed, resulting in delays in screen update on the terminal side and reaction delays on the terminal side. May occur.
  • the time variation of the network bandwidth is likely to increase and the above-described delay is likely to occur. .
  • a method is also known in which a screen drawing command for generating a screen at a terminal is transferred from the server without transferring the screen itself from the server.
  • this method when the screen movement is small, the amount of data transferred from the server to the terminal can be reduced compared to the former case, but when the screen is complicated and the movement is large, the data is transferred from the server to the terminal. Since the number of commands is greatly increased, the processing load on the terminal is greatly increased compared to the former in order to draw the screen on the terminal. For this reason, the processing time in a terminal increases and responsiveness falls. Also, the amount of power used by the terminal increases due to an increase in processing load at the terminal.
  • Patent Document 1 is cited as a document describing a technique related to the present invention.
  • Patent Document 1 describes a server that instructs another client device to transmit drawing information for updating the screen of one client device.
  • Patent Document 2 is cited as a document in which other techniques related to the present invention are described.
  • Japanese Patent Application Laid-Open No. 2004-228561 describes a method in which a screen is divided into a plurality of blocks, compression-coded for each block, and transmitted to a client device with priority from a block having a small image size after compression coding.
  • the present invention has been made in view of such a situation, and the problem to be solved by the present invention is based on the communication status of the network and the nature of the image to be transmitted when transmitting an image constituting a moving image. Accordingly, it is possible to perform transmission in an appropriate transmission form.
  • the present invention has, as one aspect thereof, a transmission-side device that transmits a moving image for each screen via a data communication network and a reception-side device that receives the screen via the data communication network.
  • the transmission side device is any one of means for dividing the screen into a plurality of blocks, image data obtained by encoding the block for each of the plurality of blocks, and a drawing command for drawing the block Selecting means, means for encoding a block into image data, means for determining a drawing command for drawing the block from the block, and receiving the image data and drawing command via the data communication network Means for transmitting to the side device, the receiving side device from the transmitting side device to the data communication network.
  • Means for discriminating the image data and drawing command received via the network means for decoding the image data and outputting as an image signal constituting part of the screen, and part of the screen according to the drawing command
  • a moving image transmission system comprising means for outputting an image signal of an image drawn.
  • the present invention provides, as another aspect, means for dividing one screen of a moving image into a plurality of blocks, image data obtained by encoding the block for each of the plurality of blocks, and a drawing command for drawing the block
  • a data communication apparatus comprising means for transmitting to another data communication apparatus.
  • the present invention corresponds to one of a plurality of blocks obtained by dividing one screen of a moving image, and an image received from the other data communication apparatus via the data communication network.
  • Means for discriminating data or a drawing command, means for decoding the image data and outputting it as an image signal constituting a part of the screen, and an image signal of an image obtained by drawing a part of the screen according to the drawing command Provided is a data communication device comprising a means for outputting.
  • the present invention provides, as another aspect, means for dividing one screen of a moving image into a plurality of blocks, image data obtained by encoding the block for each of the plurality of blocks, and a drawing command for drawing the block
  • a program for causing a computer to function as means for transmitting to another data communication apparatus is provided.
  • the transmitting side device divides one screen of the moving image into a plurality of blocks, and the transmitting side device uses the image data of the block for one of the plurality of blocks. Selecting any one of encoding to and determination of a drawing command for drawing the block, executing one of the above in response to the selection at the transmitting side device, the image data, and A step of transmitting any of the drawing commands from the transmission side device to the reception side device via the data communication network, and the reception side device according to the decoding of the image data and the drawing command A moving image transmission method comprising the step of generating an image signal constituting a part of the screen by performing any one of the drawn images.
  • one block of a screen when transmitted, it is transmitted as either image data or a drawing command, so that the block can be transmitted in a transmission form according to the situation.
  • FIG. 1 is a block diagram of a moving picture transmission system 1 according to the first embodiment of the present invention.
  • FIG. 2 is a block diagram of a thin client system 100 according to the second embodiment of the present invention.
  • FIG. 3 is a block diagram of the server device 110 of the thin client system 100.
  • FIG. 4 is a block diagram of the determination unit 185 of the server device 110.
  • FIG. 5 is a block diagram of the image encoding unit 186 of the server apparatus 110.
  • FIG. 6 is a block diagram of the portable terminal 170 of the thin client system 100.
  • the moving image transmission system 1 is a system that transmits a moving image for each screen from the transmission side device 3 to the reception side device 4 via the network 2.
  • the network 2 is a data communication network, and is particularly suitable for a network whose communication status changes according to time, such as a mobile communication network such as a mobile phone network as at least a part thereof.
  • the transmission side device 3 is a device that transmits a time-changing screen to the reception side device 4.
  • a server of a thin client system as in the second embodiment to be described later, that is, a server that executes a virtual client and provides a screen of the virtual client to the thin client is a preferable example of the transmission side device 3.
  • the transmission side device 3 includes a dividing unit 5, a determination unit 6, an image encoder 7, a drawing command encoder 8, and a transmission unit 9.
  • the dividing unit 5 divides a screen input from a moving image photographing function (not shown) into a plurality of areas, for example, m ⁇ n (m and n are natural numbers) areas.
  • each divided area is referred to as a block.
  • the discriminating unit 6 selects one of the image encoder 7 and the drawing command encoder 8 according to the block itself input from the dividing unit 5 and the communication status of the network 2, and passes the block to the selected encoder.
  • the image encoder 7 encodes the transferred block into image data of a predetermined format.
  • the image encoder 7 may encode the block according to any of a plurality of types of compression encoding methods.
  • the drawing command encoder 8 obtains a drawing command group necessary for drawing the input block, and encodes the obtained drawing command group according to a predetermined compression encoding method.
  • the transmission unit 9 transmits the block encoded by either the image encoder 7 or the drawing command encoder 8 to the reception side device 4 via the network 2.
  • the receiving side device 4 is a device that receives and displays a screen from the transmitting side device 3.
  • the terminal of the thin client system of the second embodiment to be described later that is, a terminal that receives a virtual client screen from a server that executes the virtual client, particularly a mobile communication terminal such as a mobile phone terminal, although it is a suitable example, it may be applied to other devices that receive and display a moving image from a network, such as an information processing device, a workstation, a server, or a personal computer having a data communication function, and a moving image display function.
  • the receiving side device 4 includes a receiving unit 10, an image decoder 11, a drawing command decoder 12, a screen display unit 13, and a screen drawing unit 14.
  • the receiving unit 10 determines whether the encoded data is encoded as image data of a predetermined format or the drawing command group, and the former In this case, the image is output to the image decoder 11, and in the latter case, the image is output to the drawing command decoder 12.
  • the image decoder 11 decodes the image data according to the encoding method used by the image encoder 7 for encoding, and displays the image data in a corresponding area on the screen of the screen display unit 13.
  • the corresponding area is an area occupied by the block in the screen before the screen including the block corresponding to the image data is input to the dividing unit 5.
  • the drawing command decoder 12 decodes the encoded drawing command group and outputs it.
  • the screen drawing unit 14 draws an image in a corresponding area on the screen of the screen display unit 13 in accordance with the input drawing command group.
  • the screen display unit 13 is a display device, and specifically, there are a liquid crystal display device, a plasma display, a cathode ray tube, and the like, but the type thereof is not limited. In a configuration in which a block is transmitted only as image data regardless of the block or communication status as in the past, for example, even a block that can be drawn with a simple drawing command group is transmitted as image data.
  • the amount of data increases, and the display on the screen display unit 13 may be delayed depending on the communication status of the network.
  • the block and communication status in the configuration in which the block is transmitted only as a drawing command, the number of commands is greatly increased when converting a block of a screen that is particularly complex and intensely moving into a drawing command. Not only does the amount of data to be transmitted increase, but also the processing time required to execute all the increased drawing commands in the screen drawing unit 14 becomes longer.
  • the block is encoded using either the image encoder 7 or the drawing command encoder 8 according to the communication status of the network 2 and the block itself to be transmitted.
  • a thin client system 100 according to a second embodiment of the present invention will be described with reference to FIG.
  • a mobile network 150 is used as the network.
  • the structure in the case of using a SGSN / GGSNN apparatus as a packet transfer apparatus is shown.
  • the SGSN / GGSN device means a device in which an SGSN (Serving GPRS Support Node) device and a GGSN (Gateway GPRS Support Node) device are integrated.
  • SGSN Serving GPRS Support Node
  • GGSN Gateway GPRS Support Node
  • the thin client server device 110 is arranged on the cloud network 130 and the cloud network 130 and the mobile network 150 are connected.
  • the end user connects the mobile terminal 170 to the virtual client of the server apparatus 110 arranged in the cloud network 130 and operates the virtual client as if operating the real terminal.
  • the client software of the mobile terminal 170 transmits a packet storing the operation signal to the server device 110 via the base station 194, the RNC device 195, and the SGSN / GGSN device 190 on the mobile network 150.
  • the operation signal means a signal transmitted from the mobile terminal 170 to the server device 110 by an operation such as a key operation on the mobile terminal 170, a touch operation on the screen, character input, or scrolling.
  • the operation signal packet transmitted from the packet transmission unit in the client software installed in the mobile terminal 170 is transmitted to the cloud network 130 via the base station device 194, the RNC device 195, and the SGSN / GGSN device 190 on the mobile network 150.
  • the server device 110 receives the operation signal.
  • a well-known protocol can be used for sending the operation signal, but UDP / IP is used, but TCP / IP or the like can also be used.
  • FIG. 3 is a block diagram illustrating a configuration of the server device 110.
  • the operation signal packet receiving unit 182 receives a packet storing the operation signal from the mobile terminal 170 via the base station 194, the RNC device 195, and the SGSN / GGSN device 190.
  • the operation signal packet receiving unit 182 extracts an operation signal from the received operation signal UDP / IP packet and outputs the operation signal to the virtual client unit 211.
  • the virtual client unit 211 includes application software corresponding to various services, a control unit, a screen generation unit, a cache memory, and the like. In addition, the application software can be easily updated from the outside of the server device 110.
  • the virtual client unit 211 analyzes the operation signal input from the operation signal packet receiving unit 182, activates the application software specified by the operation signal, and displays a screen drawn by the application software and the OS as a predetermined screen. The image is generated at the resolution and output to the screen capture unit 180. Further, a drawing command executed when generating and drawing the screen is output to the drawing command collecting unit 181.
  • the drawing command collection unit 181 collects the drawing command group output from the virtual client unit 211 for each screen, temporarily saves it for each screen, and outputs it to the determination unit 185.
  • the screen capture unit 180 captures and outputs a screen at a predetermined screen resolution and frame rate.
  • the dividing unit 184 divides the captured screen into a plurality of blocks having a predetermined size.
  • the block size is, for example, 16 pixels ⁇ 16 lines, but other sizes, for example, 8 pixels ⁇ 8 lines, 4 pixels ⁇ 4 lines, and the like can be used. The smaller the block size, the better the discrimination accuracy in the discriminator, but the processing amount increases.
  • the dividing unit 184 outputs the divided blocks to the determining unit 185.
  • the configuration of the determination unit 185 is shown in FIG.
  • the determination unit 185 determines whether to transfer the screen based on at least one of the feature amount obtained from the image signal of the screen and the network delay amount, or to transfer the drawing command. Determine either.
  • a motion vector is used as the image feature amount used by the determination unit 185.
  • other well-known feature amounts for example, the sum of absolute values of difference values between frames, the motion compensation prediction residual error, and the like are used. An absolute value sum can also be used.
  • the motion vector calculation unit 201 performs, for example, D in the following Expression 1 for each block.
  • k Vector V to minimize k (Dx, dy) is calculated.
  • f n, k (Xi, Yj), f n-1, k (Xi, Yj) represents a pixel that enters the kth block of the nth frame and a pixel that enters the kth block of the (n-1) th frame.
  • the motion vector calculation unit 201 obtains the magnitude and direction of the motion vector for each block by the following formulas 2 and 3, and outputs these to the discrimination selection unit 202.
  • V k Indicates the magnitude of the motion vector in the kth block
  • ⁇ k Indicates the angle (direction) of the motion vector in the k-th block.
  • the discrimination / selection unit 202 applies V to a plurality of consecutive blocks.
  • the moving image area is shaped so as to be a rectangular area, and the area range includes the number of pixels in the horizontal direction and the number of lines in the vertical direction of the rectangular area, and the blocks included in the area. Number and block size.
  • the network bandwidth estimation unit 203 periodically inputs information necessary for network delay estimation from the first packet transmission / reception unit 176 of FIG.
  • Necessary information includes, for example, the data size transmitted from the server, the time transmitted from the server, the time received by the terminal, and the like. Based on these pieces of information, for example, the network band estimated value B is periodically calculated based on the following equations 4 and 5.
  • D (j) indicates the data size of the jth packet transmitted from the first packet transmitting / receiving unit 176 to the mobile terminal 170
  • R (j) is the jth packet received by the mobile terminal 170. Is the time of reception.
  • the band estimation value W calculated by Expression 4 is temporally smoothed using Expression 5 below.
  • B (n) represents a network band estimation value after smoothing at the nth time
  • is a constant in a range of 0 ⁇ ⁇ 1.
  • the network bandwidth estimation unit 203 periodically calculates B (n) and outputs it to the discrimination selection unit 202.
  • the discrimination / selection unit 202 obtains the ratio ⁇ between the area of the moving image area and the area of the entire screen by the following Expression 6.
  • the discrimination / selection unit 202 inputs the network band estimation value ⁇ from the network band estimation unit 203. If ⁇ exceeds a predetermined threshold value Th1, it is determined that the screen is transferred, the discrimination flag F is set to 0, and the discrimination flag, the moving picture area information, and the movement area information are sent to the image encoding unit 186 in FIG. Alternatively, information on the static area is output.
  • Th1 a predetermined threshold value
  • Th2 it is determined that the command is transferred, the determination flag F is set to 1, and the determination flag F and the drawing command group are set as shown in FIG. Output to the drawing command encode 183.
  • the discrimination selection unit 202 outputs the discrimination flag F to the first packet transmission / reception unit 176.
  • the configuration of the image encoding unit 186 will be described with reference to FIG.
  • a first image encoder 227 inputs an image signal of a moving image area and a determination flag, and when the determination flag is 0, that is, in the case of screen transfer, a compression code using a predetermined moving image encoder is used for the image signal.
  • the compressed bit stream is output to the first packet transmitting / receiving unit 176 in FIG.
  • a predetermined moving image encoder H.264 is used.
  • H.264 is used, but other well-known moving image codecs, for example, MPEG-4 is also available for HEVC (High Efficiency Video Coding).
  • the region information of the moving image region is output to the first packet transmitting / receiving unit 176 in FIG.
  • the second image encoder unit 228 inputs a captured image from the image capture unit 180, inputs a determination flag from the determination unit 185, and when the determination flag is 0, other regions (for example, a moving region or a still region)
  • the range is input, and in the case of a still image, the image is compressed and encoded using a still image codec and output to the first packet transmitting / receiving unit 176 in FIG.
  • JPEG2000 is used as the still image codec, but other well-known codecs such as JPEG can also be used. Further, information on other areas is also output to the first packet transmission unit 176 in FIG.
  • a bit stream obtained by compression-encoding an image before movement with a still image codec and one representative motion vector are output to the first packet transmitting / receiving unit 176 in FIG. Further, information on other areas is also output to the first packet transmitting / receiving unit 176 in FIG.
  • the discrimination flag is 1, that is, when the drawing command is transferred
  • the drawing command encoding unit 183 inputs a drawing command group for each screen, performs lossless encoding of the drawing command group by a predetermined compression method, and results of compression encoding Is output to the first packet transmitting / receiving unit 176.
  • the predetermined compression method a known lossless encoding method such as a Zip compression method can be used.
  • the audio encoding unit 187 inputs an audio signal attached to the screen from the screen capture unit 180, performs compression encoding with the audio encoder, and performs the second packet transmission unit 177 in FIG. Output to.
  • MPEG-4 AAC is used as the audio encoder, but other known audio encoders can also be used.
  • the first packet transmission / reception unit 176 inputs flag information from the determination unit. When the flag information is 0, the first packet transmission / reception unit 176 inputs the compression encoded bitstream and region information from the image encoding unit 186.
  • a compression-encoded drawing command is input from the drawing command encoding unit 183, and flag information, area information, a compression-encoded bit stream, and a compression-encoded drawing command are stored in the payload portion of the packet.
  • a packet according to a predetermined protocol is constructed and sent to the SGSN / GGSN apparatus 190 in FIG.
  • a well-known protocol such as RTP / UDP / IP, UDP / IP, TCP / IP, or the like can be used as a predetermined protocol, but UDP / IP is used as an example.
  • the area information can be stored in an unused area of the RTP header part or the UDP header part.
  • the first packet transmission / reception unit 176 extracts information necessary for delay amount calculation from the response signal received from the mobile terminal 170 and outputs the information to the network bandwidth estimation unit 203 in FIG.
  • the second packet transmission unit 177 stores the compression-coded bit stream for the audio signal in the payload of the packet, constructs a packet according to a predetermined protocol, and transmits the packet to the SGSN / GGSN device 190.
  • a well-known protocol such as RTP / UDP / IP, UDP / IP, TCP / IP, or the like can be used as a predetermined protocol, but UDP / IP is used as an example.
  • the SGSN / GGSN device 190 tunnels the packet received from the server device 110 using the GTP-U protocol and transfers the packet to the RNC device 195.
  • the RNC device 195 transmits the packet to the mobile terminal 170 wirelessly through the base station device 194.
  • client software for sending to the mobile terminal 170 an operation signal when the user operates the terminal to the server, receiving packets from the server, and decoding and displaying the compressed encoded stream is displayed. 171 is mounted.
  • the configuration of the client software 171 is shown in FIG.
  • the first packet transmission / reception / selection unit 250 receives a packet, and extracts a discrimination flag, compression-encoded bitstream and area information, and compression-encoded drawing command information stored in the packet.
  • the size of the moving image area and the compression-encoded bit stream encoded by the first image encoder are selected and extracted as moving image area information, and the first image decoder 252 Output.
  • the other region information and the compression-encoded bit stream encoded by the other encoder in the region are output to the second image decoder 253.
  • the first image decoder 252 receives the moving image area information and the compression-encoded stream, decodes the compression-encoded stream, and outputs the decoded stream to the screen display unit 256.
  • the moving image area information is also output to the screen display unit 256.
  • the first image decoder for example, H.264 is used.
  • the H.264 decoder is used, but other well-known image decoders such as an MPEG-4 decoder can also be used. However, it goes without saying that the same type as the first image encoder 227 in the server is used.
  • the second image decoder 253 receives the other region information and the compression-encoded stream, decodes the compression-encoded stream for the other region, and outputs the decoded image to the screen display unit 256. Further, other area information is output to the screen display unit 256.
  • the screen display unit 256 receives the first area information and the image signal in the first area from the first image decoder 252, and receives the other area information and the image signal in the other area from the second image decoder 253.
  • the output image from the first image decoder 252 is displayed in the first area using the first area information, and the other area information is used to display the output image from the second image decoder 253 in the other area.
  • a display screen is generated by combining the image signals of the respective regions and output.
  • the first packet transmission / reception / selection unit 250 selects and extracts the compression-coded drawing command information and outputs it to the coder 259 with the drawing command.
  • the drawing command decoder 259 performs, for example, Zip compression decoding and outputs a drawing command group to the screen drawing unit 260 for each screen.
  • the screen drawing unit 260 inputs a drawing command group for each screen, draws and generates a screen, and outputs the generated screen to the screen display unit 256. Further, the first packet transmission / reception / selection unit 250 sends a response signal packet for the received packet to the network. Information on data size, reception time, and transmission time is described in the response signal packet.
  • the second packet receiving unit 251 receives the packet, extracts a compression-coded bit stream related to audio stored in the packet, and outputs the compressed bit stream to the audio decoder 255.
  • the audio decoder 255 receives and decodes the compression-coded stream, and outputs it in synchronization with the screen portion.
  • the operation signal generation unit 257 detects an operation input by the user to the mobile terminal 170, for example, screen touch, screen scroll, icon touch, character input, and the like, generates an operation signal for each, and sends the operation signal to the packet transmission unit 258. Output.
  • the packet transmission unit 258 receives an operation signal, stores it in a packet according to a predetermined protocol, and transmits it to the network.
  • TCP / IP, UDP / IP, etc. can be used as a predetermined protocol.
  • an image feature amount is obtained for a screen generated on the server side, and at least one of the feature amount and a network bandwidth estimation value or a network delay amount is obtained. Based on one, it is determined whether to transfer the screen or drawing command, and these are switched and transferred, so it is realized by balancing the amount of data, the amount of network delay, and the load on the terminal As a result, it is possible to prevent a significant increase in data acid, a significant delay in response time, and a significant increase in processing load at the terminal. While the present invention has been described with reference to the embodiment, the present invention is not limited to this. There may be three or more types of screen areas determined by the determination / selection unit 202.
  • a feature quantity other than a motion vector can be used as an image feature quantity used for region determination, or a plurality of types of feature quantities can be used in combination.
  • Other known methods can be used as the bandwidth estimation method in the network bandwidth estimation unit 203.
  • the delay amount can be estimated, and the delay selection unit 202 can use the delay amount estimation value to determine whether the screen transfer or the drawing command transfer.
  • Other known methods can be used as the determination method in the determination / selection unit 202.
  • the mobile network 150 can be a mobile LTE / EPC network, or can be a WiMax network or a WiFi network. Further, it can be a fixed network, an NGN network, or an Internet network.
  • the server device is arranged in the cloud network, but can be arranged in the Internet network.
  • the server device can be arranged in the company network.
  • the server device 110 can be arranged on the mobile network 150, a fixed network, or an NGN network.
  • a transmission-side device that transmits a moving image for each screen via a data communication network, and a reception-side device that receives the screen via the data communication network
  • the transmitting device is: Means for dividing the screen into a plurality of blocks; Means for selecting at least one of the plurality of blocks, either image data obtained by encoding the block, or a drawing command for drawing the block; Means for encoding the block into image data; Means for determining a drawing command for drawing the block from the block; and Means for transmitting the image data and the drawing command to the receiving side device via the data communication network;
  • the receiving side device Means for discriminating the image data and drawing command received from the transmission side device via the data communication network; Means for decoding the image data and outputting as an image signal constituting a part of the screen; and Means for outputting an image signal of an image obtained by drawing a part of the screen in accordance with the drawing command;
  • a video transmission system characterized by the above.
  • the transmitting device operates as the server of a thin client system including a server that executes a virtual client and provides a screen of a virtual client to a client terminal, and a client terminal that displays a screen provided from the server.
  • the receiving device operates as the client terminal;
  • the data communication network includes a mobile communication network
  • the moving image transmission system according to supplementary note 1, wherein: (Appendix 3) The system according to any one of Supplementary Note 1 and Supplementary Note 2, wherein the means for selecting one of the image data and the drawing command performs selection according to one or more image feature amounts obtained from the block. .
  • the transmission side device includes a plurality of image encoders having different encoding methods as means for encoding the image data
  • the receiving-side apparatus includes a plurality of image decoders corresponding to the encoding methods of the plurality of image encoders as means for decoding the image data.
  • the transmission-side apparatus includes, as means for encoding the image data, a first image encoder for encoding a moving image area and a second image encoder for encoding a non-moving area
  • the receiving-side apparatus includes, as means for decoding the image data, a first image encoder for decoding a moving image area and a second image encoder for decoding a non-moving area.
  • Appendix 7 A way to divide one screen of a video into multiple blocks, Means for selecting at least one of the plurality of blocks, either image data obtained by encoding the block, or a drawing command for drawing the block; Means for encoding the block into image data; Means for determining a drawing command for drawing the block from the block; and A data communication apparatus comprising means for transmitting the image data and the drawing command to another data communication apparatus via a data communication network.
  • the computer operates as the server of a thin client system including a server that executes a virtual client and provides a screen of the virtual client to the client terminal, and a client terminal that displays the screen provided from the server,
  • the other data communication device operates as the client terminal
  • the data communication network includes a mobile communication network
  • the data communication device according to appendix 7, wherein (Appendix 9) The data according to any one of appendix 7 and appendix 8, wherein the means for selecting one of the image data and the drawing command performs selection according to one or more image feature amounts obtained from the block. Communication device. (Appendix 10) 10.
  • the data communication apparatus according to any one of appendix 7 to appendix 9, wherein the means for selecting one of the image data and the drawing command performs selection according to a communication status of the data communication network.
  • Appendix 11 11.
  • Appendix 12 Additional means 11 for encoding the image data functions as a first image encoder for encoding a moving image area and a second image encoder for encoding a non-moving area.
  • (Appendix 13) Means corresponding to one of a plurality of blocks formed by dividing one screen of a moving image, and means for discriminating image data or a drawing command received from the other data communication device via a data communication network; Means for decoding the image data and outputting as an image signal constituting a part of the screen; and Means for outputting an image signal of an image obtained by drawing a part of the screen in accordance with the drawing command; A data communication device.
  • the other data communication device is a server of a thin client system including a server that executes a virtual client and provides a screen of the virtual client to the client terminal, and a client terminal that displays the screen provided from the server.
  • the data communication device operates as the client terminal;
  • the data communication network includes a mobile communication network 14.
  • (Appendix 16) (Supplementary note 15)
  • the means for decoding the image data includes a first image encoder for decoding a moving image area and a second image encoder for decoding a non-moving area.
  • Appendix 17 A way to divide one screen of a video into multiple blocks, Means for selecting at least one of the plurality of blocks, either image data obtained by encoding the block, or a drawing command for drawing the block; Means for encoding the block into image data; Means for determining a drawing command for drawing the block from the block; and A program for causing a computer to function as means for transmitting the image data and the drawing command to another data communication apparatus via a data communication network.
  • the computer operates as the server of a thin client system including a server that executes a virtual client and provides a screen of the virtual client to the client terminal, and a client terminal that displays the screen provided from the server,
  • the other data communication device operates as the client terminal,
  • the data communication network includes a mobile communication network
  • Appendix 20 The program according to any one of appendix 17 to appendix 19, wherein the means for selecting one of the image data and the drawing command performs selection according to a communication status of the data communication network.
  • Appendix 21 The program according to any one of appendix 17 to appendix 20, wherein the means for encoding the image data functions as a plurality of image encoders having different encoding methods.
  • Appendix 22 The appendix 21 is characterized in that the means for encoding the image data functions as a first image encoder for encoding a moving image area and a second image encoder for encoding a non-moving area. The listed program.
  • (Appendix 23) Means corresponding to one of a plurality of blocks formed by dividing one screen of a moving image, and means for discriminating image data or a drawing command received from the other data communication device via a data communication network; Means for decoding the image data and outputting as an image signal constituting a part of the screen; and A program for causing a computer to function as means for outputting an image signal of an image obtained by drawing a part of the screen in accordance with the drawing command.
  • the other data communication device is a server of a thin client system including a server that executes a virtual client and provides a screen of the virtual client to the client terminal, and a client terminal that displays the screen provided from the server.
  • the data communication device operates as the client terminal;
  • the data communication network includes a mobile communication network
  • (Appendix 25) 25 The program according to any one of Supplementary Note 23 and Supplementary Note 24, wherein the means for decoding the image data functions as a plurality of image decoders corresponding to a plurality of image encoder encoding methods.
  • the listed program wherein the means for decoding the image data functions as a first image encoder for decoding a moving image area and a second image encoder for decoding a non-moving area
  • (Appendix 27) Dividing one screen of the video into a plurality of blocks at the transmission side device; In the transmitting device, for one of the plurality of blocks, selecting one of encoding of the block into image data and determination of a drawing command for drawing the block; Performing any one of the above in response to the selection at the transmitting device; Transmitting either the image data or the drawing command from the transmitting device to the receiving device via the data communication network; and A step of generating an image signal constituting a part of the screen by performing any one of decoding of the image data and drawing of an image according to the drawing command in the receiving side device.
  • a moving picture transmission method comprising: (Appendix 28)
  • the transmitting device operates as the server of a thin client system including a server that executes a virtual client and provides a screen of a virtual client to a client terminal, and a client terminal that displays a screen provided from the server.
  • the receiving device operates as the client terminal;
  • the data communication network includes a mobile communication network
  • the method according to appendix 27, wherein: (Appendix 29) The step of selecting either one of the encoding to the image data and the determination of the drawing command is performed according to one or more image feature amounts obtained from the block.
  • the method according to any one. (Appendix 30) 30.
  • the method of. (Appendix 31)
  • the image data is encoded using one image encoder selected from a plurality of image encoders having different encoding methods,
  • the decoding of the image data is performed using an image decoder corresponding to the encoding method of the selected one image encoder. 31.

Abstract

A transmission-side device segments a screen into a plurality of blocks, and transmits as either image data which encodes the block or a render command which renders the block. With a receiving-side device, an image signal of a block is generated by rendering according to either the decoding of the image data or the render command.

Description

動画伝送システム及び方法Video transmission system and method
 本発明はデータ通信ネットワークでの動画の伝送に関する。本発明は、特に、携帯端末などの端末がモバイルネットワークなどのネットワークを介してリモートでサーバにアクセスしサーバ上の仮想クライアントを操作する、シンクライアントシステムにおいて、仮想クライアントの画面をサーバから端末に伝送する際の動画の伝送に関する。 The present invention relates to transmission of moving images over a data communication network. In particular, in a thin client system in which a terminal such as a mobile terminal accesses a server remotely via a network such as a mobile network and operates a virtual client on the server, the screen of the virtual client is transmitted from the server to the terminal. It relates to the transmission of moving images.
 近年、特に企業などでは、高度なセキュリティを確保するために、シンクライアントの使用が普及し始めている。これは、端末から、あたかも実端末を操作するようにサーバ上の仮想クライアントを操作し仮想端末上でアプリケーションを動作させて画面情報を生成し当該画面情報を端末に転送し端末で表示する技術であり、端末にはデータを一切残さないため、端末を紛失しても秘密情報や企業情報などが外部に漏れる恐れがないという利点がある。
 サーバ側でアプリケーションを動作させて画面を生成し、当該画面を圧縮して端末に転送し、端末で画面を復号して表示する方式に基づくシンクライアントシステムがある。この種のシステムの中には、画面の状況にかかわらず、常に画像コーデックを用いて画面を圧縮符号化して転送する構成を有するものがある。こうした構成では、サーバから端末に転送する単位時間当たりのデータ量は、概ね端末の画面サイズに比例したものとなる。このため、端末の画面サイズに対してネットワークの帯域幅が狭い場合、データの転送に遅延が発生しやすくなり、その結果、端末側での画面更新の遅延や、端末側での反応の遅延が発生する恐れがある。特に、携帯電話機をはじめとする移動通信端末のように、無線区間を含むネットワークを介して端末とサーバを接続するシステムでは、ネットワークの帯域幅の時間変動が大きくなりやすく上述の遅延が発生しやすい。
 一方、サーバから画面そのものを転送せずに、端末で当該画面を生成するための画面描画コマンドを、サーバから転送する方法も知られている。本方法の場合、画面の動きが少ない場合は、サーバから端末に転送するデータ量を前者の場合に比べ削減することができるが、画面が複雑で動きが大きい場合は、サーバから端末に転送するコマンドが大幅に増えるため、端末で当該画面を描画させるために端末での処理負荷が前者に比べ大幅に増大する。このため、端末での処理時間が増大し応答性が低下する。また、端末での処理負荷の増大により端末の電力使用量が増大する。
 本発明に関連する技術が記載された文献として特許文献1を挙げる。特許文献1には、一のクライアント装置の画面を更新するための描画情報の送信を、他のクライアント装置に対して指示するサーバが記載されている。
 また、本発明に関連する他の技術が記載された文献として特許文献2を挙げる。特許文献2には、画面を複数のブロックに分割し、ブロック毎に圧縮符号化し、圧縮符号化後のブロックの画像サイズが小さいものから優先的にクライアント装置に送信するものが記載されている。
In recent years, the use of thin clients has begun to spread in order to ensure a high level of security, particularly in companies. This is a technology that operates a virtual client on the server as if operating a real terminal from a terminal, operates an application on the virtual terminal to generate screen information, transfers the screen information to the terminal, and displays it on the terminal. In addition, since no data is left in the terminal, there is an advantage that even if the terminal is lost, there is no possibility that secret information or company information is leaked to the outside.
There is a thin client system based on a system in which an application is operated on the server side to generate a screen, the screen is compressed and transferred to a terminal, and the screen is decoded and displayed on the terminal. Some systems of this type have a configuration in which a screen is always compressed and encoded using an image codec regardless of the screen status. In such a configuration, the amount of data per unit time transferred from the server to the terminal is approximately proportional to the screen size of the terminal. For this reason, when the network bandwidth is narrow relative to the screen size of the terminal, data transfer is likely to be delayed, resulting in delays in screen update on the terminal side and reaction delays on the terminal side. May occur. In particular, in a system in which a terminal and a server are connected via a network including a wireless zone, such as a mobile communication terminal such as a mobile phone, the time variation of the network bandwidth is likely to increase and the above-described delay is likely to occur. .
On the other hand, a method is also known in which a screen drawing command for generating a screen at a terminal is transferred from the server without transferring the screen itself from the server. In the case of this method, when the screen movement is small, the amount of data transferred from the server to the terminal can be reduced compared to the former case, but when the screen is complicated and the movement is large, the data is transferred from the server to the terminal. Since the number of commands is greatly increased, the processing load on the terminal is greatly increased compared to the former in order to draw the screen on the terminal. For this reason, the processing time in a terminal increases and responsiveness falls. Also, the amount of power used by the terminal increases due to an increase in processing load at the terminal.
Patent Document 1 is cited as a document describing a technique related to the present invention. Patent Document 1 describes a server that instructs another client device to transmit drawing information for updating the screen of one client device.
Patent Document 2 is cited as a document in which other techniques related to the present invention are described. Japanese Patent Application Laid-Open No. 2004-228561 describes a method in which a screen is divided into a plurality of blocks, compression-coded for each block, and transmitted to a client device with priority from a block having a small image size after compression coding.
特開2012−079222号公報JP 2012-079222 A 特開2008−211639号公報JP 2008-211639 A
 本発明はこのような状況に鑑みてなされたものであり、本発明が解決しようとする課題は、動画を構成する画像を伝送する際、ネットワークの通信状況や、送信しようとしている画像の性質に応じて適切な伝送形態での伝送を可能とすることである。 The present invention has been made in view of such a situation, and the problem to be solved by the present invention is based on the communication status of the network and the nature of the image to be transmitted when transmitting an image constituting a moving image. Accordingly, it is possible to perform transmission in an appropriate transmission form.
 上述の課題を解決するため、本発明は、その一態様として、データ通信ネットワークを介して動画を画面毎に送信する送信側装置と、前記データ通信ネットワークを介して前記画面を受信する受信側装置とを備え、前記送信側装置は、前記画面を複数のブロックに分割する手段、前記複数のブロックそれぞれについて、当該ブロックを符号化した画像データ、及び、当該ブロックを描画する描画コマンドのいずれか一方を選択する手段、ブロックを画像データに符号化する手段、ブロックから当該ブロックを描画するための描画コマンドを決定する手段、及び、前記画像データ及び描画コマンドを、前記データ通信ネットワークを介して前記受信側装置に送信する手段を備え、前記受信側装置は、前記送信側装置から前記データ通信ネットワークを介して受信した前記画像データ及び描画コマンドを判別する手段、前記画像データを復号化し、前記画面の一部を構成する画像信号として出力する手段、及び、前記描画コマンドに従って前記画面の一部を描画した画像の画像信号を出力する手段を備えることを特徴とする、動画伝送システムを提供する。
 また、本発明は、他の一態様として、動画の一画面を複数のブロックに分割する手段、前記複数のブロックそれぞれについて、当該ブロックを符号化した画像データ、及び、当該ブロックを描画する描画コマンドのいずれか一方を選択する手段、ブロックを画像データに符号化する手段、ブロックから当該ブロックを描画するための描画コマンドを決定する手段、及び、前記画像データ及び描画コマンドを、データ通信ネットワークを介して他のデータ通信装置に送信する手段を備えるデータ通信装置を提供する。
 また、本発明は、他の一態様として、動画の一画面を分割してなる複数のブロックのひとつに対応するものであって、当該他のデータ通信装置からデータ通信ネットワークを介して受信した画像データまたは描画コマンドを判別する手段、前記画像データを復号化し、前記画面の一部を構成する画像信号として出力する手段、及び、前記描画コマンドに従って前記画面の一部を描画した画像の画像信号を出力する手段を備えることを特徴とするデータ通信装置を提供する。
 また、本発明は、他の一態様として、動画の一画面を複数のブロックに分割する手段、前記複数のブロックそれぞれについて、当該ブロックを符号化した画像データ、及び、当該ブロックを描画する描画コマンドのいずれか一方を選択する手段、ブロックを画像データに符号化する手段、ブロックから当該ブロックを描画するための描画コマンドを決定する手段、及び、前記画像データ及び描画コマンドを、データ通信ネットワークを介して他のデータ通信装置に送信する手段としてコンピュータを機能させるためのプログラムを提供する。
 また、本発明は、他の一態様として、送信側装置にて動画の一画面を複数のブロックに分割する段階、前記送信側装置にて、前記複数のブロックのひとつについて、当該ブロックの画像データへの符号化、及び、当該ブロックを描画する描画コマンドの決定のいずれか一方を選択する段階、前記送信側装置にて前記選択に応じて前記いずれかの一方を実行する段階、前記画像データ及び描画コマンドのいずれかを、前記データ通信ネットワークを介して、送信側装置から受信側装置に送信する段階、及び、前記受信側装置にて、前記画像データの復号化、及び、前記描画コマンドに応じた画像の描画のいずれか一方を行なうことにより、前記画面の一部を構成する画像信号を生成する段階を含むことを特徴とする動画伝送方法を提供する。
In order to solve the above-described problems, the present invention has, as one aspect thereof, a transmission-side device that transmits a moving image for each screen via a data communication network and a reception-side device that receives the screen via the data communication network. And the transmission side device is any one of means for dividing the screen into a plurality of blocks, image data obtained by encoding the block for each of the plurality of blocks, and a drawing command for drawing the block Selecting means, means for encoding a block into image data, means for determining a drawing command for drawing the block from the block, and receiving the image data and drawing command via the data communication network Means for transmitting to the side device, the receiving side device from the transmitting side device to the data communication network. Means for discriminating the image data and drawing command received via the network, means for decoding the image data and outputting as an image signal constituting part of the screen, and part of the screen according to the drawing command There is provided a moving image transmission system comprising means for outputting an image signal of an image drawn.
Further, the present invention provides, as another aspect, means for dividing one screen of a moving image into a plurality of blocks, image data obtained by encoding the block for each of the plurality of blocks, and a drawing command for drawing the block A means for selecting any one of the above, a means for encoding the block into image data, a means for determining a drawing command for drawing the block from the block, and the image data and the drawing command via the data communication network. A data communication apparatus comprising means for transmitting to another data communication apparatus is provided.
In another aspect, the present invention corresponds to one of a plurality of blocks obtained by dividing one screen of a moving image, and an image received from the other data communication apparatus via the data communication network. Means for discriminating data or a drawing command, means for decoding the image data and outputting it as an image signal constituting a part of the screen, and an image signal of an image obtained by drawing a part of the screen according to the drawing command Provided is a data communication device comprising a means for outputting.
Further, the present invention provides, as another aspect, means for dividing one screen of a moving image into a plurality of blocks, image data obtained by encoding the block for each of the plurality of blocks, and a drawing command for drawing the block A means for selecting any one of the above, a means for encoding the block into image data, a means for determining a drawing command for drawing the block from the block, and the image data and the drawing command via the data communication network. A program for causing a computer to function as means for transmitting to another data communication apparatus is provided.
According to another aspect of the present invention, as a further aspect, the transmitting side device divides one screen of the moving image into a plurality of blocks, and the transmitting side device uses the image data of the block for one of the plurality of blocks. Selecting any one of encoding to and determination of a drawing command for drawing the block, executing one of the above in response to the selection at the transmitting side device, the image data, and A step of transmitting any of the drawing commands from the transmission side device to the reception side device via the data communication network, and the reception side device according to the decoding of the image data and the drawing command A moving image transmission method comprising the step of generating an image signal constituting a part of the screen by performing any one of the drawn images.
 本発明によれば、画面の一ブロックを伝送する際に、画像データ及び描画コマンドのいずれかとして伝送するので、状況に応じた伝送形態にてブロックを伝送することが可能となる。 According to the present invention, when one block of a screen is transmitted, it is transmitted as either image data or a drawing command, so that the block can be transmitted in a transmission form according to the situation.
 図1は本発明の第一の実施の形態である動画伝送システム1のブロック図である。
 図2は本発明の第二の実施の形態であるシンクライアントシステム100のブロック図である。
 図3はシンクライアントシステム100のサーバ装置110のブロック図である。
 図4はサーバ装置110の判別部185のブロック図である。
 図5はサーバ装置110の画像エンコード部186のブロック図である。
 図6はシンクライアントシステム100の携帯端末170のブロック図である。
FIG. 1 is a block diagram of a moving picture transmission system 1 according to the first embodiment of the present invention.
FIG. 2 is a block diagram of a thin client system 100 according to the second embodiment of the present invention.
FIG. 3 is a block diagram of the server device 110 of the thin client system 100.
FIG. 4 is a block diagram of the determination unit 185 of the server device 110.
FIG. 5 is a block diagram of the image encoding unit 186 of the server apparatus 110.
FIG. 6 is a block diagram of the portable terminal 170 of the thin client system 100.
 本発明の第一の実施の形態である動画伝送システム1について図1を参照して説明する。動画伝送システム1はネットワーク2を介して送信側装置3から受信側装置4に動画を一画面毎に送信するシステムである。ネットワーク2はデータ通信ネットワークであり、特に、その少なくとも一部として携帯電話ネットワーク等の移動体通信ネットワークのように、通信状況が時間に応じて変化するネットワークに特に好適である。
 送信側装置3は時間変化する画面を受信側装置4に送信する装置である。後述する第二の実施の形態のようなシンクライアントシステムのサーバ、即ち、仮想クライアントを実行し、シンクライアントに仮想クライアントの画面を提供するサーバが、送信側装置3の好適な例であるが、シンクライアントサーバ以外の手段により生成された画面の伝送に本発明を用いてもよい。例えば、動画撮影機能及びデータ通信機能を備える情報処理装置、より具体的には例えば動画撮影機能を備える携帯電話端末、無線通信機能を備えるデジタルビデオカメラであってもよい。
 送信側装置3は分割部5、判別部6、画像エンコーダ7、描画コマンドエンコーダ8、送信部9を備える。分割部5は不図示の動画撮影機能から入力された画面を複数の領域、例えばm×n(m、nは自然数)の領域に分割する。以下、分割した各領域をブロックと呼ぶものとする。判別部6は分割部5から入力されたブロック自身と、ネットワーク2の通信状況に応じて、画像エンコーダ7及び描画コマンドエンコーダ8のいずれか一方を選択して、そのブロックを選択したエンコーダに渡す。画像エンコーダ7は渡されたブロックを所定のフォーマットの画像データに符号化する。画像エンコーダ7は複数種類の圧縮符号化方式のいずれかに従ってブロックを符号化することとしてもよい。描画コマンドエンコーダ8は入力されたブロックを描画するために必要な描画コマンド群を求め、求めた描画コマンド群を予め定められた圧縮符号化方式に従って符号化する。送信部9は画像エンコーダ7及び描画コマンドエンコーダ8のいずれかによって符号化されたブロックを、ネットワーク2を介して受信側装置4に送信する。
 受信側装置4は送信側装置3から画面を受信して表示する装置である。後述する第二の実施の形態のシンクライアントシステムのクライアント、即ち、仮想クライアントを実行するサーバから、仮想クライアントの画面を受信する端末、特に携帯電話端末等の移動通信端末が、受信側装置4の好適な例であるが、ネットワークから動画を受信して表示する他の装置に適用してもよく、例えば、データ通信機能を備える情報処理装置、ワークステーション、サーバ、パーソナルコンピュータであり、動画表示機能を備える携帯電話端末その他の携帯情報処理装置であってもよい。
 受信側装置4は受信部10、画像デコーダ11、描画コマンドデコーダ12、画面表示部13、画面描画部14を備える。受信部10は、ネットワーク2を介して送信部9から符号化したブロックを受信すると、所定フォーマットの画像データとして符号化されたものか、描画コマンド群を符号化したものかを判別し、前者の場合は画像デコーダ11に出力し、後者の場合は描画コマンドデコーダ12に出力する。画像デコーダ11は画像エンコーダ7が符号化の際に用いた符号化方式に従って画像データを復号化し、その画像データを画面表示部13のスクリーン上の対応する領域に表示する。対応する領域とは、その画像データに対応するブロックを含む画面が分割部5に入力される前の時点で、そのブロックがその画面内で占めていた領域である。描画コマンドデコーダ12は符号化された描画コマンド群を復号して出力する。画面描画部14は、入力された描画コマンド群に従って、画面表示部13のスクリーン上の対応する領域に、画像を描画する。画面表示部13はディスプレイ装置であり、具体的には液晶ディスプレイ装置、プラズマディスプレイ、ブラウン管などがあるが、その種類を問わない。
 従来のように、ブロックや通信状況にかかわらず、ブロックを画像データとしてのみ送信する構成では、例えば単純な描画コマンド群で描画可能なブロックであっても画像データとして送信するので、ネットワークを伝送するデータ量が大きくなり、ネットワークの通信状況によっては画面表示部13での表示に遅延が生じる恐れがある。逆に、ブロックや通信状況にかかわらず、ブロックを描画コマンドとしてのみ送信する構成では、特に複雑で動きが激しい画面のブロックを描画コマンド化する際にコマンド数が大幅に増加して、ネットワーク2を伝送するデータ量が増えるだけではなく、更に、増加した描画コマンドすべてを画面描画部14にて実行するために要する処理時間も長時間化する。これに対して本実施の形態では、ネットワーク2の通信状況や送信しようとしているブロック自身に応じて、画像エンコーダ7、描画コマンドエンコーダ8のいずれかを用いてそのブロックをエンコードするので、ブロックと通信状況に適合した送信形態にてブロックを送信することが可能となり、画面表示部13での表示の遅延を抑制することができる。
 本発明の第二の実施の形態であるシンクライアントシステム100について図2を参照して説明する。本実施の形態では、ネットワークとして、モバイルネットワーク150を用いる例を示している。また、パケット転送装置としてSGSN/GGSNN装置を用いる場合の構成を示している。ここで、SGSN/GGSN装置とはSGSN(Serving GPRS Support Node)装置とGGSN(Gateway GPRS Support Node)装置を一体化した装置であることを意味する。また、図2では、一例として、シンクライアントのサーバ装置110はクラウド網130上に配置されており、クラウド網130とモバイルネットワーク150とが接続されている構成を示している。
 図2において、エンドユーザは携帯端末170を、クラウド網130に配置されたサーバ装置110の仮想クライアントに接続し、仮想クライアントをあたかも実端末を操作するように操作する。そのために、携帯端末170のクライアントソフトから、モバイルネットワーク150上の基地局194、RNC装置195、SGSN/GGSN装置190を経由して、サーバ装置110に対し、操作信号を格納したパケット送出する。ここで、操作信号は、携帯端末170でのキー操作、画面へのタッチ操作、文字入力、スクロールなどの操作により、携帯端末170からサーバ装置110に送出される信号を意味する。
 携帯端末170に搭載されたクライアントソフトにおけるパケット送信部から送出された操作信号パケットは、モバイルネットワーク150上にある、基地局装置194、RNC装置195、SGSN/GGSN装置190を介し、クラウド網130上のサーバ装置110に到達し、サーバ装置110が前記操作信号を受信する。ここで、操作信号を送出するときのプロトコルには周知のものを使用することができるが、UDP/IPを用いることにするが、TCP/IPなどを用いることもできる。
 図3は、サーバ装置110の構成を示すブロック図である。操作信号パケット受信部182は、携帯端末170から、操作信号を格納したパケットを、基地局194、RNC装置195、SGSN/GGSN装置190を経由して受信する。操作信号パケット受信部182は、受信した操作信号UDP/IPパケットから操作信号を抽出し、仮想クライアント部211に出力する。
 仮想クライアント部211は、各種サービスに対応したアプリケーションソフト、制御部、画面生成部、キャッシュメモリ、などを有している。また、サーバ装置110の外部から前記アプリケーションソフトの更新を容易に行える構成となっている。仮想クライアント部211は、操作信号パケット受信部182から入力した操作信号を解析し、前記操作信号で指定されたアプリケーションソフトを起動し、アプリケーションソフトならびにOSにより描画される画面を、あらかじめ定められた画面解像度で生成し、画面キャプチャ部180に出力する。さらに、画面を生成・描画する際に実行される描画コマンドを描画コマンド収集部181に出力する。
 描画コマンド収集部181は、仮想クライアント部211から出力される描画コマンド群を画面毎に収集し、画面毎に一旦保存した上で、判別部185に出力する。
 画面キャプチャ部180は、あらかじめ定められた画面解像度およびフレームレートで画面をキャプチャして出力する。
 分割部184は、キャプチャした画面をあらかじめ定められたサイズの複数個のブロックに分割する。ここでは、ブロックのサイズは、たとえば、16画素×16ラインとするが、他のサイズ、例えば8画素×8ラインや4画素×4ラインなどを用いることができる。ブロックのサイズが小さいほうが判別部での判別精度が向上するが処理量は増大する。分割部184は分割したブロックを判別部185に出力する。
 判別部185の構成を図4に示す。ここで、本実施の形態においては、判別部185は、画面の画像信号から求めた特徴量およびネットワークの遅延量のうちの少なくとも一つにもとづき画面を転送するか、描画コマンドを転送するかのいずれかを判別する。なお、判別部185で用いる画像特徴量として、ここでは、動きベクトルを使用することとするが、他の周知な特徴量、例えばフレーム間での差分値の絶対値和、動き補償予測残差の絶対値和を用いこともできる。
 図4において、動きベクトル算出部201は、ブロック毎に、例えば次の式1のDを最小にする動きベクトルV(dx,dy)を算出する。
Figure JPOXMLDOC01-appb-I000001
ここで、fn,k(Xi,Yj)、fn−1,k(Xi,Yj)は、n番目のフレームのk番目のブロックに入る画素、n−1番目のフレームのk番目のブロックに入る画素、をそれぞれ示す。
 次に、動きベクトル算出部201は、各ブロック毎に、動きベクトルの大きさ、方向を次の式2、式3により求め、これらを判別選択部202に出力する。
Figure JPOXMLDOC01-appb-I000002
ここで、Vはk番目のブロックでの動きベクトルの大きさを示し、θはk番目のブロックでの動きベクトルの角度(方向)を示す。
 次に、判別選択部202は、複数の連続するブロックに対しVとθを調べ、連続する複数のブロックで、Vがあらかじめ定められたしきい値を越えかつ、θがばらつく場合に、これらのブロックに対し、動画領域であると判断する。
 なお、連続する複数のブロックで、Vがしきい値を越えるがθがほぼ同じ角度を示す場合は、動画領域とは見なさず、画面スクロールなどによる移動領域と判断する。また、Vがしきい値を越えない場合は静止領域と判断する。ここで、動画領域としては、矩形の領域となるように整形したものとし、領域の範囲とは、当該矩形領域の水平方向の画素数と垂直方向のライン数ならびに、当該領域に含まれるブロックの番号とブロックのサイズとする。
 次に、ネットワーク帯域推定部203は、図2の第1のパケット送受信部176から、ネットワーク遅延推定に必要な情報を定期的に入力する。必要な情報としては、例えば、サーバから送信したデータサイズ、サーバから送信した時刻、端末で受信した時刻、などである。これらの情報をもとに、例えば、次の式4、式5にもとづき、ネットワーク帯域推定値Bを定期的に計算する。
Figure JPOXMLDOC01-appb-I000003
D(j)は、第1のパケット送受信部176から携帯端末170に向け送出した、j番目のパケットのデータサイズを示し、R(j)は、携帯端末170で前記j番目のパケットを受信したときの受信時刻である。これらの情報は携帯端末170から第1のパケット送受信部176に返信され、第1のパケット送受信部176が受信した情報である。
 次に、式4で計算した帯域推定値Wを次の式5を用いて時間的に平滑化する。
Figure JPOXMLDOC01-appb-I000004
ここで、B(n)は第n時刻の平滑化後の、ネットワーク帯域推定値を示し、βは0<β<1の範囲の定数である。ネットワーク帯域推定部203は、定期的にB(n)を算出し、判別選択部202に出力する。
 次に、判別選択部202は、動画領域の面積と画面全体の面積との比率γを次の式6により求める。
γ=動画領域の面積/画面全体の面積 … 式6
 次に、判別選択部202は、ネットワーク帯域推定部203から、ネットワーク帯域推定値τを入力する。γがあらかじめ定められたしきい値Th1を越える場合は、画面転送と判断し、判別フラグFを0として、図3の画像エンコード部186に対し、判別フラグと動画領域の情報ならびに移動領域の情報または静止領域の情報を出力する。
 また、γがTh1未満で、B(n)があらかじめ定められたしきい値Th2未満の場合は、コマンド転送と判断し、判別フラグFを1として、判別フラグFと描画コマンド群を図3の描画コマンドエンコード183に出力する。また、判別選択部202は判別フラグFを第1のパケット送受信部176に出力する。
 画像エンコード部186の構成を図5を用いて説明する。図5において、第1の画像エンコーダ227は、動画領域の画像信号と判別フラグを入力し、判別フラグが0、すなわち画面転送の場合、画像信号に対しあらかじめ定められた動画エンコーダを用いて圧縮符号化した上で圧縮後のビットストリームを図3の第1のパケット送受信部176に出力する。ここでは、あらかじめ定められた動画エンコーダとして、H.264を用いることとするが、他の周知な動画コーデック、例えばMPEG−4はHEVC(High Efficieny Video Coding)など、を用いることもできる。さらに、動画領域の領域情報を図3の第1のパケット送受信部176に出力する。
 第2の画像エンコーダ部228は、画像キャプチャ部180からキャプチャ画像を入力し、判別部185から判別フラグを入力し、判別フラグが0のとき、その他の領域(例えば、移動領域や静止領域)の範囲を入力し、静止画の場合は、静止画コーデックを用いて画像を圧縮符号化し図3の第1のパケット送受信部176に出力する。ここでは、静止画コーデックとしてJPEG2000を用いることとするが、他の周知なコーデック、たとえば、JPEGなどを用いることもできる。さらに、その他の領域の情報も図3の第1のパケット送信部176に出力する。
 一方、移動領域の場合は、移動前の画像を静止画コーデックで圧縮符号化したビットストリームと、代表的な動きベクトルを1種類、図3の第1のパケット送受信部176に出力する。さらに、その他の領域の情報も図3の第1のパケット送受信部176に出力する。
 描画コマンドエンコード部183は、判別フラグが1、すなわち描画コマンド転送の場合は、画面毎に描画コマンド群を入力し、あらかじめ定められた圧縮方式により描画コマンド群をロスレス符号化し、圧縮符号化した結果を第1のパケット送受信部176に出力する。ここで、あらかじめ定められた圧縮方式としては、Zip圧縮方式など、周知のロスレス符号化方式を用いることが出来る。
 次に、画面にオーディオが付随する場合は、オーディオエンコード部187は、画面キャプチャ部180から画面に付随するオーディオ信号を入力し、オーディオエンコーダにより圧縮符号化して図3の第2のパケット送信部177に出力する。ここでは、オーディオエンコーダとして、MPEG−4 AACを用いることとするが、他の周知なオーディオエンコーダを用いることもできる。
 図3にもどって、第1のパケット送受信部176は、判別部からフラグ情報を入力し、フラグ情報が0のときは画像エンコード部186から圧縮符号化ビットストリームと領域情報を入力し、フラグ情報が1のときは描画コマンドエンコード部183から圧縮符号化された描画コマンドを入力し、フラグ情報、領域情報、圧縮符号化されたビットストリーム、圧縮符号化された描画コマンドをパケットのペイロード部に格納し、あらかじめ定められたプロトコルによるパケットを構築して、図2のSGSN/GGSN装置190に送出する。ここでは、あらかじめ定められたプロトコルとして、周知のプロトコル、RTP/UDP/IP、UDP/IP、TCP/IPなど、を用いることができるが、一例として、UDP/IPを用いることとする。なお、領域の情報はRTPヘッダ部やUDPヘッダ部の未使用領域に格納することもできる。
 さらに、第1のパケット送受信部176は、携帯端末170から受信した応答信号から遅延量計算に必要な情報を抽出して図4のネットワーク帯域推定部203に出力する。
 第2のパケット送信部177は、オーディオ信号に対する圧縮符号化ビットストリームをパケットのペイロードに格納しあらかじめ定められたプロトコルによるパケットを構築して、SGSN/GGSN装置190に送出する。ここでは、あらかじめ定められたプロトコルとして、周知のプロトコル、RTP/UDP/IP、UDP/IP、TCP/IPなど、を用いることができるが、一例として、UDP/IPを用いることとする。
 SGSN/GGSN装置190は、サーバ装置110から受信したパケットを、GTP−UプロトコルによりトネリングしてRNC装置195に転送し、RNC装置195は基地局装置194を通して無線により携帯端末170に送出する。
 本発明では、携帯端末170には、ユーザが端末を操作したときの操作信号をサーバに送出しするとともに、サーバからのパケットを受信し圧縮符号化ストリームをデコードして表示させるための、クライアントソフトウェア171を搭載する。クライアントソフトウェア171の構成を図6に示す。
 図6において、第1のパケット送受信/選択部250は、パケットを受信し、パケットに格納された、判別フラグ、圧縮符号化ビットストリームと領域の情報、圧縮符号化された描画コマンド情報を取り出す。判別フラグが0、つまり画面転送を示す場合は、動画領域情報として動画領域のサイズならびに、第1の画像エンコーダにてエンコードされた圧縮符号化ビットストリームを選択抽出し、第1の画像デコーダ252に出力する。また、その他の領域情報ならびに、当該領域においてその他のエンコーダにてエンコードされた圧縮符号化ビットストリームを第2の画像デコーダ253に出力する。
 第1の画像デコーダ252は、動画領域情報ならびに圧縮符号化されたストリームを入力し、圧縮符号化されたストリームをデコードして画面表示部256に出力する。さらに動画領域情報も画面表示部256に出力する。ここでは、第1の画像デコーダとしては、例えば、H.264デコーダを用いることとするが、他の周知な画像デコーダ、例えば、MPEG−4デコーダなど、を用いることもできる。ただし、サーバでの第1の画像エンコーダ227と同一種類を用いるのは言うまでもない。
 第2の画像デコーダ253は、その他の領域情報ならびに圧縮符号化されたストリームを入力し、その他の領域に対して圧縮符号化されたストリームをデコードして画面表示部256に出力する。さらにその他の領域情報を画面表示部256に出力する。
 画面表示部256は、第1の画像デコーダ252から、第1の領域情報と第1の領域における画像信号を入力し、第2の画像デコーダ253から、その他の領域情報とその他の領域における画像信号を入力し、第1の領域情報を用いて第1の領域に第1の画像デコーダ252からの出力画像を表示させ、その他の領域情報を用いてその他の領域に第2の画像デコーダ253からの出力画像を表示させることにより、各々の領域の画像信号を組み合わせて表示画面を生成し、出力する。
 次に、第1のパケット送受信/選択部250は、判別フラグが1の場合、つまり描画コマンド転送の場合は、圧縮符号号化された描画コマンド情報を選択抽出して描画コマンドでコーダ259に出力する。
 描画コマンドデコーダ259は、例えばZip圧縮復号して画面毎に描画コマンド群を画面描画部260に出力する。
 画面描画部260は、画面毎に描画コマンド群を入力し画面を描画生成し当該生成した画面を画面表示部256に出力する。
 さらに、第1のパケット送受信/選択部250は、受信したパケットに対する応答信号パケットをネットワークに送出する。応答信号パケットには、データサイズ、受信時刻、送信時刻の情報を記載しておく。
 第2のパケット受信部251は、パケットを受信し、パケットに格納されたオーディオに関する圧縮符号化ビットストリームを取り出し、オーディオデコーダ255に出力する。
 オーディオデコーダ255は、前記圧縮符号化ストリームを入力しデコードした上で、画面部分と同期をとって出力する。ここでオーディオデコーダとしては、例えばMPEG−4 AACを使うことができるが、他の周知なオーディオデコーダを使用してもよい。ただし、サーバ装置でのオーディオエンコーダと同一種類を用いることは言うまでもない。
 操作信号生成部257は、ユーザが携帯端末170に対し入力した操作、例えば、画面タッチ、画面スクロール、アイコンタッチ、文字入力、などを検出し、各々に対し操作信号を生成しパケット送信部258に出力する。
 パケット送信部258は、操作信号を入力し、あらかじめ定められたプロトコルによるパケットに格納し、ネットワークに送出する。ここで、あらかじめ定められたプロトコルとしては、TCP/IP、UDP/IPなどを用いることができる。
 本実施の形態によれば、ネットワークを介してシンクライアントを使う場合、サーバ側で生成した画面に対し画像特徴量を求め、前記特徴量およびネットワークの帯域推定値またはネットワークによる遅延量のうちの少なくとも一つにもとづき、画面を転送するか描画コマンドを転送するかのいずれかを判別しこれらを切り替えて転送するので、データ量、ネットワーク遅延量、端末での負荷量のバランスをとって実現することが可能となるという効果があり、これにより、データ酸の大幅な増加、応答時間の大幅な遅延、端末での処理負荷の大幅な増加を防ぐにとができるという効果がある。
 以上、本発明を実施の形態に即して説明したが、本発明はこれに限定されるものではない。
 判別選択部202で判別する画面の領域の種類は3種類以上とすることもできる。また、領域の判別に用いる画像特徴量には、動きベクトル以外の特徴量を使用することもできるし、複数種類の特徴量を組み合わせて使用することもできる。
 ネットワーク帯域推定部203での帯域推定の手法は他の周知な手法を用いることも出来る。また、帯域推定のかわりに遅延量を推定し判別選択部202で前記遅延量推定値を画面転送か描画コマンド転送かの判別に用いることも出来る。
 判別選択部202での判別の手法は、他の周知な手法を用いることも出来る。
 図2で、モバイルネットワーク150は、モバイルLTE/EPCネットワークとすることもできるし、WiMax網やWiFi網とすることもできる。さらに、固定網やNGN網やインターネット網とすることもできる。ただしこれらの場合は、携帯端末ではなく固定端末やPCからの接続となる。
 図2では、サーバ装置をクラウド網に配置したが、インターネット網に配置することもできる。また、企業にシンクライアントのサーバを置く場合はサーバ装置を企業網に配置することもできる。また、別の構成として、通信事業者自らがシンクライアントのサーバを置く場合は、モバイルネットワーク150上や固定網上やNGN網上に、サーバ装置110を配置することもできる。
 上記の実施形態の一部又は全部は以下の付記のようにも記載されうるが、これらに限定されるものではない。
(付記1)
 データ通信ネットワークを介して動画を画面毎に送信する送信側装置と、前記データ通信ネットワークを介して前記画面を受信する受信側装置とを備え、
 前記送信側装置は、
 前記画面を複数のブロックに分割する手段、
 前記複数のブロックの少なくともひとつについて、当該ブロックを符号化した画像データ、及び、当該ブロックを描画する描画コマンドのいずれか一方を選択する手段、
 ブロックを画像データに符号化する手段、
 ブロックから当該ブロックを描画するための描画コマンドを決定する手段、及び、
 前記画像データ及び描画コマンドを、前記データ通信ネットワークを介して前記受信側装置に送信する手段を備え、
 前記受信側装置は、
 前記送信側装置から前記データ通信ネットワークを介して受信した前記画像データ及び描画コマンドを判別する手段、
 前記画像データを復号化し、前記画面の一部を構成する画像信号として出力する手段、及び、
 前記描画コマンドに従って前記画面の一部を描画した画像の画像信号を出力する手段を備える
ことを特徴とする、動画伝送システム。
(付記2)
 前記送信側装置は、仮想クライアントを実行してクライアント端末に対して仮想クライアントの画面を提供するサーバと、前記サーバから提供された画面を表示するクライアント端末を備えるシンクライアントシステムの前記サーバとして動作し、
 前記受信側装置は前記クライアント端末として動作し、
 前記データ通信ネットワークは移動体通信ネットワークを含む
ことを特徴とする付記1に記載の動画伝送システム。
(付記3)
 前記画像データ及び描画コマンドのいずれか一方を選択する手段は、ブロックから求めた一乃至複数の画像特徴量に応じて選択を行なうことを特徴とする付記1及び付記2のいずれかに記載のシステム。
(付記4)
 前記画像データ及び描画コマンドのいずれか一方を選択する手段は、前記データ通信ネットワークの通信状況に応じて選択を行なうことを特徴とする付記1乃至付記3のいずれかに記載のシステム。
(付記5)
 前記送信側装置は、前記画像データを符号化する手段として、符号化方式が互いに異なる複数の画像エンコーダを備え、
 前記受信側装置は、前記画像データを復号化する手段として、前記複数の画像エンコーダの符号化方式に対応する複数の画像デコーダを備える
ことを特徴とする付記1乃至付記4のいずれかに記載のシステム。
(付記6)
 前記送信側装置は、前記画像データを符号化する手段として、動画領域を符号化するための第1の画像エンコーダと、非動画領域を符号化するための第2の画像エンコーダとを備え、
 前記受信側装置は、前記画像データを復号化する手段として、動画領域を復号化するための第1の画像エンコーダと、非動画領域を復号化するための第2の画像エンコーダとを備えることを特徴とする付記5に記載のシステム。
(付記7)
 動画の一画面を複数のブロックに分割する手段、
 前記複数のブロックの少なくともひとつについて、当該ブロックを符号化した画像データ、及び、当該ブロックを描画する描画コマンドのいずれか一方を選択する手段、
 ブロックを画像データに符号化する手段、
 ブロックから当該ブロックを描画するための描画コマンドを決定する手段、及び、
 前記画像データ及び描画コマンドを、データ通信ネットワークを介して他のデータ通信装置に送信する手段を備えるデータ通信装置。
(付記8)
 前記コンピュータは、仮想クライアントを実行してクライアント端末に対して仮想クライアントの画面を提供するサーバと、前記サーバから提供された画面を表示するクライアント端末を備えるシンクライアントシステムの前記サーバとして動作し、
 前記他のデータ通信装置は前記クライアント端末として動作し、
 前記データ通信ネットワークは移動体通信ネットワークを含む
ことを特徴とする付記7に記載のデータ通信装置。
(付記9)
 前記画像データ及び描画コマンドのいずれか一方を選択する手段は、ブロックから求めた一乃至複数の画像特徴量に応じて選択を行なうことを特徴とする付記7及び付記8のいずれかに記載のデータ通信装置。
(付記10)
 前記画像データ及び描画コマンドのいずれか一方を選択する手段は、前記データ通信ネットワークの通信状況に応じて選択を行なうことを特徴とする付記7乃至付記9のいずれかに記載のデータ通信装置。
(付記11)
 前記画像データを符号化する手段は、符号化方式が互いに異なる複数の画像エンコーダとして機能することを特徴とする付記7乃至付記10のいずれかに記載のデータ通信装置。
(付記12)
 前記画像データを符号化する手段は、動画領域を符号化するための第1の画像エンコーダと、非動画領域を符号化するための第2の画像エンコーダとして機能することを特徴とする付記11に記載のデータ通信装置。
(付記13)
 動画の一画面を分割してなる複数のブロックのひとつに対応するものであって、当該他のデータ通信装置からデータ通信ネットワークを介して受信した画像データまたは描画コマンドを判別する手段、
 前記画像データを復号化し、前記画面の一部を構成する画像信号として出力する手段、及び、
 前記描画コマンドに従って前記画面の一部を描画した画像の画像信号を出力する手段を備える
ことを特徴とするデータ通信装置。
(付記14)
 前記他のデータ通信装置は、仮想クライアントを実行してクライアント端末に対して仮想クライアントの画面を提供するサーバと、前記サーバから提供された画面を表示するクライアント端末を備えるシンクライアントシステムの前記サーバとして動作し、
 前記データ通信装置は前記クライアント端末として動作し、
 前記データ通信ネットワークは移動体通信ネットワークを含む
ことを特徴とする付記13に記載のデータ通信装置。
(付記15)
 前記画像データを復号化する手段として、複数の画像エンコーダの符号化方式に対応する複数の画像デコーダを備えることを特徴とする付記13及び付記14のいずれかに記載のデータ通信装置。
(付記16)
 前記画像データを復号化する手段として、動画領域を復号化するための第1の画像エンコーダと、非動画領域を復号化するための第2の画像エンコーダとを備えることを特徴とする付記15に記載のデータ通信装置。
(付記17)
 動画の一画面を複数のブロックに分割する手段、
 前記複数のブロックの少なくともひとつについて、当該ブロックを符号化した画像データ、及び、当該ブロックを描画する描画コマンドのいずれか一方を選択する手段、
 ブロックを画像データに符号化する手段、
 ブロックから当該ブロックを描画するための描画コマンドを決定する手段、及び、
 前記画像データ及び描画コマンドを、データ通信ネットワークを介して他のデータ通信装置に送信する手段としてコンピュータを機能させるためのプログラム。
(付記18)
 前記コンピュータは、仮想クライアントを実行してクライアント端末に対して仮想クライアントの画面を提供するサーバと、前記サーバから提供された画面を表示するクライアント端末を備えるシンクライアントシステムの前記サーバとして動作し、
 前記他のデータ通信装置は前記クライアント端末として動作し、
 前記データ通信ネットワークは移動体通信ネットワークを含む
ことを特徴とする付記17に記載のプログラム。
(付記19)
 前記画像データ及び描画コマンドのいずれか一方を選択する手段は、ブロックから求めた一乃至複数の画像特徴量に応じて選択を行なうことを特徴とする付記17及び付記18のいずれかに記載のプログラム。
(付記20)
 前記画像データ及び描画コマンドのいずれか一方を選択する手段は、前記データ通信ネットワークの通信状況に応じて選択を行なうことを特徴とする付記17乃至付記19のいずれかに記載のプログラム。
(付記21)
 前記画像データを符号化する手段は、符号化方式が互いに異なる複数の画像エンコーダとして機能することを特徴とする付記17乃至付記20のいずれかに記載のプログラム。
(付記22)
 前記画像データを符号化する手段は、動画領域を符号化するための第1の画像エンコーダと、非動画領域を符号化するための第2の画像エンコーダとして機能することを特徴とする付記21に記載のプログラム。
(付記23)
 動画の一画面を分割してなる複数のブロックのひとつに対応するものであって、当該他のデータ通信装置からデータ通信ネットワークを介して受信した画像データまたは描画コマンドを判別する手段、
 前記画像データを復号化し、前記画面の一部を構成する画像信号として出力する手段、及び、
 前記描画コマンドに従って前記画面の一部を描画した画像の画像信号を出力する手段としてコンピュータを機能させるためのプログラム。
(付記24)
 前記他のデータ通信装置は、仮想クライアントを実行してクライアント端末に対して仮想クライアントの画面を提供するサーバと、前記サーバから提供された画面を表示するクライアント端末を備えるシンクライアントシステムの前記サーバとして動作し、
 前記データ通信装置は前記クライアント端末として動作し、
 前記データ通信ネットワークは移動体通信ネットワークを含む
ことを特徴とする付記23に記載のプログラム。
(付記25)
 前記画像データを復号化する手段を、複数の画像エンコーダの符号化方式に対応する複数の画像デコーダとして機能させることを特徴とする付記23及び付記24のいずれかに記載のプログラム。
(付記26)
 前記画像データを復号化する手段を、動画領域を復号化するための第1の画像エンコーダと、非動画領域を復号化するための第2の画像エンコーダとして機能させることを特徴とする付記25に記載のプログラム。
(付記27)
 送信側装置にて動画の一画面を複数のブロックに分割する段階、
 前記送信側装置にて、前記複数のブロックのひとつについて、当該ブロックの画像データへの符号化、及び、当該ブロックを描画する描画コマンドの決定のいずれか一方を選択する段階、
 前記送信側装置にて前記選択に応じて前記いずれかの一方を実行する段階、
 前記画像データ及び描画コマンドのいずれかを、前記データ通信ネットワークを介して、送信側装置から受信側装置に送信する段階、及び、
 前記受信側装置にて、前記画像データの復号化、及び、前記描画コマンドに応じた画像の描画のいずれか一方を行なうことにより、前記画面の一部を構成する画像信号を生成する段階
を含むことを特徴とする動画伝送方法。
(付記28)
 前記送信側装置は、仮想クライアントを実行してクライアント端末に対して仮想クライアントの画面を提供するサーバと、前記サーバから提供された画面を表示するクライアント端末を備えるシンクライアントシステムの前記サーバとして動作し、
 前記受信側装置は前記クライアント端末として動作し、
 前記データ通信ネットワークは移動体通信ネットワークを含む
ことを特徴とする付記27に記載の方法。
(付記29)
 前記画像データへの符号化及び描画コマンドの決定のいずれか一方を選択する段階は、ブロックから求めた一乃至複数の画像特徴量に応じて選択を行なうことを特徴とする付記27及び付記28のいずれかに記載の方法。
(付記30)
 前記画像データへの符号化及び描画コマンドの決定のいずれか一方を選択する段階は、前記データ通信ネットワークの通信状況に応じて選択を行なうことを特徴とする付記27乃至付記29のいずれかに記載の方法。
(付記31)
 前記画像データの符号化を、符号化方式が互いに異なる複数の画像エンコーダの中から選択した一の画像エンコーダを用いて行ない、
 前記画像データの復号化を、前記選択した一の画像エンコーダの符号化方式に対応する画像デコーダを用いて行なう
ことを特徴とする付記27乃至付記30のいずれかに記載の方法。
(付記32)
 前記画像データの符号化を、動画領域を符号化するための第1の画像エンコーダ、及び、非動画領域を符号化するための第2の画像エンコーダのいずれかを用いて行い、
 前記画像データの復号化を、動画領域を復号化するための第1の画像エンコーダと、非動画領域を復号化するための第2の画像エンコーダのいずれかを用いて行なうことを特徴とする付記31に記載の方法。
 この出願は、2012年10月12日に出願された日本出願特願第2012−226654号を基礎とする優先権を主張し、その開示のすべてをここに取り込むものである。
A moving picture transmission system 1 according to a first embodiment of the present invention will be described with reference to FIG. The moving image transmission system 1 is a system that transmits a moving image for each screen from the transmission side device 3 to the reception side device 4 via the network 2. The network 2 is a data communication network, and is particularly suitable for a network whose communication status changes according to time, such as a mobile communication network such as a mobile phone network as at least a part thereof.
The transmission side device 3 is a device that transmits a time-changing screen to the reception side device 4. A server of a thin client system as in the second embodiment to be described later, that is, a server that executes a virtual client and provides a screen of the virtual client to the thin client is a preferable example of the transmission side device 3. You may use this invention for transmission of the screen produced | generated by means other than a thin client server. For example, it may be an information processing apparatus having a moving image shooting function and a data communication function, more specifically, a mobile phone terminal having a moving image shooting function, or a digital video camera having a wireless communication function.
The transmission side device 3 includes a dividing unit 5, a determination unit 6, an image encoder 7, a drawing command encoder 8, and a transmission unit 9. The dividing unit 5 divides a screen input from a moving image photographing function (not shown) into a plurality of areas, for example, m × n (m and n are natural numbers) areas. Hereinafter, each divided area is referred to as a block. The discriminating unit 6 selects one of the image encoder 7 and the drawing command encoder 8 according to the block itself input from the dividing unit 5 and the communication status of the network 2, and passes the block to the selected encoder. The image encoder 7 encodes the transferred block into image data of a predetermined format. The image encoder 7 may encode the block according to any of a plurality of types of compression encoding methods. The drawing command encoder 8 obtains a drawing command group necessary for drawing the input block, and encodes the obtained drawing command group according to a predetermined compression encoding method. The transmission unit 9 transmits the block encoded by either the image encoder 7 or the drawing command encoder 8 to the reception side device 4 via the network 2.
The receiving side device 4 is a device that receives and displays a screen from the transmitting side device 3. The terminal of the thin client system of the second embodiment to be described later, that is, a terminal that receives a virtual client screen from a server that executes the virtual client, particularly a mobile communication terminal such as a mobile phone terminal, Although it is a suitable example, it may be applied to other devices that receive and display a moving image from a network, such as an information processing device, a workstation, a server, or a personal computer having a data communication function, and a moving image display function. It may be a mobile phone terminal or other mobile information processing device equipped with
The receiving side device 4 includes a receiving unit 10, an image decoder 11, a drawing command decoder 12, a screen display unit 13, and a screen drawing unit 14. When receiving the encoded block from the transmitting unit 9 via the network 2, the receiving unit 10 determines whether the encoded data is encoded as image data of a predetermined format or the drawing command group, and the former In this case, the image is output to the image decoder 11, and in the latter case, the image is output to the drawing command decoder 12. The image decoder 11 decodes the image data according to the encoding method used by the image encoder 7 for encoding, and displays the image data in a corresponding area on the screen of the screen display unit 13. The corresponding area is an area occupied by the block in the screen before the screen including the block corresponding to the image data is input to the dividing unit 5. The drawing command decoder 12 decodes the encoded drawing command group and outputs it. The screen drawing unit 14 draws an image in a corresponding area on the screen of the screen display unit 13 in accordance with the input drawing command group. The screen display unit 13 is a display device, and specifically, there are a liquid crystal display device, a plasma display, a cathode ray tube, and the like, but the type thereof is not limited.
In a configuration in which a block is transmitted only as image data regardless of the block or communication status as in the past, for example, even a block that can be drawn with a simple drawing command group is transmitted as image data. The amount of data increases, and the display on the screen display unit 13 may be delayed depending on the communication status of the network. On the other hand, regardless of the block and communication status, in the configuration in which the block is transmitted only as a drawing command, the number of commands is greatly increased when converting a block of a screen that is particularly complex and intensely moving into a drawing command. Not only does the amount of data to be transmitted increase, but also the processing time required to execute all the increased drawing commands in the screen drawing unit 14 becomes longer. On the other hand, in this embodiment, the block is encoded using either the image encoder 7 or the drawing command encoder 8 according to the communication status of the network 2 and the block itself to be transmitted. Blocks can be transmitted in a transmission form suitable for the situation, and display delay on the screen display unit 13 can be suppressed.
A thin client system 100 according to a second embodiment of the present invention will be described with reference to FIG. In this embodiment, an example is shown in which a mobile network 150 is used as the network. Moreover, the structure in the case of using a SGSN / GGSNN apparatus as a packet transfer apparatus is shown. Here, the SGSN / GGSN device means a device in which an SGSN (Serving GPRS Support Node) device and a GGSN (Gateway GPRS Support Node) device are integrated. In FIG. 2, as an example, the thin client server device 110 is arranged on the cloud network 130 and the cloud network 130 and the mobile network 150 are connected.
In FIG. 2, the end user connects the mobile terminal 170 to the virtual client of the server apparatus 110 arranged in the cloud network 130 and operates the virtual client as if operating the real terminal. For this purpose, the client software of the mobile terminal 170 transmits a packet storing the operation signal to the server device 110 via the base station 194, the RNC device 195, and the SGSN / GGSN device 190 on the mobile network 150. Here, the operation signal means a signal transmitted from the mobile terminal 170 to the server device 110 by an operation such as a key operation on the mobile terminal 170, a touch operation on the screen, character input, or scrolling.
The operation signal packet transmitted from the packet transmission unit in the client software installed in the mobile terminal 170 is transmitted to the cloud network 130 via the base station device 194, the RNC device 195, and the SGSN / GGSN device 190 on the mobile network 150. The server device 110 receives the operation signal. Here, a well-known protocol can be used for sending the operation signal, but UDP / IP is used, but TCP / IP or the like can also be used.
FIG. 3 is a block diagram illustrating a configuration of the server device 110. The operation signal packet receiving unit 182 receives a packet storing the operation signal from the mobile terminal 170 via the base station 194, the RNC device 195, and the SGSN / GGSN device 190. The operation signal packet receiving unit 182 extracts an operation signal from the received operation signal UDP / IP packet and outputs the operation signal to the virtual client unit 211.
The virtual client unit 211 includes application software corresponding to various services, a control unit, a screen generation unit, a cache memory, and the like. In addition, the application software can be easily updated from the outside of the server device 110. The virtual client unit 211 analyzes the operation signal input from the operation signal packet receiving unit 182, activates the application software specified by the operation signal, and displays a screen drawn by the application software and the OS as a predetermined screen. The image is generated at the resolution and output to the screen capture unit 180. Further, a drawing command executed when generating and drawing the screen is output to the drawing command collecting unit 181.
The drawing command collection unit 181 collects the drawing command group output from the virtual client unit 211 for each screen, temporarily saves it for each screen, and outputs it to the determination unit 185.
The screen capture unit 180 captures and outputs a screen at a predetermined screen resolution and frame rate.
The dividing unit 184 divides the captured screen into a plurality of blocks having a predetermined size. Here, the block size is, for example, 16 pixels × 16 lines, but other sizes, for example, 8 pixels × 8 lines, 4 pixels × 4 lines, and the like can be used. The smaller the block size, the better the discrimination accuracy in the discriminator, but the processing amount increases. The dividing unit 184 outputs the divided blocks to the determining unit 185.
The configuration of the determination unit 185 is shown in FIG. Here, in this embodiment, the determination unit 185 determines whether to transfer the screen based on at least one of the feature amount obtained from the image signal of the screen and the network delay amount, or to transfer the drawing command. Determine either. Here, a motion vector is used as the image feature amount used by the determination unit 185. However, other well-known feature amounts, for example, the sum of absolute values of difference values between frames, the motion compensation prediction residual error, and the like are used. An absolute value sum can also be used.
In FIG. 4, the motion vector calculation unit 201 performs, for example, D in the following Expression 1 for each block. k Vector V to minimize k (Dx, dy) is calculated.
Figure JPOXMLDOC01-appb-I000001
Where f n, k (Xi, Yj), f n-1, k (Xi, Yj) represents a pixel that enters the kth block of the nth frame and a pixel that enters the kth block of the (n-1) th frame.
Next, the motion vector calculation unit 201 obtains the magnitude and direction of the motion vector for each block by the following formulas 2 and 3, and outputs these to the discrimination selection unit 202.
Figure JPOXMLDOC01-appb-I000002
Where V k Indicates the magnitude of the motion vector in the kth block, and θ k Indicates the angle (direction) of the motion vector in the k-th block.
Next, the discrimination / selection unit 202 applies V to a plurality of consecutive blocks. k And θ k And in a plurality of consecutive blocks, V k Exceeds a predetermined threshold value and θ k If there is variation, it is determined that these blocks are moving image areas.
In addition, in a plurality of consecutive blocks, V k Exceeds the threshold but θ k Are substantially the same angle, it is not regarded as a moving image area, but is determined as a moving area by screen scrolling or the like. Also, V k If the threshold does not exceed the threshold value, it is determined as a still area. Here, the moving image area is shaped so as to be a rectangular area, and the area range includes the number of pixels in the horizontal direction and the number of lines in the vertical direction of the rectangular area, and the blocks included in the area. Number and block size.
Next, the network bandwidth estimation unit 203 periodically inputs information necessary for network delay estimation from the first packet transmission / reception unit 176 of FIG. Necessary information includes, for example, the data size transmitted from the server, the time transmitted from the server, the time received by the terminal, and the like. Based on these pieces of information, for example, the network band estimated value B is periodically calculated based on the following equations 4 and 5.
Figure JPOXMLDOC01-appb-I000003
D (j) indicates the data size of the jth packet transmitted from the first packet transmitting / receiving unit 176 to the mobile terminal 170, and R (j) is the jth packet received by the mobile terminal 170. Is the time of reception. These pieces of information are information that is returned from the portable terminal 170 to the first packet transmission / reception unit 176 and received by the first packet transmission / reception unit 176.
Next, the band estimation value W calculated by Expression 4 is temporally smoothed using Expression 5 below.
Figure JPOXMLDOC01-appb-I000004
Here, B (n) represents a network band estimation value after smoothing at the nth time, and β is a constant in a range of 0 <β <1. The network bandwidth estimation unit 203 periodically calculates B (n) and outputs it to the discrimination selection unit 202.
Next, the discrimination / selection unit 202 obtains the ratio γ between the area of the moving image area and the area of the entire screen by the following Expression 6.
γ = Area of moving image area / area of entire screen Equation 6
Next, the discrimination / selection unit 202 inputs the network band estimation value τ from the network band estimation unit 203. If γ exceeds a predetermined threshold value Th1, it is determined that the screen is transferred, the discrimination flag F is set to 0, and the discrimination flag, the moving picture area information, and the movement area information are sent to the image encoding unit 186 in FIG. Alternatively, information on the static area is output.
When γ is less than Th1 and B (n) is less than a predetermined threshold Th2, it is determined that the command is transferred, the determination flag F is set to 1, and the determination flag F and the drawing command group are set as shown in FIG. Output to the drawing command encode 183. Further, the discrimination selection unit 202 outputs the discrimination flag F to the first packet transmission / reception unit 176.
The configuration of the image encoding unit 186 will be described with reference to FIG. In FIG. 5, a first image encoder 227 inputs an image signal of a moving image area and a determination flag, and when the determination flag is 0, that is, in the case of screen transfer, a compression code using a predetermined moving image encoder is used for the image signal. The compressed bit stream is output to the first packet transmitting / receiving unit 176 in FIG. Here, as a predetermined moving image encoder, H.264 is used. H.264 is used, but other well-known moving image codecs, for example, MPEG-4 is also available for HEVC (High Efficiency Video Coding). Further, the region information of the moving image region is output to the first packet transmitting / receiving unit 176 in FIG.
The second image encoder unit 228 inputs a captured image from the image capture unit 180, inputs a determination flag from the determination unit 185, and when the determination flag is 0, other regions (for example, a moving region or a still region) The range is input, and in the case of a still image, the image is compressed and encoded using a still image codec and output to the first packet transmitting / receiving unit 176 in FIG. Here, JPEG2000 is used as the still image codec, but other well-known codecs such as JPEG can also be used. Further, information on other areas is also output to the first packet transmission unit 176 in FIG.
On the other hand, in the case of a moving area, a bit stream obtained by compression-encoding an image before movement with a still image codec and one representative motion vector are output to the first packet transmitting / receiving unit 176 in FIG. Further, information on other areas is also output to the first packet transmitting / receiving unit 176 in FIG.
When the discrimination flag is 1, that is, when the drawing command is transferred, the drawing command encoding unit 183 inputs a drawing command group for each screen, performs lossless encoding of the drawing command group by a predetermined compression method, and results of compression encoding Is output to the first packet transmitting / receiving unit 176. Here, as the predetermined compression method, a known lossless encoding method such as a Zip compression method can be used.
Next, when audio is attached to the screen, the audio encoding unit 187 inputs an audio signal attached to the screen from the screen capture unit 180, performs compression encoding with the audio encoder, and performs the second packet transmission unit 177 in FIG. Output to. Here, MPEG-4 AAC is used as the audio encoder, but other known audio encoders can also be used.
Returning to FIG. 3, the first packet transmission / reception unit 176 inputs flag information from the determination unit. When the flag information is 0, the first packet transmission / reception unit 176 inputs the compression encoded bitstream and region information from the image encoding unit 186. When 1 is 1, a compression-encoded drawing command is input from the drawing command encoding unit 183, and flag information, area information, a compression-encoded bit stream, and a compression-encoded drawing command are stored in the payload portion of the packet. Then, a packet according to a predetermined protocol is constructed and sent to the SGSN / GGSN apparatus 190 in FIG. Here, a well-known protocol such as RTP / UDP / IP, UDP / IP, TCP / IP, or the like can be used as a predetermined protocol, but UDP / IP is used as an example. The area information can be stored in an unused area of the RTP header part or the UDP header part.
Further, the first packet transmission / reception unit 176 extracts information necessary for delay amount calculation from the response signal received from the mobile terminal 170 and outputs the information to the network bandwidth estimation unit 203 in FIG.
The second packet transmission unit 177 stores the compression-coded bit stream for the audio signal in the payload of the packet, constructs a packet according to a predetermined protocol, and transmits the packet to the SGSN / GGSN device 190. Here, a well-known protocol such as RTP / UDP / IP, UDP / IP, TCP / IP, or the like can be used as a predetermined protocol, but UDP / IP is used as an example.
The SGSN / GGSN device 190 tunnels the packet received from the server device 110 using the GTP-U protocol and transfers the packet to the RNC device 195. The RNC device 195 transmits the packet to the mobile terminal 170 wirelessly through the base station device 194.
In the present invention, client software for sending to the mobile terminal 170 an operation signal when the user operates the terminal to the server, receiving packets from the server, and decoding and displaying the compressed encoded stream is displayed. 171 is mounted. The configuration of the client software 171 is shown in FIG.
In FIG. 6, the first packet transmission / reception / selection unit 250 receives a packet, and extracts a discrimination flag, compression-encoded bitstream and area information, and compression-encoded drawing command information stored in the packet. When the determination flag is 0, that is, when screen transfer is indicated, the size of the moving image area and the compression-encoded bit stream encoded by the first image encoder are selected and extracted as moving image area information, and the first image decoder 252 Output. In addition, the other region information and the compression-encoded bit stream encoded by the other encoder in the region are output to the second image decoder 253.
The first image decoder 252 receives the moving image area information and the compression-encoded stream, decodes the compression-encoded stream, and outputs the decoded stream to the screen display unit 256. Furthermore, the moving image area information is also output to the screen display unit 256. Here, as the first image decoder, for example, H.264 is used. The H.264 decoder is used, but other well-known image decoders such as an MPEG-4 decoder can also be used. However, it goes without saying that the same type as the first image encoder 227 in the server is used.
The second image decoder 253 receives the other region information and the compression-encoded stream, decodes the compression-encoded stream for the other region, and outputs the decoded image to the screen display unit 256. Further, other area information is output to the screen display unit 256.
The screen display unit 256 receives the first area information and the image signal in the first area from the first image decoder 252, and receives the other area information and the image signal in the other area from the second image decoder 253. , The output image from the first image decoder 252 is displayed in the first area using the first area information, and the other area information is used to display the output image from the second image decoder 253 in the other area. By displaying the output image, a display screen is generated by combining the image signals of the respective regions and output.
Next, when the determination flag is 1, that is, when the drawing command is transferred, the first packet transmission / reception / selection unit 250 selects and extracts the compression-coded drawing command information and outputs it to the coder 259 with the drawing command. To do.
The drawing command decoder 259 performs, for example, Zip compression decoding and outputs a drawing command group to the screen drawing unit 260 for each screen.
The screen drawing unit 260 inputs a drawing command group for each screen, draws and generates a screen, and outputs the generated screen to the screen display unit 256.
Further, the first packet transmission / reception / selection unit 250 sends a response signal packet for the received packet to the network. Information on data size, reception time, and transmission time is described in the response signal packet.
The second packet receiving unit 251 receives the packet, extracts a compression-coded bit stream related to audio stored in the packet, and outputs the compressed bit stream to the audio decoder 255.
The audio decoder 255 receives and decodes the compression-coded stream, and outputs it in synchronization with the screen portion. Here, for example, MPEG-4 AAC can be used as the audio decoder, but other known audio decoders may be used. However, it goes without saying that the same type as the audio encoder in the server device is used.
The operation signal generation unit 257 detects an operation input by the user to the mobile terminal 170, for example, screen touch, screen scroll, icon touch, character input, and the like, generates an operation signal for each, and sends the operation signal to the packet transmission unit 258. Output.
The packet transmission unit 258 receives an operation signal, stores it in a packet according to a predetermined protocol, and transmits it to the network. Here, TCP / IP, UDP / IP, etc. can be used as a predetermined protocol.
According to the present embodiment, when a thin client is used via a network, an image feature amount is obtained for a screen generated on the server side, and at least one of the feature amount and a network bandwidth estimation value or a network delay amount is obtained. Based on one, it is determined whether to transfer the screen or drawing command, and these are switched and transferred, so it is realized by balancing the amount of data, the amount of network delay, and the load on the terminal As a result, it is possible to prevent a significant increase in data acid, a significant delay in response time, and a significant increase in processing load at the terminal.
While the present invention has been described with reference to the embodiment, the present invention is not limited to this.
There may be three or more types of screen areas determined by the determination / selection unit 202. In addition, a feature quantity other than a motion vector can be used as an image feature quantity used for region determination, or a plurality of types of feature quantities can be used in combination.
Other known methods can be used as the bandwidth estimation method in the network bandwidth estimation unit 203. Further, instead of band estimation, the delay amount can be estimated, and the delay selection unit 202 can use the delay amount estimation value to determine whether the screen transfer or the drawing command transfer.
Other known methods can be used as the determination method in the determination / selection unit 202.
In FIG. 2, the mobile network 150 can be a mobile LTE / EPC network, or can be a WiMax network or a WiFi network. Further, it can be a fixed network, an NGN network, or an Internet network. However, in these cases, connection is made from a fixed terminal or a PC instead of a portable terminal.
In FIG. 2, the server device is arranged in the cloud network, but can be arranged in the Internet network. In addition, when a thin client server is installed in a company, the server device can be arranged in the company network. As another configuration, when the telecommunications carrier itself places a thin client server, the server device 110 can be arranged on the mobile network 150, a fixed network, or an NGN network.
A part or all of the above embodiments may be described as in the following supplementary notes, but is not limited thereto.
(Appendix 1)
A transmission-side device that transmits a moving image for each screen via a data communication network, and a reception-side device that receives the screen via the data communication network,
The transmitting device is:
Means for dividing the screen into a plurality of blocks;
Means for selecting at least one of the plurality of blocks, either image data obtained by encoding the block, or a drawing command for drawing the block;
Means for encoding the block into image data;
Means for determining a drawing command for drawing the block from the block; and
Means for transmitting the image data and the drawing command to the receiving side device via the data communication network;
The receiving side device
Means for discriminating the image data and drawing command received from the transmission side device via the data communication network;
Means for decoding the image data and outputting as an image signal constituting a part of the screen; and
Means for outputting an image signal of an image obtained by drawing a part of the screen in accordance with the drawing command;
A video transmission system characterized by the above.
(Appendix 2)
The transmitting device operates as the server of a thin client system including a server that executes a virtual client and provides a screen of a virtual client to a client terminal, and a client terminal that displays a screen provided from the server. ,
The receiving device operates as the client terminal;
The data communication network includes a mobile communication network
The moving image transmission system according to supplementary note 1, wherein:
(Appendix 3)
The system according to any one of Supplementary Note 1 and Supplementary Note 2, wherein the means for selecting one of the image data and the drawing command performs selection according to one or more image feature amounts obtained from the block. .
(Appendix 4)
The system according to any one of appendix 1 to appendix 3, wherein the means for selecting one of the image data and the drawing command performs selection according to a communication status of the data communication network.
(Appendix 5)
The transmission side device includes a plurality of image encoders having different encoding methods as means for encoding the image data,
The receiving-side apparatus includes a plurality of image decoders corresponding to the encoding methods of the plurality of image encoders as means for decoding the image data.
The system according to any one of appendix 1 to appendix 4, wherein
(Appendix 6)
The transmission-side apparatus includes, as means for encoding the image data, a first image encoder for encoding a moving image area and a second image encoder for encoding a non-moving area,
The receiving-side apparatus includes, as means for decoding the image data, a first image encoder for decoding a moving image area and a second image encoder for decoding a non-moving area. The system according to appendix 5, which is characterized.
(Appendix 7)
A way to divide one screen of a video into multiple blocks,
Means for selecting at least one of the plurality of blocks, either image data obtained by encoding the block, or a drawing command for drawing the block;
Means for encoding the block into image data;
Means for determining a drawing command for drawing the block from the block; and
A data communication apparatus comprising means for transmitting the image data and the drawing command to another data communication apparatus via a data communication network.
(Appendix 8)
The computer operates as the server of a thin client system including a server that executes a virtual client and provides a screen of the virtual client to the client terminal, and a client terminal that displays the screen provided from the server,
The other data communication device operates as the client terminal,
The data communication network includes a mobile communication network
The data communication device according to appendix 7, wherein
(Appendix 9)
The data according to any one of appendix 7 and appendix 8, wherein the means for selecting one of the image data and the drawing command performs selection according to one or more image feature amounts obtained from the block. Communication device.
(Appendix 10)
10. The data communication apparatus according to any one of appendix 7 to appendix 9, wherein the means for selecting one of the image data and the drawing command performs selection according to a communication status of the data communication network.
(Appendix 11)
11. The data communication apparatus according to any one of appendix 7 to appendix 10, wherein the means for encoding the image data functions as a plurality of image encoders having different encoding methods.
(Appendix 12)
Additional means 11 for encoding the image data functions as a first image encoder for encoding a moving image area and a second image encoder for encoding a non-moving area. The data communication device described.
(Appendix 13)
Means corresponding to one of a plurality of blocks formed by dividing one screen of a moving image, and means for discriminating image data or a drawing command received from the other data communication device via a data communication network;
Means for decoding the image data and outputting as an image signal constituting a part of the screen; and
Means for outputting an image signal of an image obtained by drawing a part of the screen in accordance with the drawing command;
A data communication device.
(Appendix 14)
The other data communication device is a server of a thin client system including a server that executes a virtual client and provides a screen of the virtual client to the client terminal, and a client terminal that displays the screen provided from the server. Work,
The data communication device operates as the client terminal;
The data communication network includes a mobile communication network
14. The data communication device according to appendix 13, wherein
(Appendix 15)
15. The data communication apparatus according to any one of supplementary note 13 and supplementary note 14, comprising a plurality of image decoders corresponding to a plurality of image encoder encoding methods as means for decoding the image data.
(Appendix 16)
(Supplementary note 15) The means for decoding the image data includes a first image encoder for decoding a moving image area and a second image encoder for decoding a non-moving area. The data communication device described.
(Appendix 17)
A way to divide one screen of a video into multiple blocks,
Means for selecting at least one of the plurality of blocks, either image data obtained by encoding the block, or a drawing command for drawing the block;
Means for encoding the block into image data;
Means for determining a drawing command for drawing the block from the block; and
A program for causing a computer to function as means for transmitting the image data and the drawing command to another data communication apparatus via a data communication network.
(Appendix 18)
The computer operates as the server of a thin client system including a server that executes a virtual client and provides a screen of the virtual client to the client terminal, and a client terminal that displays the screen provided from the server,
The other data communication device operates as the client terminal,
The data communication network includes a mobile communication network
The program according to appendix 17, characterized by:
(Appendix 19)
The program according to any one of appendix 17 and appendix 18, wherein the means for selecting one of the image data and the drawing command performs selection according to one or more image feature amounts obtained from the block. .
(Appendix 20)
The program according to any one of appendix 17 to appendix 19, wherein the means for selecting one of the image data and the drawing command performs selection according to a communication status of the data communication network.
(Appendix 21)
The program according to any one of appendix 17 to appendix 20, wherein the means for encoding the image data functions as a plurality of image encoders having different encoding methods.
(Appendix 22)
The appendix 21 is characterized in that the means for encoding the image data functions as a first image encoder for encoding a moving image area and a second image encoder for encoding a non-moving area. The listed program.
(Appendix 23)
Means corresponding to one of a plurality of blocks formed by dividing one screen of a moving image, and means for discriminating image data or a drawing command received from the other data communication device via a data communication network;
Means for decoding the image data and outputting as an image signal constituting a part of the screen; and
A program for causing a computer to function as means for outputting an image signal of an image obtained by drawing a part of the screen in accordance with the drawing command.
(Appendix 24)
The other data communication device is a server of a thin client system including a server that executes a virtual client and provides a screen of the virtual client to the client terminal, and a client terminal that displays the screen provided from the server. Work,
The data communication device operates as the client terminal;
The data communication network includes a mobile communication network
The program according to appendix 23, which is characterized by the above.
(Appendix 25)
25. The program according to any one of Supplementary Note 23 and Supplementary Note 24, wherein the means for decoding the image data functions as a plurality of image decoders corresponding to a plurality of image encoder encoding methods.
(Appendix 26)
Appendix 25 wherein the means for decoding the image data functions as a first image encoder for decoding a moving image area and a second image encoder for decoding a non-moving area The listed program.
(Appendix 27)
Dividing one screen of the video into a plurality of blocks at the transmission side device;
In the transmitting device, for one of the plurality of blocks, selecting one of encoding of the block into image data and determination of a drawing command for drawing the block;
Performing any one of the above in response to the selection at the transmitting device;
Transmitting either the image data or the drawing command from the transmitting device to the receiving device via the data communication network; and
A step of generating an image signal constituting a part of the screen by performing any one of decoding of the image data and drawing of an image according to the drawing command in the receiving side device.
A moving picture transmission method comprising:
(Appendix 28)
The transmitting device operates as the server of a thin client system including a server that executes a virtual client and provides a screen of a virtual client to a client terminal, and a client terminal that displays a screen provided from the server. ,
The receiving device operates as the client terminal;
The data communication network includes a mobile communication network
The method according to appendix 27, wherein:
(Appendix 29)
The step of selecting either one of the encoding to the image data and the determination of the drawing command is performed according to one or more image feature amounts obtained from the block. The method according to any one.
(Appendix 30)
30. The method according to any one of appendixes 27 to 29, wherein the step of selecting one of the encoding to the image data and the determination of the drawing command is performed according to a communication status of the data communication network. the method of.
(Appendix 31)
The image data is encoded using one image encoder selected from a plurality of image encoders having different encoding methods,
The decoding of the image data is performed using an image decoder corresponding to the encoding method of the selected one image encoder.
31. The method according to any one of appendix 27 to appendix 30, wherein
(Appendix 32)
The image data is encoded using either a first image encoder for encoding a moving image region and a second image encoder for encoding a non-moving region,
The decoding of the image data is performed using either a first image encoder for decoding a moving image area or a second image encoder for decoding a non-moving area. 31. The method according to 31.
This application claims the priority on the basis of Japanese application Japanese Patent Application No. 2012-226654 for which it applied on October 12, 2012, and takes in those the indications of all here.

Claims (10)

  1.  データ通信ネットワークを介して動画を画面毎に送信する送信側装置と、前記データ通信ネットワークを介して前記画面を受信する受信側装置とを備え、
     前記送信側装置は、
     前記画面を複数のブロックに分割する手段、
     前記複数のブロックの少なくともひとつについて、当該ブロックを符号化した画像データ、及び、当該ブロックを描画する描画コマンドのいずれか一方を選択する手段、
     ブロックを画像データに符号化する手段、
     ブロックから当該ブロックを描画するための描画コマンドを決定する手段、及び、
     前記画像データ及び描画コマンドを、前記データ通信ネットワークを介して前記受信側装置に送信する手段を備え、
     前記受信側装置は、
     前記送信側装置から前記データ通信ネットワークを介して受信した前記画像データ及び描画コマンドを判別する手段、
     前記画像データを復号化し、前記画面の一部を構成する画像信号として出力する手段、及び、
     前記描画コマンドに従って前記画面の一部を描画した画像の画像信号を出力する手段を備える
    ことを特徴とする、動画伝送システム。
    A transmission-side device that transmits a moving image for each screen via a data communication network, and a reception-side device that receives the screen via the data communication network,
    The transmitting device is:
    Means for dividing the screen into a plurality of blocks;
    Means for selecting at least one of the plurality of blocks, either image data obtained by encoding the block, or a drawing command for drawing the block;
    Means for encoding the block into image data;
    Means for determining a drawing command for drawing the block from the block; and
    Means for transmitting the image data and the drawing command to the receiving side device via the data communication network;
    The receiving side device
    Means for discriminating the image data and drawing command received from the transmission side device via the data communication network;
    Means for decoding the image data and outputting as an image signal constituting a part of the screen; and
    A moving picture transmission system comprising: means for outputting an image signal of an image obtained by drawing a part of the screen in accordance with the drawing command.
  2. 前記送信側装置は、仮想クライアントを実行してクライアント端末に対して仮想クライアントの画面を提供するサーバと、前記サーバから提供された画面を表示するクライアント端末を備えるシンクライアントシステムの前記サーバとして動作し、
     前記受信側装置は前記クライアント端末として動作し、
     前記データ通信ネットワークは移動体通信ネットワークを含む
    ことを特徴とする請求項1に記載の動画伝送システム。
    The transmitting device operates as the server of a thin client system including a server that executes a virtual client and provides a screen of a virtual client to a client terminal, and a client terminal that displays a screen provided from the server. ,
    The receiving device operates as the client terminal;
    The moving image transmission system according to claim 1, wherein the data communication network includes a mobile communication network.
  3.  前記画像データ及び描画コマンドのいずれか一方を選択する手段は、ブロックから求めた一乃至複数の画像特徴量に応じて選択を行なうことを特徴とする請求項1及び請求項2のいずれかに記載の動画伝送システム。 The means for selecting any one of the image data and the drawing command performs selection according to one or more image feature amounts obtained from the block. Video transmission system.
  4.  前記画像データ及び描画コマンドのいずれか一方を選択する手段は、前記データ通信ネットワークの通信状況に応じて選択を行なうことを特徴とする請求項1乃至請求項3のいずれかに記載の動画伝送システム。 4. The moving picture transmission system according to claim 1, wherein the means for selecting one of the image data and the drawing command performs selection according to a communication status of the data communication network. .
  5.  前記送信側装置は、前記画像データを符号化する手段として、符号化方式が互いに異なる複数の画像エンコーダを備え、
     前記受信側装置は、前記画像データを復号化する手段として、前記複数の画像エンコーダの符号化方式に対応する複数の画像デコーダを備える
    ことを特徴とする請求項1乃至請求項4のいずれかに記載の動画伝送システム。
    The transmission side device includes a plurality of image encoders having different encoding methods as means for encoding the image data,
    The receiving apparatus includes a plurality of image decoders corresponding to encoding methods of the plurality of image encoders as means for decoding the image data. The video transmission system described.
  6.  前記送信側装置は、前記画像データを符号化する手段として、動画領域を符号化するための第1の画像エンコーダと、非動画領域を符号化するための第2の画像エンコーダとを備え、
     前記受信側装置は、前記画像データを復号化する手段として、動画領域を復号化するための第1の画像エンコーダと、非動画領域を復号化するための第2の画像エンコーダとを備えることを特徴とする請求項5に記載の動画伝送システム。
    The transmission-side apparatus includes, as means for encoding the image data, a first image encoder for encoding a moving image area and a second image encoder for encoding a non-moving area,
    The receiving-side apparatus includes, as means for decoding the image data, a first image encoder for decoding a moving image area and a second image encoder for decoding a non-moving area. 6. The moving image transmission system according to claim 5, wherein
  7.  動画の一画面を複数のブロックに分割する手段、
     前記複数のブロックの少なくともひとつについて、当該ブロックを符号化した画像データ、及び、当該ブロックを描画する描画コマンドのいずれか一方を選択する手段、
     ブロックを画像データに符号化する手段、
     ブロックから当該ブロックを描画するための描画コマンドを決定する手段、及び、
     前記画像データ及び描画コマンドを、データ通信ネットワークを介して他のデータ通信装置に送信する手段を備えるデータ通信装置。
    A way to divide one screen of a video into multiple blocks,
    Means for selecting at least one of the plurality of blocks, either image data obtained by encoding the block, or a drawing command for drawing the block;
    Means for encoding the block into image data;
    Means for determining a drawing command for drawing the block from the block; and
    A data communication apparatus comprising means for transmitting the image data and the drawing command to another data communication apparatus via a data communication network.
  8.  動画の一画面を分割してなる複数のブロックのひとつに対応するものであって、当該他のデータ通信装置からデータ通信ネットワークを介して受信した画像データまたは描画コマンドを判別する手段、
     前記画像データを復号化し、前記画面の一部を構成する画像信号として出力する手段、及び、
     前記描画コマンドに従って前記画面の一部を描画した画像の画像信号を出力する手段を備える
    ことを特徴とするデータ通信装置。
    Means corresponding to one of a plurality of blocks formed by dividing one screen of a moving image, and means for discriminating image data or a drawing command received from the other data communication device via a data communication network;
    Means for decoding the image data and outputting as an image signal constituting a part of the screen; and
    A data communication apparatus comprising: means for outputting an image signal of an image obtained by drawing a part of the screen in accordance with the drawing command.
  9.  動画の一画面を複数のブロックに分割する手段、
     前記複数のブロックの少なくともひとつについて、当該ブロックを符号化した画像データ、及び、当該ブロックを描画する描画コマンドのいずれか一方を選択する手段、
     ブロックを画像データに符号化する手段、
     ブロックから当該ブロックを描画するための描画コマンドを決定する手段、及び、
     前記画像データ及び描画コマンドを、データ通信ネットワークを介して他のデータ通信装置に送信する手段としてコンピュータを機能させるためのプログラム。
    A way to divide one screen of a video into multiple blocks,
    Means for selecting at least one of the plurality of blocks, either image data obtained by encoding the block, or a drawing command for drawing the block;
    Means for encoding the block into image data;
    Means for determining a drawing command for drawing the block from the block; and
    A program for causing a computer to function as means for transmitting the image data and the drawing command to another data communication apparatus via a data communication network.
  10.  送信側装置にて動画の一画面を複数のブロックに分割する段階、
     前記送信側装置にて、前記複数のブロックのひとつについて、当該ブロックの画像データへの符号化、及び、当該ブロックを描画する描画コマンドの決定のいずれか一方を選択する段階、
     前記送信側装置にて前記選択に応じて前記いずれかの一方を実行する段階、
     前記画像データ及び描画コマンドのいずれかを、前記データ通信ネットワークを介して、送信側装置から受信側装置に送信する段階、及び、
     前記受信側装置にて、前記画像データの復号化、及び、前記描画コマンドに応じた画像の描画のいずれか一方を行なうことにより、前記画面の一部を構成する画像信号を生成する段階
    を含むことを特徴とする動画伝送方法。
    Dividing one screen of the video into a plurality of blocks at the transmission side device;
    In the transmitting device, for one of the plurality of blocks, selecting one of encoding of the block into image data and determination of a drawing command for drawing the block;
    Performing any one of the above in response to the selection at the transmitting device;
    Transmitting either the image data or the drawing command from the transmitting device to the receiving device via the data communication network; and
    Including a step of generating an image signal constituting a part of the screen by performing any one of decoding of the image data and drawing of an image according to the drawing command in the receiving-side apparatus. A video transmission method characterized by the above.
PCT/JP2013/075965 2012-10-12 2013-09-18 Motion video transmission system and method WO2014057809A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2012226654 2012-10-12
JP2012-226654 2012-10-12

Publications (1)

Publication Number Publication Date
WO2014057809A1 true WO2014057809A1 (en) 2014-04-17

Family

ID=50477280

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2013/075965 WO2014057809A1 (en) 2012-10-12 2013-09-18 Motion video transmission system and method

Country Status (1)

Country Link
WO (1) WO2014057809A1 (en)

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004312584A (en) * 2003-04-10 2004-11-04 Matsushita Electric Ind Co Ltd Image processing method and image display system
JP2010232891A (en) * 2009-03-26 2010-10-14 Nec Personal Products Co Ltd Server, remote control system, transmission scheme control method, program, and recording medium

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2004312584A (en) * 2003-04-10 2004-11-04 Matsushita Electric Ind Co Ltd Image processing method and image display system
JP2010232891A (en) * 2009-03-26 2010-10-14 Nec Personal Products Co Ltd Server, remote control system, transmission scheme control method, program, and recording medium

Similar Documents

Publication Publication Date Title
KR101443070B1 (en) Method and system for low-latency transfer protocol
US10756997B2 (en) Bandwidth adjustment for real-time video transmission
JP5983761B2 (en) Server device, terminal, thin client system, screen transmission method and program
US20170094301A1 (en) Initial Bandwidth Estimation For Real-time Video Transmission
WO2011142311A1 (en) Remote mobile communication system, server device and remote mobile communication system control method
US10516892B2 (en) Initial bandwidth estimation for real-time video transmission
CN104509048B (en) Server unit, communication system and communication means
US20170094296A1 (en) Bandwidth Adjustment For Real-time Video Transmission
WO2013030166A2 (en) Method for transmitting video signals from an application on a server over an ip network to a client device
JP5854247B2 (en) Image information transmission method and packet communication system
JPWO2012029648A1 (en) Server apparatus, video quality measurement system, video quality measurement method and program
WO2014057809A1 (en) Motion video transmission system and method
KR101632012B1 (en) Communication system, server apparatus, server apparatus controlling method and computer readable storage medium storing program
CN116134474A (en) Method and apparatus for performing rendering using delay-compensated gesture prediction with respect to three-dimensional media data in a communication system supporting mixed reality/augmented reality
WO2014042137A1 (en) Communication system and method, and server device and terminal
KR101251879B1 (en) Apparatus and method for displaying advertisement images in accordance with screen changing in multimedia cloud system

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 13844726

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 13844726

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: JP