EP0886968A1 - Procede et systemes de transmission asynchrone et progressive de donnees multimedia - Google Patents
Procede et systemes de transmission asynchrone et progressive de donnees multimediaInfo
- Publication number
- EP0886968A1 EP0886968A1 EP97902559A EP97902559A EP0886968A1 EP 0886968 A1 EP0886968 A1 EP 0886968A1 EP 97902559 A EP97902559 A EP 97902559A EP 97902559 A EP97902559 A EP 97902559A EP 0886968 A1 EP0886968 A1 EP 0886968A1
- Authority
- EP
- European Patent Office
- Prior art keywords
- data
- blocks
- digital
- client
- database
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Withdrawn
Links
- 230000005540 biological transmission Effects 0.000 title claims abstract description 85
- 238000000034 method Methods 0.000 title claims description 79
- 230000000750 progressive effect Effects 0.000 title claims description 57
- 238000012545 processing Methods 0.000 claims description 57
- 230000002452 interceptive effect Effects 0.000 claims description 51
- 238000007906 compression Methods 0.000 claims description 27
- 230000006835 compression Effects 0.000 claims description 27
- 238000003860 storage Methods 0.000 claims description 27
- 238000004891 communication Methods 0.000 claims description 26
- 238000004519 manufacturing process Methods 0.000 claims description 20
- 238000005070 sampling Methods 0.000 claims description 19
- 238000009825 accumulation Methods 0.000 claims description 15
- 238000007726 management method Methods 0.000 claims description 14
- 230000008569 process Effects 0.000 claims description 9
- 238000009877 rendering Methods 0.000 claims description 8
- 230000003993 interaction Effects 0.000 claims description 7
- 230000006837 decompression Effects 0.000 claims description 5
- 230000004044 response Effects 0.000 claims description 5
- 238000012544 monitoring process Methods 0.000 claims description 4
- 238000005266 casting Methods 0.000 claims description 3
- 125000004122 cyclic group Chemical group 0.000 claims description 2
- 238000010586 diagram Methods 0.000 description 14
- 230000000694 effects Effects 0.000 description 6
- 238000005516 engineering process Methods 0.000 description 5
- 238000013459 approach Methods 0.000 description 3
- 230000008901 benefit Effects 0.000 description 3
- 230000010354 integration Effects 0.000 description 3
- 230000003595 spectral effect Effects 0.000 description 3
- 230000001965 increasing effect Effects 0.000 description 2
- 238000009434 installation Methods 0.000 description 2
- 230000033001 locomotion Effects 0.000 description 2
- 238000013507 mapping Methods 0.000 description 2
- 230000008707 rearrangement Effects 0.000 description 2
- 230000009467 reduction Effects 0.000 description 2
- 238000012546 transfer Methods 0.000 description 2
- 235000014676 Phragmites communis Nutrition 0.000 description 1
- XUIMIQQOPSSXEZ-UHFFFAOYSA-N Silicon Chemical compound [Si] XUIMIQQOPSSXEZ-UHFFFAOYSA-N 0.000 description 1
- 238000003491 array Methods 0.000 description 1
- 230000003139 buffering effect Effects 0.000 description 1
- 239000006227 byproduct Substances 0.000 description 1
- 230000008859 change Effects 0.000 description 1
- 230000002301 combined effect Effects 0.000 description 1
- 238000012790 confirmation Methods 0.000 description 1
- 230000001186 cumulative effect Effects 0.000 description 1
- 238000013144 data compression Methods 0.000 description 1
- 238000009826 distribution Methods 0.000 description 1
- 238000011143 downstream manufacturing Methods 0.000 description 1
- 230000002708 enhancing effect Effects 0.000 description 1
- 230000006870 function Effects 0.000 description 1
- 238000003384 imaging method Methods 0.000 description 1
- 230000006872 improvement Effects 0.000 description 1
- 238000010348 incorporation Methods 0.000 description 1
- 238000012432 intermediate storage Methods 0.000 description 1
- 238000012804 iterative process Methods 0.000 description 1
- 230000007246 mechanism Effects 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 238000004091 panning Methods 0.000 description 1
- 238000005192 partition Methods 0.000 description 1
- 238000002360 preparation method Methods 0.000 description 1
- 238000003825 pressing Methods 0.000 description 1
- 239000000047 product Substances 0.000 description 1
- 238000003908 quality control method Methods 0.000 description 1
- 238000013139 quantization Methods 0.000 description 1
- 229910052710 silicon Inorganic materials 0.000 description 1
- 239000010703 silicon Substances 0.000 description 1
- 238000000638 solvent extraction Methods 0.000 description 1
- 230000005236 sound signal Effects 0.000 description 1
- 230000002123 temporal effect Effects 0.000 description 1
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/231—Content storage operation, e.g. caching movies for short term storage, replicating data over plural servers, prioritizing data for deletion
- H04N21/23106—Content storage operation, e.g. caching movies for short term storage, replicating data over plural servers, prioritizing data for deletion involving caching operations
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/21—Server components or server architectures
- H04N21/222—Secondary servers, e.g. proxy server, cable television Head-end
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/23—Processing of content or additional data; Elementary server operations; Server middleware
- H04N21/234—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
- H04N21/2343—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
- H04N21/234318—Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by decomposing into objects, e.g. MPEG-4 objects
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/40—Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
- H04N21/47—End-user applications
- H04N21/472—End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
- H04N21/47205—End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for manipulating displayed content, e.g. interacting with MPEG-4 objects, editing locally
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/14—Systems for two-way working
- H04N7/141—Systems for two-way working between two video terminals, e.g. videophone
- H04N7/147—Communication arrangements, e.g. identifying the communication as a video-communication, intermediate storage of the signals
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N7/00—Television systems
- H04N7/16—Analogue secrecy systems; Analogue subscription systems
- H04N7/173—Analogue secrecy systems; Analogue subscription systems with two-way working, e.g. subscriber sending a programme selection signal
- H04N7/17309—Transmission or handling of upstream communications
- H04N7/17318—Direct or substantially direct transmission and handling of requests
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/30—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
- H04N19/39—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability involving multiple description coding [MDC], i.e. with separate layers being structured as independently decodable descriptions of input picture data
Definitions
- the present invention relates to methods and systems for encoding digital multimedia data for transmission over a network.
- a user When using various media such as video, audio, text and images, a user generally retrieves the media from a storage device or "server” connected via a network to many computers or users. The server downloads the media to the network and transmits it to the user at the user's request.
- server When using various media such as video, audio, text and images, a user generally retrieves the media from a storage device or "server” connected via a network to many computers or users. The server downloads the media to the network and transmits it to the user at the user's request.
- the present invention relates to the second limitation.
- One example of such a system includes a CD ROM drive and personal computer which may be located at the same site.
- Another example includes a network connecting Internet servers and users' personal computers. Such networks are installed in order to facilitate convenient data transmission between users and data distribution from the server to the users' computers.
- bandwidth limitations affect the amount of time required to transmit a video frame from the server to the user, and thus limit the video frame rate.
- bandwidth limitations when dealing with object movies and panoramas the files being transmitted are extremely large, so that overcoming bandwidth limitations is a critical enabling factor, even for high bandwidth networks.
- the first is to compress the video frame sequence, thereby speeding up transmission time at the cost of additional downstream processing to decompress the frames prior to display.
- the second is to copy the entire sequence to an intermediate storage device, such as a user's hard disk, to which the user has higher bandwidth access, at the cost of delaying the viewing of the video until the entire sequence has been delivered.
- Known network applications involve streaming data from a server to a client computer (hereinafter also referred to as "client").
- client client computer
- “Streaming” refers to serial or parallel transmission of digital data between two computers, by transmitting sequences of bit packets. For example, installation executables on a network server stream files to a client computer performing the installation.
- Servers with large amounts of memory are used to archive digital movies, which are streamed to a client computer for viewing upon demand.
- Digital video is broadcast from cable stations to subscribers using streaming.
- Internet browsers such as Netscape and Microsoft Explorer, are used to stream data from a server on the web to a client.
- Internet web sites can contain enormous databases, such as phone directories for all of the cities in the U.S., photographs from art galleries and museums around the world, voluminous encyclopedias, and even copies of all patents ever issued by the U.S. Patent & Trademark Office.
- Clients using the Internet can search these databases and then request the server to download specific information. This request initiates a streaming event.
- the present invention seeks to provide an improved method and system for transmitting digital data representing the original over plural transmission links at least some of which have limited bandwidth.
- the present invention relates to scalable encoding, which enables two or more clients, connected to a server by lines having differing bandwidth, to begin playing the multimedia data on-line, and both at the same time, almost immediately after the start of streaming, but the lower bandwidth client receives lower quality media than the higher bandwidth client at first. As the media is replayed in the foreground and the bandwidth is freed, more data streams in via background, and the quality of the media is enhanced
- a client of an Internet application must wait until the requested data arrives, at whatever rate its network line provides A client with a 14 4 Kbs modem line, for example, would have to wait twice as long as a client with a 28 8 Kbs modem line.
- the 14 4 Kbs client would never be able to achieve live playback, since there would be an ever-increasing lag in the data stream.
- the 28.8 Kbs client would receive unnecessarily poor quality media.
- the additional data block arriving in a 14 4 Kbs stream combines with the previous data block which arrived in a 14 4 Kbs stream, to produce a 28.8 Kbs streamed version; all that is being sent is the incremental data necessary for the upgrade.
- the progressive form of the encoding itself provides the ability to achieve scalability
- Another shortcoming of non-scalable encoding as in the prior art is the inability to preview a video sequence Often a client would like to play a quick preview of a video clip, before deciding whether or not to download it
- the scalable representation of the present invention can be used to deliver the video in a preview mode, as the first data blocks. If the client continues to download the video after previewing, the first data block already transmitted is progressively integrated with additional data blocks to create the full viewing video.
- the present invention can also be applied to enhance delivery of large still images for multi-resolution gazing.
- Current technology transmits such images as large files, and carries out extensive computations for sub-sampling to lower resolution and zooming in to areas of interest for gazing. This makes it very time consuming to interact with large images, and as a result it is currently impractical to produce high resolution images for Internet browsing.
- producers simply sub-sample them to fit entirely within a computer monitor screen, and store the resulting low resolution images on web servers.
- producers can deliver high resolution images over the Internet for rapid interactive gazing.
- the present invention seeks to provide a scalable representation of multimedia data, enabling the data to be (a) progressively streamed, (b) transmitted asynchronously to clients at different bandwidths and (c) played back interactively on-line.
- the representation is two-dimensional, with one dimension (block number) being characterized by progressiveness in quality, and the second dimension (frame number) being characterized by interactivity.
- the representation comprises data blocks which are integrated with one another to produce successively higher bandwidth versions of the media, the data blocks comprising encoded frames.
- the first data block corresponds to the lowest bandwidth, and enables the client with this bandwidth to play back the media on-line at the lowest quality.
- the second data block when integrated with the first block, corresponds to the next higher bandwidth, and enables the client with this bandwidth to play back the media on-line at the next highest quality, and similarly for each successive data block.
- a client with the lowest bandwidth who played the media at the low quality and freed the bandwidth can continue in background to receive successive data blocks and integrate them with previously received data blocks, resulting in successively higher quality media each time it is replayed.
- the modular fo ⁇ n of the data representation thus makes it possible to both accommodate different bandwidths and progressively update media quality.
- a production tool makes it possible for a producer to control modularity and quality settings.
- a method for providing on-line virtual reality movies comprising inputting a cyclic movie sequence into an encoder, determining the number of portions that each frame of said movie is divided into, and forming partial frames, specifying hot-spots and independent objects for interaction within a partial frame, transmitting the partial frames part by part to a user's asynchronous database, and displaying said frames on a user's interface
- a system for producing virtual reality (VR) movies comprising an encoder for preparing the VR movie for transmission, and a server including a repository for the VR movie and a transceiver for transmitting the movie, part by part to a user, upon request
- An essential feature of the present invention is the use of a two-dimensional interactive progressive database to represent multimedia data, and the storage of this database in three different forms for streaming, processing and playback purposes
- the database is calibrated in data blocks of roughly equal size, to deliver the media for on ⁇ line playback at a selected range of bandwidths, and in such a way that the higher bandwidth versions are built by integrating data blocks with the lower bandwidth versions Thus, rather than discard the lower bandwidth data, it is saved and used directly to upgrade from low to high bandwidth quality.
- the data blocks themselves are comprised of frames which can be randomly accessed, thus giving a second dimension (namely, frames) to the progressive database Thus it is possible to selectively build higher quality versions of some frames and not others.
- the mechanism determining which frames to send within each block may be controlled interactively by the user.
- the present invention operates by creating three copies of the progressive database.
- a first copy is stored on the server serially.
- a second copy which mirrors the server database, is built on the client, with random accessibility. These first two copies are in encoded form.
- the frames in the data blocks from the second copy are decoded and stored in the third copy.
- the third copy is dynamically updated and contains the frames to be displayed in either raw bitmap form, or in intermediate compressed form whereby the decompression is fast enough to keep up with real time interactive display in response to user commands.
- a by-product of the present invention when applied to video clips, is the ability to deliver a preview of the video using the first data blocks. This enables the client to play the preview almost immediately after the transmission begins, and then to quickly decide whether or not to proceed with the download. Moreover, if the client does continue with the download, then the first data block already transmitted is integrated with additional data blocks being downloaded, to form the full view version of the video. Thus, rather than discard the data transmitted for the preview, it is saved and used to create the full view frame sequence.
- the present invention can also be applied to efficiently deliver large still images at multiple resolutions for interactive gazing.
- Each block of the progressive database stores various tiles of the image at different resolutions. Smaller tiles are stored at higher resolution. Hot spots are used to link tiles at lower resolution to smaller tiles contained within them at higher resolution. When a viewer clicks on a hot spot to gaze, the display quickly brings up the tile linked to by the hot spot, giving the effect of an instant zoom in.
- the totality of multi-resolution tiles may comprise the "frames" in this application, and these frames form the interactivity dimension of the database.
- the first tile consists of the lowest resolution version of the full image, with hot spots encoded within it.
- the user can at once begin gazing at the higher resolution tiles, even though the image is of low quality.
- additional data blocks are being delivered and decoded in background, and the quality of the tiles is being upgraded as time progresses.
- the zoomed in portion of the image being gazed at gets displayed almost immediately after the streaming begins, but at low quality.
- the quality improves with time.
- the higher resolution tiles correspond to hierarchical "areas of interest" in the image. The choice of which areas to mark with hot spots as areas of interest is in the hands of the producer.
- This use of the invention is particularly efficient in the case where there is a relatively small number of areas of interest in the full image, so that relatively few tiles are encoded at the higher resolutions. Without the present invention, the client would have to wait to receive the full image at higher resolution before viewing any part of it at this resolution, even though only small parts of it are of interest. Moreover, each zoom in and out would be both processor and memory demanding. Viewer interactivity would be painfully slow.
- a system for transmitting digital data representing the original over plural transmission links at least some of which have limited bandwidth including: a digital data source storing digital data representing an original; a digital data receiver receiving the digital data representing an original via one of the plural transmission links having limited bandwidth; and a digital data transmitter operative to transmit the digital data representing an original to the receiver over a transmission link having a limited bandwidth in plural blocks which are sequentially transmitted at a rate determined by the limited bandwidth, each block being an incomplete collection of data which includes parts of multiple frames, each frame being viewable in a selectable order by the receiver even when less than all of the plural blocks have been received, receipt of subsequent blocks by the receiver being used to cumulatively improve the quality of the digital data viewed by the receiver.
- a digital data transmitter actuator comprising: an organizer operative, when actuated, to access digital data representing an original which is organized in plural blocks for subsequent transmission, each block being an incomplete collection of data which includes parts of multiple frames, each frame being viewable in a selectable order by the a receiver even when less than all of the plural blocks have been received; and a receiver instruction interface responsive to interactive inputs from a receiver for actuating the organizer to select a given block and at least one given partial frame within the given block for transmission.
- a digital data receiver including: a data receipt interface receiving digital data representing an original in a plurality of sequential blocks, each block being an incomplete collection of data which includes parts of multiple frames; a block accumulator for combining plural blocks as they are received for viewing by the recipient; and a viewer including a recipient interface which permits each frame to be viewed in an order selected by the recipient, even when less than all of the plural blocks have been received, combining of plural blocks by the block accumulator being used to improve the quality of the digital data viewed by the recipient.
- a method for transmitting digital data representing an original over plural transmission links at least some of which have limited bandwidth including the steps of: storing digital data representing an original; receiving at a receiver the digital data representing an original via one of the plural transmission links having limited bandwidth; and transmitting the digital data representing an original to the receiver over a transmission link having a limited bandwidth in plural blocks which are sequentially transmitted at a rate determined by the limited bandwidth, each block being an incomplete collection of data which includes parts of multiple frames, each frame being viewable in a selectable order by the receiver even when less than all of the plural blocks have been received, receipt of subsequent blocks by the receiver being used to cumulatively improve the quality of the digital data viewed by the receiver.
- a method for digital data transmission including: organizing digital data representing an original into plural blocks for subsequent transmission, each block being an incomplete collection of data which includes parts of multiple frames, each frame being viewable in a selectable order by the a receiver even when less than all of the plural blocks have been received; responsive to interactive inputs from a receiver for actuating the organizer, selecting a given block and at least one given partial frame within the given block for transmission; and transmitting the selected given block and at least one given partial frame to a user.
- the block accumulator is operative to combine plural blocks which are distinguished from each other by their respective frequency bands.
- the digital data receiver includes a fractal decompression engine.
- the data receipt interface is operative to initially receive a first plurality of blocks containing relatively low frequency data and thereafter receive a second plurality of blocks containing relatively high frequency data and the block accumulator is operative to reconstitute the digital data representing an original from the blocks representing relatively high frequency and relatively low frequency data.
- the block accumulator is operative to combine plural blocks having different sampling.
- the sampling rate of a combined plurality of blocks is equal to the sum of the sampling rates of individual ones of the plurality of blocks.
- the digital data receiver includes a wavelet decoder.
- the block accumulator includes a dequantizer which combines blocks each of which contain quantized data of a different order, such that accumulation of multiple blocks provides combined data of greater precision than that contained in any single block.
- a digital data transmitter actuator including: an organizer operative, when actuated, to access digital data representing an original which is organized in plural blocks for subsequent transmission, each block being an incomplete collection of data which includes parts of multiple frames, each frame being viewable in a selectable order by the a receiver even when less than all of the plural blocks have been received; and a receiver instruction interface responsive to interactive inputs from a receiver for actuating the organizer to select a given block and at least one given partial frame within the given block for transmission.
- a first one of the plural blocks contains digital data which represents a first approximation to the original.
- additional ones of the plural blocks, when combined with the first one of the plural blocks provide additionally accurate approximations to the original.
- each of the multiple frames includes a portion of data which can be independently and interactively manipulated.
- the system also includes a block generator operative to receive digital data representing the original and to provide the plural blocks.
- a block generator including: a producer interface; and a digital data compressor, operative in response to producer control parameters received via the producer interface for receiving digital data representing an original and providing plural blocks, each block being an incomplete collection of data which includes parts of multiple frames.
- the block generator is operative to provide plural blocks which are distinguished from each other by their respective frequency bands.
- the block generator includes a fractal compression engine.
- the block generator is operative to decompose the digital data representing an original in relatively high frequency and relatively low frequency digital data portions, and wherein a first plurality of blocks containing the relatively low frequency portion is transmitted by the data transmitter prior to transmission of a second plurality of blocks containing the relatively high frequency portion.
- the block generator is operative to provide plural blocks by sampling the received digital data.
- the sampling rate of a plurality of blocks is equal to the sum of the sampling rates of individual ones of the plurality of blocks.
- the block generator includes a wavelet encoder.
- the block generator includes a quantizer which produces blocks each of which contain quantized data of a different order, such that accumulation of multiple blocks provides combined data of greater precision than that contained in any single block.
- a method for encoding original digital video data to be stored on a server computer for on-line delivery to client computers including the steps of: encoding the digital video into a database including a series of encoded data blocks, each block including a sequence of encoded frames, with the property that successive blocks when decoded and integrated together provide successively higher bandwidth versions of the video for on-line playback; storing the database on a server computer; processing a request by a client computer for on-line delivery of the video in order to determine which data blocks to transmit, so as to accommodate the client bandwidth; transmitting the necessary data blocks to the client; decoding the data blocks on the client computer; integrating the data blocks together on the client computer to reconstruct an appropriate version of the original digital video; and playing the reconstructed video on the client computer.
- the step of encoding includes a bit-rate control device enabling the producer to pre-select the sequence of bandwidths or quality levels for the
- the step of encoding is performed in such a way that the first blocks of the database correspond to previews of the video.
- the steps of transmitting, decoding, integrating and playing are repeated in succession a number of times in order to transmit additional data blocks to the client, thereby upgrading the quality of the video while it is replayed.
- a method for encoding original digital audio data to be stored on a server computer for on-line delivery to client computers including the steps of: encoding the digital audio into a database including a series of encoded data blocks, each block including a sequence of encoded frames, with the property that successive blocks when decoded and integrated together provide successively higher bandwidth versions of the audio for on-line playback; storing the database on a server computer; processing a request by a client computer for on-line delivery of the audio in order to determine which data blocks to transmit, so as to accommodate the client bandwidth; transmitting the necessary data blocks to the client; decoding the data blocks on the client computer; integrating the data blocks together on the client computer to reconstruct an appropriate version of the original digital audio; and playing the reconstructed audio on the client computer.
- the step of encoding includes a bit-rate control device enabling the producer to pre-select the sequence of bandwidths or quality levels for the database.
- the steps of transmitting, decoding, integrating and playing are repeated in succession a number of times in order to transmit additional data blocks to the client, thereby upgrading the quality of the audio while it is replayed.
- a method for encoding original digital object movie data to be stored on a server computer for on-line delivery to client comprising the steps of: encoding the digital object movie into a database including a series of encoded data blocks, each block including a sequence of encoded frames, with the property that successive blocks when decoded and integrated together provide successively higher bandwidth versions of the object movie for on-line playback; storing the database on a server computer; processing a request by a client computer for on-line delivery of the object movie in order to determine which data blocks to transmit, so as to accommodate the client bandwidth; transmitting the necessary data blocks to the client; decoding the data blocks on the client computer; integrating the data blocks together on the client computer to reconstruct an appropriate version of the original digital object movie; and playing the reconstructed object movie on the client computer.
- the step of encoding includes a bit-rate control device enabling the producer to pre-select the sequence of bandwidths or quality levels for the database.
- the steps of transmitting, decoding, integrating and playing are repeated in succession a number of times in order to transmit additional data blocks to the client, thereby upgrading the quality of the object movie while it is replayed.
- a method for encoding an original digital panorama to be stored on a server computer for on-line delivery to client computers including the steps of: encoding the digital panorama into a database including a series of encoded data blocks, each block including a sequence of encoded frames, with the property that successive blocks when decoded and integrated together provide successively higher bandwidth versions of the panorama for on-line playback; storing the database on a server computer; processing a request by a client computer for on-line delivery of the panorama in order to determine which data blocks to transmit, so as to accommodate the client bandwidth; transmitting the necessary data blocks to the client; decoding the data blocks on the client computer; integrating the data blocks together on the client computer to reconstruct an appropriate version of the original digital panorama; and playing the reconstructed panorama on the client computer.
- the step of encoding includes a bit-rate control device enabling the producer to pre-select the sequence of bandwidths or quality levels for the database.
- the steps of transmitting, decoding, integrating and playing are repeated in succession a number of times in order to transmit additional data blocks to the client, thereby upgrading the quality of the panorama while it is replayed.
- a method for encoding original digital large still image data to be stored on a server computer for on-line delivery to client computers including the steps of: encoding the large digital image into a database including a series of encoded data blocks, each block including a sequence of encoded multi-resolution tiles of the image, with the property that successive blocks when decoded and integrated together provide successively higher quality versions of the tiles for display; storing the database on a server computer; processing a request by a client computer for on-line delivery of the image in order to determine which data blocks to transmit; transmitting the necessary data blocks to the client; decoding the data blocks on the client computer; integrating the data blocks together on the client computer to reconstruct an appropriate version of the original multi-resolution image tiles; and interactively displaying the reconstructed tiles on the client computer.
- the step of encoding includes a compression control device enabling the producer to pre ⁇ select the sequence of quality levels for the database.
- the step of encoding operates on a plurality of images forming an animation, and each encoded data block is comprised of multi-resolution tiles from the plurality of images.
- a video processing system operative on digital video data for encoding the digital video, storing it on a server computer and delivering it to client computers on ⁇ line upon request
- an encoder for compressing the digital video into a database including a series of encoded data blocks, each block including a sequence of encoded frames, with the property that successive blocks when decoded and integrated together provide successively higher bandwidth versions of the video for on-line playback
- a storage device for archiving the database on a server computer
- a processing unit for accepting a request by a client computer for on-line delivery of the video and determining which data blocks to transmit, so as to accommodate the client bandwidth
- a transmitter for delivering the necessary data blocks to the client
- a decoder for decompressing the data blocks back into video data on the client computer
- an accumulator for integrating the data blocks together on the client computer to reconstruct an appropriate version of the original digital video
- a player on the client computer for playing the reconstructed digital video.
- the encoder includes a bit-rate controller enabling the user to pre-select the sequence of bandwidths or quality levels for the database. Moreover in accordance with a preferred embodiment of the present invention the encoder compresses the digital video in such a way that the first blocks of the database correspond to previews of the video.
- the transmitter, decoder, accumulator and player repeatedly operate in succession a number of times in order to transmit additional data blocks to the client, thereby upgrading the quality of the video while it is being replayed.
- an audio processing system operative on digital audio data for encoding the digital audio, storing it on a server computer and delivering it to client computers on-line upon request
- an encoder for compressing the digital audio into a database including a series of encoded data blocks, each block including a sequence of encoded frames, with the property that successive blocks when decoded and integrated together provide successively higher bandwidth versions of the audio for on-line playback
- a storage device for archiving the database on a server computer
- a processing unit for accepting a request by a client computer for on-line delivery of the audio and determining which data blocks to transmit, so as to accommodate the client bandwidth
- a transmitter for delivering the necessary data blocks to the client
- a decoder for decompressing the data blocks back into audio data on the client computer
- an accumulator for integrating the data blocks together on the client computer to reconstruct an appropriate version of the original digital audio
- a player on the client computer for playing the reconstructed digital audio
- the transmitter, decoder, accumulator and player repeatedly operate in succession a number of times in order to transmit additional data blocks to the client, thereby upgrading the quality of the audio while it is being replayed.
- an object movie processing system operative on digital object movie data for encoding the digital object movie, storing it on a server computer and delivering it to client computers on-line upon request
- an encoder for compressing the digital object movie into a database including a series of encoded data blocks, each block comprising a sequence of encoded frames, with the property that successive blocks when decoded and integrated together provide successively higher bandwidth versions of the object movie for on-line playback
- a storage device for archiving the database on a server computer
- a processing unit for accepting a request by a client computer for on-line delivery of the object movie and determining which data blocks to transmit, so as to accommodate the client bandwidth
- a transmitter for delivering the necessary data blocks to the client
- a decoder for decompressing the data blocks back into object movie data on the client computer
- an accumulator for integrating the data blocks together on the client computer to reconstruct an appropriate version of the original digital object movie
- a player on the client computer for playing the
- the encoder includes a bit-rate controller enabling the user to pre-select the sequence of bandwidths or quality levels for the database.
- the transmitter, decoder, accumulator and player repeatedly operate in succession a number of times in order to transmit additional data blocks to the client, thereby upgrading the quality of the object movie while it is being replayed.
- a panorama processing system operative on digital panorama data for encoding the digital panorama, storing it on a server computer and delivering it to client computers on-line upon request
- an encoder for compressing the digital panorama into a database including a series of encoded data blocks, each block including a sequence of encoded frames, with the property that successive blocks when decoded and integrated together provide successively higher bandwidth versions of the panorama for on-line playback
- a storage device for archiving the database on a server computer
- a processing unit for accepting a request by a client computer for on-line delivery of the panorama and determining which data blocks to transmit, so as to accommodate the client bandwidth
- a transmitter for delivering the necessary data blocks to the client
- a decoder for decompressing the data blocks back into panorama data on the client computer
- an accumulator for integrating the data blocks together on the client computer to reconstruct an appropriate version of the original digital panorama
- a player on the client computer for playing the reconstructed digital panorama.
- the encoder includes a bit-rate controller enabling the user to pre-select the sequence of bandwidths or quality levels for the database.
- the transmitter, decoder, accumulator and player repeatedly operate in succession a number of times in order to transmit additional data blocks to the client, thereby upgrading the quality of the panorama while it is being replayed.
- an image processing system operative on large digital image data for encoding the digital image, storing it on a server computer and delivering it to client computers on-line upon request
- an encoder for compressing the large digital image into a database including a series of encoded data blocks, each block including a sequence of encoded multi- resolution tiles of the image, with the property that successive blocks when decoded and integrated together provide successively higher quality versions of the image tiles
- a storage device for archiving the database on a server computer
- a processing unit for accepting a request by a client computer for on-line delivery of the image and determining which data blocks to transmit; a transmitter for delivering the necessary data blocks to the client; a decoder for decompressing the data blocks back into image tile data on the client computer; an accumulator for integrating the data blocks together on the client computer to reconstruct an appropriate version of the original multi-resolution image tiles; and an interactive viewer on the client computer for displaying the reconstructed image tiles.
- the encoder includes a compression controller enabling the user to pre-select the sequence of quality levels for the database.
- each encoded data block is comprised of multi-resolution tiles from the plurality of images.
- a method for caching of data which gets transmitted from servers to clients on a central hub within a network including the steps of: encoding digital multimedia data into databases including a series of encoded data blocks, each block including a sequence of encoded frames, with the property that successive blocks when decoded and integrated together provide successively higher bandwidth versions of the media for on-line playback; storing the databases on a multitude of server computers; managing within the hub requests by client computers for on-line delivery of media stored on server computers in order to determine which data blocks to transmit, so as to accommodate the client bandwidth; transmitting the necessary data blocks from the server and from the hub to the client; storing the data blocks delivered by the server in the cache residing in the central hub; processing within the hub the data blocks it receives; decoding the data blocks on the client computer; integrating the data blocks together on the client computer to reconstruct an appropriate version of the original digital media; and playing the reconstructed media on the client computer.
- the step of managing is performed by: setting inventory flags to indicate which data blocks are currently stored in the hub.
- the step of managing further including the steps of: communicating with the servers to monitor which media data is outdated; removing from cache the blocks corresponding to the media data which is outdated; and resetting the inventory flags to indicate that the above blocks are no longer stored in the cache.
- the step of processing comprising the steps of: decoding the data blocks received; integrating the data blocks together to reconstruct appropriate versions of the original digital media; and encoding the reconstructed media versions into an intermediate database for future transmission to the clients.
- a proxy system operative on a server/client network for caching of data which gets transmitted from servers to clients on a central hub, including: an encoder for compressing digital multimedia data into databases including a series of encoded data blocks, each block including a sequence of encoded frames, with the property that successive blocks when decoded and integrated together provide successively higher bandwidth versions of the media for on-line playback; server communication lines from the servers to the hub for sending data blocks; client communication lines from the hub to the clients for sending digital data; storage devices for archiving the databases on a multitude of server computers; a management unit within the hub to process requests by client computers for on ⁇ line delivery of media stored on server computers in order to determine which data blocks to transmit, so as to accommodate the client bandwidth; a transmitter for delivering the necessary data blocks on the server communication lines from the server to the hub, and on the client communication lines from the hub to the client; a storage device for saving the data blocks delivered by the server communication lines in the cache residing in the central hub;
- the management unit operates by setting inventory flags to indicate which data blocks are currently stored in the hub.
- the management unit operates by monitoring from the servers which media data is outdated, removing from cache the blocks corresponding to the media data which is outdated, and resetting the inventory flags to indicate that the above blocks are no longer stored in the cache.
- the processing unit comprising. a decoder for decompressing the data blocks received; an accumulator for integrating the data blocks together to reconstruct appropriate versions of the original digital media; and an encoder for compressing the reconstructed media versions into an intermediate database for future transmission to the clients.
- a multi-casting unit (MCU) system operative on a broadcasting network for caching of data which gets transmitted from stations to viewers, including: an encoder for compressing digital multimedia data into databases including a series of encoded data blocks, each block including a sequence of encoded frames, with the property that successive blocks when decoded and integrated together provide successively higher bandwidth versions of the media for on-line playback; station communication lines from the stations to the MCU for sending data blocks; viewer communication lines from the MCU to the viewers for sending data; viewer receiver units for receiving the data sent by the MCU; storage devices for archiving the databases on a multitude of station computers; a management unit within the MCU to process requests by viewers for on-line delivery of media stored on station computers in order to determine which data blocks to transmit, so as to accommodate the viewer bandwidth; a transmitter for delivering the necessary data blocks on the station communication lines from the station to the MCU, and on the viewer communication lines from the MCU to the viewer receiver units; a storage device for
- the management unit operates by setting inventory flags to indicate which data blocks are currently stored in the MCU.
- the management unit operates by monitoring from the stations which media data is outdated, removing from cache the blocks corresponding to the media data which is outdated, and resetting the inventory flags to indicate that the above blocks are no longer stored in the cache.
- the processing unit within the MCU including: a decoder for decompressing the data blocks received; an accumulator for integrating the data blocks together to reconstruct appropriate versions of the original digital media; and an encoder for compressing the reconstructed media versions into an intermediate database for future transmission to the viewers.
- a method for streaming multimedia data over a network including the steps of: encoding the media into a progressive database indexed according to frame and progressive block numbers; serializing the encoded database; storing the serialized database on a server; streaming the serialized database to a client upon request; creating a mirror copy of the encoded database on the client computer from the data which streams in; and decoding the encoded database on the client computer into a sequence of frames for real time display.
- a multimedia network streaming system including: an encoder for compressing the media into a progressive database indexed according to frame and progressive block numbers; a sequencer for serializing the encoded database; a storage device for archiving the serialized database on a server; a transmitter for streaming the serialized database to a client upon request; a processor for creating a mirror copy of the encoded database on the client computer from the data which streams in; and a decoder for decompressing the encoded database on the client computer into a sequence of frames for real time display.
- a system for transmitting model based data representations of three dimensional images over plural transmission links having limited bandwidth including: a digital data source storing model based data representations of three dimensional images; an image processor for rendering views of said model based data representations into raster bitmap format; a digital data receiver receiving said digital data in said raster bitmap format over a one of the plural transmission links having limited bandwidth; and a digital data transmitter operative to transmit the digital data in said raster bitmap format to said receiver over a transmission link having a limited bandwidth in plural blocks which are sequentially transmitted at a rate determined by the limited bandwidth, each block being an incomplete collection of data which includes parts of multiple frames, each frame being viewable in a selectable order by said receiver even when less than all of the plural blocks have been received, receipt of subsequent blocks by the receiver being used to cumulatively improve the quality of the digital data viewed by the receiver.
- model based data representations comprise VRML representations.
- model based data representations comprise CAD-CAM representations.
- the image processor is operative to render only views which are selected by a user.
- a method for transmitting model based data representations of three dimensional images over plural transmission links having limited bandwidth including: storing model based data representations of three dimensional images; rendering views of the model based data representations into raster bitmap format, receiving the digital data in said raster bitmap format over a one of said plural transmission links having limited bandwidth, and transmit the digital data in said raster bitmap format to the receiver over a transmission link having a limited bandwidth in plural blocks which are sequentially transmitted at a rate determined by the limited bandwidth, each block being an incomplete collection of data which includes parts of multiple frames, each frame being viewable in a selectable order by the receiver even when less than all of the plural blocks have been received, receipt of subsequent blocks by the receiver being used to cumulatively improve the quality of the digital data viewed by the receiver
- model based data representations comprise VRML representations, and CAD-CAM representations.
- RESOLUTION The relationship between the number of digital samples per unit of an original and the number of digital samples per unit in a rendered version thereof Specifically, when dealing with images, resolution refers to the relationship between the number of pixels per unit area of an original image or scene and the number of pixels per unit area in a displayed image Specifically, when dealing with audio, resolution refers to the relationship between the number of samples per unit time of an original sound and the number of samples per unit time in a played sound
- QUALITY The degree to which a rendered version of an original is faithful to the original.
- quality refers to the degree to which the displayed image is faithful to the original image or scene. Normally this is expressed as the degree to which the approximation of pixel values in the displayed image approaches the correct pixel values in the original image or scene
- quality refers to the degree to which a played sound is faithful to the original sound.
- FRAME A portion of an original which can be independently and interactively manipulated.
- frame refers to a portion of an image or of a collection of images which can be independently and interactively manipulated.
- frame refers to a portion of a sound which is delimited in time and can be independently and interactively manipulated.
- BLOCK A sequentially transmitted collection of partial data which is used to build multiple frames.
- the frames are built up of one or more sequentially transmitted blocks, whose contents are accumulated.
- the block contains image data.
- the block contains audio data.
- PARTIAL FRAME The part of a frame which is contained in a given block.
- TELE A window sized pixel array of a predetermined given size forming part of an image. For example, tiles partition an image into a plurality of arrays, each of which contains an identical number of pixels.
- FIG. 1 and Fig. 2 are simplified block diagrams illustrating a system for scalable representation of multimedia data for progressive asynchronous transmission, constructed and operative in accordance with a preferred embodiment of the present invention
- Fig. 3 A is a simplified schematic diagram of the database structure of the present invention which includes three databases, embodied within the client-server system of the present invention;
- Fig. 3B is a simplified diagram of a database structure particularly useful in the client database of Fig. 3 A, illustrating its two-dimensional nature;
- Fig. 4 is an illustration of the operation of a preferred embodiment of the present invention.
- Fig. 5 is a simplified schematic diagram of a production tool for converting a digital multimedia file into a progressive scalable database representation for storage on a server computer in accordance with a preferred embodiment of the present invention
- Fig. 6 is a simplified schematic diagram of the structure of a block within the server database, partitioned into frames which can be accessed randomly in accordance with a preferred embodiment of the present invention
- Fig. 7 is a simplified schematic diagram of a decoder for receiving and integrating data blocks from a scalable database, to form a version of a digital multimedia object for playback in accordance with a preferred embodiment of the present invention
- Fig. 8 is a simplified schematic diagram of a scalable progressive database for a video clip in which the first data blocks are used for previewing the video in accordance with a preferred embodiment of the present invention
- Fig. 9 and Fig. 10 are simplified schematic diagrams of a system for inco ⁇ orating a scalable progressive database into a time-based video sequence of frames indexed by two time scales: a macro and micro scale, in accordance with a preferred embodiment of the present invention
- Fig. 1 1 is a simplified block diagram of a proxy system used to cache in a central hub multimedia data which is transmitted from servers to clients in accordance with a preferred embodiment of the present invention
- Fig. 12 is a simplified block diagram of a system for generating a scalable database from digital media data, by running a compressor in a feedback loop in accordance with a preferred embodiment of the present invention
- Fig. 13 is a simplified block diagram of a decoder for the database generated by the system of Fig. 12 in accordance with a preferred embodiment of the present invention
- Fig. 14 is a simplified diagram illustrating a scalable progressive database useful for a large still image in accordance with a preferred embodiment of the present invention
- Fig. 15 is a simplified diagram illustrating a virtual reality system constructed and operative in accordance with a preferred embodiment of the present invention.
- Fig. 16 is a simplified flowchart illustrating operation of the system of Fig. 15.
- Fig. 17 is an illustration of the operation of a preferred embodiment of the present invention illustrated in Figs. 15 and 16, permission to reproduce Fig. 17 was granted by Tecnomatix, Ltd.
- the present invention provides a novel method for representing multimedia data.
- the invention provides a scalable representation, so that the data can be asynchronously transmitted to clients having different bandwidth connections, played on-line almost immediately after the transmission begins, interactively controlled, and also progressively upgraded as it is replayed.
- Video players play at standard rates such as thirty frames per second (fps), and require the images for display to be available at this rate. If the images are already stored on a local hard disk, then all that is necessary is disk access, which is very fast. On the other hand, if the images are streamed in from a server, then in order for on-line playback to be possible before a full download is finished, the rate of transmission must be great enough to supply the frames at thirty fps. This does not mean that the network link has to transmit the data equivalent to thirty full frames every second. Due to compression, it suffices if the network transmits thirty compressed frames every second.
- the compression achieved is 10:1, then it suffices to transmit at a rate of three fps, provided that the client CPU can decompress thirty compressed frames into full frames every second.
- compression is the mediator between the video player and the bandwidth. The player does not slow down when bandwidth is low; rather, the compression ratio has to be greater. Should a bottleneck arise, and a frame is not available when the player needs it, then the player simply skips that frame, but continues to expect frames at the thirty fps rate.
- the video can be preset at the outset for lower rates than thirty fps, but not much lower, since slow video playback breaks the continuity between frames, and thus loses the effect of motion.
- the media data is comprised of m frames F F 2 F m .
- a frame can be, for example, an individual frame of a movie sequence, a piece of a panoramic view, an individual segment of an audio signal, or even a sub-sampled version of a large still image. It can also be a group of such frames, such as for example, a group of inter-frames between key frames in a video segment, in a case where an H.263 codec is being used.
- frames are units of interactivity. For example, in object movies where interactivity means frame advance, a frame unit is an individual still image, whereas in gazing applications where interactivity means zooming in and out, the frame units are multi-resolution tiles.
- the representation encodes the media data into n data blocks B t , B 2 B combat preferably of roughly equal size.
- Each encoded data block Bj contains m compressed frame units Fj, Fi FJ.
- the database is arranged in two dimensions, corresponding to blocks and frames.
- the dimension used for blocks is for achieving progressiveness, and the dimension used for frames is for achieving interactivity.
- the frame data can be transmitted in a selective order, but the blocks must be transmitted in sequence, since they build cumulatively. This is an essential feature of the subject invention.
- Data block is used to deliver the media at the lowest bandwidth, say/ / Kbs; data blocks Bi and B when integrated together, are used to deliver the media at bandwidth/
- each block the frames can be accessed randomly and delivered selectively, so that the user can vary the quality level among the frames.
- a viewer who wants to gaze at frame #3 may instruct the database to send frame #3 data from the first ten blocks, but only one block of data for all of the other frames
- the viewer selection is carried out interactively, through the use of keyboard presses and mouse clicks, as the media is being played. Whereas for some applications it may be most natural to transmit the entire blocks in sequence, for other applications it may be more effective to first deliver as much data as possible for specific frames at the expense of lowe ⁇ ng the quality of other frames.
- the scalable representation that is the subject of the present invention is embodied in a production tool which enables the producer to control the bandwidth parameters f k , or equivalently, the qualities of the media versions obtained by integrating blocks B,, B , .... Bk
- the blocks be of equal size, nor that the frequencies f k be given by k/, although this is the preferred embodiment
- the production tool also enables the producer to control the final quality of the highest bandwidth version, or equivalently, the total number, n, of data blocks in the representation
- the media data representation is not scalable, but is encoded instead for a specific bandwidth / then only clients with bandwidth / or greater can play the media on-line as it is being downloaded A client with a lower bandwidth than / would have to download the entire data stream to memory in order to begin playback, which can take a great deal of time on account of the large file sizes typically used in multimedia production A client with a higher bandwidth connection than/ would not be able to take advantage of it to receive higher quality media Moreover, there would be no means of upgrading media quality, even for clients with high bandwidth connections, other than to transmit an entirely new data stream from the server side, and discard the previously downloaded data.
- Applications of the invention include, inter alia, scalable audio and video transmission, video previewing, progressively rendered object movies and panoramas, large still images, efficient proxy or multi-casting unit (MCU) management for web and other hubs, and VRML transmission, as described hereinafter in greater detail.
- MCU multi-casting unit
- Scaleable audio transmission Digital audio data can be progressively encoded into a scalable database for asynchronous transmission at different bandwidths.
- a client connected with a low bandwidth line can receive a low quality version of the audio, which can be played back on-line at the low bandwidth as the data streams in. After the audio is played, additional data blocks can continue to be received and integrated with the previous blocks, so that the audio is upgraded to higher quality for replay.
- Scaleable video transmission Similar to the description above for the audio transmission, digital video data can be encoded into a scalable database for asynchronous delivery and progressive quality upgrade.
- the first time scale (hereinafter referred to as the "major scale”) is used to advance from one frame to the next, based on major units of time.
- the second time scale (hereinafter referred to as the "minor scale”) is a sub-division of the major scale into smaller time units, and is used to incorporate small changes or fluctuations into the frame being displayed.
- the major scale can be advancing through a movie of a bird flying and the minor scale can be adding fluttering to the bird's wings.
- the advantage of such a two-scale player is that the decoder, which does the intensive processing to supply the frames, need only run at the slower rate, e.g. 3 frames per second (fps), governed by the major scale, whereas the viewer, doing the less intensive processing, is playing at the faster rate, e.g. 30 fps, governed by the minor scale.
- the subject invention can be incorporated into a system having two time scales as described hereinabove by using the minor scale in a way different from the way that was originally intended. Instead of being used to introduce fluctuations, it is used to display progressively rendered versions of a frame.
- the player displays the latest version available of the frame indexed by the major scale. For example, suppose there are ten minor time units within the major time unit during which frame #4 is to be displayed.
- the player initially displays the version of frame #4 which it has available from the first data blocks already processed. As additional blocks of data are accumulated and higher quality versions of frame #4 become available, the player displays those frames at successive minor time units.
- the production tool For the user to be able to view the video immediately, without waiting for the entire file to download, the production tool must store the encoded video in the order of successive blocks. Each partial frame must be handled as if it were an entire frame. That is, the production tool must treat the movie as if there were a total of m ⁇ n distinct frames being encoded. On the other hand, each frame is sent only once to the codec for encoding, and is returned as a series of encoded partial frames. Thus it is necessary to post-process the encoded data file, to rearrange the data items from a frame dominated order to a block dominated order. This rearrangement process is referred to as "flattening" in the art.
- the player in turn, however, must know that although it is receiving what appears to be m ⁇ n data items, there are really only a total of m frames. It must decode and accumulate every successive sequence of m data items with the previous ones, to update the frames.
- the combined effect of the flattening on the production side and the player's interpretation on the client side enables seamless integration of the scalable progressive database within a non-progressive video interface. That is, the incorporation of progressive blocks does not require any modifications to the existing interface.
- Video previewing When encoding digital video data into a scalable database, the first data blocks can be used to generate a preview of the video, restricted to selected frames. The preview can be played back by the client almost immediately after the streaming begins. Moreover, additional data blocks received are integrated with the data blocks from the preview, to form full view versions of the video.
- Object movies Advertising agencies are using object movies to produce interactive 3-D virtual reality presentations of merchandise on the Internet. The user can rotate and zoom the 3-D object, and examine it from different viewing angles. Using the methodology of the current invention, object movies can be progressively encoded so that the viewer can download and begin playing them almost immediately after the streaming begins. Initially the movie will scale to a quality commensurate with the bandwidth of the user's network connection, but as the data blocks are received and the user interacts with the movie, additional data blocks are delivered and integrated with the previous blocks, resulting in a higher and higher quality movie.
- An important feature of the invention is that, regardless of bandwidth, the user can begin playback and interaction almost immediately, and does not need to wait for the complete download, as the first version of the movie delivered scales itself to the native bandwidth. As playback continues additional data streams in the background and the movie version is upgraded to higher and higher quality.
- Panoramas are very large images which the user cannot view in their entirety, but rather sees within a restricted viewing window. By panning in various directions, and zooming in and out, the user navigates through the panorama. The continuous change in viewing window gives the effect of movement within a scene. Similar to the description above for object movies, panoramas can be progressively encoded so that the viewer can download and begin navigating through them almost immediately after the streaming begins. Initially the panorama scales to the client bandwidth, and after the first data blocks are received, additional data blocks are streamed in background while the panorama is playing, to provide higher and higher image quality.
- the frames can be small image tiles within the full image at different resolutions, the smaller tiles having higher resolution than the larger ones.
- the locations of the tiles can be marked as hot spots.
- the database delivers that tile at a higher resolution, giving the effect of a zoom in. Within the higher resolution tile there can be more hot spots, and the zooming can continue through the database.
- the first frame within a data block may contain the full image sub-sampled by 4: 1, for example, in each dimension.
- the next set of frames within the data block may contain (some subset of) the four quadrants of the full image sub- sampled by 2:1 in each dimension.
- the next set of frames within the data block may contain (some subset of) the sixteen quadrants within the above four quadrants at the original resolution.
- a viewer could see the 4: 1 reduction of the original image (the first block), click on one of the quadrants and then see that quadrant at a 2: 1 reduction (a frame from the second set), and click further on one of its quadrants and then see it at full scale (a frame from the third set).
- Proxies are large storage devices, located as hubs within networks, used as large caches for data being delivered from servers to clients.
- MCUs are large storage devices used as caches for data being delivered from broadcasting stations to viewers, such as cable TV.
- the proxy or MCU stores the data in a central hub so that it is available for delivery at a high bandwidth if requested again by any of the clients connected to the hub. It plays a similar role to paging files on a local computer disk, but on a much larger scale and for a much larger clientele.
- the scalable representation of the subject invention is particularly well suited for proxies and MCUs which operate in asynchronous environments.
- Server/client connections and broadcast transmissions can be of many different bandwidths, and so the proxy or MCU can be accumulating versions of the same multimedia data corresponding to different qualities. Without scalability these versions are all independent of one another, and cannot be combined to achieve quality levels other than those originally preset or combined to save space.
- the proxy or MCU can be optimized to cache the various progressive building blocks. This affords great flexibility in being able to create versions of different quality levels, and reduces the space requirements.
- a first user with a low bandwidth connection/ to the server who demands a multimedia file, downloads data block B t , which is then also saved to cache on the proxy, and the user receives a low quality version (quality level 1) of the media.
- a second user with a higher bandwidth connection/ to the server who demands the same multimedia file, can download data blocks B 2 and B 3 from the server, and can access block B t directly from the proxy.
- the three data blocks are integrated and the second user receives a very high quality version (quality level 3) of the media.
- Data blocks B and Bi would then also be stored on the proxy.
- a third user with a direct connection to the proxy of bandwidth
- the proxy or MCU inherits the scalability from the servers, giving it a great deal more flexibility in its media delivery to the clients than would be possible in a non- scalable environment.
- VRML transmission Virtual reality modeling language (VRML) is a descriptive language for representing and rendering three-dimensional objects.
- the objects are modeled as collections of polygonal elements, the description of which forms a VRML database.
- the individual elements are processed and the desired view of the object is rendered into a raster bitmap for display.
- the VRML representation is rich enough to encapsulate all possible views of the object. In fact, there is an infinity of possible variations in viewing parameters.
- a user interacts with the VRML object by adjusting viewing parameters, through mouse clicks and keyboard presses.
- VRML was first popularized by Silicon Graphics. Their top Iris workstations, for example, can render on the order of a million polygonal elements per second.
- VRML images are characterized by their sha ⁇ photo-realistic attributes.
- the present invention can be applied to efficiently deliver VRML imagery over a server/client network, for on-line interaction.
- two problems arise when transmitting VRML databases over a network for on-line interaction.
- First there is the bandwidth limitation, which inhibits the rate of transmission.
- the present invention can be used to mitigate the problem by allowing the rendering to be done on the server computer without requiring enormous memory, and yet enable the client to freely interact with the VRML object in an on-line interactive setting.
- This is one of many examples involving real-time encoding.
- the invention operates by receiving the viewing parameters from the user, rendering the corresponding image on the server into a raster bitmap image, encoding the bitmap into progressive partial frames and inserting them into a two-dimensional server database.
- the encoded data within the server database is continually streamed from server to client, enabling the client to begin viewing a low quality image as soon as the first partial frame data arrives.
- additional bitmaps are rendered, encoded and inserted into the server database.
- the server does not need to render the same bitmaps again. Rather, the streaming simply continues in background, and the quality of the image on the client side is enhanced as additional partial frames are integrated. Similarly, if the user stays focused on a single view, then the bitmap being displayed is enhanced as additional partial frames stream in. Once all of the partial frames are integrated, the image has the same sha ⁇ photo-realistic quality as is characteristic of VRML images. On the other hand, the user does not have to wait for all of the data to arrive in order to interact with the object, nor does the client computer have to do the intensive processing to render the VRML database into bitmaps.
- Fig. 1 shows a block diagram of a system for providing on-line virtual reality (VR) movies.
- the system includes a production workstation 3 for receiving input images and processing same, as will be described hereinafter.
- Such input images may be constituted by photographs which are scanned into the workstation.
- the output from production workstation 3, being a raw VR movie, is fed into an encoder 5 for preparing the movie for transmission and in turn, applied to a server 7 essentially used for storage and transmission of the movie to clients, namely, subscribers or user units 9.
- Seen is a transceiver 34, an asynchronous memory/database 35, a decoder 36 and a user's workstation 40.
- Typical operation of a preferred embodiment of the present invention is now described with reference to Figs. 1 and 2.
- Selected images are introduced in production workstation 3 in which the VR movie is produced in accordance with a certain script.
- the producer at the workstation determines the number and size (for example, in bytes) of the partial frames and also defines the various available interactions between the frames by defining hot spots and objects at 21, using auxiliary standard devices for producing movies, such as a keyboard, a mouse, speakers and a CPU all designated by 23.
- the product obtained is a raw VR movie, which is a complete VR movie that has not been reformatted for transmission.
- the preparation of the movie for transmission is effected in encoder 5 where partial frames are generated through an iterative process.
- the partial frames are generated by encoder 5 as controlled by the controller 25, as follows A partial resolution frame or a partial resolution slice of each frame of the VR movie sequence is generated.
- One example of such a partial resolution frame is sub- sampled scan lines, e.g., the removal of every 10 14 line of a 150 line frame or a compression encoded frame, which partial resolution results in a blurry display.
- the partial resolution frame is then subtracted from the original frame by a partial frame subtractor 27, yielding a residual frame or a remainder frame. This process can be repeated on the residual frame, generating a second partial resolution frame.
- the procedure is also repeated time and time again, until the number (which is determined by the producer) of partial resolution frames is generated.
- the net result is a set of partial resolution frames that can be recombined into the original full resolution frame.
- the partial resolution frames are transmitted.
- the order of transmission follows the script given by the producer, and commonly, the first partial resolution frame of each frame is transmitted, followed by the second partial resolution frame, and so on. This sequence of transmission allows for the whole sequence to be viewed in a partial resolution format that progressively comes into focus.
- Partial resolution frames may be optionally compression encoded, possibly taking into account similarities between various frames. This is effected in the compressor 29.
- Encoded and compressed VR movie parts are passed to a server 7 where the movie parts are stored in a database 31 and transmitted to a user's unit 9 part by part, by means of a transceiver 33.
- a user's transceiver 34 receives movie parts and transmits requests for additional information.
- a user's database 35 is progressively updated with requested images or, alternatively, may be progressively updated by the server 7.
- User's database 35 functions asynchronously, supplying the frames to the user via a decoder 36 by request independent of data transmission.
- the decoder Upon receiving the frames, the decoder initially decompresses the frames as indicated at 37 (if compression took place) and then decodes and recombines them by means of a partial frame integrator 38. Following this, the partial frames are stored in the user's database where the frames may be stored in a compressed format, effected by a compressor 39.
- a user's workstation 40 enables the user to view and interact with the VR movie.
- the user utilizes the workstation for sending requests for images to the decoder which retrieves (and decompresses, if necessary), the images from the user's database and sends requests for particular images which may not yet have been transmitted to the server database. Furthermore, the user's workstation actuates any script produced in the production workstation 3. Hence the user's workstation also includes the standard devices included in the workstation 3 and designated by the number 23 (Fig. 1).
- the progressive scalable database which is a subject of the current invention preferably is stored in three databases within the server/client system.
- Fig. 3 A shows a preferred database structure in accordance with a preferred embodiment of the present invention. It is a particular feature of the present invention that three databases are employed, a server database 41, which is arranged in a serial form, containing multiple data blocks, each including multiple partial frame data, a client database 42 which is arranged in a two dimensional structure, conceptually illustrated in Fig. 3B and an interactive database 43 which contains a single data block including multiple frames, which is dynamically updated from the client database 42.
- the client database could be eliminated.
- Server database 41 which is archived on a server constitutes a first database of the progressive scalable database.
- the server database 41 includes a plurality of data blocks in encoded form.
- the progressive scalable database is two-dimensional in nature. It has a progressive dimension indexed by block number, and an interactive dimension indexed by frame number but it is serialized for streaming and can only be accessed sequentially.
- Server database 41 is streamed from server to client via the transmission and buffering protocol of the Internet browser.
- Client database 42 the second database, is built up on the client side as the information streams in, to mirror the server database 41.
- Client database 42 is truly two- dimensional, with random access capability within the data blocks.
- the data blocks within it are also in encoded form.
- Interactive database 43 the third database, is created by decoding the data from the client database 42.
- This interactive database 43 is one-dimensional, and contains only one sequence of frames, but it is dynamically updated. As additional block data is integrated, these frames are updated, with the previous versions over-written.
- Interactive database 43 is controlled by the user interface through keyboard presses and mouse clicks
- the creation and update of interactive database 43 from client database 42 is done in background time slices, while the client CPU is idle
- Interactive database 43 may store the frames in either raw bitmap form or in an intermediate compressed form, as long as the intermediate compression is such that the frames can be decompressed in real time for display
- An advantage of using an intermediate compression is to confine interactive database 43 to internal RAM, which has fast access time, rather than swap to hard disk memory, which has slow access time
- the swapping in itself is a drain on processing speed.
- interactive database 43 When the user requests a frame to be displayed, interactive database 43 displays that frame immediately, if it is available In case the frame is not available, interactive database 43 passes a message back to client database 42 requesting that frame Client database 42 accesses the specific frame requested from its first encoded data block, if it is available, and sends it to the decoder for decompression and integration, and subsequent inco ⁇ oration into interactive database 43 Once a frame is inco ⁇ orated, interactive database 43 displays the frame at once If client database 42 has not yet received the requested frame from the server stream, then it must wait until the encoded frame arrives, since the streaming is sequential If the streaming were instead random access, client database 42 would be able to directly request the specific frame it needs from server database 41
- the server database is two-dimensional but serialized for sequential streaming; the client database is two-dimensional with random access within blocks, and the interactive database is one-dimensional but dynamically updated In the interactive database, the progressive database dimension is actually being represented as time rather than space
- This "three database strategy,” using three different databases (/) two- dimensional serialized, (/ ' /) two-dimensional, (/ * //) one space and one time dimension, is a key to the present invention, and to the discussion of the figures in detail which follow
- progressiveness means quality
- the encoder builds the blocks of the database based on achieving the best quality at given bit rates
- progressiveness means cumulative integration.
- Progressiveness within the client database is a computational property.
- the accumulator on the client computer integrates frames from successive blocks with those from previous blocks.
- progressiveness means time. As time progresses, the frames are dynamically updated as more blocks have been accumulated. The transmission from the server database to the client database is streamed serially, and this is where the progressive dimension is effectively converted from "space" to "time.”
- the progressiveness manifests itself in bandwidth during the streaming.
- the transmission from the client database to the interactive database is asynchronous.
- the client database is being created in the background while the interactive database is being played, and the former acts as a buffer for the latter.
- the client can interact with the media almost immediately after the streaming begins, and does not have to wait for the client database to be constructed entirely.
- the interactive dimension of the database corresponds to whatever functionality the user interface allows. For example, it can manifest itself as frame advance for videos and object movies, navigating for panoramas, and gazing for large still images.
- FIG. 4 illustrates one application of the three database structure described hereinabove in Figs. 3 A and 3B.
- each image in Fig. 4 is built up of corresponding partial frames in successive data blocks which are cumulatively received.
- the five images in a first horizontal row to correspond to five interactively viewable frames in a first block of data, each successive frame typically illustrating a successive position of an imaged model.
- a limited bandwidth user first receives the first row of images and is immediately able to interact therewith. Over time, depending on the bandwidth available to that user, successive data blocks are received, each cumulatively enhancing the quality. It is a particular feature of the invention that during receipt of successive blocks of data, the user is able to fully interact with the images.
- a production tool 71 which accepts as an input a sequence of original digital frame units 72, integrated into a digital multimedia file 73.
- the production tool 71 includes an encoder unit 74 which operates by partitioning and compressing the digital multimedia file 73 into a scalable progressive database 75 comprised of data blocks 76. Successive blocks combine together to form higher bandwidth versions a to n of the media.
- Database 75 is stored on server 77.
- the production tool 71 enables the producer to control the bandwidth or quality granularity through control parameters 78. These parameters are used to calculate the data block sizes and compression settings within encoder unit 74.
- Fig. 6 showing the structure of data block 76 in scalable progressive database 75.
- a random access server is available, selective encoded frames 72 from block 76 are accessed at 79 on the server 77 database, based on interactive requests coming from the client.
- the encoded frames are transmitted from server 77 to a client computer 80 and integrated within client database 81, to mirror server database 75. It is appreciated that both sequential and random access servers may be advantageously employed in the present invention, although random access servers are preferred.
- Fig. 7 showing the decoder on a client computer 80.
- the client computer 80 receives from server 77 (Fig. 5) into a buffer 82 a series of data blocks 76 from scalable database 75. As the blocks are received, a client database, which mirrors the server database, is built up.
- a decoder unit 83 decompresses the blocks and an accumulator unit 84 integrates them to form a suitable low quality version 85 of the multimedia file 73 (Fig. 5), which is stored in a buffer 86, thus building up an interactive database.
- the operations of the decoder 83 and accumulator 84 are governed by a CPU 87. They may operate in either order; i.e., the decoding may be carried out before the accumulation, or the accumulation may be carried out before the decoding.
- the multimedia file 73 is played on a player unit 88 in response to interactive user commands. As the user interactively requests specific frames to be played, the buffer 86 supplies the highest quality version which it possesses. If the desired frame is not available, the buffer 86 sends back a request to buffer 82 to decode and accumulate that frame. If the frame is also not available in buffer 82, then that buffer 82 sends back a request to the server database 75 to transmit the frame. As playback continues and the bandwidth frees, additional data blocks 76 are received and integrated with the previously received blocks into higher quality versions 89 of the multimedia file.
- Figs. 1 - 7 The description of Figs. 1 - 7 has been directed towards the overall system and method provided by the present invention. The description which follows is directly principally to particular applications of the system and method described hereinabove.
- the progressive database 75 comprises data blocks, where the first ones of data blocks 76 are used to create a preview 90 of the video, and the second ones of data blocks 76 are accumulated with the first blocks to create a full view 91 of the video.
- the views are stored in the interactive database buffer 86 and played in player unit 88, in response to interactive user commands.
- FIG. 9 showing a system for inco ⁇ orating a progressive scalable database into a time-based video frame sequence with a macro and micro time scale, such as the one used in Apple's QUICKTIME ® movies in accordance with a preferred embodiment of the present invention.
- Individual frames 72 are arranged according to a macro time scale, denoted by major axis markings 92 in Fig. 9. Each frame is displayed at the respective macro times indicated by markings 92.
- the major time scale can be displaying a bird flying
- the minor time scale can be used to add fluttering to the bird's wings.
- the fluctuations typically involve only a small portion of the image area, and are displayed in rapid succession, according to minor axis markings 93 in Fig. 9.
- Such a time based sequence allows the decoder, which does the intensive processing to supply the frames 72, to run at a slow rate; e.g. 3 fps, whereas the viewer, doing the less intensive processing, can be playing at a fast rate; e.g. 30 fps.
- the fluctuations must be simple enough, though, that they can be rendered at the full 30 fps rate.
- the present invention includes a novel use of such a two-scale time based system, to enable progressive streaming.
- the minor time scale 93 is used to display progressive versions of a frame, rather than to display fluctuations, as originally conceived in the prior art.
- the player displays the latest version of the frame which is available. For example, in Fig. 9 the low quality version 94 of frame #4, corresponding to the first block 76, is displayed over a duration of three minor axis marks, by which time the second block 76 has been accumulated to form the medium quality version 95.
- This medium quality version is then displayed over a duration of two minor axis marks, by which time the third block 76 has been accumulated to form the high quality version 96 of the media.
- This high quality version is then displayed for a duration of three further minor axis marks, following which the frame advances to frame #5.
- the cycle then repeats for progressive display of frame #5. If additional blocks arrive and are accumulated for frame #4, they are displayed when the video sequence is replayed.
- the progressive dimension of the database can be inco ⁇ orated within the interactive dimension, through the use of a minor time scale, which is situated within the major time scale used for advancing the frames.
- the major time scale is the interactive axis, and the minor time scale becomes the progressive axis.
- the overall accomplishment is to enable viewing of the video before the full media stream has been downloaded. Initially, the low quality frames are displayed, and while additional blocks are downloaded and accumulated in background, the quality of the frames steadily improves.
- a production tool for the user to be able to view the video immediately, without waiting for the entire file to download, a production tool must store the encoded video in the order of successive blocks. Once the first block is downloaded, the video can already be viewed.
- the order of the encoded data items must therefore be:
- each row in this order comprises one entire block.
- each partial frame must be handled as if it were an entire frame. That is, the production tool must treat the movie as if there were a total of m - n distinct frames being encoded. The player in turn, however, must know that although it is receiving what appears to be m n encoded frames, there are really only a total of m frames. It must decode and accumulate every successive sequence of m data items (i.e., each row of the above sequence) with the previous ones, to update the frames.
- the frames 72 are supplied in sequence to the production tool, which produces a series of partial blocks 76 for each frame.
- the partial blocks are reordered, as indicated by the mapping in Fig. 10, into a single file stream 97.
- the first partial blocks of each frame form the first m data units in the file 97
- the second set of partial blocks of each frame form the next m data units, etc.
- Fig. 11 shows a proxy system 98 for caching in a central hub 99 multimedia data which streams from servers 77 to clients 80.
- the servers 77 store their multimedia data in the progressive scalable representation described above in the server database 75, with each media encoded into data blocks 76 B t , B 2 B shadow
- a proxy computational unit 100 first determines, based on the server bandwidth, which data blocks are to be transmitted to the client. If those data blocks are not already cached in the hub, then the proxy 98 retrieves the required blocks 76 from the server along low bandwidth communication channels 101, at the appropriate bandwidth, and delivers them to the client 80 along high bandwidth communication channels 102. The blocks 76 are also cached in the hub 99.
- an inventory flag database 103 on the hub which keeps a record of which data blocks are available. If some of the required data blocks 76 are already cached, then the proxy computational unit 100 computes which data blocks must be delivered from the server 77 in order to transmit to the client the highest quality version of the media possible, within the bandwidth constraint. The proxy 98 then retrieves the required blocks 76 from the server 77 along low bandwidth communication channels 101, stores them in its cache and delivers them to the client 80 along high bandwidth communication channels 102. A decoder unit 83 on the client computer decodes the data blocks received, and an accumulator unit 84 integrates them.
- the proxy 98 may also have its own decoder unit 83 and accumulator unit 84, which converts the data from its original compressed form on the server database 77 to an intermediate compressed form on the hub database 104, one which is faster to decompress than that of the client database.
- the proxy accumulation unit 84 may perform the necessary data block accumulation to store on the hub all possible versions of the multimedia in its intermediate compressed form, in which case the client accumulator 84 unit is unnecessary.
- An update communication line 105 links servers 77 to the proxy 98, through which servers 77 can notify proxy 98 if any of the multimedia files have been updated. If proxy 98 receives such notification, then it clears its cache of any data blocks associated with those updated files, and resets its inventory flags 103, so that upon future client requests it will know that it has to retrieve the updated files from the server again.
- the sizes of the compressed data blocks are such that the first consecutive compressed data blocks B t , B 2 B k when transmitted at bandwidth/ suffice to enable on-line playback of the media version.
- Fig. 12 shows a system for generating a progressive scalable database from digital media data by cascading compressors in tandem.
- the user selects bit-rate and quality control parameters 106 for the encoding
- the original multimedia data file 73 is input to a compressor 107 along with the user- selected control parameters 106, resulting in compressed data 108 adapted to a user- selected bandwidth.
- the compressed data 108 is transmitted to the scalable database 75 as the first data block 76. It is also transmitted to decompressor 109, which reconstructs the media as it would be generated on the client side.
- the reconstructed data 110 is subtracted from the original data 73 to arrive at a residual 111.
- the residual 111 is fed back to the compressor 107 in a feedback loop 1 12, and compressed by compressor 107 with bit-rate control so that the compressed data 108 is adapted to the difference between the first and second user-selected bandwidths.
- the compressed data is transmitted to the progressive scalable database 75 as the second data block 76, and the loop continues repeatedly until the user-selected final quality is achieved.
- the compressor 107 it is not necessary for the compressor 107 to use the same compression method each time it operates. Rather, it can use a block identifier 1 13 as a parameter for switching between methods.
- the first block could be encoded using a low quality version of H.263 and successive blocks could be encoded using spatial vector quantization and temporal wavelets.
- the progressive JPEG standard allows the encoder to segment the compression into spectral selection and successive approximation scans. In spectral selection the DCT coefficients are grouped into spectral bands, and in successive approximation the bits used to represent them are divided into lower and higher precision information.
- Progressive JPEG is described by Pennebaker, W. B. and Mitchell, J. L. in JPEG: Still Image Data Compression, Van-Nostrand Reinhold, New York, 1993, the disclosure of which is hereby inco ⁇ orated by reference
- Fig 13 shows a decoder for the scalable database 75 on the client side, corresponding to the encoder from Fig. 12
- Blocks 76 are successively transmitted from the server to the client and are decompressed in decoder 83
- the decoded data is integrated in an accumulator 84, which converts it into a form compatible with the player 88, and stores it in the interactive buffer 86 While the player is showing the media, additional blocks 76 are received in the background, decoded and accumulated, so that the media quality is upgraded when it is replayed
- the player 88 requests frames from buffer 86, and if they are not available then buffer 86 requests them from database 75
- a block identifier 113 provides an input to the decoder as each block is decoded, so that the decoder can apply the appropriate decompression method for that data block, corresponding to the compression method which was performed in compressor 107
- the first encoded frame unit of a data block consists of the full image sub- sampled 4 1 in each dimension
- the second set of frames consist of four encoded frame units corresponding to each of the quadrants of the full image, sub-sampled at 2 1
- this second set of frames appears to be four times the size of the first frame, it can be stored using only three times as much data since the decoder can also accumulate data from the previously displayed frame
- the third set of frames includes sixteen encoded frame units, each corresponding to a quadrant of a quadrant of the full image, but at the original scale Again, this third set of frames stores three times as much data as the second set
- These three sets of frames comprise the entire encoded data block, and their sum total is, of course, the same amount of data as the full image at the original scale (The effect of the compression is being ignored here )
- the frames are all arranged sequentially in the encoded data block 76
- the mapping from multi-resolution image tiles to sequential frames 72 is shown in
- the client CPU looks for the desired frame #2 in its interactive database. If the frame is already present, it is delivered to the viewer for immediate display; otherwise the interactive buffer sends the request back to the progressive database.
- the progressive database can access and send the specific encoded frame required from the second set of frames, without sending all of the frames.
- FIG. 15 illustrates a system for transmitting VRML images over a server/client network.
- a VRML database 121 is stored on a server.
- a client 80 interactively controls the VRML viewing parameters 122 through use of a mouse and keyboard 23.
- the viewing parameters 122 are used in conjunction with the VRML database 121 to render a raster bitmap image 123 of the VRML object on the server computer If the bitmap image corresponding to the viewing parameters was already rendered previously, then the frame data for the bitmap in the client database 42 is used for display by the player 88.
- the raster bitmap is encoded into partial frames by encoder 74, and the partial frames are inserted into a server two-dimensional database 41.
- the server database is continually streamed to the client, building up the client database 42.
- the client database 42 is used to provide the frames for display Fig. 16 depicts a flowchart for the VRML application shown in Fig. 15
- the user interacts with the mouse and keyboard, and the client computer updates the viewing parameters
- the client computer checks whether or not those viewing parameters have already been processed
- the viewing parameters can vary continuously, a preferred embodiment of the present invention discretizes them to a finite number of settings; for example, 10° resolution for angles. This makes it likely that the user will navigate back to the same settings used earlier.
- the viewing parameters are new, then they are sent to the server computer, which renders the VRML database into a raster bitmap corresponding to the specific viewing parameters selected, at step 133
- the bitmap is encoded into partial frames, and inco ⁇ orated into the server database
- Step 135 is continually operative to transmit additional encoded data from the server to the client
- the client database is built up at step 136
- the client database generates the latest version of the bitmap on demand, and displays it. This step is also carried out whenever step 132 results in confirmation that the viewing parameters have already been processed.
- Fig. 17 illustrates a typical two-dimensional array of CAD/CAM images, indexed vertically according to progressive coordinate and horizontally according to interactive coordinate. It is appreciated that the images in a given horizontal row may be viewed interactively Each successive horizontal row of images is built up over time at a rate determined by bandwidth availability and has increased quality inasmuch as it is based on an increasing number of data blocks
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Databases & Information Systems (AREA)
- Human Computer Interaction (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Abstract
Applications Claiming Priority (7)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
IL11713396 | 1996-02-14 | ||
IL11713396A IL117133A (en) | 1996-02-14 | 1996-02-14 | Method and system for providing on-line virtual reality movies |
IL11965596A IL119655A (en) | 1996-11-20 | 1996-11-20 | Method and system for scaleable representation of multimedia data for progressive asynchronous transmission |
IL11965596 | 1996-11-20 | ||
US78883097A | 1997-01-06 | 1997-01-06 | |
US788830 | 1997-01-06 | ||
PCT/IL1997/000055 WO1997030551A1 (fr) | 1996-02-14 | 1997-02-13 | Procede et systemes de transmission asynchrone et progressive de donnees multimedia |
Publications (2)
Publication Number | Publication Date |
---|---|
EP0886968A1 true EP0886968A1 (fr) | 1998-12-30 |
EP0886968A4 EP0886968A4 (fr) | 1999-09-22 |
Family
ID=27271754
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
EP97902559A Withdrawn EP0886968A4 (fr) | 1996-02-14 | 1997-02-13 | Procede et systemes de transmission asynchrone et progressive de donnees multimedia |
Country Status (4)
Country | Link |
---|---|
EP (1) | EP0886968A4 (fr) |
JP (1) | JP2000504906A (fr) |
AU (1) | AU1616597A (fr) |
WO (1) | WO1997030551A1 (fr) |
Families Citing this family (32)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6331869B1 (en) * | 1998-08-07 | 2001-12-18 | Be Here Corporation | Method and apparatus for electronically distributing motion panoramic images |
US6721952B1 (en) | 1996-08-06 | 2004-04-13 | Roxio, Inc. | Method and system for encoding movies, panoramas and large images for on-line interactive viewing and gazing |
JP3407287B2 (ja) | 1997-12-22 | 2003-05-19 | 日本電気株式会社 | 符号化復号システム |
IL127793A0 (en) * | 1998-05-28 | 1999-10-28 | Ibm | Internet server |
FI113124B (fi) | 1999-04-29 | 2004-02-27 | Nokia Corp | Tiedonsiirto |
CA2280662A1 (fr) * | 1999-05-21 | 2000-11-21 | Joe Toth | Serveur de media a compression evolutive multidimensionnelle des donnees |
AU5484200A (en) * | 1999-06-18 | 2001-01-09 | Intel Corporation | Systems and methods for enhanced visual presentation using interactive video streams |
US6314452B1 (en) * | 1999-08-31 | 2001-11-06 | Rtimage, Ltd. | System and method for transmitting a digital image over a communication network |
US7028096B1 (en) * | 1999-09-14 | 2006-04-11 | Streaming21, Inc. | Method and apparatus for caching for streaming data |
WO2001041437A2 (fr) * | 1999-12-03 | 2001-06-07 | Ourworld Live, Inc. | Systemes d'acces aux consommateurs et leurs procedes de fourniture |
IL148431A0 (en) | 1999-12-30 | 2002-09-12 | Swisscom Mobile Ag | Method for the transmission of image data |
WO2001097520A2 (fr) * | 2000-06-15 | 2001-12-20 | France Telecom | Installation d'interface video, systeme de distribution et procede permettant de transferer des programmes et sequences video codes via un reseau longue distance |
US6766376B2 (en) | 2000-09-12 | 2004-07-20 | Sn Acquisition, L.L.C | Streaming media buffering system |
US8595372B2 (en) | 2000-09-12 | 2013-11-26 | Wag Acquisition, Llc | Streaming media buffering system |
US7716358B2 (en) | 2000-09-12 | 2010-05-11 | Wag Acquisition, Llc | Streaming media buffering system |
US8924506B2 (en) | 2000-12-27 | 2014-12-30 | Bradium Technologies Llc | Optimized image delivery over limited bandwidth communication channels |
US20020107988A1 (en) * | 2001-02-05 | 2002-08-08 | James Jordan | In-line compression system for low-bandwidth client-server data link |
FR2826823B1 (fr) | 2001-06-27 | 2003-10-10 | Canon Kk | Procede et dispositif de traitement d'un signal numerique code |
FR2831728B1 (fr) | 2001-10-25 | 2004-03-12 | Canon Kk | Procede et dispositif de formation d'un signal numerique derive a partir d'un signal numerique compresse |
US20030098869A1 (en) * | 2001-11-09 | 2003-05-29 | Arnold Glenn Christopher | Real time interactive video system |
US20060235883A1 (en) | 2005-04-18 | 2006-10-19 | Krebs Mark S | Multimedia system for mobile client platforms |
FR2893470B1 (fr) | 2005-11-16 | 2008-02-15 | Canon Res Ct France Soc Par Ac | Procede et dispositif de creation d'une sequence video representative d'une sequence video numerique et procedes et dispositifs de transmission et reception de donnees video associes |
US8099476B2 (en) | 2008-12-31 | 2012-01-17 | Apple Inc. | Updatable real-time or near real-time streaming |
US8805963B2 (en) | 2010-04-01 | 2014-08-12 | Apple Inc. | Real-time or near real-time streaming |
GB201105502D0 (en) | 2010-04-01 | 2011-05-18 | Apple Inc | Real time or near real time streaming |
US9691430B2 (en) | 2010-04-01 | 2017-06-27 | Microsoft Technology Licensing, Llc | Opportunistic frame caching |
TWI451279B (zh) | 2010-04-07 | 2014-09-01 | Apple Inc | 即時或接近即時串流傳輸之內容存取控制 |
US8856283B2 (en) | 2011-06-03 | 2014-10-07 | Apple Inc. | Playlists for real-time or near real-time streaming |
US8843586B2 (en) | 2011-06-03 | 2014-09-23 | Apple Inc. | Playlists for real-time or near real-time streaming |
US20140040201A1 (en) * | 2012-08-01 | 2014-02-06 | Redigi, Inc. | Transfer of Digital Media Objects Via Migration |
WO2018142159A1 (fr) | 2017-02-03 | 2018-08-09 | Tv One Limited | Procédé de transmission et d'affichage vidéo |
CN114475713B (zh) * | 2022-01-24 | 2023-07-25 | 电子科技大学 | 高速磁悬浮车地通信系统多源数据融合分接系统和方法 |
Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5481312A (en) * | 1994-09-12 | 1996-01-02 | At&T Corp. | Method of and apparatus for the transmission of high and low priority segments of a video bitstream over packet networks |
WO1996001528A1 (fr) * | 1994-07-01 | 1996-01-18 | Commonwealth Scientific And Industrial Research Organisation | Representation fractale de donnees |
WO1996002895A1 (fr) * | 1994-07-14 | 1996-02-01 | Johnson Grace Company | Procede et appareil pour comprimer des images |
EP0739140A2 (fr) * | 1995-04-18 | 1996-10-23 | Sun Microsystems, Inc. | Codeur pour un système de distribution vidéo point-à point à échelle variable |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5421031A (en) * | 1989-08-23 | 1995-05-30 | Delta Beta Pty. Ltd. | Program transmission optimisation |
US5528281A (en) * | 1991-09-27 | 1996-06-18 | Bell Atlantic Network Services | Method and system for accessing multimedia data over public switched telephone network |
JP2521016B2 (ja) * | 1991-12-31 | 1996-07-31 | インターナショナル・ビジネス・マシーンズ・コーポレイション | マルチメディア・デ―タ処理システム |
US5414455A (en) * | 1993-07-07 | 1995-05-09 | Digital Equipment Corporation | Segmented video on demand system |
US5544313A (en) * | 1994-05-11 | 1996-08-06 | International Business Machines Corporation | Baton passing optimization scheme for load balancing/configuration planning in a video-on-demand computer system |
US5512934A (en) * | 1994-12-29 | 1996-04-30 | At&T Corp. | System and method for transmission of programming on demand |
US5561791A (en) * | 1995-04-10 | 1996-10-01 | Digital Equipment Corporation | Method and apparatus for conditioning timed program independent of transport timing |
-
1997
- 1997-02-13 JP JP9529159A patent/JP2000504906A/ja active Pending
- 1997-02-13 WO PCT/IL1997/000055 patent/WO1997030551A1/fr not_active Application Discontinuation
- 1997-02-13 AU AU16165/97A patent/AU1616597A/en not_active Abandoned
- 1997-02-13 EP EP97902559A patent/EP0886968A4/fr not_active Withdrawn
Patent Citations (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO1996001528A1 (fr) * | 1994-07-01 | 1996-01-18 | Commonwealth Scientific And Industrial Research Organisation | Representation fractale de donnees |
WO1996002895A1 (fr) * | 1994-07-14 | 1996-02-01 | Johnson Grace Company | Procede et appareil pour comprimer des images |
US5481312A (en) * | 1994-09-12 | 1996-01-02 | At&T Corp. | Method of and apparatus for the transmission of high and low priority segments of a video bitstream over packet networks |
EP0739140A2 (fr) * | 1995-04-18 | 1996-10-23 | Sun Microsystems, Inc. | Codeur pour un système de distribution vidéo point-à point à échelle variable |
Non-Patent Citations (3)
Title |
---|
PENTLAND A ET AL: "A PRACTICAL APPROACH TO FRACTAL-BASED IMAGE COMPRESSION" DATA COMPRESSION CONFERENCE, 8 April 1991 (1991-04-08), pages 176-185, XP000611138 * |
See also references of WO9730551A1 * |
VANDENDORPE L: "HIERARCHICAL TRANSFORM AND SUBBAND CODING OF VIDEO SIGNALS*" SIGNAL PROCESSING. IMAGE COMMUNICATION, vol. 4, no. 3, 1 June 1992 (1992-06-01), pages 245-262, XP000270232 ISSN: 0923-5965 * |
Also Published As
Publication number | Publication date |
---|---|
EP0886968A4 (fr) | 1999-09-22 |
AU1616597A (en) | 1997-09-02 |
JP2000504906A (ja) | 2000-04-18 |
WO1997030551A1 (fr) | 1997-08-21 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US6536043B1 (en) | Method and systems for scalable representation of multimedia data for progressive asynchronous transmission | |
EP0886968A1 (fr) | Procede et systemes de transmission asynchrone et progressive de donnees multimedia | |
US6721952B1 (en) | Method and system for encoding movies, panoramas and large images for on-line interactive viewing and gazing | |
US7751628B1 (en) | Method and apparatus for progressively deleting media objects from storage | |
US5968120A (en) | Method and system for providing on-line interactivity over a server-client network | |
US6139197A (en) | Method and system automatically forwarding snapshots created from a compressed digital video stream | |
US6745226B1 (en) | Method and system for progressive encoding in an active desktop environment | |
US8259788B2 (en) | Multimedia stream compression | |
JP2003099358A (ja) | サーバー−クライアントネットワークを介するオンライン対話性を改善する方法及びシステム | |
US20120291080A1 (en) | Image delivery system with image quality varying with frame rate | |
EP1056273A2 (fr) | Procédé et système pour délivrer des images de haute qualité à partir d'un train de vidéo numérique | |
WO1998037699A1 (fr) | Systeme et procede permettant d'envoyer et de recevoir une video comme montage de diapositives sur un reseau d'ordinateurs | |
US7954057B2 (en) | Object movie exporter | |
Chang et al. | Development of Columbia's video on demand testbed | |
Haynes et al. | Visualcloud demonstration: A dbms for virtual reality | |
CN101395909A (zh) | 用于组合编辑信息和媒体内容的方法和系统 | |
IL125643A (en) | Method and systems for advanced asynchronous fragments of multimedia data | |
GB2348074A (en) | Encoding movies, panoramas and large images for on-line interactive viewing and gazing | |
WO2002028085A2 (fr) | Reutilisation de donnees multimedia decodees pour utilisateurs multiples | |
IL173679A (en) | Providing compressed video | |
IL141104A (en) | Remote computer access |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PUAI | Public reference made under article 153(3) epc to a published international application that has entered the european phase |
Free format text: ORIGINAL CODE: 0009012 |
|
17P | Request for examination filed |
Effective date: 19980909 |
|
AK | Designated contracting states |
Kind code of ref document: A1 Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE |
|
A4 | Supplementary search report drawn up and despatched |
Effective date: 19990806 |
|
AK | Designated contracting states |
Kind code of ref document: A4 Designated state(s): AT BE CH DE DK ES FI FR GB GR IE IT LI LU MC NL PT SE |
|
RIC1 | Information provided on ipc code assigned before grant |
Free format text: 6H 04N 7/14 A, 6H 04N 7/173 B, 6H 04N 7/26 B |
|
STAA | Information on the status of an ep patent application or granted ep patent |
Free format text: STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN |
|
18D | Application deemed to be withdrawn |
Effective date: 20040901 |