WO2000010329A1 - Systeme cote client de creation de television numerique - Google Patents

Systeme cote client de creation de television numerique Download PDF

Info

Publication number
WO2000010329A1
WO2000010329A1 PCT/US1999/018292 US9918292W WO0010329A1 WO 2000010329 A1 WO2000010329 A1 WO 2000010329A1 US 9918292 W US9918292 W US 9918292W WO 0010329 A1 WO0010329 A1 WO 0010329A1
Authority
WO
WIPO (PCT)
Prior art keywords
program
video
client system
client
data
Prior art date
Application number
PCT/US1999/018292
Other languages
English (en)
Inventor
Stephen Hartford
Michael Richard Young Moore
Stephan D. Schaem
Thomas A. Riso
Steven R. Kell
J. Paul MONTGOMERY
Original Assignee
Play, Inc.
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Play, Inc. filed Critical Play, Inc.
Priority to AU54793/99A priority Critical patent/AU5479399A/en
Publication of WO2000010329A1 publication Critical patent/WO2000010329A1/fr

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/433Content storage operation, e.g. storage operation in response to a pause request, caching operations
    • H04N21/4331Caching operations, e.g. of an advertisement for later insertion during playback
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/02Editing, e.g. varying the order of information signals recorded on, or reproduced from, record carriers
    • G11B27/031Electronic editing of digitised analogue information signals, e.g. audio or video signals
    • G11B27/034Electronic editing of digitised analogue information signals, e.g. audio or video signals on discs
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/262Content or additional data distribution scheduling, e.g. sending additional data at off-peak times, updating software modules, calculating the carousel transmission frequency, delaying a video stream transmission, generating play-lists
    • H04N21/26258Content or additional data distribution scheduling, e.g. sending additional data at off-peak times, updating software modules, calculating the carousel transmission frequency, delaying a video stream transmission, generating play-lists for generating a list of items to be played back in a given order, e.g. playlist, or scheduling item distribution according to such list
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/4508Management of client data or end-user data
    • H04N21/4532Management of client data or end-user data involving end-user characteristics, e.g. viewer profile, preferences
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/45Management operations performed by the client for facilitating the reception of or the interaction with the content or administrating data related to the end-user or to the client device itself, e.g. learning user preferences for recommending movies, resolving scheduling conflicts
    • H04N21/462Content or additional data management, e.g. creating a master electronic program guide from data received from the Internet and a Head-end, controlling the complexity of a video stream by scaling the resolution or bit-rate based on the client capabilities
    • H04N21/4622Retrieving content or additional data from different sources, e.g. from a broadcast channel and the Internet
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/472End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content
    • H04N21/47205End-user interface for requesting content, additional data or services; End-user interface for interacting with content, e.g. for content reservation or setting reminders, for requesting event notification, for manipulating displayed content for manipulating displayed content, e.g. interacting with MPEG-4 objects, editing locally
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/47End-user applications
    • H04N21/478Supplemental services, e.g. displaying phone caller identification, shopping application
    • H04N21/4782Web browsing, e.g. WebTV
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/61Network physical structure; Signal processing
    • H04N21/6106Network physical structure; Signal processing specially adapted to the downstream path of the transmission network
    • H04N21/6125Network physical structure; Signal processing specially adapted to the downstream path of the transmission network involving transmission via Internet
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8456Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/16Analogue secrecy systems; Analogue subscription systems
    • H04N7/173Analogue secrecy systems; Analogue subscription systems with two-way working, e.g. subscriber sending a programme selection signal
    • H04N7/17309Transmission or handling of upstream communications
    • H04N7/17318Direct or substantially direct transmission and handling of requests
    • GPHYSICS
    • G11INFORMATION STORAGE
    • G11BINFORMATION STORAGE BASED ON RELATIVE MOVEMENT BETWEEN RECORD CARRIER AND TRANSDUCER
    • G11B27/00Editing; Indexing; Addressing; Timing or synchronising; Monitoring; Measuring tape travel
    • G11B27/10Indexing; Addressing; Timing or synchronising; Measuring tape travel
    • G11B27/34Indicating arrangements 

Definitions

  • This invention relates generally to multimedia production, and more particularly to producing low bandwidth, high quality multimedia content programs in client-side systems.
  • ITU-R 601 standard used in professional digital video production provides for an NTSC frame of 720 pixels x 486 scan lines, with eight-bit 4:2:2 sampling of Y, R-Y, and B-Y color components. At sixty fields per second, this frame structure results in a 20 megabyte per second data stream, which far exceeds rates currently available in common networks.
  • fast modems connected to the Internet up to 56 kilobits per second. Transferring a high quality compressed (for instance, 5: 1) rTU-R 601 signal would require a further compression of 750: 1. Compression of this magnitude would render the underlying signal unrecognizable.
  • Digital networks have similar limited bandwidth deficiencies. Examples of digital networks include modem-to-modem over the conventional telephone network, the Internet with modems, the Internet with an Integrated Services Digital Network (ISDN) line at one end, the Internet with ISDN lines at both ends, and a corporate 10 Base T Local Area Network (LAN).
  • ISDN Integrated Services Digital Network
  • Latency presents additional problems for digital networks.
  • the Internet architecture does not guarantee any particular transfer bandwidth to be continuous over time.
  • Existing video streaming architectures attempt to solve this problem by filling a buffer in the client (receiver) side with future video and audio frames so that, when a gap occurs in the Internet transmission, video is played from the buffer with the hope that the transmission will resume before the buffer is emptied.
  • the transmission often does not resume in time, which, as the frame rate drops, can cause a pause in the video stream accompanied by an audio glitch.
  • the buffering techniques also require that the buffer be filled before the video program begins playing. Consequently, as the size of the buffer increases, the time delay before the video playback begins also increases. Additionally, avoiding glitches in playback requires that the buffer be re-filled before the next transmission interruption.
  • the World Wide Web (the "web") is a collection of millions of electronic documents consisting primarily of text and still images, linked together and accessible by anyone with a suitable Internet browser. This huge database of information is typically navigated, or “surfed.” by using various search tools to find the desired information. Surfing, or navigating, the Internet is referred to as an "active" task in which the user actively engages in, and indeed controls, the exploration and acquisition of information.
  • the present invention provides a multimedia system and method for producing a client-side low-bandwidth television (LBTV) program.
  • the invention first creates scripts of the programs in an authoring system and stores them on a server, which, upon receiving a request from a user to view a program, transfers the corresponding Elastic Edit Decision List (EDL) and a small amount of preliminary content to the user's client system.
  • EDL serves as the script and includes instructions for the client system to produce and orchestrate the program in real-time on the client's hardware.
  • the client system then executes the EDL for viewing the program with as little delay as possible. Even if a portion of the program script is received before all of the content has been transferred, the client system plays that portion of the script. While the client system is running one program segment, the program searches for data for subsequent segments, which may be locally available in the client system or be transferred from the server. If the data is not available, then the program may ask the client system to generate the data, use another piece of data having similar functions, or perform other functions to keep displaying the program continuously and smoothly, including prolonging display of an image, redisplaying an image that has been displayed earlier, or playing some graphic animation. Further, while the program is running, the server sends in the background additional raw content that is not available in the client system.
  • the invention allowing producing the program and receiving raw program content in real-time, is advantageous over prior art techniques in which the final program including the video and audio tracks are transmitted in their entirety.
  • the invention to effectively use transmission bandwidths, sends only a description of how to recreate a given scene, rather than sending every video frame. Consequently, the invention enables the use of low bandwidth media to broadcast high-quality imagery, at least as good as in a television broadcast.
  • FIG. 1 is a schematic overview of a network comprising the authoring system and clients of the preferred embodiment of the invention
  • FIG. 2 is a block diagram illustrating the architecture of the authoring system
  • FIG. 3 is a schematic showing a timeline editor which is used to produce the event-driven storyboard for an LBTV program
  • FIG. 4 shows an exemplary storyboard representing an LBTV program created in accordance with the invention
  • FIG. 5 shows an LBTV program memory map created from the storyboard of FIG. 4;
  • FIG. 6 is a schematic overview of an architecture of a client system in FIG. 1 ;
  • FIG. 7 is a schematic showing a playback engine of a client system;
  • FIG. 8 is a flowchart illustrating the method of the invention from the step of creating the LBTV scripts through the step of playing the LBTV program.
  • FIG. 1 a schematic overview is shown of a network 100 of the preferred embodiment, which includes an authoring system 104, a server 108, and a plurality of client systems ("clients") 1 10A to 1 ION, all of which interconnect via the Internet 1 14.
  • Authoring system 104 enables generation of Low Bandwidth Television (LBTV) software program scripts, which include raw data and instructions to construct a "client-side" video/television program from raw data.
  • LBTV Low Bandwidth Television
  • server 108 stores the scripts created by authoring system 104 and transmits the scripts to clients 110.
  • Program scripts preferably reside as disk files on server 108.
  • Other network distribution systems such as corporate intranets, Local Area Networks (LANs), Wide Area Networks (WANs), 10 base T networks, and terrestrial digital Radio Frequency (RF) broadcasting, or modem (point-to-point) distribution systems are also effective.
  • LANs Local Area Networks
  • WANs Wide Area Networks
  • RF Radio Frequency
  • modem point-to-point
  • Server 108 transfers the program script consisting of an Elastic Edit Decision List (EDL) file, along with a small amount of preliminary content to client 110.
  • EDL Elastic Edit Decision List
  • the Elastic EDL file serves as the script by which client 110 will orchestrate the final production of the video.
  • Client 110 begins executing the EDL file with as little delay as possible. Additional raw content that is not already stored on client 1 10 is sent from the server 108 in temporal order over the Internet 114.
  • the program sequentially displays the media content according to the script. Unlike many conventional computer media systems, however, program execution does not stop and wait for the availability of content.
  • Elastic EDLs developed by Play. Inc., a corporation having headquarters in Rancho Cordova, are an improvement over the traditional EDL that includes an inflexible list of video clip segments for producing the video in order of the segments.
  • an Elastic EDL is "elastic, " that is, it contains information and instructions to create the LBTV program on the fly, including information on whether to adjust the program during displaying such as prolonging a video segment or substituting the segment with another segment, etc. Consequently, an Elastic EDL, when required to compensate for the bandwidth variations, allows scaling of the program.
  • an Elastic EDL taking accounts a user's previous playing of a program, may contain a user's preferences, and allows subsequent playing based on these preferences.
  • FIG. 1 illustrates a preferred embodiment in which the authoring system 104 and client 110 are separated by the Internet 1 14, alternatively, a single application program may contain the authoring system 104.
  • the application program is loaded by a user on client 110, which then can create, edit, and execute program scripts.
  • server 108 may contain content which is utilized by the program scripts, or all necessary program content may be loaded onto client 110 as part of the application program.
  • program scripts created by one client 110 can be executed by other clients 1 10 in such an embodiment.
  • FIG. 2 a block diagram illustrates the architectural details of the authoring system 104.
  • Authoring system 104 includes a data bus 201 connecting a Central Processing Unit (CPU) 204 to a plurality of memories, which store data and application programs that run on CPU 204. These memories and applications include a timeline editor 208, a music sound track module 212, an audio effects module 216, a video content module 220, a video effects module 224, a text generation module 228, and a communication unit 236.
  • Timeline editor 208 is used to generate an event-driven storyboard for an LBTV program from music sound tracks, audio effects, video, video effects, and text stored in respective modules 212, 216, 220, 224, and 228.
  • Timeline editor 208 can also retrieve audio and video objects from a computer hard disk or other means such as a scanner or a drawing tool. To reduce the video clip size, timeline editor 208 divides a long video clip into short pieces of video tracks, and when transferring the data to client 1 10, server 108 sends the entire audio tracks and these short pieces of video tracks. If all video tracks have not been received during display, client system 1 10, in accordance with instructions in the Elastic EDL, may replace the belated video tracks with cut-aways of camera-stand style still movies, animating graphics, or stock video clip footage.
  • Music sound track module 212 contains information associated with music.
  • music sound track 212 defines how long a piece of music will be played, how loud the musical piece will be played, what music key the piece will be performed in, and whether the music will be looped for continuous playing.
  • the music is preferably stored as MIDI or compressed .WAV files.
  • Audio effects module 216 produces various audio effects, such as control of music tracks, Foley effects, and voice-overs. Audio effects module 216 can also mix multiple audio tracks and add special effects, including reverberation and delays in realtime.
  • Video content module 220 contains the video objects from which an LBTV program is created. These video objects are stored in a variety of forms such as still pictures, short video clips, animated computer graphics, bitmaps, or other movie files suitable for QuickTime movies. Video content module 220 also contains information about how the video objects are to be used, e.g., whether a picture will be displayed in full size or half size, and whether the picture will be cropped or trimmed to exclude unwanted images.
  • Video effects module 224 enables application of a variety of visual effects to picture images in an LBTV program. Examples of video effects include cuts, dissolves, fades to and from black, wipes, organic animating wipes, and digital video effects such as push, pull, and flying video planes. Additionally, video effects module 224 can emulate various conventional camera techniques used in traditional video production, such as slow camera pans and zooms across still images. Video effects module 224 also provides transitional smoothing from one scene to another, from one image still to another, and from one video clip to another.
  • Text module 228 may be a conventional text processing program and is used to design and generate text to be displayed with the video. Preferably, text module 228 enables text to be created as an object and placed anywhere on a video page.
  • Authoring system 104 uses communication unit 236 to transmit and receive data from other systems, including server 108 and clients 110.
  • Communication unit 236 is a conventional network communication device such as a router or modem.
  • Timeline editor 208 includes a source element browser 308, an element library 312, a plurality of object property editors 320, a storyboard display 326, an LBTV script compiler 330, and a playback engine 334.
  • Source element browser 308 enables a user to find video objects and bring them into timeline editor 208.
  • These video objects may reside on an Internet web page, a CD-ROM, or a PC hard disk, but preferably are contained in the element library 312 connected to element browser 308. These video objects may also be provided from a scanner, a software video object drawing tool, or other means of providing video elements.
  • each type of video object is edited using a specific object editor 320 suitable for the particular type of object.
  • text objects are conventionally edited using a word processor, still pictures are generally edited using a "Paint" program, and so on.
  • object editors 320 are shown as residing within timeline editor 208, object editors 320 could reside anywhere in authoring system 104, so long as they can be accessed by timeline editor 208.
  • Video objects are displayed, preferably after having been edited, in a storyboard format using the storyboard display 304.
  • a storyboard representation of an LBTV program is preferably displayed as a sequence of icons or pictures. Each icon in the sequence represents a script object.
  • a still image in the sequence might be represented by a thumbnail of the image.
  • a music object (.WAV file) might be represented by a music symbol displayed with the name of the file.
  • An animated sequence might be represented by a thumbnail image of the first frame of the sequence.
  • LBTV compiler 330 generates an LBTV program script from the edited storyboard. Playback engine 334 allows a user to review a program script created by timeline editor 208.
  • FIG. 4 a block diagram illustrates an exemplary storyboard 400 of an LBTV program created in accordance with the invention.
  • Storyboard 400 is event- driven and includes, for example, from time T 0 to time T , a "fade from black," a still image A, video effects for the still image A, a still image B, a virtual camera pan, and a still image C, respectively.
  • the program preferably starts by "fading from black" to still image A at T,.
  • Other colors and/or effects may substitute for the fade-from-black transition.
  • the fading time is preferably synchronized to a real-time clock, and is independent of the hardware and or graphic card of the client 110.
  • still image A is displayed.
  • video effects are added to enhance image A. Examples of the video effects applied to still image A include wipes, organic animating wipes, and digital video effects such as push, pull, and flying video planes. Other audio and musical effects may also be added.
  • still image B is displayed, and like still image A, image B may be accompanied by video and audio effects.
  • a virtual camera pan is used to pan across image B.
  • the virtual camera pan is a video effect in which the viewer is able to see only a portion of the still image being viewed. The portion of the image displayed to the viewer changes slowly, giving the viewer the perspective of either moving his eyes across a panoramic view or of having the image move relative to the viewer.
  • the virtual camera pan effect at T 4 contains video data related to the panning paths so that the paths can be constructed in client 110.
  • image B is replaced by still image C. Since no transition video effect separates still images B and C (such as a fade or a wipe), image C merely replaces image B. As the FIG.
  • each subsequent image, sound, or video effect is loaded by playback engine 334 for execution.
  • the image or sound may not be immediately available for playback.
  • programmed playback engine 334 continues executing the storyboard by substituting some alternative content for the missing image. For example, if at time T 3 image B is not available to replace the still image A of time T, the program may continue to display still image A. Alternatively, a line drawing or stock photograph may be substituted for the image A.
  • playback engine 324 loops to groups of images that have been displayed earlier.
  • the invention continuously redisplays events from time T, to time T that include still image A, video effects for image A, still image B, and virtual camera panning for image B.
  • Other options to keep the program running continuously include looping the music, looping the graphics animation, and replaying parts of the storyboard.
  • the invention can also substitute stand-in video content, change picture color to black and white, or substitute the transitions from one image to another image.
  • the programming must continue to play. This is in contrast to more traditional multimedia systems where the playback system generally stops executing and waits for the missing image video or audio clip to load before playback resumes.
  • a block diagram illustrates an LBTV program memory map 500 containing the program script created from the storyboard 400 of FIG. 4.
  • a set of parameters is associated with each FIG. 4 time event. These parameters are stored in memory sections 504, 508, 510, 512, 516, and 520 for events at time To, Ti, T , T 3 , T 4 , and T 5 , respectively.
  • the fade from black event includes black as the initial color, and one second as the minimum duration time for fading.
  • image A is stored as a compressed bitmap. Additional parameters appended to image A indicate that image A will be displayed for a minimum of five seconds, and has a cropping value of (X Y ⁇ , X 2 , Y 2 ). Image A also has a zoom factor Z.
  • the images including still image A
  • memory section 508 in fact preferably points to an image library where image A is actually stored.
  • Section 510 contains the video effect occurring at time T 2 .
  • the video effect is described as having a video effects speed of one, with still image A traveling from left (L) to right (R), and image A having a red border.
  • Section 512 contains information for compressed image B.
  • Image B is described as being displayed for ten seconds.
  • a virtual camera pan is described in section 516 as occurring with respect to image B.
  • the virtual camera pan is a pre-programmed effect, which in this case preferably results in a left to right sweep of the image across the video screen.
  • the virtual camera pan of section 516 indicates that no zoom of the still image B will occur.
  • Memory section 520 stores the still image C described with reference to FIG. 4 as being displayed at time T 5 .
  • still image C is the last element.
  • FIG. 6 a block diagram illustrates the preferred architecture of
  • FIG.1 client 110 which can be a standard personal computer (PC), an LBTV compatible set-top box, or other system capable of executing software and displaying multimedia content. Since LBTV programs are actually created in a high-level description language, the program's actual content can be generated in real-time to fit the capabilities of the client 1 10 system.
  • client 1 10 is preferably a Pentium computer with a PCI video display card.
  • the real-time playback of an LBTV program preferably does not require a graphics card with any special hardware capabilities.
  • client 1 10 works well on a PC. without compromising the broadcast television mandates of smooth, continuous, and well-produced content.
  • LBTV- compatible set-top boxes also preferably use standard PC display technologies, resulting in a low cost, stand-alone unit.
  • the LBTV architecture preferably combines the standard PC display chipsets, a high-speed, general purpose CPU, and flexible software application modules to maintain a professional quality television program, even over a low bandwidth connection.
  • Client 110 includes a client CPU 604 connected via data bus 601 to a plurality of memories which store data and application programs that run on the CPU 604, including a network browser 608, a playback engine 612, a script library 616, a video library 620, an algorithmic video and audio module 628, an audio library 634, and a keying and layering engine 638.
  • a display 640 and a communication unit 642 are also conventionally coupled to data bus 601.
  • Internet browser 608 allows a user to request a stored LBTV program, such as from a web site, and to download various media resources (images, effects, video clips, etc.) that might be required.
  • Playback engine 612 provides intelligence for an LBTV script to be executed for viewing.
  • playback engine 612 of FIG. 6 is identical to playback engine 334 of FIG. 3 discussed with reference to timeline editor 208 of authoring system 104.
  • playback engine 334 has a different user interface or diagnostics than the client playback engine 612, as might be contemplated for an editor versus a run-time tool.
  • Script library 616 stores the program scripts that have been created and transferred to client 1 10. Each program script includes the Elastic EDL and some small preliminary content.
  • the program may be modified at any time, including during playback.
  • the invention using instructions in the Elastic EDL, may, in real-time, customize a program for the viewer, including removing segments of shows already seen by the viewer, expanding segments which include topics the viewer previously expressed interest in, and customizing a commercial to the viewer's tastes.
  • the invention may also produce, in real-time, video edits, special effects, titles, graphics, and audio streams.
  • Audio and video objects are stored in audio library 634 and video library 620, respectively.
  • Pre- generated canned video elements are also stored in video library 620.
  • Algorithmic video and audio module 628 is used to generate video images based on a defined algorithm provided by playback engine 612. Keying and layering 638 provides the capabilities for a viewer to see one picture layered on another picture, or one video object superimposed over a background.
  • Client 1 10 uses communication unit 642 to transmit and receive data
  • Playback engine 612 which orchestrates the final production of an LBTV program.
  • Playback engine 612 includes a speed coordinator 704, a synchronization module 708, and a transitional smoothness module 712.
  • Speed coordinator 704 is responsible for ensuring that the LBTV program scripts run equivalently on both slow (PC 486) and faster (PC Pentium) machines.
  • PC 486 slow
  • PC Pentium faster
  • a program that runs on a faster machine does not necessarily complete its tasks in a shorter time, but the program's audio and video objects are usually of better quality.
  • a program created for a faster machine, using a mapping technique can run on a slower machine.
  • Speed coordinator 704 first determines the speed at which playback engine 612 is running, and, based on this speed, takes actions accordingly.
  • the speed of playback engine 612 varies, depending on the speed of other system components, such as graphic card, audio card, system mother board, etc.
  • Synchronization module 708 establishes synchronization points to synchronize different audio and video tracks that form the LBTV program. In between each two synchronization points the tracks may not synchronize, but at each synchronization point the tracks re-synchronize. In the preferred embodiment, one synchronization point is close enough to an adjacent point so that a program viewer cannot perceive that the tracks do not synchronize. Additionally, at each synchronization point, module 708 waits to acquire all the data required for the next program segment before allowing the segment to be displayed.
  • Transitional smoothness module 712 provides the high-quality, smooth and continuous appearance of a conventional television program. While the program is waiting to acquire all necessary data, module 712 uses several techniques to "mask out" the wait state so that a viewer perceives a smooth and continuous program. Module 712 can instruct the program to fade out, prolong playing on one video/audio track, loop on one track continuously, or replay part of a track. Module 712 can also use an animation technique allowing some video objects to move around display 640 while the program is waiting. This is done using the capabilities provided by the keying and layering module 638. Substitution can also be used.
  • transition smoothness module 712 can replace the desired instrument with another similar instrument that is available on client 1 10.
  • module 712 can request that algorithmic video module 628 create a video replacement based on algorithms defined by module 712.
  • FIG. 8 a flowchart illustrates the operation of the invention from the step of creating an LBTV program script through the step of playing the program. The method begins with creating the LBTV scripts in step 804. An LBTV program producer uses the authoring system 104 to generate the program scripts.
  • timeline editor 208 is used to edit and add musical, audio, and video effects.
  • a storyboard is created in which objects are sequentially arranged to produce the program script. Objects, represented by graphical icons within the storyboard, can be manipulated by dragging and rearranging the icons. The objects can be moved to and from object libraries (e.g. music soundtrack 212, video content 220, video effects 224 (FIG. 2)) and downloaded from remote servers 108.
  • object libraries e.g. music soundtrack 212, video content 220, video effects 224 (FIG. 2)
  • a script compiler 330 converts the storyboard to an executable program file.
  • LBTV scripts may be created and in step 808 are transferred to server 108.
  • Server 108 preferably sets up a web page and makes the scripts available to a viewer having access to the Internet 114.
  • the web page of server 108 also preferably includes instructions for a viewer to request and download a desired LBTV program.
  • Each LBTV program script can also be represented as a picture icon on the web page.
  • a viewer uses an Internet browser 608 to request a program for viewing.
  • the viewer contacts the server 108, which then in step 816 transfers the Elastic EDL along with a small amount of preliminary content.
  • the EDL serves as the script to which the client system 110 orchestrates the final production of the LBTV video on the client 110's hardware.
  • the invention to effectively use the transmission bandwidth, sends only a description of how to recreate a given scene, rather than sending every video frame.
  • step 820 client system 1 10 starts executing the Elastic EDL for viewing the program with as little delay as possible.
  • the Elastic EDL contains information on whether there are sufficient content segments to start playing the program. The delay depends on the system bandwidth and speed. Even if a given section of the program script occurs before all of the content has been transferred, client 110 continues and plays that section of the script, typically using "stand-in" content, such as line drawings, stock photographs, etc. While client 110 is running one program segment, the program in step 821 is searching for data for subsequent segments. The data may be transferred from server 108, or be locally available in the client 1 10 * s storage disk including video library 620 or audio library 634. Additionally, in step 822.
  • step 824 the program finds data available, the data is retrieved in step 826 and the program continues to step 828. If the data is not available in step 824, the program has several options. One option is that the program can substitute another piece of data having a similar function. For example, if a section of audio requires a synthesized piano voice and the piano voice is not immediately available, a pipe organ or similar available instrument may be substituted. As such, step 836 determines whether the substitute data is acceptable or feasible. When substitution is feasible, the program in step 840 retrieves and provides the substitute data for the program to continue in step 828.
  • step 836 substitution is not feasible, the program in step 838 requests algorithmic video and audio module 628 to generate data suitable for the next program segment.
  • step 844 the program determines whether data from module 628 is available. If so, the program in step 848 then retrieves the data from module 628 and continues in step 828. If the program, pursuant to step 844, determines that such data is not available, the program performs other functions in step 852. These functions include, for example, prolonging display of an image, redisplaying an image that has been displayed earlier, or playing some graphics animation until the desired data is completely transferred from server 108. The program in step 828 continues to fetch data and run until the program is complete.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Databases & Information Systems (AREA)
  • Human Computer Interaction (AREA)
  • Television Signal Processing For Recording (AREA)

Abstract

La présente invention concerne un système et un procédé de création audiovisuelle multimédia côté client (110A, 110B, 110N) permettant de retarder la production finale effective de la vidéo jusqu'à ce que l'utilisateur commence à regarder la production audiovisuelle, et ce, au moment même où le système ou le procédé crée dynamiquement la production audiovisuelle. Pour utiliser efficacement la largeur de bande d'émission, au lieu d'envoyer des trames vidéo et des pistes audio totalement terminées, le système envoie divers éléments de contenu brut et des instructions sur la façon de créer le programme définitif à partir du contenu brut. Ne produisant pas la production audiovisuelle définitive avant que le spectateur ne la regarde, le système permet de modifier la production audiovisuelle à tout instant, même pendant la restitution. L'utilisation d'une liste EDL (Elastic Edit Decision List) (816) fait que le système accepte la personnalisation de la production audiovisuelle, et même la suppression de segments de spectacles déjà visionnés, le développement de segments pour lesquels le spectateur a préalablement exprimé son intérêt, ainsi que la personnalisation en fonction des goûts du spectateur du contenu des émissions commerciales. Etant donné qu'elle permet la diffusion de flux de télévision de haute qualité en passant pas des canaux à faible largeur de bande, l'invention présente des avantages par rapport aux techniques existantes qui consistent à émettre les pistes vidéo et audio dans leur totalité, ce qui les limite par rapport à la largeur de bande disponible.
PCT/US1999/018292 1998-08-13 1999-08-11 Systeme cote client de creation de television numerique WO2000010329A1 (fr)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU54793/99A AU5479399A (en) 1998-08-13 1999-08-11 Client-side digital television authoring system

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US9666598P 1998-08-13 1998-08-13
US60/096,665 1998-08-13

Publications (1)

Publication Number Publication Date
WO2000010329A1 true WO2000010329A1 (fr) 2000-02-24

Family

ID=22258477

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US1999/018292 WO2000010329A1 (fr) 1998-08-13 1999-08-11 Systeme cote client de creation de television numerique

Country Status (2)

Country Link
AU (1) AU5479399A (fr)
WO (1) WO2000010329A1 (fr)

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2357890A (en) * 1999-10-05 2001-07-04 Sony Corp Image editing
GB2361097A (en) * 2000-04-05 2001-10-10 Sony Uk Ltd A system for generating audio/video productions
WO2005027516A1 (fr) * 2003-09-15 2005-03-24 Carlan Investments Ltd. Procede et systeme pour produire et presenter un tournoi de golf
WO2007081877A1 (fr) * 2006-01-06 2007-07-19 Google Inc. Infrastructure de service multimédia dynamique
WO2007131342A1 (fr) * 2006-05-12 2007-11-22 Gill Barjinderpal S Liste de points de montage permettant la distribution de produits multimédia
US7702219B2 (en) 2000-04-05 2010-04-20 Sony United Kingdom Limited Audio and/or video generation apparatus and method of generating audio and/or video signals

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5889514A (en) * 1996-03-29 1999-03-30 International Business Machines Corp. Method and system for a multimedia application development sequence editor using spacer tools
US5903262A (en) * 1995-07-31 1999-05-11 Kabushiki Kaisha Toshiba Interactive television system with script interpreter
US5931679A (en) * 1995-03-30 1999-08-03 Brother Kogyo Kabushiki Kaisha Information provision system
US5953044A (en) * 1996-01-11 1999-09-14 Matsushita Electric Industrial Co., Ltd. Picture transmission system

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5931679A (en) * 1995-03-30 1999-08-03 Brother Kogyo Kabushiki Kaisha Information provision system
US5903262A (en) * 1995-07-31 1999-05-11 Kabushiki Kaisha Toshiba Interactive television system with script interpreter
US5953044A (en) * 1996-01-11 1999-09-14 Matsushita Electric Industrial Co., Ltd. Picture transmission system
US5889514A (en) * 1996-03-29 1999-03-30 International Business Machines Corp. Method and system for a multimedia application development sequence editor using spacer tools

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
JIMMY CHI-MING LAI: "AUTHORING AND DELIVERING NETWORKED MULTIMEDIA: INTEGRATING SCRIPTX WITH THE WEB, PASSAGE", AUTHORING AND DELIVERING NETWORKED MULTIMEDIA: INTEGRATINGSCRIPTX WITH THE WEB, XX, XX, 1 July 1995 (1995-07-01), XX, pages 01 - 12, XP002925374 *

Cited By (10)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
GB2357890A (en) * 1999-10-05 2001-07-04 Sony Corp Image editing
GB2357890B (en) * 1999-10-05 2004-03-10 Sony Corp Image editing
US6957008B1 (en) 1999-10-05 2005-10-18 Sony Corporation Image editing apparatus and recording medium
GB2361097A (en) * 2000-04-05 2001-10-10 Sony Uk Ltd A system for generating audio/video productions
US7702219B2 (en) 2000-04-05 2010-04-20 Sony United Kingdom Limited Audio and/or video generation apparatus and method of generating audio and/or video signals
US8214858B2 (en) 2000-04-05 2012-07-03 Sony United Kingdom Limited Audio and/or video generation apparatus and method of generating audio and/or video signals
US9311962B2 (en) 2000-04-05 2016-04-12 Sony United Kingdom Limited Audio and/or video generation apparatus and method of generating audio and/or video signals
WO2005027516A1 (fr) * 2003-09-15 2005-03-24 Carlan Investments Ltd. Procede et systeme pour produire et presenter un tournoi de golf
WO2007081877A1 (fr) * 2006-01-06 2007-07-19 Google Inc. Infrastructure de service multimédia dynamique
WO2007131342A1 (fr) * 2006-05-12 2007-11-22 Gill Barjinderpal S Liste de points de montage permettant la distribution de produits multimédia

Also Published As

Publication number Publication date
AU5479399A (en) 2000-03-06

Similar Documents

Publication Publication Date Title
US6941517B2 (en) Low bandwidth television
US9584571B2 (en) System and method for capturing, editing, searching, and delivering multi-media content with local and global time
JP6397911B2 (ja) ビデオコンテンツを配布するビデオブロードキャストシステム及び方法
US7237254B1 (en) Seamless switching between different playback speeds of time-scale modified data streams
US8411758B2 (en) Method and system for online remixing of digital multimedia
US6622171B2 (en) Multimedia timeline modification in networked client/server systems
US9185379B2 (en) Medium and method for interactive seamless branching and/or telescopic advertising
US20040021684A1 (en) Method and system for an interactive video system
US20050081251A1 (en) Method and apparatus for providing interactive multimedia and high definition video
US20070169158A1 (en) Method and system for creating and applying dynamic media specification creator and applicator
JP2022106944A (ja) 高品質のエクスペリエンスのためのオーディオメッセージの効率的な配信および使用のための方法および装置
US20090103835A1 (en) Method and system for combining edit information with media content
WO2009135088A2 (fr) Système et procédé permettant la synchronisation en temps réel d'une ressource vidéo par rapport à différentes ressources audio
US20030172346A1 (en) Method and computer program for expanding and contracting continuous play media seamlessly
WO2000010329A1 (fr) Systeme cote client de creation de television numerique
WO2007084870A2 (fr) Procédé et système d'enregistrement de montages dans un contenu multimédia
US20020158895A1 (en) Method of and a system for distributing interactive audiovisual works in a server and client system
KR102403263B1 (ko) 다중 라이브 송출 환경에서의 채널 간 고속 전환 모드를 구현하는 방법, 시스템, 및 컴퓨터 판독가능한 기록 매체
KR20000024126A (ko) 네트워크를 통한 영상제공방법 및 영상제공시스템
JP4498531B2 (ja) ストリーミング映像再生方法、ストリーミング映像再生装置、及びストリーミング映像再生プログラムを記録した記録媒体
JP2003199058A (ja) 低帯域幅映像装置、低帯域幅映像供給方法、コンピュータ読み取り可能な記録媒体及びプロダクションモジュール制作装置
City Copyright and Disclaimer

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AL AM AT AU AZ BA BB BG BR BY CA CH CN CU CZ DE DK EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MD MG MK MN MW MX NO NZ PL PT RO RU SD SE SG SI SK SL TJ TM TR TT UA UG US UZ VN YU ZA ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW SD SL SZ UG ZW AM AZ BY KG KZ MD RU TJ TM AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE BF BJ CF CG CI CM GA GN GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
REG Reference to national code

Ref country code: DE

Ref legal event code: 8642

122 Ep: pct application non-entry in european phase