WO2005018234A1 - Procede et systeme de fourniture de donnees de media - Google Patents

Procede et systeme de fourniture de donnees de media Download PDF

Info

Publication number
WO2005018234A1
WO2005018234A1 PCT/US2003/021947 US0321947W WO2005018234A1 WO 2005018234 A1 WO2005018234 A1 WO 2005018234A1 US 0321947 W US0321947 W US 0321947W WO 2005018234 A1 WO2005018234 A1 WO 2005018234A1
Authority
WO
WIPO (PCT)
Prior art keywords
subset
data
representation
service
quality
Prior art date
Application number
PCT/US2003/021947
Other languages
English (en)
Inventor
Antonio Ortega
Scott M. Darden
Asha Vellaikal
Zhourong Miao
Joseph Caldarola
Original Assignee
Citipati Partners, Llc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Citipati Partners, Llc filed Critical Citipati Partners, Llc
Priority to PCT/US2003/021947 priority Critical patent/WO2005018234A1/fr
Priority to AU2003251896A priority patent/AU2003251896A1/en
Priority to EP03818145A priority patent/EP1661401A4/fr
Publication of WO2005018234A1 publication Critical patent/WO2005018234A1/fr

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/16Analogue secrecy systems; Analogue subscription systems
    • H04N7/173Analogue secrecy systems; Analogue subscription systems with two-way working, e.g. subscriber sending a programme selection signal
    • H04N7/17309Transmission or handling of upstream communications
    • H04N7/17318Direct or substantially direct transmission and handling of requests
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/132Sampling, masking or truncation of coding units, e.g. adaptive resampling, frame skipping, frame interpolation or high-frequency transform coefficient masking
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/157Assigned coding mode, i.e. the coding mode being predefined or preselected to be further used for selection of another element or parameter
    • H04N19/159Prediction type, e.g. intra-frame, inter-frame or bidirectional frame prediction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/172Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/18Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a set of transform coefficients
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/30Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using hierarchical techniques, e.g. scalability
    • H04N19/36Scalability techniques involving formatting the layers as a function of picture distortion after decoding, e.g. signal-to-noise [SNR] scalability
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/40Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video transcoding, i.e. partial or full decoding of a coded input stream followed by re-encoding of the decoded output stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/513Processing of motion vectors
    • H04N19/517Processing of motion vectors by encoding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/50Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding
    • H04N19/503Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using predictive coding involving temporal prediction
    • H04N19/51Motion estimation or motion compensation
    • H04N19/577Motion compensation with bidirectional frame interpolation, i.e. using B-pictures
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/61Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/65Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using error resilience
    • H04N19/66Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using error resilience involving data partitioning, i.e. separation of data into packets or partitions according to importance
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/85Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
    • H04N19/89Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving methods or arrangements for detection of transmission errors at the decoder
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2343Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements
    • H04N21/234327Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving reformatting operations of video signals for distribution or compliance with end-user requests or end-user device requirements by decomposing into layers, e.g. base layer and one or more enhancement layers
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/23Processing of content or additional data; Elementary server operations; Server middleware
    • H04N21/234Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs
    • H04N21/2347Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving video stream encryption
    • H04N21/23476Processing of video elementary streams, e.g. splicing of video streams or manipulating encoded video stream scene graphs involving video stream encryption by partially encrypting, e.g. encrypting the ending portion of a movie
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/20Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
    • H04N21/25Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
    • H04N21/254Management at additional data server, e.g. shopping server, rights management server
    • H04N21/2541Rights Management
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/60Network structure or processes for video distribution between server and client or between remote clients; Control signalling between clients, server and network components; Transmission of management data between server and client, e.g. sending from server to client commands for recording incoming content stream; Communication details between server and client 
    • H04N21/63Control signaling related to video distribution between client, server and network components; Network processes for video distribution between server and clients or between remote clients, e.g. transmitting basic layer and enhancement layers over different transmission paths, setting up a peer-to-peer communication via Internet between remote STB's; Communication protocols; Addressing
    • H04N21/631Multimode Transmission, e.g. transmitting basic layers and enhancement layers of the content over different transmission paths or transmitting with different error corrections, different keys or with different transmission protocols
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/835Generation of protective data, e.g. certificates
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/835Generation of protective data, e.g. certificates
    • H04N21/8355Generation of protective data, e.g. certificates involving usage data, e.g. number of copies or viewings allowed
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/83Generation or processing of protective or descriptive data associated with content; Content structuring
    • H04N21/845Structuring of content, e.g. decomposing content into time segments
    • H04N21/8456Structuring of content, e.g. decomposing content into time segments by decomposing the content in the time domain, e.g. in time segments
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/16Analogue secrecy systems; Analogue subscription systems
    • H04N7/167Systems rendering the television signal unintelligible and subsequently intelligible
    • H04N7/1675Providing digital key or authorisation information for generation or regeneration of the scrambling sequence

Definitions

  • the present invention is concerned with techniques for transmitting and delivering media data in a communications network.
  • Communications networks e.g. the Internet
  • Such data is in digital form, and may represent a still image, a video sequence, or an audio sequence of voice or music, for example.
  • Rendering of time-based digital media such as audio, speech or video involves producing a series of outputs over a period of time, e.g., a video frame every 1/30-th of a second in the case of video NTSC output.
  • security measures are required for preventing contents from becoming available at intermediate points and for restricting access to the data to those who have been properly authorized.
  • Such measures may include encryption as well as partitioning of data for transmission of the individual parts over distinct paths.
  • parts are chosen so that no one of them provides a good approximation to the original content, thus requiring complete assembly of all parts for content to become available at good quality to an authorized recipient.
  • Similar concerns arise with respect to security and privacy in communications over an open network, especially in real-time communication between users and in real-time broadcasting of potentially sensitive information.
  • Data representing media content tends to be voluminous, and its protection typically entails considerable complexity and transmission costs: in encryption, in transmission over a secure server, and in the use of streaming in preference to downloading to make copying more difficult.
  • Summary of the Invention Techniques of the invention can provide for protection of media data at reduced cost in that protective measures may be applied to one or several data subsets having a comparatively small size. The subset(s) can be selected such that protection of the complementary bulk of the data becomes less important.
  • time-based formats are MP3 audio and MPEG2 audio and video.
  • a data set/sequence representing the title is partitioned into smaller subsets/subsequences, each having a specific access control mechanism, e.g., download-only, streaming-only or download-or-stream, and each being associated with reconstruction logic information for reassembly of the original title.
  • a first of the subsets is chosen relatively large, yet to convey a degraded-quality representation of the title, and the others for complementing the first subset to obtain a superior-quality representation.
  • Fig. 1 is a schematic of operation of an authoring module.
  • Fig. 2 is a schematic of different partitioning of a media sequence.
  • Fig. 3 is a schematic of a relationship between authoring module, metadata server and content distribution system.
  • Fig. 4 is a schematic of interaction between a client and a content distribution cloud.
  • Fig. 5 is a schematic of operation of a media distribution service.
  • Fig. 6 is a schematic of processing at a client.
  • Fig. 7 is a schematic of an encoder for data compression.
  • content may be delivered in different forms, to different types of terminals.
  • one terminal may be a closed system, e.g. a TV set-top box, and another an open system such as a PC.
  • the distribution mechanism need not be uniform, as it may include means such as direct satellite broadcasting, Internet streaming, or distribution of content in a recorded physical medium such as a disk, for example.
  • a content owner or service provider will seek to secure the content in all cases and regardless of differences in delivery formats and means.
  • There are three aspects of delivery systems which are considered as particularly important.
  • decoder means should be inexpensive, as with set-top boxes or other hardware to be bought by the end user, and decoding should be fast, e.g.
  • the third aspect is with content protection desirably offering easy preview of content.
  • O(LAN) For encryption of N bits of data, the over-all complexity is O(LAN) which can be substantial.
  • O(LAN) For example, in the RSA cryptographic algorithm (named after its originators Rivest, Shamir and Adleman), encryption and decryption involve exponentiation by a large integer followed by a modulo operation.
  • input data is processed to extract content-dependent key information whose length in bits here is denoted by L'.
  • a compression system involves one encoder and at least one decoder.
  • the encoder generates a compressed representation of the signal that can be read by the decoder to produce an approximation to the input signal.
  • Open international standards developed to date including such broadly used standards as MPEG-2 or JPEG, specify the syntax of the bit stream exchanged by encoder and decoder. Their goal is to maintain inter-operability between encoders and decoders of different vendors.
  • the standards prescribe how the decoder should interpret a bit stream, and the operations to be performed on the data to reconstruct the signal.
  • the standards documents do not specify how the encoding process needs to be performed, only how the encoder can represent the information to be sent to the decoder.
  • many different encoding strategies can be employed while still preserving syntax compatibility with a standard decoder, e.g. as based on the MPEG-2 standard, in the sense of the decoder being able to decode the information.
  • a cryptographic algorithm customarily is applied directly to the compressed stream, which then is no longer syntax-compatible.
  • a preview of the media stream requires a separate, low-resolution file.
  • Known techniques for providing the latter result in inefficiency, e.g., when a so-called scalable video coding is used, particularly when the preview is distributed first and the rest of the signal is provided only to those end users that have been granted access. In this case, the preview containing a small portion of the data is pre-positioned, i.e., available to the end user, and a large amount of additional data has to be transmitted to view the signal at the higher resolution.
  • the additional data has the most value, and its encryption and decryption will be costly.
  • two data sets are derived from an original compressed video, namely a base layer with rate Rb and an enhancement layer with rate Re, where Rb « Re.
  • the enhancement layer has to be fully encrypted, or else a selective encryption would have to be performed to guard against decoding of the enhancement.
  • a low-resolution preview file in fact contains most of the data, typically by a wide margin, so that Rb » Re using the above notation.
  • the low-resolution file contains at least 80% or even 95% of the total data.
  • information can be removed at different stages of encoding, desirably while preserving syntax compatibility.
  • Privacy/Security in Network Communications Characteristics of a reverse-scalable system are advantageous also for security and privacy in communications over an open network such as the Internet, especially in real-time applications such as VoIP (Voice over Internet Protocol), video conferencing, and real-time broadcasting where potentially sensitive information may be transmitted.
  • VoIP Voice over Internet Protocol
  • video conferencing video conferencing
  • real-time broadcasting where potentially sensitive information may be transmitted.
  • a coiporation will have reservations against use of an open network for conducting remote business meetings.
  • VPN virtual private network
  • content can be separated into subsequences at the source, or at network gateways.
  • a relatively small subsequence can be formed to convey key information, while a much larger subsequence provides a low-quality approximation to the signal which by itself may be essentially useless.
  • Fig. 1 shows an authoring module 10 accepting media sequence input and additional parameter input, and generating N subsequences along with respective reconstruction logic.
  • the authoring module 10 serves for analyzing/splitting the input sequence into the subsequences, and it generates the reconstruction logic for recombining some or all of the subsequences after their delivery.
  • the reconstruction logic may be bundled together with the subsequences, or, in a different implementation, they can be made available independently.
  • the subsequences can be delivered in various ways, such as downloading or streaming, for example. In downloading, a data file is delivered in its entirety prior to its use, e.g. by direct server-to-client transfer, by transfer to the client from a third processor acting as caching proxy, or by a transfer from another user in possession of the file as in a so-called peer-to-peer network.
  • variants (a), (b) and (c) for analyzing/splitting a time-based media sequence into subsequences 1 and 2.
  • the sequence is to be split into two subsequences.
  • splitting is by a switch 21 for allocating bits to one or the other of the subsequences based on a chosen splitting scheme which does not rely on knowledge of any coding format adhered to in the sequence 20.
  • the analyzer utilizes certain l ⁇ iown coding information. Such information may enable splitting such that at least one of the subsequences may be playable to some extent, albeit abbreviated or at a lower quality, for example.
  • codec parser 22 the media sequence is first parsed based on the information, and the parsed sequence is then split by a switch 23.
  • Variant (c) provides for splitting by analyzer 24 without any further partitioning prior to delivery. In each of the cases, the original sequence and/or any of the subsequences may be enciypted, then requiring corresponding decryption after delivery.
  • Fig. 3 shows authoring module 31 generating media subsequences being delivered to a content distribution system 32 serving three destinations where content may be displayed or archived, for example. The authoring module 31 further generates metadata which are forwarded to a metadata server 33. Metadata provide information which is ancillary to the actual subsequence bit streams, e.g.
  • subsequence file identification unique naming is straightforward in a centrally controlled system, by following a chosen format assigned to a media title.
  • a name server can act as a mechanism for preventing subsequence identifier conflicts.
  • the authoring module then can query the name server for unique identifiers for each new media title to be introduced into the distributive system.
  • different sequences may be specified for downloading or streaming, for example, and for downloading there may be a specification of priority of one subsequence over another.
  • Fig. 4 illustrates a specific instantiation of interaction in a media distribution service, between a client 41 and a content distribution facility or Aclouds 42.
  • Such a service can utilize existing media distribution systems and infrastructures, e.g. the Internet.
  • certain supporting systems are provided.
  • a client is considered who requests audio playback of a media title A.
  • Aplay the client receives a reply from the media distribution service including instructions for deciphering the subsequence access information and for reconstruction of the title from the subsequences.
  • the reply designated as Media Title Locator (MTL)
  • MTL Media Title Locator
  • MSRL Media Subsequence Resource Locator
  • the MSRL of a subsequence includes media server information, access protocol information, and the subsequence file name. In case the reconstruction logic is not bundled with the subsequence, the MSRL also includes information for accessing the corresponding logic.
  • FIG. 5 more specifically illustrates operation of a media distribution , service.
  • a client computer 51 submits its request for a media title to the media distribution response server 52.
  • the response server 52 queries the metadata server 53 to retrieve one or several required subsequences, logic and any other relevant information.
  • the response server 52 queries the directory service 54 for the locations of the individual subsequences/logic, with the directory service 54 possibly having intelligence for choosing optimal locations in a case where a subsequence can be obtained from different locations.
  • the response server 52 combines the information to form the MTL and sends it to the client 51.
  • the client 51 parses the MTL and opens connection with the individual media servers 55 to obtain the subsequences/logic.
  • File distribution can involve an available mechanism, e.g. client-server distribution, edge-of-network/push caching, pull caching as in standard passive caching and peer-to-peer file sharing. Such mechanisms may be categorized as being of one of two types, media server versus directory service.
  • the role of a media server, central or distributed, includes storing the subsequences and serving or streaming them on demand to any requesting client.
  • a central server can be used in a service where a user can download or stream directly from a main service provider in a client-server mode.
  • a file sharing service replicates a file at multiple locations at Aedges ⁇ , such as mirrors and proxies of the network, but in a more controlled fashion as compared with peer-to-peer.
  • a further example for a server includes cache proxies in the network for storing recently requested information.
  • Subsequences can be made available as downloaded or streamed, and, in streaming, multicasting can be provided for multiple users to listen to the same song, for example.
  • subsequences When subsequences are not encrypted or otherwise constrained, they can be copied to disk, transferred, removed, backed up, and the like.
  • a directory service a listing/record is maintained of the locations of the individual subsequences, for supporting a querying interface.
  • the directory service can be a simple database on a single server.
  • a decentralized service e.g. peer-to-peer, multiple users can each have a copy of the same file, with availability varying as a function of who is logged on.
  • a media distribution system as illustrated above can be implemented for a variety of uses including an online media locker, online jukebox, chat room, or advertising means, for example.
  • an online media locker users can store digital media content they already own, e.g. from a CD, enabling the user to access the content from any computer connected to the network. If allowed by the content provider, storing may be of the fully reconstructed file, or of a lower-quality version. Also, an original file may be split into one portion which can be downloaded, and another to be obtained by streaming from a server whenever playback is desired.
  • subscribers have access to music in a database.
  • Users may obtain portions of music for downloading, of a quality which is inadequate for satisfactory playback and requiring a complementary portion which may be obtained by relatively low-rate streaming.
  • streams can be multicast such that the streams trigger playback. From users, comments may be overlayed on the music.
  • Other media e.g. images or video can be distributed correspondingly, with partitioning of original data based on the technique of the invention.
  • application of the technique is not limited to visual and auditory media, but can extend to tactile media, for example.
  • Compression as exemplified below is used in numerous media applications such as speech, audio and video, including DVDs, broadcasting by cable or satellite, and streaming video over the Internet.
  • a media sequence can be partitioned in the course of compression-encoding prior to transmission, under standards such as MP3, MPEG-2 or MPEG 4, into subsequences which can be decoded by a conesponding standard decoder.
  • a media sequence that is already in compressed format can be read and parsed for partitioning.
  • the partitioning can be effected during compression, or later on the compressed stream.
  • recombination can be effected in the course of decoding or before.
  • Fig. 7 shows a basic structure for a compression system, including a motion estimation module 71, a motion compensation module 72, a transform and quantization module 73, a reconstructed-frame memory 74, a DC differential pulse code modulator (DPCM) 75, an AC zig-zag run length module 76, and variable-length coders 77 and 78.
  • Functionality of modules involves the following: DCT Transform. From input blocks of pixels, typically blocks of size 8 by 8, the same number of samples of the frequency contents of each block are generated, typically called frequency coefficients.
  • This transform is invertible, so that, if no further processing is performed, the original pixel values can be recovered by applying an appropriate inverse transform.
  • the output coefficients represent the information content of each input signal in the frequency domain.
  • the transforms provides significant energy compaction, i.e., for typical images the number of frequency coefficients needed to provide a good representation of the signal tends to be small. While DCT (Discrete Cosine Transform) is prevalent in current data compression techniques, use of other transforms, e.g. wavelet transforms, is not precluded. Wavelet transforms are used in the JPEG 2000 standard and are likely to find use in video coders. Quantization.
  • the frequency coefficients which can take any real values within an interval that is known beforehand, are quantized to the closest among a discrete set of values. Thus, the quantized coefficients take only a discrete set of values. This operation is lossy, amounting to a many-to-one mapping, and cannot be inverted. Entropy Coding. Also l ⁇ iown as variable-length coding, the quantized values are represented by specific variable-length codes designed such that the most likely quantized values are assigned shorter code lengths. This process is lossless, i.e., the mapping from quantized value to code is one-to-one, and once the coded symbols have been concatenated for transmission there is a unique way of parsing the bit stream to obtain the individual codes. Motion Estimation and Compensation.
  • Motion estimation is only performed at the encoder, while motion compensation is performed at both encoder and decoder.
  • Motion estimation consists of dividing a current frame into blocks, typically 16 by 16 pixels in size, and for each block finding the block in a specified region in the previous frame that most closely matches it. If the match is sufficiently good then a motion vector is transmitted and the difference between the block in the current frame and the best-match block from the previous frame will be compressed and transmitted. Otherwise, if the match is not sufficiently good, the block in the current frame will be compressed directly, without subtraction of any information from the previous frame.
  • the encoder can be modified for removing information in the course of encoding. Then, after such partial removal of information, the output from the encoder can be taken as a first subsequence, with the removed portion as a complementary subsequence.
  • an analyzer filter can be formed for combining partial decoding of a compressed file with removal of information for partitioning. For removal of information it is desired to adhere to syntax-compatible strategies in the sense described above, for at least one of the subsequences, typically the one containing the bulk data. The following are examples of such strategies: AC Coefficients.
  • the AC coefficients are coded with a variable length code (VLC) where two dimensions are encoded, run-length and magnitude. Escape codes are used for all values outside the table of codes.
  • VLC variable length code
  • Escape codes are used for all values outside the table of codes.
  • selected AC coefficients are set to a specified value, e.g. zero, with the actual value placed in a supplementary stream. Because of the run lengths, if any coefficient is changed to zero, as the run of zeros now has become longer, this alters not only its code but also the code before it. Therefore, some form of re-encoding of the block may be needed. Zeroing the AC coefficients distorts the image texture, leaving an unappealing, though recognizable sequence. Another option is to set the sign of certain AC coefficients to be positive.
  • a third option is to remove the coefficient and replace it with a number indicating the distance to the next bits that will be changed.
  • a fourth option is to remove the coefficient without replacement, resulting in a shift, by one, of all the following coefficients in the zig-zag scan.
  • the AC coefficients are appealing for encoding a distance because their range is from -2048 to 2048 (but not zero). This is long enough in most cases, and, if need be, two coefficients can be used to code the jump, making for 24-bit distance.
  • Replacing coefficients has the advantage of a recombination filter of low complexity, as recombination can skip bits without parsing them, but provides less control over the quality of a resulting first subsequence.
  • Removing coefficients without replacement has complexity similar to zeroing, with a potential advantage in introducing significant quality degradation even if just a single coefficient is removed.
  • the following are among options: (i) the N largest coefficients of a block, (ii) the first N coefficients of a block, and (iii) the coefficients that provide the maximum average energy over a subset of the sequence, e.g. a frame or a group of pictures (GOP).
  • a further possibility lies in selection of blocks for coefficient removal, e.g.
  • DC coefficients are differentially coded, with a new sequence starting at the begimiing of each line of 8 by 8 blocks. The first value of a new line is between 0 and 255. The difference values range form -255 to 255. All coefficients are variable-length coded. Altering the DC vectors impairs the fidelity of the video and has a number of options. Zeroing the vectors produces marginally acceptable video in which some of the video contents can be made out.
  • altering the signs of the DC differentials is another choice. Because the range of differential vectors is from -255 to 255, if all signs are set in one direction, e.g. all positive, or even if a single sign is changed on a chosen line, the predicted DC value of a block can be outside the 0-255 range. Thus, an attacker might be able to determine that a particular sign should be changed when out-of-range coefficients are observed. Many practical media players simply truncate the DC values if they fall out of range.
  • a further option involves eliminating the MV temporal direction information for each block of a B-frame.
  • a code table can be used to represent each of the possible values of this information, along with other parameters.
  • the code that is used is not unique but depends on the values taken by the other parameters encoded.
  • one multi- bit code has to be replaced by another where the temporal direction has been modified.
  • Other options that are applicable to most of the above include coding only P-frames or only one of the MV directions. Combination of Techniques. It is feasible to change multiple elements of the bit stream. Not only will this reduce the quality, but may also make unauthorized decoding more difficult, thus enhancing security. Also, certain techniques as described above can be combined with known scrambling techniques.
  • the supplemental stream then contains only the coefficients packed together, and the filter can run through the file quickly, swapping and skipping.
  • a seek-and-swap method one concern arises from the fact that the number of bits allowable for skipping actually depends on what the MPEG format permits for a specific coefficient.
  • the DC coefficients are used then there is a range of 256 values for unpredicted blocks, and of 512 for predicted blocks. Therefore, skipping is constrained to 256 or 512 bits. While most block sizes are considerably less than 256 bits, larger blocks may have to be accommodated. AC coefficients may present a better choice, as they provide 4096 bits of precision.
  • a technique of the invention can provide for removal of either DC or AC information, selectively depending on the degree of image degradation desired for a scene in a video sequence. Such dependence can be selected to advantage for providing a file with a preview functionality, for example. When informed as to which scenes have been degraded lightly, a user can view these scenes as a preview of the complete sequence.
  • Optimization of Data Removal As described, the level of protection offered by our techniques depends on the number of values that can result in a valid bit stream in the position where data has been removed. For example, when a differentially encoded DC coefficient is removed, there are exactly 512 different values possible in that position. To reconstruct the original compressed signal, a hacker would have to guess which among the 512 possible values is the right one.
  • the DC sign when the DC sign is set to Aalways positives, Ri equals one bit per DC coefficient, Di depends on both the change in the coefficient itself and on the subsequent DC coefficients which will be affected in differential coding, and Ni equals 2.
  • the goal can be the maximizing of a weighted cost function, namely Di + k ⁇ Ni, where k is a Lagrange multiplier allowing emphasis on distortion or level of protection in optimization.
  • the number of "good" options can be approximated by the entropy of the random variable representing them. For example, when a sign is removed, the two options typically have the same likelihood.
  • the level of protection achieved is directly related to the number of bits required to represent the information that was removed, and an approximate formulation for optimization calls for maximizing Di + k ⁇ Ri, where a large value is desirable for Ri in that it represents a large number of options to be chosen in order to restore the data.
  • Techniques of the invention can be implemented in software, for a processor to follow program instructions stored in memory. It was with such an implementation that experimental results were obtained as presented below. For commercial use, in the interest of enhanced efficiency, use of firmware and/or dedicated hardware is indicated, e.g. at a network gateway. Whether by hardware, firmware or software, the techniques can be implemented as separate modules, or their functions can be integrated in encoder and decoder modules.
  • the size of the bulk file remains close to the size of the original file, and the supplemental streams are small by comparison. Even the supplemental file including the most information, namely the one conveying selected AC coefficients, remains less in size than 10% of the original file.
  • respective supplemental files of approximately 1% and 0.2% in size are noted. Techniques including removal of the DC coefficients or their sign result in highly degraded video sequences. Upon removal of the AC coefficients or their sign there results a moderate decrease in quality.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computer Security & Cryptography (AREA)
  • Databases & Information Systems (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

Dans la communication de média, par exemple dans la fourniture de données vidéo et audio ou dans la téléconférence, un fichier ou titre de média divisé en vue d'une livraison sure et efficace. Un séquence de données représentant le fichier est divisée en sous-séquences plus petites, une première sous-séquence étant choisie pour acheminer une représentation à qualité dégradée des données et les autres séquences étant choisies pour compléter la première sous-séquence de manière à obtenir une représentation de qualité supérieure. Les domaines d'application dudit procédé sont les média à la demande, les communications et la diffusion en temps réel et les services en ligne de distribution de média par casier (locker), de jukebox et de messagerie en temps réel.
PCT/US2003/021947 2003-07-15 2003-07-15 Procede et systeme de fourniture de donnees de media WO2005018234A1 (fr)

Priority Applications (3)

Application Number Priority Date Filing Date Title
PCT/US2003/021947 WO2005018234A1 (fr) 2003-07-15 2003-07-15 Procede et systeme de fourniture de donnees de media
AU2003251896A AU2003251896A1 (en) 2003-07-15 2003-07-15 Method and system for delivering media data
EP03818145A EP1661401A4 (fr) 2003-07-15 2003-07-15 Procede et systeme de fourniture de donnees de media

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/US2003/021947 WO2005018234A1 (fr) 2003-07-15 2003-07-15 Procede et systeme de fourniture de donnees de media

Publications (1)

Publication Number Publication Date
WO2005018234A1 true WO2005018234A1 (fr) 2005-02-24

Family

ID=34192483

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2003/021947 WO2005018234A1 (fr) 2003-07-15 2003-07-15 Procede et systeme de fourniture de donnees de media

Country Status (3)

Country Link
EP (1) EP1661401A4 (fr)
AU (1) AU2003251896A1 (fr)
WO (1) WO2005018234A1 (fr)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113949848A (zh) * 2021-10-18 2022-01-18 重庆紫光华山智安科技有限公司 数据传输方法、装置、设备及介质

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20090135444A1 (en) * 2007-11-26 2009-05-28 Steven Francis Best Method to protect sensitive data fields stored in electronic documents

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5526131A (en) * 1992-12-01 1996-06-11 Samsung Electronics Co., Ltd Data coding for a digital video tape recorder suitable for high speed picture playback
US5991276A (en) * 1996-11-19 1999-11-23 Fujitsu Limited Videoconference system
US6233017B1 (en) * 1996-09-16 2001-05-15 Microsoft Corporation Multimedia compression system with adaptive block sizes
US6404738B1 (en) * 1998-01-21 2002-06-11 Nec Usa, Inc. Dynamic network bandwidth allocation for multimedia applications with soft quality-of-service requirements
US20030079222A1 (en) * 2000-10-06 2003-04-24 Boykin Patrick Oscar System and method for distributing perceptually encrypted encoded files of music and movies

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2660139B1 (fr) * 1990-03-23 1995-08-25 France Etat Procede de codage et de transmission a au moins deux niveaux de qualite d'images numeriques appartenant a une sequence d'images, et dispositifs correspondants.
US5235420A (en) * 1991-03-22 1993-08-10 Bell Communications Research, Inc. Multilayer universal video coder
JP3288898B2 (ja) * 1995-05-31 2002-06-04 株式会社東芝 ディジタルテレビジョン放送システム
US6275988B1 (en) * 1995-06-30 2001-08-14 Canon Kabushiki Kaisha Image transmission apparatus, image transmission system, and communication apparatus
JPH09205630A (ja) * 1996-01-26 1997-08-05 Matsushita Electric Ind Co Ltd テレビジョン信号送信・受信方法、及びテレビジョン信号送信・受信装置

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5526131A (en) * 1992-12-01 1996-06-11 Samsung Electronics Co., Ltd Data coding for a digital video tape recorder suitable for high speed picture playback
US6233017B1 (en) * 1996-09-16 2001-05-15 Microsoft Corporation Multimedia compression system with adaptive block sizes
US5991276A (en) * 1996-11-19 1999-11-23 Fujitsu Limited Videoconference system
US6404738B1 (en) * 1998-01-21 2002-06-11 Nec Usa, Inc. Dynamic network bandwidth allocation for multimedia applications with soft quality-of-service requirements
US20030079222A1 (en) * 2000-10-06 2003-04-24 Boykin Patrick Oscar System and method for distributing perceptually encrypted encoded files of music and movies

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP1661401A4 *

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113949848A (zh) * 2021-10-18 2022-01-18 重庆紫光华山智安科技有限公司 数据传输方法、装置、设备及介质

Also Published As

Publication number Publication date
EP1661401A1 (fr) 2006-05-31
AU2003251896A1 (en) 2005-03-07
EP1661401A4 (fr) 2006-08-02

Similar Documents

Publication Publication Date Title
US9350782B2 (en) Method and system for delivering media data
US10277656B2 (en) Method and system for delivering media data
US7006631B1 (en) Method and system for embedding binary data sequences into video bitstreams
KR100736080B1 (ko) 다 계층으로 구성된 멀티미디어 스트림의 저작권을 계층별로 관리하는 방법 및 장치
US8731193B2 (en) Partial encryption using variable block sizes
EP2596643B1 (fr) Agencement de fragments de piste secondaire pour une diffusion en continu de données vidéo
EP1529262B1 (fr) Filigranage numerique et dactyloscopie de contenu numerique a l'aide de blocs alternatifs pour incorporer des informations
EP2735166B1 (fr) Transmission de données de reconstruction dans une hiérarchie de qualité de signal échelonnée
US7057535B2 (en) Methods for scaling encoded data without requiring knowledge of the encoding scheme
US6989773B2 (en) Media data encoding device
US8055974B2 (en) Content distribution method, encoding method, reception/reproduction method and apparatus, and program
Spanos et al. Security for real-time MPEG compressed video in distributed multimedia applications
US20070038578A1 (en) Method and system for digital content distribution
FR2860122A1 (fr) Embrouillage, desembrouillage et distribution securisee de sequences audiovisuelles issues de codeurs videos bases dct
Thomas et al. A novel secure H. 264 transcoder using selective encryption
Iqbal et al. Compressed-domain video processing for adaptation, encryption, and authentication
Su et al. A practical design of content protection for H. 264/AVC compressed videos by selective encryption and fingerprinting
EP1661401A1 (fr) Procede et systeme de fourniture de donnees de media
US9038096B2 (en) System and method of adaptive and progressive descrambling of digital image content
Sharma et al. Commutative encryption based video encoding technique with high-efficiency & video adaptation capabilities
Iqbal et al. A framework for MPEG-21 DIA based adaptation and perceptual encryption of H. 264 video
US6345120B1 (en) Image processing system, image data transmission and reception apparatus, and image processing method
Momoh et al. A Comparative Analysis of Video Compression Standard and Algorithms: State of the Art
Challa et al. Towards video secure streaming-feasibility study of using an obscuring algorithm in conjunction of H. 264 encoding and compression
Iqbal et al. Secured MPEG-21 digital item adaptation for H. 264 video

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AE AG AL AM AT AU AZ BA BB BG BR BY BZ CA CH CN CO CR CU CZ DE DK DM DZ EC EE ES FI GB GD GE GH GM HR HU ID IL IN IS JP KE KG KP KR KZ LC LK LR LS LT LU LV MA MD MG MK MN MW MX MZ NI NO NZ OM PG PH PL PT RO RU SC SD SE SG SK SL SY TJ TM TN TR TT TZ UA UG UZ VC VN YU ZA ZM ZW

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): GH GM KE LS MW MZ SD SL SZ TZ UG ZM ZW AM AZ BY KG KZ MD RU TJ TM AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HU IE IT LU MC NL PT RO SE SI SK TR BF BJ CF CG CI CM GA GN GQ GW ML MR NE SN TD TG

121 Ep: the epo has been informed by wipo that ep was designated in this application
WWE Wipo information: entry into national phase

Ref document number: 2003818145

Country of ref document: EP

WWP Wipo information: published in national office

Ref document number: 2003818145

Country of ref document: EP

NENP Non-entry into the national phase

Ref country code: JP