WO2001063936A1 - Compressed video bitstreams processing - Google Patents

Compressed video bitstreams processing Download PDF

Info

Publication number
WO2001063936A1
WO2001063936A1 PCT/GB2001/000724 GB0100724W WO0163936A1 WO 2001063936 A1 WO2001063936 A1 WO 2001063936A1 GB 0100724 W GB0100724 W GB 0100724W WO 0163936 A1 WO0163936 A1 WO 0163936A1
Authority
WO
WIPO (PCT)
Prior art keywords
bitstream
bit rate
mode
storage
transcoded
Prior art date
Application number
PCT/GB2001/000724
Other languages
French (fr)
Inventor
Bruce Fairbairn Devlin
Michael James Knee
Original Assignee
Snell & Wilcox Limited
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Snell & Wilcox Limited filed Critical Snell & Wilcox Limited
Priority to AU37538/01A priority Critical patent/AU3753801A/en
Publication of WO2001063936A1 publication Critical patent/WO2001063936A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/107Selection of coding mode or of prediction mode between spatial and temporal predictive coding, e.g. picture refresh
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/146Data rate or code amount at the encoder output
    • H04N19/147Data rate or code amount at the encoder output according to rate distortion criteria
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/146Data rate or code amount at the encoder output
    • H04N19/152Data rate or code amount at the encoder output by measuring the fullness of the transmission buffer
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/162User input
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/172Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a picture, frame or field
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/17Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object
    • H04N19/176Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being an image region, e.g. an object the region being a block, e.g. a macroblock
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/40Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using video transcoding, i.e. partial or full decoding of a coded input stream followed by re-encoding of the decoded output stream
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/60Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
    • H04N19/61Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
    • H04N21/440254Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream or rendering scenes according to encoded video stream scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display by altering signal-to-noise parameters, e.g. requantization
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N9/00Details of colour television systems
    • H04N9/79Processing of colour television signals in connection with recording
    • H04N9/7921Processing of colour television signals in connection with recording for more than one processing mode
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/76Television signal recording
    • H04N5/78Television signal recording using magnetic recording
    • H04N5/782Television signal recording using magnetic recording on tape
    • H04N5/783Adaptations for reproducing at a rate different from the recording rate

Definitions

  • This invention relates to compressed bitstreams and in the most important example to the processing of compressed bitstreams prior to storage.
  • the television viewer has, through the video cassette recorder (VCR) become accustomed to time-shifting broadcast television programmes that is to say recording a television programme off air for viewing at a more convenient time.
  • the VCR has also created an alternative distribution channel for video content in the form of the sold or hired video tape.
  • the VCR has been developed to offer a range of additional features. Typically, a video tape can be played at an increased speed with a recognisable picture still being displayed, to enable particular material to be located. Advanced VCRs will often provide the capability of freeze-framing or of stepping frame-by-frame through selected scenes. Another feature that is frequently available is that of "long play", where the user elects to compromise the recording quality in return for a longer recording interval on a given length of tape.
  • the present invention consists in one aspect in a method of processing a compressed digital video bitstream, comprising the steps of: receiving a bitstream; establishing a plurality of user-defined storage modes, comprising at least two of a long play mode, having a lower bit rate than the received bitstream, a compress-to-fit mode, having a bit rate related to available storage, and a rapid access mode, having an increased number of non-predicted frames; on user selection of a given one of said storage modes, transcoding the received bitstream to a transcoded bitstream having a different bit rate, reflecting the selected storage mode; and storing the transcoded bitstream.
  • the present invention consists in another aspect in a method of processing a compressed digital video bitstream to facilitate rapid access, comprising the steps of: receiving a bitstream; transcoding the received bitstream to a transcoded bitstream having a higher bit rate and less prediction; and storing the transcoded bitstream.
  • the present invention consists in yet another aspect in a method of processing a compressed digital video bitstream, comprising the steps of: receiving a bitstream having a constant bit rate; establishing a plurality of user- defined storage modes; on user selection of a given storage mode, determining a fixed quality level reflecting that storage mode; transcoding the received bitstream, employing the fixed quality level, to a transcoded bitstream having a variable bit rate; and storing the transcoded bitstream.
  • Figure 1 is a block diagram illustrating an embodiment of the present invention
  • FIG. 2 is a block diagram of one form of the transcoder shown in Figure 1;
  • FIG. 3 is a block diagram of another form of the transcoder shown in Figure 1.
  • a compressed bitstream is received at terminal 100 and, instead of passing directly to the store 102 is received by a transcoder 104.
  • the user is provided the option of selecting storage modes through mode selection block 106.
  • One such storage mode is "long play" and upon selection of this mode, the transcoder receives from the mode selection block a control signal.
  • the transcoder operates to transcode the received bitstream to a transcoded bitstream having a selected lower bit rate. In this way, the programme material represented by the received bitstream can be accommodated by using a lesser portion of the store 102.
  • Another storage mode is "compress-to-fit" mode, for use where only a certain amount of storage remains.
  • a user interface is provided, enabling the user to instruct recording of a given television programme.
  • the arrangement according to this example of the invention can operate to determine first the amount of available storage capability in the store 102. If this is less than that required to record the entire programme at the broadcast bit rate, the mode selection block 106 can recalculate a bit rate at which the entire programme can be recorded. This recalculated bit rate is provided to the transcoder which performs the transcoding operation as before.
  • the bit rate calculation may be performed by estimating the total amount of information to be recorded. This estimate may be made by calculating an approximate average bit rate of the material being broadcast, typically over a short period of the broadcast, and using programme length information transmitted with the broadcast.
  • the user can select to store the received bitstream at the lowest bit rate consistent with a defined quality of video image.
  • the bitstream can be analysed and a quality measure determined. This measure can be produced without reference to the source picture, in contrast to conventional methods of quality measurement, such as PSNR.
  • the transcoder will in this storage mode reduce the bit rate of the transcoded bitstream until the quality measure reaches a defined threshold.
  • this measure is derived using the quantisation levels either detected in the bitstream, or passed from a previous encoding. The quality level may therefore also be defined by the use of a fixed quantisation level.
  • the input bitstream has a constant bit rate
  • this will produce a variable bit rate output stream.
  • variance of the output bit rate is not forbidden.
  • This permits several advantageous features. For example, in a long play mode, a user may select a certain quality level with which they are satisfied, in order to increase recording capacity. This quality level will equate to a particular quantisation level, rather than a simple "halving" of the quality, as is common with VCR long play modes.
  • several quality levels may be selected for different types of material.
  • a finer quantisation may be selected for recording a film than for recording a documentary, which may be finer than that selected for recording a game show.
  • a maximum quality may also be specified, by selecting a quantisation no finer than a certain level. This may save storage capacity, as the recording will not be of a higher quality (and hence, bit rate) than that perceivable by the viewer.
  • bit rate either calculated or selected may be varied during recording, should the viewer recognise that a lower quality recording may be sufficient for the particular programme.
  • the transcoder 104 operates in accordance with the disclosure of EP 0 765 576 and EP 0 913 058.
  • an incoming bitstream can be decoded to an uncompressed or less compressed bitstream, with the coding decisions being extracted.
  • the uncompressed or less compressed signal can then be re-encoded at the required bit rate utilising the same coding decisions, so as to minimise or remove losses in re-encoding.
  • These coding decisions may accompany the bitstream in an information bus signal, or the information may be embedded in the bitstream itself.
  • the transcoding operation can also usefully take advantage of the disclosure of EP 0 963 657.
  • the mode selector 106 permits selection of a mode in which the transcoder operates to increase bit rate.
  • the received bitstream will typically take the form of a long GOP MPEG bitstream, that is to say a bitstream in which there are relatively few intra- coded or I frames and a relatively large number of predicted frames which may take the form of forward predicted frames (P frames) or both forward and backward predicted frames (B frames). If such a long GOP bitstream is written directly to store, rapid access operations, such as a high-speed search for picture material, freeze frame and so on will be complex and time consuming. Unless the available functionality is to be confined to the relatively infrequent I frames, some form of decoding operation will be required within each trick mode function.
  • this difficulty is overcome, where storage capacity permits, by modifying the received bitstream before it is written into the store.
  • the received bitstream is modified by reducing the amount of prediction coding.
  • identified P frames in the received bitstream will be decoded and re-encoded as I frames.
  • the incoming bitstream is received by a controller 200. This is under the control of the mode selection block 106. In dependence upon user preference, aspects of the bitstream and the amount of available storage, it is elected to convert all or a given proportion of P frames into I frames.
  • the controller passes some or all P frames to a decoder 202 where, with information of course from the associated I frame, a decompressed picture signal is derived.
  • I frame encoder 204 This is re-encoded in I frame encoder 204 to provide a sequence of I frames which are multiplexed in multiplexer 206 with the remaining, undisturbed frames, to form a transcoded bitstream output.
  • coding decisions are forwarded from the decoder 202 to the I frame encoder 204 (which may select or use only those required), and to the multiplexer 206.
  • the coding decisions may also be stored, in order to enable the lossless reversal of the re-encoding if required at a later stage.
  • the controller passes to the decoder those P frames which are in defined positions in the GOP structure. For example, in a typical structure, between each I frame, there are interposed three P frames, with intervening B frames. The controller might therefore pass every P frame falling in the middle of such a structure.
  • the controller passes all P frames to the decoder, in order to produce an "I frame only" bitstream.
  • the storage space available may not accommodate such a large amount of information, and thus the transcoder may be instructed to re-encode the l-frames at a lower resolution, in order to save bits, but nevertheless produce a fully intra-coded bitstream.
  • the P frames which are passed to the decoder are those falling in areas of the original material where a substantial amount of motion is present.
  • the bitstream is passed as usual to the controller 200, and is also passed to a monitor of the motion vector bit rate 300. Where there is a high motion vector bit rate, the material will contain significant motion.
  • the output of block 300 is passed to the controller 200, and the related P frames having the high motion vector bit rate are passed to the decoder 202.
  • the "cost" in bit rate terms of converting such P frames to I frames will be less than that of converting P frames where there is little or no motion.
  • the number of intra-coded macroblocks within a picture is increased by a defined proportion.
  • a given number of P type macroblocks will be converted to I type macroblocks.
  • This feature has particular advantage in those trick modes which utilise a portion only of any one picture.
  • a recognisable picture is provided in a search mode by combining respective different picture regions from different incoming frames.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Television Signal Processing For Recording (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

In compression of bitstreams prior to storage, typically on a PDR, a user selects one of a number of storage modes, including long play (having lower bit rate than the received bitstream), compress-to-fit (having a bit rate related to available storage space), and rapid access (having an increased number of non-predicted frames) modes. The received bitstream is transcoded to a bitstream having a bit rate reflecting the selected storage mode, and the transcoded bitstream is then stored.

Description

COMPRESSED VIDEO BITSTREAMS PROCESSING
This invention relates to compressed bitstreams and in the most important example to the processing of compressed bitstreams prior to storage. The television viewer has, through the video cassette recorder (VCR) become accustomed to time-shifting broadcast television programmes that is to say recording a television programme off air for viewing at a more convenient time. The VCR has also created an alternative distribution channel for video content in the form of the sold or hired video tape. The VCR has been developed to offer a range of additional features. Typically, a video tape can be played at an increased speed with a recognisable picture still being displayed, to enable particular material to be located. Advanced VCRs will often provide the capability of freeze-framing or of stepping frame-by-frame through selected scenes. Another feature that is frequently available is that of "long play", where the user elects to compromise the recording quality in return for a longer recording interval on a given length of tape.
In the move to digital television, proposals are already being made to imitate the functionality of a VCR through other storage media such as hard disks or re-writable CDS or DVDs. It has already been recognised that new storage media, and new storage facilities, such as personal digital recorders (PDRs) offer capabilities considerably beyond those of the VCR, such as the ability to record and play back simultaneously to give a "pause" effect in essentially real time viewing.
The compressed nature of broadcast digital video does, however, present challenges in providing functionality analogous to that of an advanced VCR. Generally, a bitstream delivered to the viewer (whatever the method of distribution) will be a relatively low bit rate and in the MPEG case will contain relatively few intra-coded frames.
It is an object of one aspect of the present invention to address such difficulties.
Accordingly, the present invention consists in one aspect in a method of processing a compressed digital video bitstream, comprising the steps of: receiving a bitstream; establishing a plurality of user-defined storage modes, comprising at least two of a long play mode, having a lower bit rate than the received bitstream, a compress-to-fit mode, having a bit rate related to available storage, and a rapid access mode, having an increased number of non-predicted frames; on user selection of a given one of said storage modes, transcoding the received bitstream to a transcoded bitstream having a different bit rate, reflecting the selected storage mode; and storing the transcoded bitstream.
It is an object of another aspect of the invention to address in particular the difficulties arising in providing the rapid access functionality analogous to that of an advanced VCR.
Accordingly, the present invention consists in another aspect in a method of processing a compressed digital video bitstream to facilitate rapid access, comprising the steps of: receiving a bitstream; transcoding the received bitstream to a transcoded bitstream having a higher bit rate and less prediction; and storing the transcoded bitstream.
The present invention consists in yet another aspect in a method of processing a compressed digital video bitstream, comprising the steps of: receiving a bitstream having a constant bit rate; establishing a plurality of user- defined storage modes; on user selection of a given storage mode, determining a fixed quality level reflecting that storage mode; transcoding the received bitstream, employing the fixed quality level, to a transcoded bitstream having a variable bit rate; and storing the transcoded bitstream.
The invention will now be described by way of example with reference to the accompanying drawings in which:
Figure 1 is a block diagram illustrating an embodiment of the present invention;
Figure 2 is a block diagram of one form of the transcoder shown in Figure 1; and
Figure 3 is a block diagram of another form of the transcoder shown in Figure 1.
Referring initially to Figure 1 , a compressed bitstream is received at terminal 100 and, instead of passing directly to the store 102 is received by a transcoder 104. In accordance with the invention, the user is provided the option of selecting storage modes through mode selection block 106. One such storage mode is "long play" and upon selection of this mode, the transcoder receives from the mode selection block a control signal. In accordance with this control signal, the transcoder operates to transcode the received bitstream to a transcoded bitstream having a selected lower bit rate. In this way, the programme material represented by the received bitstream can be accommodated by using a lesser portion of the store 102.
Another storage mode is "compress-to-fit" mode, for use where only a certain amount of storage remains. For example, in certain known arrangements, a user interface is provided, enabling the user to instruct recording of a given television programme. In such a circumstance, the arrangement according to this example of the invention can operate to determine first the amount of available storage capability in the store 102. If this is less than that required to record the entire programme at the broadcast bit rate, the mode selection block 106 can recalculate a bit rate at which the entire programme can be recorded. This recalculated bit rate is provided to the transcoder which performs the transcoding operation as before.
If, as may be in the above example, the transcoding is being performed "live" as a programme is broadcast, the bit rate calculation may be performed by estimating the total amount of information to be recorded. This estimate may be made by calculating an approximate average bit rate of the material being broadcast, typically over a short period of the broadcast, and using programme length information transmitted with the broadcast.
In another storage mode, the user can select to store the received bitstream at the lowest bit rate consistent with a defined quality of video image. Using, for example, the quality measure disclosed in PCT/GB99/03356, the bitstream can be analysed and a quality measure determined. This measure can be produced without reference to the source picture, in contrast to conventional methods of quality measurement, such as PSNR. Using this measure in feedback, the transcoder will in this storage mode reduce the bit rate of the transcoded bitstream until the quality measure reaches a defined threshold. In one embodiment, this measure is derived using the quantisation levels either detected in the bitstream, or passed from a previous encoding. The quality level may therefore also be defined by the use of a fixed quantisation level. Where the input bitstream has a constant bit rate, this will produce a variable bit rate output stream. However, as the bitstream is to be written to a store, rather than passed along a transmission chain requiring a constant rate bitstream, variance of the output bit rate is not forbidden. This permits several advantageous features. For example, in a long play mode, a user may select a certain quality level with which they are satisfied, in order to increase recording capacity. This quality level will equate to a particular quantisation level, rather than a simple "halving" of the quality, as is common with VCR long play modes. Moreover, several quality levels may be selected for different types of material. For example, a finer quantisation may be selected for recording a film than for recording a documentary, which may be finer than that selected for recording a game show. A maximum quality may also be specified, by selecting a quantisation no finer than a certain level. This may save storage capacity, as the recording will not be of a higher quality (and hence, bit rate) than that perceivable by the viewer.
In another embodiment, where compress-to-fit or long play modes are employed, the bit rate either calculated or selected may be varied during recording, should the viewer recognise that a lower quality recording may be sufficient for the particular programme.
In one form of this invention, the transcoder 104 operates in accordance with the disclosure of EP 0 765 576 and EP 0 913 058. As disclosed in these references, an incoming bitstream can be decoded to an uncompressed or less compressed bitstream, with the coding decisions being extracted. The uncompressed or less compressed signal can then be re-encoded at the required bit rate utilising the same coding decisions, so as to minimise or remove losses in re-encoding. These coding decisions may accompany the bitstream in an information bus signal, or the information may be embedded in the bitstream itself. The transcoding operation can also usefully take advantage of the disclosure of EP 0 963 657. Here, the play-off in the quantisation step between decreasing the error and decreasing the bit rate of the coding can be controlled advantageously by dynamic variation of λ, a parameter defining the bounds of the quantisation levels selected. For example, if λ=0.75, a signal between two quantisation levels will be "rounded down" to the lower level if it is less than 75% of the interval between the two levels. This uses fewer bits, as the higher quantisation level is only used for signals occupying the highest 25% of the interval.
In another form of the present invention, the mode selector 106 permits selection of a mode in which the transcoder operates to increase bit rate. As has been explained, the received bitstream will typically take the form of a long GOP MPEG bitstream, that is to say a bitstream in which there are relatively few intra- coded or I frames and a relatively large number of predicted frames which may take the form of forward predicted frames (P frames) or both forward and backward predicted frames (B frames). If such a long GOP bitstream is written directly to store, rapid access operations, such as a high-speed search for picture material, freeze frame and so on will be complex and time consuming. Unless the available functionality is to be confined to the relatively infrequent I frames, some form of decoding operation will be required within each trick mode function. In accordance with the present invention, this difficulty is overcome, where storage capacity permits, by modifying the received bitstream before it is written into the store. Thus the received bitstream is modified by reducing the amount of prediction coding. In one example, identified P frames in the received bitstream will be decoded and re-encoded as I frames. Thus as shown in Figure 2, the incoming bitstream is received by a controller 200. This is under the control of the mode selection block 106. In dependence upon user preference, aspects of the bitstream and the amount of available storage, it is elected to convert all or a given proportion of P frames into I frames. Thus the controller passes some or all P frames to a decoder 202 where, with information of course from the associated I frame, a decompressed picture signal is derived. This is re-encoded in I frame encoder 204 to provide a sequence of I frames which are multiplexed in multiplexer 206 with the remaining, undisturbed frames, to form a transcoded bitstream output. Advantageously, coding decisions are forwarded from the decoder 202 to the I frame encoder 204 (which may select or use only those required), and to the multiplexer 206. The coding decisions may also be stored, in order to enable the lossless reversal of the re-encoding if required at a later stage.
In one embodiment, the controller passes to the decoder those P frames which are in defined positions in the GOP structure. For example, in a typical structure, between each I frame, there are interposed three P frames, with intervening B frames. The controller might therefore pass every P frame falling in the middle of such a structure.
In another embodiment, the controller passes all P frames to the decoder, in order to produce an "I frame only" bitstream. In certain cases, the storage space available may not accommodate such a large amount of information, and thus the transcoder may be instructed to re-encode the l-frames at a lower resolution, in order to save bits, but nevertheless produce a fully intra-coded bitstream.
Shuttle, still, or slow motion modes are likely to be particularly useful in scenes involving large amounts of movement. Thus, in a further embodiment, illustrated in Figure 3, the P frames which are passed to the decoder are those falling in areas of the original material where a substantial amount of motion is present. The bitstream is passed as usual to the controller 200, and is also passed to a monitor of the motion vector bit rate 300. Where there is a high motion vector bit rate, the material will contain significant motion. Thus the output of block 300 is passed to the controller 200, and the related P frames having the high motion vector bit rate are passed to the decoder 202. The "cost" in bit rate terms of converting such P frames to I frames will be less than that of converting P frames where there is little or no motion. In another form of the invention, the number of intra-coded macroblocks within a picture is increased by a defined proportion. Typically, a given number of P type macroblocks will be converted to I type macroblocks. This feature has particular advantage in those trick modes which utilise a portion only of any one picture. Thus, by analogy with a VCR shuttle mode, it can be arranged that a recognisable picture is provided in a search mode by combining respective different picture regions from different incoming frames. By generally increasing the number of intra-coded macroblocks and by specifically ensuring that those picture regions which are to be used to produce a combined picture are coded as I type macroblocks, the present invention can considerably facilitate such an arrangement.
It should be understood that this invention has been described by way of example only and that a wide variety of further modifications are possible without departing from the scope of the invention.

Claims

1. A method of processing a compressed digital video bitstream, comprising the steps of: receiving a bitstream; establishing a plurality of user-defined storage modes, comprising at least two of a long play mode, having a lower bit rate than the received bitstream, a compress-to-fit mode, having a bit rate related to available storage, and a rapid access mode, having an increased number of non-predicted frames; on user selection of a given one of said storage modes, transcoding the received bitstream to a transcoded bitstream having a different bit rate, reflecting the selected storage mode; and storing the transcoded bitstream.
2. A method according to Claim 1 , wherein the transcoding step comprises the steps of decoding to an uncompressed or less compressed signal, extracting coding decisions and re-encoding the uncompressed or less compressed signal utilising some or all of said extracted coding decisions.
3. A method according to Claim 2, wherein the step of re-encoding utilises the same arrangement of predicted and non-predicted frames, the same coding modes and the same motion vectors as the received bitstream, and selects different quantisation levels in accordance with the user selected mode.
4. A method according to any one of the preceding claims, comprising the steps, on user selection of said compress-to-fit storage mode, of determining an amount of available storage; estimating an aggregate bit count of the received bitstream; and transcoding the received bitstream to a bit rate calculated with reference to said available storage and said bit count.
5. A method according to Claim 4, wherein the step of estimating the aggregate bit count comprises measuring bit rate over an interval of the received bitstream, and receiving information regarding the temporal extent of the bitstream being received.
6. A method according to Claim 1 , wherein the step of transcoding in the rapid access mode comprises converting at least a portion of the prediction coded matter into intra-coded matter.
7. A method according to Claim 6. wherein said portion is a macroblock.
8. A method according to Claim 6, wherein said portion is a frame.
9. A method according to any of claims 6 to 8, further comprising producing a fully intra-coded transcoded bitstream, wherein the resolution of the frames may be varied in dependence upon the available storage capacity.
10. A method according to any of claims 6 to 8, further comprising selecting for conversion those prediction coded frames furthest from the nearest intra-coded frame.
11. A method according to any of claims 6 to 8, further comprising selecting for conversion those prediction coded frames having the highest motion vector bit rate.
12. A method of processing a compressed digital video bitstream to facilitate rapid access, comprising the steps of: receiving a bitstream; transcoding the received bitstream to a transcoded bitstream having a higher bit rate and less prediction; and storing the transcoded bitstream.
13. A method according to Claim 12, wherein the step of transcoding in the rapid access mode comprises converting at least a portion of the prediction coded matter into intra-coded matter.
14. A method according to Claim 13. wherein said portion is a macroblock.
15. A method according to Claim 13, wherein said portion is a frame.
16. A method according to any of claims 13 to 15, further comprising producing a fully intra-coded transcoded bitstream, wherein the resolution of the frames may be varied in dependence upon the available storage capacity.
17. A method according to any of claims 13 to 15, further comprising selecting for conversion those prediction coded frames furthest from the nearest intra-coded frame.
18. A method according to any of claims 13 to 15, further comprising selecting for conversion those prediction coded frames having the highest motion vector bit rate.
19. A method according to any of the claims 6 to 18, further comprising storing sufficient coding decisions to enable lossless recreation of the received bitstream.
20. A method according to any of claims 1 to 4, wherein the step of transcoding in the long play mode employs a fixed quality level.
21. A method according to Claim 20 where the received bitstream has a constant bit rate, wherein the step of transcoding employs a fixed quantisation level, to produce a stored bitstream having a variable bit rate.
22. A method according to Claim 21, wherein the fixed quantisation level may be selected according to user-defined quality requirements.
23. A method according to Claim 21, wherein the fixed quantisation level is selected to reflect available storage.
24. A method according to Claim 21 , wherein the fixed quantisation level is selected to produce the maximum user-perceivable picture quality.
25. A method according to Claim 21, wherein the fixed quantisation level is constrained to never be finer than that reflecting a maximum user-perceivable picture quality.
26. A method of processing a compressed digital video bitstream, comprising the steps of: receiving a bitstream having a constant bit rate; establishing a plurality of user-defined storage modes; on user selection of a given storage mode, determining a fixed quality level reflecting that storage mode; transcoding the received bitstream, employing the fixed quality level, to a transcoded bitstream having a variable bit rate; and storing the transcoded bitstream.
27. A method according to any of the above claims, wherein the received bitstream is compliant with a given compression standard and the transcoded bitstream is compliant with the same standard.
28. A digital video recorder, comprising a bitstream receiver, a bitstream store and a transcoder, the transcoder comprising a controller and a user interface, wherein the controller is adapted to establish, in response to user interface, a plurality of user-defined storage modes, comprising at least two of a long play mode, having a lower bit rate than the received bitstream, a compress- to-fit mode, having a bit rate related to available storage, and a rapid access mode, having an increased number of non-predicted frames, and wherein the transcoder is adapted to, on user selection of a given storage mode, transcode the received bitstream to a transcoded bitstream having a different bit rate, reflecting the selected storage mode.
29. A digital video recorder, comprising a bitstream receiver, a bitstream store and a transcoder, the transcoder comprising a controller and a user interface, wherein the transcoder is adapted to transcode a received bitstream to a transcoded bitstream having a higher bit rate and less prediction.
30. A digital video recorder, comprising a bitstream receiver, a bitstream store and a transcoder, the transcoder comprising a controller and a user interface, wherein the controller is adapted to establish, in response to user interface, a plurality of user-defined storage modes, wherein the controller is adapted, on user selection of a given storage mode, to determine a fixed quality level, and wherein the transcoder is adapted to employ the fixed quality level in transcoding the received bitstream to a transcoded bitstream having a variable bit rate.
PCT/GB2001/000724 2000-02-21 2001-02-21 Compressed video bitstreams processing WO2001063936A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
AU37538/01A AU3753801A (en) 2000-02-21 2001-02-21 Compressed video bitstreams processing

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
GB0004061A GB2359438A (en) 2000-02-21 2000-02-21 Method of changing bit rate in accordance with user selected storage mode
GB0004061.8 2000-02-21

Publications (1)

Publication Number Publication Date
WO2001063936A1 true WO2001063936A1 (en) 2001-08-30

Family

ID=9886107

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/GB2001/000724 WO2001063936A1 (en) 2000-02-21 2001-02-21 Compressed video bitstreams processing

Country Status (3)

Country Link
AU (1) AU3753801A (en)
GB (1) GB2359438A (en)
WO (1) WO2001063936A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI411309B (en) * 2010-04-05 2013-10-01 Mediatek Inc Image processing apparatus and image processing method

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
FR2846506B1 (en) * 2002-10-28 2005-02-11 Philippe Marie Alber Guenebaud DEVICE FOR TRANS-COMPRESSION OF AN AUDIOVISUAL DIGITAL STREAM FOR ADAPTING IT TO EXISTING TELEVISION EQUIPMENT
FR2915047B1 (en) * 2007-04-13 2009-10-30 Streamwide Soc Par Actions Sim ARCHITECTURE FOR TRANSCODING MULTIMEDIA STREAMS.
KR20100060416A (en) * 2008-11-27 2010-06-07 삼성전자주식회사 Image recording apparatus and method of recording image

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
EP0627854A2 (en) * 1993-06-04 1994-12-07 Daewoo Electronics Co., Ltd Digital video signal recording/reproducing apparatus
EP0711079A2 (en) * 1994-11-04 1996-05-08 RCA Thomson Licensing Corporation Method and apparatus for scaling a compressed video bitstream
WO1997039584A1 (en) * 1996-04-12 1997-10-23 Imedia Corporation Video transcoder
WO1998003017A1 (en) * 1996-07-15 1998-01-22 Snell & Wilcox Limited Video signal compression
US5923814A (en) * 1993-01-13 1999-07-13 Hitachi America, Ltd. Methods and apparatus for performing video data reduction operations and for concealing the visual effects of data reduction operations

Family Cites Families (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JPH08214058A (en) * 1995-02-07 1996-08-20 Victor Co Of Japan Ltd Recorder for telephone set
US5982723A (en) * 1996-09-30 1999-11-09 Laser Dynamics, Inc. Data recording and reproducing method for multi-layered optical disk system

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US5923814A (en) * 1993-01-13 1999-07-13 Hitachi America, Ltd. Methods and apparatus for performing video data reduction operations and for concealing the visual effects of data reduction operations
EP0627854A2 (en) * 1993-06-04 1994-12-07 Daewoo Electronics Co., Ltd Digital video signal recording/reproducing apparatus
EP0711079A2 (en) * 1994-11-04 1996-05-08 RCA Thomson Licensing Corporation Method and apparatus for scaling a compressed video bitstream
WO1997039584A1 (en) * 1996-04-12 1997-10-23 Imedia Corporation Video transcoder
WO1998003017A1 (en) * 1996-07-15 1998-01-22 Snell & Wilcox Limited Video signal compression

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
TWI411309B (en) * 2010-04-05 2013-10-01 Mediatek Inc Image processing apparatus and image processing method

Also Published As

Publication number Publication date
GB2359438A (en) 2001-08-22
AU3753801A (en) 2001-09-03
GB0004061D0 (en) 2000-04-12

Similar Documents

Publication Publication Date Title
US7343083B2 (en) Digital video signal record and playback device and method for selectively reproducing desired video information from an optical disk
US6724977B1 (en) Compressed video editor with transition buffer matcher
USRE37222E1 (en) Video signal transmitting system
JP4351746B2 (en) Encoding device, encoding method, decoding device, and decoding method
US6574274B2 (en) Picture signal processing system, decoder, picture signal processing method, and decoding method
US5991502A (en) Optical recording device which calculates distances between I-frames and records I-frame addresses in a sector
KR100766740B1 (en) Apparatus and method for converting data using encoding history information
KR100355204B1 (en) Hdtv trick play stream derivation for pre-recorded digital video recording
KR100599017B1 (en) Image data compression device and method
JP3840895B2 (en) Video encoded data conversion apparatus and method
JP3825719B2 (en) Image reproduction method, image reproduction apparatus, and image recording apparatus
US20030169817A1 (en) Method to encode moving picture data and apparatus therefor
US20050105624A1 (en) Image processing apparatus and method, program, and recording medium
WO2001063936A1 (en) Compressed video bitstreams processing
JP3487205B2 (en) Image data editing device
JP4462559B2 (en) Trick playback content creation method and device, trick playback compressed video data transmission method and device, and trick playback content creation program
JP3951152B2 (en) Image coding apparatus and method
KR100543453B1 (en) Apparatus and method for controlling bit rate of digital video data in reverse play
GB2293516A (en) Digital video signal recording arrangement with enhanced skip search playback
KR100195074B1 (en) Method for recording moving picture data
JP3750846B2 (en) Video signal recording method and video signal recording apparatus
JP3964563B2 (en) Video server device
US20050084008A1 (en) Image processing apparatus and method, information processing apparatus and method, program, recording medium, and information processing system
JP2006187037A (en) Image reproducing method
JP3384563B2 (en) Optical disc, reproducing apparatus and reproducing method

Legal Events

Date Code Title Description
AK Designated states

Kind code of ref document: A1

Designated state(s): AU CA JP US

AL Designated countries for regional patents

Kind code of ref document: A1

Designated state(s): AT BE CH CY DE DK ES FI FR GB GR IE IT LU MC NL PT SE TR

121 Ep: the epo has been informed by wipo that ep was designated in this application
DFPE Request for preliminary examination filed prior to expiration of 19th month from priority date (pct application filed before 20040101)
122 Ep: pct application non-entry in european phase
NENP Non-entry into the national phase

Ref country code: JP