WO2015196462A1 - Method and device for displaying a video sequence - Google Patents

Method and device for displaying a video sequence Download PDF

Info

Publication number
WO2015196462A1
WO2015196462A1 PCT/CN2014/080979 CN2014080979W WO2015196462A1 WO 2015196462 A1 WO2015196462 A1 WO 2015196462A1 CN 2014080979 W CN2014080979 W CN 2014080979W WO 2015196462 A1 WO2015196462 A1 WO 2015196462A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
video frame
determining
aspect ratio
output
Prior art date
Application number
PCT/CN2014/080979
Other languages
French (fr)
Inventor
Zhengdong Wei
Qinglei YANG
Qianlong YU
Original Assignee
Thomson Licensing
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Thomson Licensing filed Critical Thomson Licensing
Priority to PCT/CN2014/080979 priority Critical patent/WO2015196462A1/en
Publication of WO2015196462A1 publication Critical patent/WO2015196462A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/44Receiver circuitry for the reception of television signals according to analogue transmission standards
    • H04N5/46Receiver circuitry for the reception of television signals according to analogue transmission standards for receiving on more than one standard at will
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/431Generation of visual interfaces for content selection or interaction; Content or additional data rendering
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
    • H04N21/440263Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display by altering the spatial resolution, e.g. for displaying on a connected PDA
    • H04N21/440272Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display by altering the spatial resolution, e.g. for displaying on a connected PDA for performing aspect ratio conversion
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N7/00Television systems
    • H04N7/01Conversion of standards, e.g. involving analogue television standards or digital television standards processed at pixel level
    • H04N7/0117Conversion of standards, e.g. involving analogue television standards or digital television standards processed at pixel level involving conversion of the spatial resolution of the incoming video signal

Definitions

  • the present disclosure relates to image processing, and more particularly relates to a method and a device for displaying a video sequence.
  • an image has an aspect ratio.
  • the aspect ratio describes the proportional relationship between the image's width and the image's height, i.e. it is the ratio of width to height.
  • the most common aspect ratios used today in the presentation of films in cinemas are 1.85: 1 and 2.39: 1.
  • Two common videographic aspect ratios are 4:3 (1.33:1 ), which is the universal video format of the 20th century, and 16:9 (1.77:1 ), which is universal for high-definition television and European digital television.
  • Other cinema and video aspect ratios exist, but are used infrequently.
  • the most common aspect ratios are 4:3, 3:2, and more recently being found in consumer cameras 16:9.
  • Other aspect ratios, such as 5:3, 5:4, and 1 : 1 (square format) are used in photography as well, particularly in medium format and large format.
  • a method for displaying a video sequence comprising the steps of receiving (401 ) a video frame of the video sequence, wherein the video frame consists of an image and masking; determining (402) an image region of the image within the video frame of the video sequence; determining (403) an output aspect ratio; and generating (404) an output video frame based on the video frame, the image region, the output aspect ratio and a predefined value indicating a filling manner for filling the image into the output video frame.
  • a device for displaying a video sequence comprising a storage; and a processor for receiving a video frame of the video sequence, wherein the video frame consists of an image and masking; determining an image region of the image within the video frame of the video sequence; determining an output aspect ratio; generating an output video frame based on the video frame, the image region, the output aspect ratio and a predefined value indicating a filling manner for filling the image into the output video frame, wherein the predefined value is stored in the storage.
  • non- transitory computer-readable medium comprising a computer program product recorded thereon and capable of being run by a processor, including program code instructions for implementing the steps of above method.
  • Fig. 1 is a diagram showing a 2.35:1 widescreen image letterboxed in a 1.33:1 format according to prior art
  • Fig. 2 is a diagram showing an image is placed in a wider aspect ratio format according to prior art
  • Fig. 3 is a diagram showing an image displayed on TV when the aspect ratio of a video frame having masking is different from aspect ratio of a display device according to prior art;
  • Fig. 4 is a flowchart showing a method for displaying a video sequence according to an embodiment of the present invention
  • Fig. 5 is a diagram showing an image displayed on TV according to the embodiment of the present invention.
  • Fig. 6 is a block diagram showing a STB according to the embodiment of the present invention.
  • the aspect ratio of image format for an original image is different from an aspect ratio of a targeting format (or called receiving format), e.g. the aspect ratio of the original image is 4:3 and aspect ratio of the targeting format is 16:9.
  • the targeting format can be used for storage e.g. in a videotape or hard disk or for displaying on e.g. a TV). In this case, format conversion is necessary.
  • converting formats of unequal ratios is achieved a) by enlarging the original image to fill the targeting format's display area and cutting off any excess picture information (zooming and cropping), b) by adding horizontal mattes (letterboxing) or vertical mattes (pillarboxing) to retain the original format's aspect ratio for the original image, c) by stretching (hence distorting) the original image to fill the targeting format's aspect ratio, and d) by scaling by different factors in both directions, possibly scaling by a different factor in the center and at the edges (as in Wide Zoom mode).
  • Letterboxing is the practice of transferring a video sequence, e.g. a film shot, with a widescreen aspect ratio to standard-width video formats while preserving the video sequence's original aspect ratio.
  • the resulting image in the resulting video sequence has mattes (also called black bars or masking) above and below the original image in the standard-width video formats, i.e. the mattes are on the top and bottom of the resulting image. These mattes are part of the resulting image.
  • Fig. 1 is a diagram showing a 2.35:1 widescreen image letterboxed in a 1.33:1 format according to prior art.
  • the rectangle in grid pattern is the original widescreen image and the rectangle including both the rectangle in grid pattern and mattes is the targeting format.
  • Pillarbox occurs in widescreen video displays when mattes are placed on the sides of the original image. It becomes necessary when video or film that was not originally designed for widescreen is shown on a widescreen displaying device or stored in a widescreen format, or when the original image is displayed or stored in a wider aspect ratio. The original image is placed in the middle of the targeting format.
  • Fig. 2 is a diagram showing an image is placed in a wider aspect ratio format according to prior art.
  • a set top box (STB) has been processed with lettebox, pillarbox and the like. It means that there is masking around the original image, e.g. as shown in the Fig. 1 and Fig. 2.
  • the STB can't display the original image (which doesn't have masking) in an appropriate aspect ratio on TV.
  • a Standard Definition (SD) camera shots a video with aspect ratio of 4:3.
  • the broadcast operator delivers the video in a High Definition channel with aspect ratio of 16:9.
  • pillarbox is used and the format of video sequence is like what is shown in the Fig. 2.
  • the video sequence is displayed in 16:9, the displayed video is like what is shown in the Fig. 2.
  • the present disclosure provides a method and a device for display a video sequence.
  • Fig. 4 is a flowchart showing a method for displaying a video sequence according to an embodiment of the present invention.
  • the STB receives the video sequence. It is implemented by tuning to a channel, starting to receive video signal that conveys the video sequence and decoding the video signal to obtain the video sequence.
  • a video frame in the video sequence is made up of the original image and masking around the original image, e.g. as shown in the Fig. 5.
  • the STB determines image region of the original image in the video frame.
  • the video frame is YUV picture in the buffer of the STB.
  • the STB can obtain the size of the video frame, e.g. its width is a and height is b, i.e. it has b rows of pixels and a columns of pixels.
  • the STB can obtain data of every pixel in the video frame.
  • the value for masking in the video frame is set to a fixed value. The fixed value may vary from implementations, e.g. different chipsets for image processing may use different values.
  • n1 the STB scans horizontally from top to bottom. For n2, it scans horizontally from bottom to top. For ml , it scans vertically from left to right. And for m2, it scans horizontally from right to left. For any scanning for n1 , n2, ml and m2, if pixels in the scanning row have the same value, then it's determined that the scanning row belongs to masking and the scanning continues to next row or column. If they don't have same value, the scanning process stops.
  • the STB determines the masking and determines that the real image region starts at n1 row and ml column and ends at (b-n2) row and (a-m2) column, and the size of the image region is (a-m1 -m2, b-n1 -n2).
  • the STB repeats the scanning process for a predetermined number of video frames, e.g. 30 frames to obtain an average values for ml , m2, n1 , n2.
  • the STB determines an output aspect ratio.
  • the output aspect ratio can be an input from the user, a predefined value by the user or automatically detected by the STB based on communication protocol between the STB and a displaying device.
  • the STB In the step 404, the STB generates an output video frame based on the original image, the output aspect ratio and a predefined filling parameter value.
  • the original image is determined by the video frame and the determined image region.
  • the predefined filling parameter indicates a filling manner in which the original image is fitted into the output video frame.
  • Values of filling parameter comprises a) scaling by a same factor (i.e. enlarging or diminishing in a linear manner) the original image to fill entire output video frame and cutting off excess portion of the enlarged image; b) scaling by a same factor the original image so as to make one pair of edges of the original image (e.g.
  • the value of the predefined filling parameter is selected by the user from above 4 values and stored in the storage of the STB.
  • the user can use remote to set the predefined filling parameter value in a STB menu.
  • the STB converts the original image from its original aspect ratio to the output aspect ratio by using a filling manner indicated by the predefined filling parameter value.
  • steps 401 to 404 are performed before the STB starts to display the output video frame to the users. In a variant of the embodiment, it is performed in a real time manner, i.e. before the STB determines the image region of the original image, the STB displays the video frame of the video sequence to the user, and after the STB determines the image region, the STB displays the output video frame with original image to the user.
  • the STB selects one value close to aspect ratio of the image region from 4:3, 16:9 and 14:9, and uses the selected one as a verified aspect ratio for the image region. So it provides compatibility to existing filling functions among different aspect ratios.
  • Fig. 6 is a block diagram showing a STB according to the embodiment of the present invention.
  • the STB comprises a tuner, a decoder, a processor and a storage.
  • the tuner receives a digital broadcasting signal to generate a digital data stream conveying a video sequence, and transmits the digital data stream to the decoder.
  • the decoder e.g. MPEG-2 decoder, decodes the digital data stream to obtain the video sequence.
  • the obtained video sequence can be either digital signal or analog signal depending on the acceptability of signal to the displaying device, e.g. TV or monitor. If TV only accepts analog signal, then a digital to analog converter shall be added and used.
  • the storage is used to store the predefined filling parameter value.
  • the buffer (not shown in the Fig. 5) is used to store temporary data, such as video frames of the video sequence, output video frames etc.
  • the processor is used for receiving a video frame of the video sequence, wherein the video frame consists of an image and masking; determining an image region of the image within the video frame of the video sequence; determining an output aspect ratio; generating an output video frame based on the video frame, the image region, the output aspect ratio and a predefined filling parameter value indicating a filling manner for filling the image into the output video frame, wherein the predefined filling parameter value is stored in the storage.
  • the processor is further used for receiving a predefined number of video frames; determining image regions for each video frames; and determining average value of the image regions of the predefined number of video frames as a common image region.
  • the processor is further used for determining a value closed to aspect ratio of the image region from 4:3, 16:9 and 14:9 as a verified aspect ratio of the image region.
  • the processor is further used for performing in the video frame scanning horizontally from top to bottom, scanning horizontally from bottom to top, scanning vertically from left to right and scanning vertically from right to left; during each scanning, if pixels in a row or column have a same value determining the row or the column belongs to masking; and determining the image region.
  • the processor is further used for receiving an input selectable from a) scaling by a same factor the image to fill entire output video frame and cutting off excess portion of the enlarged image; b) scaling by a same factor the image so as to make one pair of edges of the image overlap with one pair of edges of the output video frame while keeping the other pair of edges of the image within the output video frame; c) stretching the image to fill entire output video frame; and d) scaling by different factors in the center and at the edges of the image to fill entire output video frame; and storing the input as the predefined filling parameter value in the storage.
  • STB can be used for a player to play a video sequence stored on a storage device, e.g. local hard disk or remote hard disk or to play a VOD (Video on Demand) video as long as the video frame consists of an original image (e.g. shot image by a camera) and masking.
  • a storage device e.g. local hard disk or remote hard disk
  • VOD Video on Demand
  • the tuner is not necessary.
  • non- transitory computer-readable medium comprising a computer program product recorded thereon and capable of being run by a processor, including program code instructions for implementing the steps of above method.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)

Abstract

It is provided a method for displaying a video sequence,comprising the steps of receiving (401) a video frame of the video sequence,wherein the video frame consists of an image and masking; determining (402) an image region of the image within the video frame of the video sequence; determining (403) an output aspect ratio; and generating (404) an output video frame based on the video frame,the image region,the output aspect ratio and a predefined value indicating a filling manner for filling the image into the output video frame.

Description

METHOD AND DEVICE FOR DISPLAYING A VIDEO SEQUENCE
TECHNICAL FIELD
The present disclosure relates to image processing, and more particularly relates to a method and a device for displaying a video sequence.
BACKGROUND
In technical field of image processing, an image has an aspect ratio. The aspect ratio describes the proportional relationship between the image's width and the image's height, i.e. it is the ratio of width to height.
The most common aspect ratios used today in the presentation of films in cinemas are 1.85: 1 and 2.39: 1. Two common videographic aspect ratios are 4:3 (1.33:1 ), which is the universal video format of the 20th century, and 16:9 (1.77:1 ), which is universal for high-definition television and European digital television. Other cinema and video aspect ratios exist, but are used infrequently. In still camera photography, the most common aspect ratios are 4:3, 3:2, and more recently being found in consumer cameras 16:9. Other aspect ratios, such as 5:3, 5:4, and 1 : 1 (square format), are used in photography as well, particularly in medium format and large format.
SUMMARY
According to an aspect of the present disclosure, it is provided a method for displaying a video sequence, comprising the steps of receiving (401 ) a video frame of the video sequence, wherein the video frame consists of an image and masking; determining (402) an image region of the image within the video frame of the video sequence; determining (403) an output aspect ratio; and generating (404) an output video frame based on the video frame, the image region, the output aspect ratio and a predefined value indicating a filling manner for filling the image into the output video frame. According to an aspect of the present disclosure, it is provided a device for displaying a video sequence, comprising a storage; and a processor for receiving a video frame of the video sequence, wherein the video frame consists of an image and masking; determining an image region of the image within the video frame of the video sequence; determining an output aspect ratio; generating an output video frame based on the video frame, the image region, the output aspect ratio and a predefined value indicating a filling manner for filling the image into the output video frame, wherein the predefined value is stored in the storage.
According to an aspect of the present disclosure, it is provided computer program product downloadable from a communication network and/or recorded on a medium readable by computer and/or executable by a processor, comprising program code instructions for implementing the steps of above method.
According to an aspect of the present disclosure, it is provided non- transitory computer-readable medium comprising a computer program product recorded thereon and capable of being run by a processor, including program code instructions for implementing the steps of above method.
It is to be understood that more aspects and advantages of the disclosure will be found in the following detailed description of the present invention.
BRIEF DESCRIPTION OF THE DRAWINGS
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this application, will be used to illustrate an embodiment of the invention, as explained by the description. The invention is not limited to the embodiment.
In the drawings:
Fig. 1 is a diagram showing a 2.35:1 widescreen image letterboxed in a 1.33:1 format according to prior art; Fig. 2 is a diagram showing an image is placed in a wider aspect ratio format according to prior art;
Fig. 3 is a diagram showing an image displayed on TV when the aspect ratio of a video frame having masking is different from aspect ratio of a display device according to prior art;
Fig. 4 is a flowchart showing a method for displaying a video sequence according to an embodiment of the present invention;
Fig. 5 is a diagram showing an image displayed on TV according to the embodiment of the present invention; and
Fig. 6 is a block diagram showing a STB according to the embodiment of the present invention.
DETAILED DESCRIPTION
Sometimes, the aspect ratio of image format for an original image is different from an aspect ratio of a targeting format (or called receiving format), e.g. the aspect ratio of the original image is 4:3 and aspect ratio of the targeting format is 16:9. (the targeting format can be used for storage e.g. in a videotape or hard disk or for displaying on e.g. a TV). In this case, format conversion is necessary. In conventional methods, converting formats of unequal ratios is achieved a) by enlarging the original image to fill the targeting format's display area and cutting off any excess picture information (zooming and cropping), b) by adding horizontal mattes (letterboxing) or vertical mattes (pillarboxing) to retain the original format's aspect ratio for the original image, c) by stretching (hence distorting) the original image to fill the targeting format's aspect ratio, and d) by scaling by different factors in both directions, possibly scaling by a different factor in the center and at the edges (as in Wide Zoom mode).
As to above conventional method b), below gives some description about letterboxing and pillarboxing.
Letterboxing is the practice of transferring a video sequence, e.g. a film shot, with a widescreen aspect ratio to standard-width video formats while preserving the video sequence's original aspect ratio. The resulting image in the resulting video sequence has mattes (also called black bars or masking) above and below the original image in the standard-width video formats, i.e. the mattes are on the top and bottom of the resulting image. These mattes are part of the resulting image. Fig. 1 is a diagram showing a 2.35:1 widescreen image letterboxed in a 1.33:1 format according to prior art. The rectangle in grid pattern is the original widescreen image and the rectangle including both the rectangle in grid pattern and mattes is the targeting format.
Pillarbox occurs in widescreen video displays when mattes are placed on the sides of the original image. It becomes necessary when video or film that was not originally designed for widescreen is shown on a widescreen displaying device or stored in a widescreen format, or when the original image is displayed or stored in a wider aspect ratio. The original image is placed in the middle of the targeting format. Fig. 2 is a diagram showing an image is placed in a wider aspect ratio format according to prior art.
Sometimes, video sequences received by a set top box (STB) have been processed with lettebox, pillarbox and the like. It means that there is masking around the original image, e.g. as shown in the Fig. 1 and Fig. 2. In this case, the STB can't display the original image (which doesn't have masking) in an appropriate aspect ratio on TV. For example, a Standard Definition (SD) camera shots a video with aspect ratio of 4:3. The broadcast operator delivers the video in a High Definition channel with aspect ratio of 16:9. In this case, pillarbox is used and the format of video sequence is like what is shown in the Fig. 2. At the user end, if the video sequence is displayed in 16:9, the displayed video is like what is shown in the Fig. 2. But if the video sequence is displayed in 4:3 on TV, it is like what is shown in the Fig. 3. It affects user experience when there is masking around the original image. Similarly, if a HD video is delivered in a SD channel by broadcast operator and the user device displays the received video with aspect ratio of 16:9, the video displayed on TV is like what is shown in the Fig. 3. In order to solve it, the present disclosure provides a method and a device for display a video sequence.
The embodiment of the present invention will now be described in detail in conjunction with the drawings. In the following description, some detailed descriptions of known functions and configurations may be omitted for clarity and conciseness.
Fig. 4 is a flowchart showing a method for displaying a video sequence according to an embodiment of the present invention.
In the step 401 , the STB receives the video sequence. It is implemented by tuning to a channel, starting to receive video signal that conveys the video sequence and decoding the video signal to obtain the video sequence. A video frame in the video sequence is made up of the original image and masking around the original image, e.g. as shown in the Fig. 5.
In the step 402, the STB determines image region of the original image in the video frame. Below gives an example for obtaining image region of the original image. In the example, the video frame is YUV picture in the buffer of the STB. The STB can obtain the size of the video frame, e.g. its width is a and height is b, i.e. it has b rows of pixels and a columns of pixels. Besides, the STB can obtain data of every pixel in the video frame. Generally, the value for masking in the video frame is set to a fixed value. The fixed value may vary from implementations, e.g. different chipsets for image processing may use different values. If pixels in a row (or column) have the same value, then it is very likely that the row (or column) belongs to masking. In order to obtain n1 , the STB scans horizontally from top to bottom. For n2, it scans horizontally from bottom to top. For ml , it scans vertically from left to right. And for m2, it scans horizontally from right to left. For any scanning for n1 , n2, ml and m2, if pixels in the scanning row have the same value, then it's determined that the scanning row belongs to masking and the scanning continues to next row or column. If they don't have same value, the scanning process stops. In this way, the STB determines the masking and determines that the real image region starts at n1 row and ml column and ends at (b-n2) row and (a-m2) column, and the size of the image region is (a-m1 -m2, b-n1 -n2).
According to a variant, the STB repeats the scanning process for a predetermined number of video frames, e.g. 30 frames to obtain an average values for ml , m2, n1 , n2.
In the step 403, the STB determines an output aspect ratio. The output aspect ratio can be an input from the user, a predefined value by the user or automatically detected by the STB based on communication protocol between the STB and a displaying device.
In the step 404, the STB generates an output video frame based on the original image, the output aspect ratio and a predefined filling parameter value. The original image is determined by the video frame and the determined image region. The predefined filling parameter indicates a filling manner in which the original image is fitted into the output video frame. Values of filling parameter comprises a) scaling by a same factor (i.e. enlarging or diminishing in a linear manner) the original image to fill entire output video frame and cutting off excess portion of the enlarged image; b) scaling by a same factor the original image so as to make one pair of edges of the original image (e.g. left and right edges, or top and bottom edges) overlap with one pair of edges of the output video frame while keeping the other pair of edges of the original image within the output video frame; c) stretching the original image to fill the entire output video frame; and d) scaling by different factors in the center and at the edges of the original image to fill entire output video frame. The value of the predefined filling parameter is selected by the user from above 4 values and stored in the storage of the STB. The user can use remote to set the predefined filling parameter value in a STB menu. In other words, in this step the STB converts the original image from its original aspect ratio to the output aspect ratio by using a filling manner indicated by the predefined filling parameter value.
It shall note that steps 401 to 404 are performed before the STB starts to display the output video frame to the users. In a variant of the embodiment, it is performed in a real time manner, i.e. before the STB determines the image region of the original image, the STB displays the video frame of the video sequence to the user, and after the STB determines the image region, the STB displays the output video frame with original image to the user.
According to a variant of the embodiment, the STB selects one value close to aspect ratio of the image region from 4:3, 16:9 and 14:9, and uses the selected one as a verified aspect ratio for the image region. So it provides compatibility to existing filling functions among different aspect ratios.
Fig. 6 is a block diagram showing a STB according to the embodiment of the present invention.
The STB comprises a tuner, a decoder, a processor and a storage.
The tuner receives a digital broadcasting signal to generate a digital data stream conveying a video sequence, and transmits the digital data stream to the decoder.
The decoder, e.g. MPEG-2 decoder, decodes the digital data stream to obtain the video sequence. The obtained video sequence can be either digital signal or analog signal depending on the acceptability of signal to the displaying device, e.g. TV or monitor. If TV only accepts analog signal, then a digital to analog converter shall be added and used.
The storage is used to store the predefined filling parameter value.
The buffer (not shown in the Fig. 5) is used to store temporary data, such as video frames of the video sequence, output video frames etc.
The processor is used for receiving a video frame of the video sequence, wherein the video frame consists of an image and masking; determining an image region of the image within the video frame of the video sequence; determining an output aspect ratio; generating an output video frame based on the video frame, the image region, the output aspect ratio and a predefined filling parameter value indicating a filling manner for filling the image into the output video frame, wherein the predefined filling parameter value is stored in the storage. According to a variant, the processor is further used for receiving a predefined number of video frames; determining image regions for each video frames; and determining average value of the image regions of the predefined number of video frames as a common image region.
According to a variant, the processor is further used for determining a value closed to aspect ratio of the image region from 4:3, 16:9 and 14:9 as a verified aspect ratio of the image region.
According to a variant, the processor is further used for performing in the video frame scanning horizontally from top to bottom, scanning horizontally from bottom to top, scanning vertically from left to right and scanning vertically from right to left; during each scanning, if pixels in a row or column have a same value determining the row or the column belongs to masking; and determining the image region.
According to a variant, the processor is further used for receiving an input selectable from a) scaling by a same factor the image to fill entire output video frame and cutting off excess portion of the enlarged image; b) scaling by a same factor the image so as to make one pair of edges of the image overlap with one pair of edges of the output video frame while keeping the other pair of edges of the image within the output video frame; c) stretching the image to fill entire output video frame; and d) scaling by different factors in the center and at the edges of the image to fill entire output video frame; and storing the input as the predefined filling parameter value in the storage.
According to a variant of the embodiment, it is not limited to STB, can be used for a player to play a video sequence stored on a storage device, e.g. local hard disk or remote hard disk or to play a VOD (Video on Demand) video as long as the video frame consists of an original image (e.g. shot image by a camera) and masking. In this case, the tuner is not necessary.
According to the embodiment of the present invention, it is provided computer program product downloadable from a communication network and/or recorded on a medium readable by computer and/or executable by a processor, comprising program code instructions for implementing the steps of above method.
According to the embodiment of the present invention, it is provided non- transitory computer-readable medium comprising a computer program product recorded thereon and capable of being run by a processor, including program code instructions for implementing the steps of above method.
A number of implementations have been described. Nevertheless, it will be understood that various modifications may be made. For example, elements of different implementations may be combined, supplemented, modified, or removed to produce other implementations. Additionally, one of ordinary skill will understand that other structures and processes may be substituted for those disclosed and the resulting implementations will perform at least substantially the same function(s), in at least substantially the same way(s), to achieve at least substantially the same result(s) as the implementations disclosed. Accordingly, these and other implementations are contemplated by this application and are within the scope of the invention as defined by the appended claims.

Claims

1. A method for displaying a video sequence, comprising the steps of:
receiving (401 ) a video frame of the video sequence, wherein the video frame consists of an image and masking;
determining (402) an image region of the image within the video frame of the video sequence;
determining (403) an output aspect ratio; and
generating (404) an output video frame based on the video frame, the image region, the output aspect ratio and a predefined value indicating a filling manner for filling the image into the output video frame.
2. The method of the claim 1 , wherein
receiving a predefined number of video frames;
determining image regions for each video frames; and
determining average value of the image regions of the predefined number of video frames as a common image region.
3. The method of the claim 1 , wherein
determining a value closed to aspect ratio of the image region from 4:3, 16:9 and 14:9 as a verified aspect ratio of the image region.
4. The method of the claim 1 , wherein the step of determining image region further comprises:
performing in the video frame scanning horizontally from top to bottom, scanning horizontally from bottom to top, scanning vertically from left to right and scanning vertically from right to left;
during each scanning, if pixels in a row or column have a same value determining the row or the column belongs to masking; and
determining the image region.
5. The method of the claim 1 , wherein
receiving an input for the predefine value selectable from a) scaling by a same factor the image to fill entire output video frame and cutting off excess portion of the enlarged image; b) scaling by a same factor the image so as to make one pair of edges of the image overlap with one pair of edges of the output video frame while keeping the other pair of edges of the image within the output video frame; c) stretching the image to fill entire output video frame; and d) scaling by different factors in the center and at the edges of the image to fill entire output video frame.
6. A device for displaying a video sequence, comprising
a storage; and
a processor for receiving a video frame of the video sequence, wherein the video frame consists of an image and masking; determining an image region of the image within the video frame of the video sequence;
determining an output aspect ratio; generating an output video frame based on the video frame, the image region, the output aspect ratio and a predefined value indicating a filling manner for filling the image into the output video frame, wherein the predefined value is stored in the storage.
7. The device of the claim 6, wherein the processor is further used for
receiving a predefined number of video frames;
determining image regions for each video frames; and
determining average value of the image regions of the predefined number of video frames as a common image region.
8. The device of the claim 6, wherein the processor is further used for determining a value closed to aspect ratio of the image region from 4:3, 16:9 and 14:9 as a verified aspect ratio of the image region.
9. The device of the claim 6, wherein the processor is further used for
performing in the video frame scanning horizontally from top to bottom, scanning horizontally from bottom to top, scanning vertically from left to right and scanning vertically from right to left;
during each scanning, if pixels in a row or column have a same value determining the row or the column belongs to masking; and
determining the image region.
10. The device of the claim 6, wherein the processor is further used for
receiving an input selectable from a) scaling by a same factor the image to fill entire output video frame and cutting off excess portion of the enlarged image; b) scaling by a same factor the image so as to make one pair of edges of the image overlap with one pair of edges of the output video frame while keeping the other pair of edges of the image within the output video frame; c) stretching the image to fill entire output video frame; and d) scaling by different factors in the center and at the edges of the image to fill entire output video frame; and
storing the input as the predefined value in the storage.
1 1. Computer program product downloadable from a communication network and/or recorded on a medium readable by computer and/or executable by a processor, comprising program code instructions for implementing the steps of a method according to at least one of claims 1 to 5.
12. Non-transitory computer-readable medium comprising a computer program product recorded thereon and capable of being run by a processor, including program code instructions for implementing the steps of a method according to at least one of claims 1 to 5.
PCT/CN2014/080979 2014-06-27 2014-06-27 Method and device for displaying a video sequence WO2015196462A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2014/080979 WO2015196462A1 (en) 2014-06-27 2014-06-27 Method and device for displaying a video sequence

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2014/080979 WO2015196462A1 (en) 2014-06-27 2014-06-27 Method and device for displaying a video sequence

Publications (1)

Publication Number Publication Date
WO2015196462A1 true WO2015196462A1 (en) 2015-12-30

Family

ID=54936523

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2014/080979 WO2015196462A1 (en) 2014-06-27 2014-06-27 Method and device for displaying a video sequence

Country Status (1)

Country Link
WO (1) WO2015196462A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4962521A (en) * 1987-12-17 1990-10-09 Mitsubishi Denki Kabushiki Kaisha Apparatus for still picture video telephone apparatus and methods for transmitting and displaying still picture image for use therein
US5208669A (en) * 1991-01-08 1993-05-04 Sony Broadcast & Communications Limited Up-conversion of a video signal format with expansion of active portion of video field
US20080084503A1 (en) * 2006-10-10 2008-04-10 Sony Corporation Apparatus, method, and computer program for processing image
CN102332096A (en) * 2011-10-17 2012-01-25 中国科学院自动化研究所 Video caption text extraction and identification method
CN102523390A (en) * 2011-12-08 2012-06-27 成都东方盛行电子有限责任公司 Automatic aspect ratio control method based on AFD (automatic feeding device)

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4962521A (en) * 1987-12-17 1990-10-09 Mitsubishi Denki Kabushiki Kaisha Apparatus for still picture video telephone apparatus and methods for transmitting and displaying still picture image for use therein
US5208669A (en) * 1991-01-08 1993-05-04 Sony Broadcast & Communications Limited Up-conversion of a video signal format with expansion of active portion of video field
US20080084503A1 (en) * 2006-10-10 2008-04-10 Sony Corporation Apparatus, method, and computer program for processing image
CN102332096A (en) * 2011-10-17 2012-01-25 中国科学院自动化研究所 Video caption text extraction and identification method
CN102523390A (en) * 2011-12-08 2012-06-27 成都东方盛行电子有限责任公司 Automatic aspect ratio control method based on AFD (automatic feeding device)

Similar Documents

Publication Publication Date Title
TWI544450B (en) Scale-independent maps
US8139081B1 (en) Method for conversion between YUV 4:4:4 and YUV 4:2:0
CN108063976B (en) Video processing method and device
US10504263B2 (en) Adaptive high dynamic range (HDR) tone mapping with overlay indication
US9349188B2 (en) Creating details in an image with adaptive frequency strength controlled transform
US11509858B2 (en) Automatic program formatting for TV displays
US20110262055A1 (en) Image scaling curve generation
US20160316274A1 (en) Method and system for switching video playback resolution
KR102197579B1 (en) Creating details in an image with frequency lifting
US9066025B2 (en) Control of frequency lifting super-resolution with image features
US20110175988A1 (en) 3d video graphics overlay
US9471958B2 (en) Image processing method and apparatus
US9161030B1 (en) Graphics overlay system for multiple displays using compressed video
US20040119891A1 (en) Method and system for video display with automatic reframing
US9053752B1 (en) Architecture for multiple graphics planes
US8483389B1 (en) Graphics overlay system for multiple displays using compressed video
US8873642B2 (en) Video content analysis methods and systems
US8855444B2 (en) Method for partitioning and processing a digital image
WO2015196462A1 (en) Method and device for displaying a video sequence
EP1848203B2 (en) Method and system for video image aspect ratio conversion
EP2239934A1 (en) Image apparatus for determining type of image data and method for processing image data applicable thereto
JP7468391B2 (en) Image capture device and image capture processing method
US20230039717A1 (en) Automatic program formatting for tv displays
JP2022097021A (en) Imaging device and imaging processing method
JP2010093856A (en) Image processing apparatus and method

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14895836

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 14895836

Country of ref document: EP

Kind code of ref document: A1