WO2014121571A1 - Method and apparatus for context-based video quality assessment - Google Patents

Method and apparatus for context-based video quality assessment Download PDF

Info

Publication number
WO2014121571A1
WO2014121571A1 PCT/CN2013/077082 CN2013077082W WO2014121571A1 WO 2014121571 A1 WO2014121571 A1 WO 2014121571A1 CN 2013077082 W CN2013077082 W CN 2013077082W WO 2014121571 A1 WO2014121571 A1 WO 2014121571A1
Authority
WO
WIPO (PCT)
Prior art keywords
distortion
temporal
current frame
spatial
frames
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Ceased
Application number
PCT/CN2013/077082
Other languages
English (en)
French (fr)
Inventor
Ning Liao
Zhibo Chen
Fan Zhang
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Thomson Licensing SAS
Original Assignee
Thomson Licensing SAS
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Thomson Licensing SAS filed Critical Thomson Licensing SAS
Priority to KR1020157021123A priority Critical patent/KR20150115771A/ko
Priority to CN201380072550.8A priority patent/CN104995914A/zh
Priority to BR112015018465A priority patent/BR112015018465A2/pt
Priority to US14/763,940 priority patent/US9716881B2/en
Priority to HK16106409.4A priority patent/HK1218482A1/zh
Priority to EP13874735.7A priority patent/EP2954677B1/en
Priority to AU2013377642A priority patent/AU2013377642A1/en
Priority to JP2015556366A priority patent/JP2016510567A/ja
Publication of WO2014121571A1 publication Critical patent/WO2014121571A1/en
Anticipated expiration legal-status Critical
Ceased legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N17/00Diagnosis, testing or measuring for television systems or their details
    • H04N17/004Diagnosis, testing or measuring for television systems or their details for digital television systems
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/0002Inspection of images, e.g. flaw detection
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/40Scenes; Scene-specific elements in video content
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/102Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
    • H04N19/103Selection of coding mode or of prediction mode
    • H04N19/114Adapting the group of pictures [GOP] structure, e.g. number of B-frames between two anchor frames
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/134Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
    • H04N19/154Measured or subjectively estimated visual quality after decoding, e.g. measurement of distortion
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/10Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
    • H04N19/169Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding
    • H04N19/177Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the coding unit, i.e. the structural portion or semantic portion of the video signal being the object or the subject of the adaptive coding the unit being a group of pictures [GOP]
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/48Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using compressed domain processing techniques other than decoding, e.g. modification of transform coefficients, variable length coding [VLC] data or run-length data
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N19/00Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
    • H04N19/85Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression
    • H04N19/89Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using pre-processing or post-processing specially adapted for video compression involving methods or arrangements for detection of transmission errors at the decoder
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F2218/00Aspects of pattern recognition specially adapted for signal processing
    • G06F2218/12Classification; Matching
    • GPHYSICS
    • G06COMPUTING OR CALCULATING; COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/10Image acquisition modality
    • G06T2207/10016Video; Image sequence

Definitions

  • This invention relates to video quality measurement, and more particularly, to a method and apparatus for determining an objective video quality metric.
  • the present principles provide a method for estimating visual quality of a video sequence, comprising: accessing spatial distortion for frames in a plurality of sliding windows that include a current frame in the video sequence; determining a factor indicating at least one of a large distortion density and a representative artifact level for each sliding window responsive to respective spatial distortion for frames in the each sliding window; determining temporal distortion for the current frame responsive to the determined factor; and determining the visual quality of the video sequence responsive to the temporal distortion of the current frame as described below.
  • the present principles also provide an apparatus for performing these steps.
  • the present principles provide a method for estimating visual quality of a video sequence, comprising: accessing spatial distortion for frames in a plurality of sliding windows that include a current frame in the video sequence; determining a factor indicating at least one of a large distortion density and a representative artifact level for each sliding window responsive to respective spatial distortion for frames in the each sliding window; determining a maximum of the factors for the plurality of sliding windows; determining temporal distortion for the current frame responsive to the maximum factor; and determining the visual quality of the video sequence responsive to the temporal distortion of the current frame as described below.
  • the present principles also provide an apparatus for performing these steps.
  • the present principles also provide a computer readable storage medium having stored thereon instructions for estimating visual quality of a video sequence according to the methods described above.
  • FIG. 1A is a pictorial example depicting spatial artifact levels for individual frames in an exemplary video sequence
  • FIG. 1 B is a pictorial example depicting perceived temporal quality for individual frames in the exemplary video sequence.
  • FIGs. 2A, 2B, and 2C are pictorial examples depicting sliding windows used in video quality modeling, in accordance with an embodiment of the present principles.
  • FIG. 3A is a pictorial example depicting spatial artifact levels for individual frames in another exemplary video sequence
  • FIGs. 3B and 3C are pictorial examples depicting dominant distortion in a frame's neighborhood and estimated temporal distortion, respectively, in accordance with an embodiment of the present principles.
  • FIG. 4A is a pictorial example depicting spatial artifact levels for individual frames in another exemplary video sequence
  • FIGs. 4B and 4C are pictorial examples depicting the highest large distortion density in a frame's neighborhood and estimated temporal distortion, respectively, in accordance with an embodiment of the present principles.
  • FIG. 5A is a pictorial example depicting spatial artifact levels for individual frames in another exemplary video sequence
  • FIGs. 5B and 5C are pictorial examples depicting the highest large distortion density in a frame's neighborhood and estimated temporal distortion, respectively, in accordance with an embodiment of the present principles.
  • FIG. 6 is a flow diagram depicting an exemplary method for modeling temporal distortion at frame n, in accordance with an embodiment of the present principles.
  • FIG. 7 is a block diagram depicting an exemplary video quality monitor, in accordance with an embodiment of the present principles.
  • FIG. 8 is a block diagram depicting an exemplary video processing system that may be used with one or more implementations.
  • spatial artifact to denote artifact perceived in a picture in a video sequence when the picture is viewed independently of other pictures in the video sequence
  • temporal artifact to denote artifact that is perceived in a picture of a video sequence when pictures in the video sequence are continuously displayed
  • spatial distortion or “spatial quality” to denote distortion or quality perceived in a picture when the picture is viewed independently of other pictures in a video sequence
  • temporal distortion or “temporal quality” to denote distortion or quality that is perceived in a picture of a video sequence when pictures in the video sequence are continuously displayed.
  • a picture When assessing spatial distortion, a picture is viewed independently of other pictures in the video sequence, for a period of time that is long enough for a viewer to recognize image content and distortion. This is different from assessing temporal distortion, wherein pictures are continuously displayed.
  • ds(n) can be obtained by various image quality assessment methods, for example, but not limited to, a full-reference or no-reference method, and a method in a pixel domain or at a bitstream level.
  • spatial distortion can be estimated as the overall visible artifact level, caused by initial and/or propagated visible artifact, as disclosed in a commonly owned PCT application, entitled “Video quality assessment at a bitstream level” by N. Liao, Z. Chen, and K. Xie (PCT/CN2012/078766, Attorney Docket No.
  • Spatial artifact in pictures needs to last for a period of time so that eyes can fix on and recognize it as artifact.
  • the pictures are part of a video sequence and each is displayed only for a very short period of time (for example, a period of 1/frame_rate when the video is played in real time)
  • the perceived video distortion at the time instant of frame n i.e., temporal distortion at frame n, dt(n)
  • FIG. 1A shows spatial artifact levels of individual frames in the video sequence
  • FIG. 1 B shows temporal quality of individual frames in the video sequence. More specifically, FIG. 1A shows spatial artifact levels of the frames when the exemplary video sequence suffers from packet losses.
  • the spatial artifact may be sporadic in the video sequence, for example, the spatial artifact seen at frames 74, 77, 215, 261 , and 262.
  • the spatial artifact may also occur in a burst, such as the artifact seen around frames 106-1 1 1 .
  • FIG. 1 B shows temporal quality when the frames of the video sequence are displayed continuously, wherein score 100 corresponds to the best quality.
  • the quality score may be at a different scale.
  • the accurate curve of temporal quality may be obtained using a subjective test method, for example, but not limited to, Single Stimulus Continuous Quality Evaluation (SSCQE), as defined in ITU-R BT 500.
  • SSCQE Single Stimulus Continuous Quality Evaluation
  • FIG. 1 A frames 74 and 77 have strong spatial artifact when these two frames are viewed independently.
  • the artifact at these two frames becomes invisible when the video is displayed continuously, and thus, frames 74 and 77 are at the best quality level when viewed continuously as shown in FIG. 1 B.
  • strong spatial artifact may not always correspond to high temporal distortion.
  • one frame of a video sequence may appear to have good quality when viewed independently of other frames, but may present very strong temporal distortion (for example, motion jitter) when the video sequence is displayed continuously. That is, small spatial distortion may not always correspond to small perceived temporal distortion (i.e., higher temporal quality).
  • very strong temporal distortion for example, motion jitter
  • the present principles provide a method and apparatus for accurately modeling temporal quality from spatial distortion for individual frames.
  • the present principles consider the context that affects how a viewer identifies temporal distortion, wherein the context includes, for example, but not limited to, the duration and the pattern of the distortion, and texture and object's motion that are recognized by a viewer via watching the neighboring frames.
  • the perceived temporal distortion can be modeled using a sliding window approach. As shown in FIGs.
  • a sliding window of L 0 frames that includes frame n (denoted asS i n ) starts at frame (n - i)and ends at frame (n - i + L 0 - 1), 0 ⁇ i ⁇ L 0 .
  • Artifact existing outside the sliding windows is regarded as having little contribution to the visibility of the temporal artifact of a current frame.
  • the perceived temporal distortion of a current frame is mainly affected by frames with large distortion (i.e., distortion level exceeds a certain threshold) that are close by.
  • MD i n median ⁇ ds(j), frame j E S i n ). (1 ) A median function examines neighboring frames to decide whether or not the spatial distortion of the current frame is representative of its surroundings and rejects extreme distortion levels (outliers). That is, we may consider MD ⁇ as a
  • a maximum function can be used to identify the dominant distortion as the maximum of median distortion values among the sliding windows ⁇ S i n , 0 ⁇ i ⁇ L 0 ):
  • FIG. 3A shows spatial artifact levels for frames in an exemplary video sequence
  • FIGs. 3B and 3C show results after applying Eqs. (2) and (3), respectively.
  • Eqs. (2) and (3) show results after applying Eqs. (2) and (3), respectively.
  • the maximum of median distortion values avoid extreme values as shown in FIG. 3B. Consequently, as can be seen from FIG. 3C, spatial artifact levels that are much larger than neighboring ones, for example, at frames 86, 125, and 166, are not present in the estimated temporal distortion.
  • the estimated temporal distortion levels have smaller variations from frame to frame than the spatial distortion levels.
  • dist(n) w n x ds(n)/f(dist(n)), (7) wherein dist(n) is the distance between frame n and the closest frame with large distortion in a sliding window corresponding to the highest large distortion density. If there is no other frame with large distortion in the corresponding sliding window, we set dist(n) to a very big value, for example, 1000. That is, when there is only one frame in the sliding window with large distortion, we consider the distortion as less visible and set dt(n) to a very small value.
  • FIG. 4A shows spatial artifact levels for frames in an exemplary video sequence
  • U (number of macro blocks per frame)/100.
  • the values of L 0 and U may vary with configurations, for example, with the GOP length, video resolution, and the frame rate.
  • FIG. 5A shows spatial artifact levels for frames in another exemplary video sequence
  • FIGs. 5B and 5C show results after applying Eqs. (5) and (7), respectively.
  • the median distortion value and the large distortion density approaches can be combined to estimate the temporal distortion for frame n as:
  • temporal distortion based on the human vision property that eyes need a period of time that is long enough to recognize artifact.
  • the temporal distortion may also be affected by other factors, for example, but not limited to, motion jerkiness. Consequently, the temporal distortion estimated as above may need to be adjusted to consider other factors.
  • dt'(n) dt(n) + c x dt 2 (n), where dt 2 (n)is the distortion caused by motion jerkness.
  • FIG. 6 illustrates an exemplary method 600 for modeling temporal distortion at frame n, according to the present principles.
  • Method 600 starts at step 605.
  • For sliding window Si ,n it calculates the median distortion value at step 610, for example, using Eq. (1 ), and calculates the large distortion density at step 620, for example, using Eq. (4). It checks whether more sliding window needs to be processed at step 630. If yes, it returns the control to step 610. Otherwise, at step 640, it calculates the maximum of median distortion values in all sliding windows for frame n, for example, using Eq. (2). At step 650, it calculates the highest large distortion density in all sliding windows for frame n, for example, using Eq. (5).
  • step 660 it estimates the temporal distortion for frame n, for example, using Eq. (3), (6), or (8).
  • the distance between frame n and the closest frame with large distortion may be considered at step 660, for example, using Eq. (7).
  • Method 600 ends at step 699.
  • both the maximum of median distortion values and the highest large distortion density are used to estimate temporal distortion.
  • only the maximum of median distortion values is used to estimate the temporal distortion. That is, steps 620 and 650 are not needed, and step 660 estimates the temporal distortion based on the maximum of median distortion values, for example, using Eq. (3).
  • only the highest large distortion density is used to estimate the temporal distortion. That is, steps 610 and 640 are not needed, and step 660 estimates the temporal distortion based on the highest large distortion density, for example, using Eq. (6) or Eq. (7).
  • Method 600 or its variations may proceed in a different order of steps, for example, step 620 may be performed before step 610, step 650 may be performed before step 640.
  • the video quality modeling methods according to the present principles can be applied to measure video quality when a video sequence suffers from
  • FIG. 7 depicts a block diagram of an exemplary video quality monitor 700.
  • the input of apparatus 700 may include a transport stream that contains the bitstream.
  • the input may be in other formats that contains the bitstream.
  • Demultiplexer 710 obtains packet layer information from the bitstream.
  • Decoder 720 parses the input stream to obtain more information. Decoder 720 may or may not reconstruct the pictures. In other embodiments, the decoder may perform the functions of the demultiplexer. Using the decoded information, the spatial artifact levels are estimated in spatial artifact level estimator730. Based on the estimated parameters, temporal distortion levels are estimated at temporal distortion estimator 740, for example, using method 600. A quality predictor 750 then pools temporal distortion levels for individual frames into a quality score for the video sequence. The quality predictor 750 may consider other types of artifacts and the property of human visual property.
  • the video quality monitor700 may be used, for example, in ITU-T
  • IPTV and mobile video streaming also called HR (High Resolution)scenario and LR (Low Resolution)scenario respectively.
  • HR High Resolution
  • LR Low Resolution
  • the input to the P.N BAMS VQM Video Quality Model
  • the output is an objective MOS score (Mean Opinion Score).
  • MOS score Mean Opinion Score
  • a major target application of P.NBAMS work is to monitor video quality in a set-top box (STB) or gateway.
  • P.NBAMS mode 1 model only uses bitstream information, and mode 2 model may decode part or all of the video sequence, and the pixel information is used for visual quality prediction in addition to parsing the bitstream information in order to improve the prediction accuracy.
  • a video transmission system or apparatus 800 is shown, to which the features and principles described above may be applied.
  • a processor 805 processes the video and the encoder 810 encodes the video.
  • the bitstream generated from the encoder is transmitted to a decoder 830 through a distribution network 820.
  • a video quality monitor may be used at different stages.
  • a video quality monitor 840 may be used by a content creator.
  • the estimated video quality may be used by an encoder in deciding encoding parameters, such as mode decision or bit rate allocation.
  • the content creator uses the video quality monitor to monitor the quality of encoded video. If the quality metric does not meet a pre-defined quality level, the content creator may choose to re-encode the video to improve the video quality. The content creator may also rank the encoded video based on the quality and charges the content accordingly.
  • a video quality monitor850 may be used by a content distributor.
  • a video quality monitor may be placed in the distribution network. The video quality monitor calculates the quality metrics and reports them to the content distributor. Based on the feedback from the video quality monitor, a content distributor may improve its service by adjusting bandwidth allocation and access control.
  • a video quality monitor 860 may be used by a user device. For example, when a user device searches videos in Internet, a search result may return many videos or many links to videos corresponding to the requested video content. The videos in the search results may have different quality levels. A video quality monitor can calculate quality metrics for these videos and decide to select which video to store. In another example, the user device may have access to several error concealment techniques. A video quality monitor can calculate quality metrics for different error concealment techniques and automatically choose which concealment technique to use based on the calculated quality metrics.
  • the implementations described herein may be implemented in, for example, a method or a process, an apparatus, a software program, a data stream, or a signal. Even if only discussed in the context of a single form of implementation (for example, discussed only as a method), the implementation of features discussed may also be implemented in other forms (for example, an apparatus or program).
  • An apparatus may be implemented in, for example, appropriate hardware, software, and firmware.
  • the methods may be implemented in, for example, an apparatus such as, for example, a processor, which refers to processing devices in general, including, for example, a computer, a microprocessor, an integrated circuit, or a programmable logic device. Processors also include communication devices, such as, for example, computers, cell phones, portable/personal digital assistants ("PDAs”), and other devices that facilitate communication of information between end-users.
  • PDAs portable/personal digital assistants
  • the appearances of the phrase “in one embodiment” or “in an embodiment” or “in one implementation” or “in an implementation”, as well any other variations, appearing in various places throughout the specification are not necessarily all referring to the same embodiment.
  • this application or its claims may refer to "determining" various pieces of information. Determining the information may include one or more of, for example, estimating the information, calculating the information, predicting the information, or retrieving the information from memory. Further, this application or its claims may refer to "accessing" various pieces of information. Accessing the information may include one or more of, for example, receiving the information, retrieving the information (for example, from memory), storing the information, processing the information, transmitting the information, moving the information, copying the information, erasing the information, calculating the information, determining the information, predicting the information, or estimating the information.
  • Receiving is, as with “accessing”, intended to be a broad term.
  • Receiving the infornnation may include one or more of, for example, accessing the information, or retrieving the information (for example, from memory).
  • “receiving” is typically involved, in one way or another, during operations such as, for example, storing the information, processing the information, transmitting the information, moving the information, copying the information, erasing the information, calculating the information, determining the information, predicting the information, or estimating the information.
  • implementations may produce a variety of signals formatted to carry information that may be, for example, stored or transmitted.
  • the information may include, for example, instructions for performing a method, or data produced by one of the described implementations.
  • a signal may be formatted to carry the bitstream of a described embodiment.
  • Such a signal may be formatted, for example, as an electromagnetic wave (for example, using a radio frequency portion of spectrum) or as a baseband signal.
  • formatting may include, for example, encoding a data stream and modulating a carrier with the encoded data stream.
  • the information that the signal carries may be, for example, analog or digital information.
  • the signal may be transmitted over a variety of different wired or wireless links, as is known.
  • the signal may be stored on a processor-readable medium.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Biomedical Technology (AREA)
  • General Health & Medical Sciences (AREA)
  • General Physics & Mathematics (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Quality & Reliability (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)
  • Testing, Inspecting, Measuring Of Stereoscopic Televisions And Televisions (AREA)
PCT/CN2013/077082 2013-02-07 2013-06-09 Method and apparatus for context-based video quality assessment Ceased WO2014121571A1 (en)

Priority Applications (8)

Application Number Priority Date Filing Date Title
KR1020157021123A KR20150115771A (ko) 2013-02-07 2013-06-09 컨텍스트-기반 비디오 품질 평가를 위한 방법 및 장치
CN201380072550.8A CN104995914A (zh) 2013-02-07 2013-06-09 用于基于上下文的视频质量评估的方法和装置
BR112015018465A BR112015018465A2 (pt) 2013-02-07 2013-06-09 método e aparelho para avaliação de qualidade de vídeo à base de contexto
US14/763,940 US9716881B2 (en) 2013-02-07 2013-06-09 Method and apparatus for context-based video quality assessment
HK16106409.4A HK1218482A1 (zh) 2013-02-07 2013-06-09 用於基於上下文的视频质量评估的方法和装置
EP13874735.7A EP2954677B1 (en) 2013-02-07 2013-06-09 Method and apparatus for context-based video quality assessment
AU2013377642A AU2013377642A1 (en) 2013-02-07 2013-06-09 Method and apparatus for context-based video quality assessment
JP2015556366A JP2016510567A (ja) 2013-02-07 2013-06-09 コンテキスト・ベースのビデオ品質評価のための方法および装置

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CNPCT/CN2013/071538 2013-02-07
CN2013071538 2013-02-07

Publications (1)

Publication Number Publication Date
WO2014121571A1 true WO2014121571A1 (en) 2014-08-14

Family

ID=51299229

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2013/077082 Ceased WO2014121571A1 (en) 2013-02-07 2013-06-09 Method and apparatus for context-based video quality assessment

Country Status (8)

Country Link
US (1) US9716881B2 (enExample)
EP (1) EP2954677B1 (enExample)
JP (1) JP2016510567A (enExample)
KR (1) KR20150115771A (enExample)
AU (1) AU2013377642A1 (enExample)
BR (1) BR112015018465A2 (enExample)
HK (1) HK1218482A1 (enExample)
WO (1) WO2014121571A1 (enExample)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104396238B (zh) * 2012-07-17 2016-12-28 汤姆逊许可公司 比特流级别的视频质量评估
KR102452313B1 (ko) * 2017-11-29 2022-10-07 한국전자통신연구원 인지 민감도를 고려하는 영상 화질 측정 방법 및 장치
US10887602B2 (en) * 2018-02-07 2021-01-05 Netflix, Inc. Techniques for modeling temporal distortions when predicting perceptual video quality

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070127819A1 (en) * 2005-12-05 2007-06-07 Samsung Electronics Co., Ltd. Method and apparatus for object detection in sequences
CN101572830A (zh) * 2008-04-30 2009-11-04 汤姆森许可贸易公司 用于对帧序列的失真版本的质量进行评估的方法
CN101635846A (zh) * 2008-07-21 2010-01-27 华为技术有限公司 一种视频质量评估方法、系统及装置
CN101695141A (zh) * 2009-10-20 2010-04-14 浙江大学 一种视频质量评估方法和装置

Family Cites Families (16)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6331859B1 (en) 1999-04-06 2001-12-18 Sharp Laboratories Of America, Inc. Video skimming system utilizing the vector rank filter
US6671324B2 (en) 2001-04-16 2003-12-30 Mitsubishi Electric Research Laboratories, Inc. Estimating total average distortion in a video with variable frameskip
US7209519B2 (en) 2001-04-16 2007-04-24 Mitsubishi Electric Research Laboratories, Inc. Encoding a video with a variable frame-rate while minimizing total average distortion
US7298865B2 (en) * 2001-07-30 2007-11-20 Sarnoff Corporation Secure robust high-fidelity watermarking
US6885757B2 (en) * 2002-04-18 2005-04-26 Sarnoff Corporation Method and apparatus for providing an asymmetric watermark carrier
US20050157791A1 (en) * 2004-01-20 2005-07-21 Eastman Kodak Company System and method for video tone scale reduction
US9025673B2 (en) 2006-04-05 2015-05-05 Qualcomm Incorporated Temporal quality metric for video coding
US20080174694A1 (en) 2007-01-22 2008-07-24 Horizon Semiconductors Ltd. Method and apparatus for video pixel interpolation
US8401331B2 (en) * 2007-12-06 2013-03-19 Alcatel Lucent Video quality analysis using a linear approximation technique
CN102317974B (zh) 2009-02-12 2014-08-06 杜比实验室特许公司 图像序列的质量评估
WO2011053255A1 (en) * 2009-10-30 2011-05-05 Agency For Science, Technology And Research Methods, devices, and computer readable mediums for processing a digital picture
WO2011082719A1 (en) 2010-01-11 2011-07-14 Telefonaktiebolaget L M Ericsson (Publ) Technique for video quality estimation
BR112013014367B1 (pt) * 2010-12-10 2022-07-12 Deutsche Telekom Ag Método e aparelho para avaliar a qualidade de uma sequência de sinal de vídeo transmitida no lado de receptor
CN102685548B (zh) 2012-05-29 2015-09-30 公安部第三研究所 视频质量的无参评估方法
CN104396238B (zh) 2012-07-17 2016-12-28 汤姆逊许可公司 比特流级别的视频质量评估
CA2958720C (en) * 2013-09-06 2020-03-24 Zhou Wang Method and system for objective perceptual video quality assessment

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070127819A1 (en) * 2005-12-05 2007-06-07 Samsung Electronics Co., Ltd. Method and apparatus for object detection in sequences
CN101572830A (zh) * 2008-04-30 2009-11-04 汤姆森许可贸易公司 用于对帧序列的失真版本的质量进行评估的方法
CN101635846A (zh) * 2008-07-21 2010-01-27 华为技术有限公司 一种视频质量评估方法、系统及装置
CN101695141A (zh) * 2009-10-20 2010-04-14 浙江大学 一种视频质量评估方法和装置

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
See also references of EP2954677A4 *

Also Published As

Publication number Publication date
EP2954677A1 (en) 2015-12-16
US9716881B2 (en) 2017-07-25
EP2954677B1 (en) 2019-07-31
KR20150115771A (ko) 2015-10-14
EP2954677A4 (en) 2016-07-13
JP2016510567A (ja) 2016-04-07
US20150373324A1 (en) 2015-12-24
BR112015018465A2 (pt) 2017-07-18
AU2013377642A1 (en) 2015-07-23
HK1218482A1 (zh) 2017-02-17

Similar Documents

Publication Publication Date Title
JP5670551B2 (ja) ビデオストリームの品質を評価する方法と装置
KR101641994B1 (ko) 비디오 인코딩 압축 품질을 획득하는 방법 및 장치
JP5215288B2 (ja) 映像符号化のための時間的品質メトリック
EP2783513A1 (en) Video quality assessment considering scene cut artifacts
EP2875640B1 (en) Video quality assessment at a bitstream level
Kumar et al. Quality of experience driven rate adaptation for adaptive HTTP streaming
EP2908522B1 (en) Method and device for evaluating video freeze distortion degree
US9723301B2 (en) Method and apparatus for context-based video quality assessment
US9716881B2 (en) Method and apparatus for context-based video quality assessment
CN104995914A (zh) 用于基于上下文的视频质量评估的方法和装置
CN104969548A (zh) 用于基于上下文的视频质量评估的方法和装置
WO2014198062A1 (en) Method and apparatus for video quality measurement
WO2014032451A1 (en) Method and apparatus for estimating motion homogeneity for video quality assessment
HK1194577B (en) Method and device for obtaining video encoding compression quality

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 13874735

Country of ref document: EP

Kind code of ref document: A1

WWE Wipo information: entry into national phase

Ref document number: 2013874735

Country of ref document: EP

ENP Entry into the national phase

Ref document number: 2013377642

Country of ref document: AU

Date of ref document: 20130609

Kind code of ref document: A

WWE Wipo information: entry into national phase

Ref document number: 14763940

Country of ref document: US

ENP Entry into the national phase

Ref document number: 20157021123

Country of ref document: KR

Kind code of ref document: A

ENP Entry into the national phase

Ref document number: 2015556366

Country of ref document: JP

Kind code of ref document: A

NENP Non-entry into the national phase

Ref country code: DE

REG Reference to national code

Ref country code: BR

Ref legal event code: B01A

Ref document number: 112015018465

Country of ref document: BR

ENP Entry into the national phase

Ref document number: 112015018465

Country of ref document: BR

Kind code of ref document: A2

Effective date: 20150731