WO2012103326A2 - Adaptive bit rate control based on scenes - Google Patents
Adaptive bit rate control based on scenes Download PDFInfo
- Publication number
- WO2012103326A2 WO2012103326A2 PCT/US2012/022710 US2012022710W WO2012103326A2 WO 2012103326 A2 WO2012103326 A2 WO 2012103326A2 US 2012022710 W US2012022710 W US 2012022710W WO 2012103326 A2 WO2012103326 A2 WO 2012103326A2
- Authority
- WO
- WIPO (PCT)
- Prior art keywords
- scene
- video
- encoding
- video stream
- recited
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Ceased
Links
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/102—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or selection affected or controlled by the adaptive coding
- H04N19/115—Selection of the code volume for a coding unit prior to coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/136—Incoming video signal characteristics or properties
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/136—Incoming video signal characteristics or properties
- H04N19/137—Motion inside a coding unit, e.g. average field, frame or block difference
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/142—Detection of scene cut or scene change
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/10—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding
- H04N19/134—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using adaptive coding characterised by the element, parameter or criterion affecting or controlling the adaptive coding
- H04N19/146—Data rate or code amount at the encoder output
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/46—Embedding additional information in the video signal during the compression process
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N19/00—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals
- H04N19/60—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding
- H04N19/61—Methods or arrangements for coding, decoding, compressing or decompressing digital video signals using transform coding in combination with predictive coding
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04N—PICTORIAL COMMUNICATION, e.g. TELEVISION
- H04N21/00—Selective content distribution, e.g. interactive television or video on demand [VOD]
- H04N21/20—Servers specifically adapted for the distribution of content, e.g. VOD servers; Operations thereof
- H04N21/25—Management operations performed by the server for facilitating the content distribution or administrating data related to end-users or client devices, e.g. end-user or client device authentication, learning user preferences for recommending movies
- H04N21/266—Channel or content management, e.g. generation and management of keys and entitlement messages in a conditional access system, merging a VOD unicast channel into a multicast channel
- H04N21/2662—Controlling the complexity of the video stream, e.g. by scaling the resolution or bitrate of the video stream based on the client capabilities
Definitions
- the present invention relates to a video and image compression technique and more particularly, to a video and image compression technique using adaptive bit rate control based on scenes.
- a video stream (typically containing an image portion and an audio portion) can require considerable bandwidth, especially at high resolution (e.g., HD videos). Audio typically requires much less bandwidth, but still sometimes needs to be taken into account.
- One streaming video approach is to heavily compress the video stream enabling rapid video delivery to allow a user to view content in run-time or substantially instantaneously (i.e., without experiencing substantial spooling delays).
- lossy compression i.e., compression that is not entirely reversible
- provides more compression than lossless compression but heavy lossy compression provides an undesirable user experience.
- the H.264 standard employs the same principles of block-based motion compensated hybrid transform coding that are known from the established standards such as MPEG-2.
- the H.264 syntax is, therefore, organized as the usual hierarchy of headers, such as picture-, slice- and macro-block headers, and data, such as motion-vectors, block-transform coefficients, quantizer scale, etc.
- the H.264 standard separates the Video Coding Layer (VCL), which represents the content of the video data, and the Network Adaptation Layer (NAL), which formats data and provides header information.
- VCL Video Coding Layer
- NAL Network Adaptation Layer
- H.264 allows for a much increased choice of encoding parameters. For example, it allows for a more elaborate partitioning and manipulation of 16x16 macro-blocks whereby e.g. motion compensation process can be performed on segmentations of a macro-block as small as 4x4 in size.
- the selection process for motion compensated prediction of a sample block may involve a number of stored previously-decoded pictures, instead of only the adjacent pictures. Even with intra coding within a single frame, it is possible to form a prediction of a block using previously-decoded samples from the same frame.
- the resulting prediction error following motion compensation may be transformed and quantized based on a 4x4 block size, instead of the traditional 8x8 size.
- an in-loop deblocking filter is now mandatory.
- the H.264 standard may be considered a superset of the H.262 / MPEG-2 video encoding syntax in that it uses the same global structuring of video data while extending the number of possible coding decisions and parameters.
- a consequence of having a variety of coding decisions is that a good trade-off between the bit rate and picture quality may be achieved.
- the H.264 standard may significantly reduce typical artifacts of block-based coding, it can also accentuate other artifacts.
- the fact that H.264 allows for an increased number of possible values for various coding parameters thus results in an increased potential for improving the encoding process, but also results in increased sensitivity to the choice of video encoding parameters.
- H.264 does not specify a normative procedure for selecting video encoding parameters, but describes through a reference implementation, a number of criteria that may be used to select video encoding parameters such as to achieve a suitable trade-off between coding efficiency, video quality and practicality of implementation.
- the described criteria may not always result in an optimal or suitable selection of coding parameters suitable for all kind of contents and applications.
- the criteria may not result in selection of video encoding parameters optimal or desirable for the characteristics of the video signal or the criteria may be based on attaining characteristics of the encoded signal which are not appropriate for the current application.
- TCP/IP network such as the Internet
- IP network is not a "bit stream" pipe, but a best effort network which the transmission capacity varies at any time.
- Encoding and transmitting videos using a CBR or VBR approach is not ideal in the best effort network.
- Some protocols have been designed to deliver video over the Internet.
- a good example is HTTP Adaptive Bit Rate Video Streaming, wherein a video stream is segmented into files, which are delivered as files over HTTP connections. Each of those files contains a video sequence having a predetermined play time; and the bit rates may vary and the file size may vary. Thus, some files may be shorter than others.
- FIG. 2 illustrates steps of a sample method for encoding an input video stream.
- FIG. 3 is a block diagram of a processing system that can be used to implement an encoder implementing certain techniques described herein.
- FIG. 1 illustrates an example of an encoder 100, according to one embodiment of the present invention.
- the encoder 100 receives an input video stream 1 10 and outputs an encoded video stream 120 that can be decoded at a decoder to recover, at least approximately, an instance of the input video stream 1 10.
- the encoder 100 comprises an input module 102, a video processing module 104, and a video encoding module 106.
- the encoder 100 may be implemented in hardware, software, or any suitable combination.
- the encoder 100 may include other components such as a video transmitting module, a parameter input module, memory for storing parameters, etc.
- the encoder 100 may perform other video processing functions not specifically described herein.
- the input module 102 receives the input video stream 1 10.
- the input video stream 1 10 may take any suitable form, and may originate from any of a variety of suitable sources such as memory, or even from a live feed.
- the input module 102 further receives scene boundary information and target bit rate for each scene.
- the scene boundary information indicates positions in the input video stream where scene transitions occur.
- the video processing module 104 analyzes an input video stream 1 10 and divides the video stream 1 10 into a plurality of sections for each of the plurality of scenes based on the scene boundary information. Each section comprises a plurality of temporally continuous image irarnes. In one embodiment, the video processing module further segments the input video stream into a plurality of files. Each file contains one or more sections. In another embodiment the position, resolution and time stamp or start frame number of each section of a video file is recorded into a file or database. A video encoding module encodes each section using the associated target bit rate or video quality with a bit-rate constrain. In one embodiment, the encoder further comprises a video transmitting module for transmitting the files over a network connection such as an HTTP connection.
- a network connection such as an HTTP connection.
- optical resolution of the video image frames are detected and utilized to determining the true or optimal scene video dimensions and the scene division.
- the optical resolution describes a resolution at which one or more video image frames can continuously resolve details. Due to the limitations of the capturing optics, recording media, original format, the optical resolution of a video image frame may be much less than the technical resolution of the video image frame.
- the video processing module may detect an optical resolution of the image frames within each section. A scene type may be determined based on the optical resolution of the image frames within the section.
- the target bit rate of a section may be determined based on an optical resolution of the image frames within the section. For a certain section with a low optical resolution, the target bit rate can be lower because high bit rate does not help retaining the fidelity of the section.
- those up-scalers that convert a low resolution image to fit into a higher resolution video frame may also produce unwanted artifacts. This is especially true in old scaling technologies. By recovering the original resolution we will allow modern video processors to upscale the image in a more efficient way and avoid encoding unwanted artifacts that are not part of the original image.
- the video encode module may encode each section using any encoding standards such as H.264/MPEG-4 AVC standard.
- Each section may be encoded at a different level of perceptual qualities conveying different bit rates (i.e. 500Kbps, 1 Mbps, 2Mbps).
- bit rates i.e. 500Kbps, 1 Mbps, 2Mbps.
- an optical or video quality bar is met at a certain low bit-rate, i.e. 500 Kbps
- the encoding process may not be needed for higher bit-rates, avoiding the need to encode that scene at a higher bit-rate, i.e. 1 Mbps or 2Mbps. See table 1.
- the single file will only store the scenes needed to be encoded at a higher bit-rate.
- it may be necessary to storage in the high-bit-rate file i.e.
- the section or segments to be stored will be the low-bit-rate ones, i.e. 500Kbps instead of the high-bit rate ones. Therefore, storage space is saved. (But not as significant as not storing the scenes). See Table 2. In other case such for systems that doesn't support multiple resolutions in a single video file, the storage of the sections will occur in files with a determined frame size. To minimize the number of files at each resolution, some systems will limit the number of frames sizes such as SDTV, HD720p, HD 1080p. See Table 3.
- Each section, based on a different scene, may be encoded at a different level of perceptual quality and a different bit rate.
- the encoder reads an input video stream and a database or other listing of scenes, and then partitions the video stream into sections based on the information of scenes.
- An example data structure for a listing of scenes in a video is shown in Table 4.
- the data structure may be stored in a computer readable memory or a database and be accessible by the encoder.
- scenes may be utilized for the listing of scenes, such as “fast motion”, “static”, “talking head”, “text”, “mostly black images”, “short scene of five frames or less”, “black screen”, “low interest”, “file”, “water”, “smoke”, “credits”, “blur”, “out of focus”, “image having a lower resolution than the image container size”, etc.
- scene sequences might be “miscellaneous”, “unknown” or “default” scene types assigned to such scenes.
- FIG. 2 illustrates steps of a method 200 for encoding an input video stream.
- the method 200 encodes the input video stream to an encoded video bit stream that can be decoded at a decoder to recover, at least approximately, an instance of the input video stream.
- the method receives an input video stream to be encoded.
- the method receives scene boundary information that indicates positions in the input video stream where scene transitions occur and target bit rate for each scene.
- the input video stream is divided into a plurality of sections based on the scene boundary information, each section comprising a plurality of temporally contiguous image frames.
- the method detects optical resolution of the image frames within each section.
- the method segments the input video stream into a plurality of files, each file containing one or more sections.
- each of the plurality of sections is encoded according to the target bit rate.
- the method transmits the plurality of files over an HTTP connection.
- the input video stream typically includes multiple image frames. Each image frame can typically be identified based on a distinct "time position" in the input video stream.
- the input video stream can be a stream that is made available to the encoder in parts or discrete segments.
- the encoder outputs the encoded video bit stream (for example, to a final consumer device such as a HDTV) as a stream on a rolling basis before even receiving the entire input video stream.
- the input video stream and the encoded video bit stream are stored as a sequence of streams.
- the encoding may be performed ahead of time and the encoded video streams may then be streamed to a consumer device at a later time.
- the encoding is completely performed on the entire video stream prior to being streamed over to the consumer device. It is understood that other examples of pre, post, or "in-line" encoding of video streams, or a combination thereof, as may be contemplated by a person of ordinary ski ll in the art. are also contemplated in conj unction with the techniques introduced herein.
- FIG. 3 is a block diagram of a processing system that can be used to implement any of the techniques described above, such as an encoder. Note that in certain embodiments, at least some of the components illustrated in FIG. 3 may be distributed between two or more physically separate but connected computing platforms or boxes.
- the processing can represent a conventional server-class computer, PC, mobile communication device (e.g., smartphone), or any other known or conventional processing/communication device.
- the processing system 301 shown in FIG. 3 includes one or more processors 310, i.e. a central processing unit (CPU), memory 320, at least one communication device 340 such as an Ethernet adapter and/or wireless communication subsystem (e.g., cellular, WiFi, Bluetooth or the like), and one or more I/O devices 370, 380, all coupled to each other through an interconnect 390.
- processors 310 i.e. a central processing unit (CPU), memory 320, at least one communication device 340 such as an Ethernet adapter and/or wireless communication subsystem (e.g., cellular, WiFi, Bluetooth or the like), and one or more I/O devices 370, 380, all coupled to each other through an interconnect 390.
- processors 310 i.e. a central processing unit (CPU), memory 320, at least one communication device 340 such as an Ethernet adapter and/or wireless communication subsystem (e.g., cellular, WiFi, Bluetooth or the like), and one or more I/O devices
- the processor(s) 310 control(s) the operation of the computer system 301 and may be or include one or more programmable general-purpose or special-purpose
- the interconnect 390 can include one or more buses, direct connections and/or other types of physical connections, and may include various bridges, controllers and/or adapters such as are well-known in the art.
- the interconnect 390 further may include a "system bus”, which may be connected through one or more adapters to one or more expansion buses, such as a form of Peripheral Component Interconnect (PCI) bus, HyperTransport or industry standard architecture (ISA) bus, small computer system interface (SCSI) bus, universal serial bus (USB), or Institute of Electrical and Electronics Engineers (IEEE) standard 1394 bus (sometimes referred to as "Firewire").
- PCI Peripheral Component Interconnect
- ISA HyperTransport or industry standard architecture
- SCSI small computer system interface
- USB universal serial bus
- IEEE Institute of Electrical and Electronics Engineers
- the memory 320 may be or include one or more memory devices of one or more types, such as read-only memory (ROM), random access memory (RAM), flash memory, disk drives, etc.
- the network adapter 340 is a device suitable for enabling the processing system 301 to communicate data with a remote processing system over a communication link, and may be, for example, a conventional telephone modem, a wireless modem, a Digital Subscriber Line (DSL) modem, a cable modem, a radio transceiver, a satellite transceiver, an Ethernet adapter, or the like.
- DSL Digital Subscriber Line
- programmable circuitry e.g., one or more microprocessors
- Special-purpose hardwired circuitry may be in the form of, for example, one or more application-specific integrated circuits (ASICs), programmable logic devices (PLDs), field-programmable gate arrays (FPGAs), etc.
- ASICs application-specific integrated circuits
- PLDs programmable logic devices
- FPGAs field-programmable gate arrays
- logic can include, for example, programmable circuitry programmed with specific software and/or firmware, special-purpose hardwired circuitry, or a combination thereof.
Landscapes
- Engineering & Computer Science (AREA)
- Multimedia (AREA)
- Signal Processing (AREA)
- Databases & Information Systems (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
Priority Applications (9)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| MX2013008757A MX2013008757A (es) | 2011-01-28 | 2012-01-26 | Control de velocidad de bits adaptativa basado en escenas. |
| KR1020137022649A KR20140034149A (ko) | 2011-01-28 | 2012-01-26 | 장면에 기초한 적응적 비트 레이트 제어 |
| JP2013551331A JP6134650B2 (ja) | 2011-01-28 | 2012-01-26 | シーンに基づく適用性のあるビットレート制御 |
| EP12738976.5A EP2668779A4 (en) | 2011-01-28 | 2012-01-26 | ADAPTIVE BITRATE CONTROL BASED ON SCENES |
| CA2825929A CA2825929A1 (en) | 2011-01-28 | 2012-01-26 | Adaptive bit rate control based on scenes |
| AU2012211243A AU2012211243A1 (en) | 2011-01-28 | 2012-01-26 | Adaptive bit rate control based on scenes |
| CN201280015700.7A CN103493481A (zh) | 2011-01-28 | 2012-01-26 | 基于场景的适应性比特率控制 |
| BR112013020068A BR112013020068A2 (pt) | 2011-01-28 | 2012-01-26 | métodos e parelhos para controle adaptativo da taxa de bits baseado em cenas |
| IL227673A IL227673A (en) | 2011-01-28 | 2013-07-28 | Scene-adjusted bit rate control |
Applications Claiming Priority (4)
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US201161437223P | 2011-01-28 | 2011-01-28 | |
| US201161437193P | 2011-01-28 | 2011-01-28 | |
| US61/437,193 | 2011-01-28 | ||
| US61/437,223 | 2011-01-28 |
Publications (2)
| Publication Number | Publication Date |
|---|---|
| WO2012103326A2 true WO2012103326A2 (en) | 2012-08-02 |
| WO2012103326A3 WO2012103326A3 (en) | 2012-11-01 |
Family
ID=46577355
Family Applications (1)
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| PCT/US2012/022710 Ceased WO2012103326A2 (en) | 2011-01-28 | 2012-01-26 | Adaptive bit rate control based on scenes |
Country Status (12)
| Country | Link |
|---|---|
| US (1) | US20120195369A1 (enExample) |
| EP (1) | EP2668779A4 (enExample) |
| JP (1) | JP6134650B2 (enExample) |
| KR (1) | KR20140034149A (enExample) |
| CN (1) | CN103493481A (enExample) |
| AU (2) | AU2012211243A1 (enExample) |
| BR (1) | BR112013020068A2 (enExample) |
| CA (1) | CA2825929A1 (enExample) |
| IL (1) | IL227673A (enExample) |
| MX (1) | MX2013008757A (enExample) |
| TW (1) | TWI586177B (enExample) |
| WO (1) | WO2012103326A2 (enExample) |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| KR101840685B1 (ko) * | 2014-03-12 | 2018-03-21 | 경희대학교 산학협력단 | 가변 비트율 미디어 전송을 위한 보장 비트율 반환 방법 및 장치 |
| CN119031135A (zh) * | 2024-10-18 | 2024-11-26 | 每日互动股份有限公司 | 一种基于采样的视频解码方法、装置、介质及设备 |
Families Citing this family (26)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US10165274B2 (en) * | 2011-01-28 | 2018-12-25 | Eye IO, LLC | Encoding of video stream based on scene type |
| US9554142B2 (en) * | 2011-01-28 | 2017-01-24 | Eye IO, LLC | Encoding of video stream based on scene type |
| US9042441B2 (en) | 2012-04-25 | 2015-05-26 | At&T Intellectual Property I, Lp | Apparatus and method for media streaming |
| US8949440B2 (en) * | 2012-07-19 | 2015-02-03 | Alcatel Lucent | System and method for adaptive rate determination in mobile video streaming |
| US9185437B2 (en) | 2012-11-01 | 2015-11-10 | Microsoft Technology Licensing, Llc | Video data |
| US10708335B2 (en) | 2012-11-16 | 2020-07-07 | Time Warner Cable Enterprises Llc | Situation-dependent dynamic bit rate encoding and distribution of content |
| US9967300B2 (en) * | 2012-12-10 | 2018-05-08 | Alcatel Lucent | Method and apparatus for scheduling adaptive bit rate streams |
| KR101415429B1 (ko) * | 2014-03-20 | 2014-07-09 | 인하대학교 산학협력단 | 블록 아티팩트 기반의 동영상 화질 최적화를 위한 비트레이트 결정 방법 |
| US9811882B2 (en) | 2014-09-30 | 2017-11-07 | Electronics And Telecommunications Research Institute | Method and apparatus for processing super resolution image using adaptive preprocessing filtering and/or postprocessing filtering |
| CN105245813B (zh) * | 2015-10-29 | 2018-05-22 | 北京易视云科技有限公司 | 一种视频优化存储的处理器 |
| CN105307053B (zh) * | 2015-10-29 | 2018-05-22 | 北京易视云科技有限公司 | 一种基于视频内容的视频优化存储的方法 |
| CN105323591B (zh) * | 2015-10-29 | 2018-06-19 | 四川奇迹云科技有限公司 | 一种基于psnr阈值的视频分段存储的方法 |
| US10742708B2 (en) | 2017-02-23 | 2020-08-11 | Netflix, Inc. | Iterative techniques for generating multiple encoded versions of a media title |
| US10715814B2 (en) | 2017-02-23 | 2020-07-14 | Netflix, Inc. | Techniques for optimizing encoding parameters for different shot sequences |
| US11166034B2 (en) | 2017-02-23 | 2021-11-02 | Netflix, Inc. | Comparing video encoders/decoders using shot-based encoding and a perceptual visual quality metric |
| US11153585B2 (en) | 2017-02-23 | 2021-10-19 | Netflix, Inc. | Optimizing encoding operations when generating encoded versions of a media title |
| US10666992B2 (en) | 2017-07-18 | 2020-05-26 | Netflix, Inc. | Encoding techniques for optimizing distortion and bitrate |
| US12255940B2 (en) | 2017-07-18 | 2025-03-18 | Netflix, Inc. | Encoding techniques for optimizing distortion and bitrate |
| US10623744B2 (en) | 2017-10-04 | 2020-04-14 | Apple Inc. | Scene based rate control for video compression and video streaming |
| US11871052B1 (en) * | 2018-09-27 | 2024-01-09 | Apple Inc. | Multi-band rate control |
| EP3869801A4 (en) * | 2018-10-18 | 2021-12-08 | Sony Group Corporation | CODING DEVICE, ENCODING PROCESS, AND DECODING DEVICE |
| US11470327B2 (en) * | 2020-03-30 | 2022-10-11 | Alibaba Group Holding Limited | Scene aware video content encoding |
| US11616993B1 (en) * | 2021-10-22 | 2023-03-28 | Hulu, LLC | Dyanamic parameter adjustment for adaptive bitrate algorithm |
| CN114511535B (zh) * | 2022-01-28 | 2025-11-11 | 北京百度网讯科技有限公司 | 一种白屏检测方法、装置、电子设备、介质及产品 |
| CN116170581B (zh) * | 2023-02-17 | 2024-01-23 | 厦门瑞为信息技术有限公司 | 一种基于目标感知的视频信息编解码方法和电子设备 |
| CN120547336A (zh) * | 2024-10-15 | 2025-08-26 | 支付宝(杭州)信息技术有限公司 | 一种视频码率控制方法、装置和设备 |
Family Cites Families (37)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP3265818B2 (ja) * | 1994-04-14 | 2002-03-18 | 松下電器産業株式会社 | 動画符号化方法 |
| JP4416845B2 (ja) * | 1996-09-30 | 2010-02-17 | ソニー株式会社 | 符号化装置及びその方法、および、記録装置及びその方法 |
| JP2001245303A (ja) * | 2000-02-29 | 2001-09-07 | Toshiba Corp | 動画像符号化装置および動画像符号化方法 |
| JP4428680B2 (ja) * | 2000-11-06 | 2010-03-10 | パナソニック株式会社 | 映像信号符号化方法および映像信号符号化装置 |
| US6909745B1 (en) * | 2001-06-05 | 2005-06-21 | At&T Corp. | Content adaptive video encoder |
| WO2003060823A2 (en) * | 2001-12-26 | 2003-07-24 | Yeda Research And Development Co.Ltd. | A system and method for increasing space or time resolution in video |
| US7099389B1 (en) * | 2002-12-10 | 2006-08-29 | Tut Systems, Inc. | Rate control with picture-based lookahead window |
| US7260251B2 (en) * | 2003-03-31 | 2007-08-21 | Cdm Optics, Inc. | Systems and methods for minimizing aberrating effects in imaging systems |
| US7558320B2 (en) * | 2003-06-13 | 2009-07-07 | Microsoft Corporation | Quality control in frame interpolation with motion analysis |
| TWI264192B (en) * | 2003-09-29 | 2006-10-11 | Intel Corp | Apparatus and methods for communicating using symbol-modulated subcarriers |
| JP4180497B2 (ja) * | 2003-12-05 | 2008-11-12 | 富士通株式会社 | コード種類判別方法、およびコード境界検出方法 |
| US7280804B2 (en) * | 2004-01-30 | 2007-10-09 | Intel Corporation | Channel adaptation using variable sounding signal rates |
| US7869500B2 (en) * | 2004-04-27 | 2011-01-11 | Broadcom Corporation | Video encoder and method for detecting and encoding noise |
| DE102004034973A1 (de) * | 2004-07-16 | 2006-02-16 | Carl Zeiss Jena Gmbh | Verfahren zur Erfassung von Bildern einer Probe mit einem Lichtrastermikroskop |
| TWI279693B (en) * | 2005-01-27 | 2007-04-21 | Etoms Electronics Corp | Method and device of audio compression |
| KR20070117660A (ko) * | 2005-03-10 | 2007-12-12 | 콸콤 인코포레이티드 | 컨텐트 적응적 멀티미디어 처리 |
| JP2006340066A (ja) * | 2005-06-02 | 2006-12-14 | Mitsubishi Electric Corp | 動画像符号化装置、動画像符号化方法及び記録再生方法 |
| US20070024706A1 (en) * | 2005-08-01 | 2007-02-01 | Brannon Robert H Jr | Systems and methods for providing high-resolution regions-of-interest |
| US8879857B2 (en) * | 2005-09-27 | 2014-11-04 | Qualcomm Incorporated | Redundant data encoding methods and device |
| US20070074251A1 (en) * | 2005-09-27 | 2007-03-29 | Oguz Seyfullah H | Method and apparatus for using random field models to improve picture and video compression and frame rate up conversion |
| US7912123B2 (en) * | 2006-03-01 | 2011-03-22 | Streaming Networks (Pvt.) Ltd | Method and system for providing low cost robust operational control of video encoders |
| US8155454B2 (en) * | 2006-07-20 | 2012-04-10 | Qualcomm Incorporated | Method and apparatus for encoder assisted post-processing |
| TW200814785A (en) * | 2006-09-13 | 2008-03-16 | Sunplus Technology Co Ltd | Coding method and system with an adaptive bitplane coding mode |
| WO2008127319A2 (en) * | 2007-01-31 | 2008-10-23 | Thomson Licensing | Method and apparatus for automatically categorizing potential shot and scene detection information |
| US8743963B2 (en) * | 2007-08-13 | 2014-06-03 | Ntt Docomo, Inc. | Image/video quality enhancement and super-resolution using sparse transformations |
| JP2009049474A (ja) * | 2007-08-13 | 2009-03-05 | Toshiba Corp | 情報処理装置および再符号化方法 |
| US9628811B2 (en) * | 2007-12-17 | 2017-04-18 | Qualcomm Incorporated | Adaptive group of pictures (AGOP) structure determination |
| WO2009087641A2 (en) * | 2008-01-10 | 2009-07-16 | Ramot At Tel-Aviv University Ltd. | System and method for real-time super-resolution |
| JP4539754B2 (ja) * | 2008-04-11 | 2010-09-08 | ソニー株式会社 | 情報処理装置及び情報処理方法 |
| US8325800B2 (en) * | 2008-05-07 | 2012-12-04 | Microsoft Corporation | Encoding streaming media as a high bit rate layer, a low bit rate layer, and one or more intermediate bit rate layers |
| JP5659154B2 (ja) * | 2008-06-06 | 2015-01-28 | アマゾン テクノロジーズ インコーポレイテッド | クライアント側ストリームスイッチング |
| ES2546091T3 (es) * | 2008-11-12 | 2015-09-18 | Cisco Technology, Inc. | Procesamiento de un programa de vídeo que tiene representaciones procesadas plurales de una señal de vídeo única para reconstrucción y emisión |
| US8396114B2 (en) * | 2009-01-29 | 2013-03-12 | Microsoft Corporation | Multiple bit rate video encoding using variable bit rate and dynamic resolution for adaptive video streaming |
| US8270473B2 (en) * | 2009-06-12 | 2012-09-18 | Microsoft Corporation | Motion based dynamic resolution multiple bit rate video encoding |
| JP4746691B2 (ja) * | 2009-07-02 | 2011-08-10 | 株式会社東芝 | 動画像符号化装置および動画像符号化方法 |
| US8837576B2 (en) * | 2009-11-06 | 2014-09-16 | Qualcomm Incorporated | Camera parameter-assisted video encoding |
| CN102907092B (zh) * | 2010-05-26 | 2017-02-15 | 高通股份有限公司 | 相机参数辅助式视频帧速率上转换 |
-
2012
- 2012-01-26 US US13/358,877 patent/US20120195369A1/en not_active Abandoned
- 2012-01-26 CA CA2825929A patent/CA2825929A1/en not_active Abandoned
- 2012-01-26 JP JP2013551331A patent/JP6134650B2/ja not_active Expired - Fee Related
- 2012-01-26 CN CN201280015700.7A patent/CN103493481A/zh active Pending
- 2012-01-26 WO PCT/US2012/022710 patent/WO2012103326A2/en not_active Ceased
- 2012-01-26 BR BR112013020068A patent/BR112013020068A2/pt not_active IP Right Cessation
- 2012-01-26 EP EP12738976.5A patent/EP2668779A4/en not_active Ceased
- 2012-01-26 AU AU2012211243A patent/AU2012211243A1/en not_active Abandoned
- 2012-01-26 MX MX2013008757A patent/MX2013008757A/es not_active Application Discontinuation
- 2012-01-26 KR KR1020137022649A patent/KR20140034149A/ko not_active Ceased
- 2012-01-30 TW TW101102947A patent/TWI586177B/zh not_active IP Right Cessation
-
2013
- 2013-07-28 IL IL227673A patent/IL227673A/en not_active IP Right Cessation
-
2016
- 2016-10-28 AU AU2016250476A patent/AU2016250476A1/en not_active Abandoned
Non-Patent Citations (1)
| Title |
|---|
| See references of EP2668779A4 * |
Cited By (2)
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| KR101840685B1 (ko) * | 2014-03-12 | 2018-03-21 | 경희대학교 산학협력단 | 가변 비트율 미디어 전송을 위한 보장 비트율 반환 방법 및 장치 |
| CN119031135A (zh) * | 2024-10-18 | 2024-11-26 | 每日互动股份有限公司 | 一种基于采样的视频解码方法、装置、介质及设备 |
Also Published As
| Publication number | Publication date |
|---|---|
| JP2014511137A (ja) | 2014-05-08 |
| MX2013008757A (es) | 2014-02-28 |
| CN103493481A (zh) | 2014-01-01 |
| WO2012103326A3 (en) | 2012-11-01 |
| BR112013020068A2 (pt) | 2018-03-06 |
| TWI586177B (zh) | 2017-06-01 |
| CA2825929A1 (en) | 2012-08-02 |
| US20120195369A1 (en) | 2012-08-02 |
| AU2012211243A1 (en) | 2013-08-22 |
| KR20140034149A (ko) | 2014-03-19 |
| IL227673A0 (en) | 2013-09-30 |
| JP6134650B2 (ja) | 2017-05-24 |
| EP2668779A4 (en) | 2015-07-22 |
| TW201238356A (en) | 2012-09-16 |
| IL227673A (en) | 2017-09-28 |
| EP2668779A2 (en) | 2013-12-04 |
| AU2016250476A1 (en) | 2016-11-17 |
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| US20120195369A1 (en) | Adaptive bit rate control based on scenes | |
| US9071841B2 (en) | Video transcoding with dynamically modifiable spatial resolution | |
| US9554142B2 (en) | Encoding of video stream based on scene type | |
| CN108769693B (zh) | 质量感知视频优化中的宏块级自适应量化 | |
| US11743475B2 (en) | Advanced video coding method, system, apparatus, and storage medium | |
| CN109788316B (zh) | 码率控制、视频转码方法与装置、计算机设备及存储介质 | |
| US20150350726A1 (en) | Method and apparatus of content-based self-adaptive video transcoding | |
| US20150312575A1 (en) | Advanced video coding method, system, apparatus, and storage medium | |
| US10205763B2 (en) | Method and apparatus for the single input multiple output (SIMO) media adaptation | |
| JP2014511138A5 (enExample) | ||
| US10165274B2 (en) | Encoding of video stream based on scene type | |
| US12262024B2 (en) | Method and systems for optimized content encoding | |
| EP3038370A1 (en) | Devices and method for video compression and reconstruction | |
| Jenab et al. | Content-adaptive resolution control to improve video coding efficiency | |
| WO2016193949A1 (en) | Advanced video coding method, system, apparatus and storage medium | |
| WO2024082971A1 (zh) | 一种视频处理方法及相关装置 | |
| CN117676266A (zh) | 视频流的处理方法及装置、存储介质、电子设备 | |
| Richardson | Video compression codecs: a survival guide |
Legal Events
| Date | Code | Title | Description |
|---|---|---|---|
| 121 | Ep: the epo has been informed by wipo that ep was designated in this application |
Ref document number: 12738976 Country of ref document: EP Kind code of ref document: A2 |
|
| ENP | Entry into the national phase |
Ref document number: 2013551331 Country of ref document: JP Kind code of ref document: A Ref document number: 2825929 Country of ref document: CA |
|
| NENP | Non-entry into the national phase |
Ref country code: DE |
|
| WWE | Wipo information: entry into national phase |
Ref document number: MX/A/2013/008757 Country of ref document: MX |
|
| WWE | Wipo information: entry into national phase |
Ref document number: 2012738976 Country of ref document: EP |
|
| ENP | Entry into the national phase |
Ref document number: 2012211243 Country of ref document: AU Date of ref document: 20120126 Kind code of ref document: A |
|
| ENP | Entry into the national phase |
Ref document number: 20137022649 Country of ref document: KR Kind code of ref document: A |
|
| REG | Reference to national code |
Ref country code: BR Ref legal event code: B01A Ref document number: 112013020068 Country of ref document: BR |
|
| REG | Reference to national code |
Ref country code: BR Ref legal event code: B01E Ref document number: 112013020068 Country of ref document: BR |
|
| REG | Reference to national code |
Ref country code: BR Ref legal event code: B01E Ref document number: 112013020068 Country of ref document: BR Free format text: EM ADITAMENTO A EXIGENCIA PUBLICADA NA RPI 2431 DE 08/08/2017 , SOLICITA-SE QUE A EXIGENCIA SEJA RESPONDIDA CORRETAMENTE, NUM PRAZO DE 60 DIAS, E QUE SEJA EFETUADO O PAGAMENTO DAS GRUS DE CODIGO 207, ESPECIFICO PARA ESSE TIPO DE SERVICO, CORRESPONDENTES AS DUAS EXIGENCIAS FORMULADAS. Ref country code: BR Ref legal event code: B01E Ref document number: 112013020068 Country of ref document: BR |
|
| ENP | Entry into the national phase |
Ref document number: 112013020068 Country of ref document: BR Kind code of ref document: A2 Effective date: 20130729 |