KR100602803B1

KR100602803B1 - Encoding block-organized data

Info

Publication number: KR100602803B1
Application number: KR1020007000588A
Authority: KR
Inventors: 윌헬머스 에이치. 에이. 브룰스; 레이니얼 비. 엠. 클레인거니위크
Original assignee: 코닌클리케 필립스 일렉트로닉스 엔.브이.
Priority date: 1998-05-20
Filing date: 1999-05-06
Publication date: 2006-07-20
Also published as: JP2002516528A; KR20010022020A; US6498812B1; WO1999060772A3; WO1999060772A2; US20020136300A1; US6999515B2; EP0998810A1; CN1301461A; CN1201587C

Abstract

블록들의 시퀀스로서 구성된 정보 스트림은 점진적인 성능 저하 원리에 따라 인코딩된다. 상기 스트림은 시간적인 비-균등 데이터 처리 요구들을 포함한다. 특히, 수신된 블록에 관하여, 하나 이상의 블록형으로 정의된 제어 파라미터들 및 연관된 블록형 처리 부하들이 검출된다(44). 하나 이상의 이전의 블록들에 대한 처리 부하의 제어(46) 하에, 처리 이전의 하나 이상의 나중 블록들에 대하여 하나 이상의 제어 파라미터들을 조절한다(48). 이것은 과도한 부하의 경우에 나중 블록에 대해 기대된 부하를 낮추고, 역으로 서브-표준 부하를 검출할 때도 성립한다.The information stream configured as a sequence of blocks is encoded according to a gradual degradation principle. The stream contains temporal non-uniform data processing requests. In particular, with respect to the received block, one or more blocky defined control parameters and associated blocky processing loads are detected (44). Under control of the processing load 46 for one or more previous blocks, one or more control parameters are adjusted 48 for one or more later blocks prior to processing. This lowers the expected load for later blocks in the case of excessive load, and vice versa when detecting sub-standard loads.

정보 스트림, 인코더, 블록, 프로세서, 부하Information stream, encoder, block, processor, load

Description

Encoding block-organized data}

본 발명은 블록-구조 오디오/비디오 데이터 인코딩에 관한 것이다.The present invention relates to block-structured audio / video data encoding.

전송 또는 저장을 위한 디지털 오디오 및 비디오의 처리는 다양한 데이터 압축 기술들의 사용을 필요로 한다. 제한적이지 않은 실시 예로는 오디오 및 비디오에 대한 다양한 버전을 가진 MPEG 표준이 있다. 다른 표준은 H.261가 있다. 그러한 압축을 소프트웨어로 실현하는 것은 Ho-Chao Huang 등에 의해, New Generation of Real-Time Software-Based Video Codec: Popular Video Coder Ⅱ, IEEE TR. Cons.EL. Vol.42, No.4, P.963-973에 공개되어 있다. 소프트웨어와 하드웨어가 혼합된 환경에서 실행되는 압축 및 유사한 동작들이 실현 가능하다. 소프트웨어 인코딩을 위해 필요한 동작들의 수는 예견하기 어렵다. 본 명세서에서의 실시예는 주로 비디오를 참조하여 설명될 것이다. 현재, 일반적으로, 압축은 화상들의 그룹들(GOPs)을 기초로 하여 실행된다. 본 명세서에서, "화상"이라는 용어가 일관되게 사용될 것이다. 실제 비디오 표준에 따라, "화상"이라는 용어는 "프레임" 뿐만 아니라 "필드"를 의미할 수 있다. 현재, 프레임형 구조의 오디오 또는 혼합된 오디오/비디오 정보 스트림들의 압축은 유사한 방법으로 실행될 수 있다. 이러한 처리는 실시간으로 수행되어야 하고, 프로세서가 과부하의 경우에, 화상들이나 그 부분들을 루징(loosing)함으로써 높은 대가를 감수해야 한다. 기본 시스템 설비들을 보호하기 위해, 어느 정도의 품질을 포기하는 점진적인 성능 저하(graceful degradation)가 데이터 처리에 있어 폭넓게 사용되어 왔다.Processing of digital audio and video for transmission or storage requires the use of various data compression techniques. Non-limiting examples include the MPEG standard with various versions of audio and video. Another standard is H.261. Such compression is realized in software by Ho-Chao Huang et al., New Generation of Real-Time Software-Based Video Codec: Popular Video Coder II, IEEE TR. Cons.EL. Vol. 42, No. 4, and P. 963-973. Compression and similar operations that run in a mixed software and hardware environment are feasible. The number of operations required for software encoding is difficult to predict. Embodiments herein will be primarily described with reference to video. Currently, compression is generally performed based on groups of pictures (GOPs). In this specification, the term "picture" will be used consistently. According to the actual video standard, the term "picture" may mean "field" as well as "frame". Currently, compression of framed audio or mixed audio / video information streams can be performed in a similar manner. This process must be performed in real time and the processor has to pay a high price by loosing images or parts thereof in case of overload. In order to protect basic system facilities, a gradual degradation that gives up some quality has been widely used in data processing.

결과적으로, 본 발명의 주목적은 향상된 인코딩 방법을 제공하는 것이다. 따라서, 본 발명의 한 관점에 따라, 본 발명은 청구항 1의 특징부에 기재된 내용을 특징으로 한다. 또한, 본 발명은 상기 방법을 실현하도록 배열된 인코더 장치에 관한 것이며, 본 발명의 유리한 관점들은 종속항들에 기재되어 있다.As a result, the main object of the present invention is to provide an improved encoding method. Thus, according to one aspect of the invention, the invention features the subject matter described in the characterizing part of claim 1. The invention also relates to an encoder arrangement arranged to realize the method, advantageous aspects of the invention being described in the dependent claims.

본 발명의 발명자들은 실제의 처리 부하를 제어하기 위해 화상 또는 슬라이스와 같은 화상의 일부에 대해 정의된 다양한 제어 파라미터들의 잠재적인 값을 인지했다. 비디오를 위한 적절한 파라미터는 블록 또는 전체 화상 내에서 중요하지 않게 취급되는 정보의 양을 나타내는 중복량(Q)이다. 본 발명의 제 1 관점에 따라, 실제의 처리 부하는 미래의 처리 부하를 예측하고, 결과적으로, 정보의 과부하로 인한 손실(overload-inflicted losses)을 피하기 위해 하나 이상의 제어 파라미터들을 조절하기 위해 사용된다.The inventors of the present invention have recognized potential values of various control parameters defined for a portion of an image, such as an image or slice, to control the actual processing load. A suitable parameter for video is the amount of redundancy (Q), which indicates the amount of information that is treated unimportant within a block or the entire picture. According to the first aspect of the present invention, the actual processing load is used to predict future processing loads and consequently to adjust one or more control parameters to avoid overload-inflicted losses. .

본 발명의 이러한 관점들 및 장점들은 양호한 실시예들의 개시 내용을 참고로 이하에서 자세히 논의될 것이다.These aspects and advantages of the present invention will be discussed in detail below with reference to the disclosure of the preferred embodiments.

도 1은 본 발명에 따른 장치의 블록도.1 is a block diagram of an apparatus according to the invention.

도 2는 제 1 향상된 프로파일 차트.2 is a first enhanced profile chart.

도 3은 제 2 향상된 프로파일 차트.3 is a second enhanced profile chart.

도 4는 방법 흐름도(policy flowchart).4 is a policy flowchart.

도 5는 전형적인 MPEG 구성도.5 is a typical MPEG configuration diagram.

도 1은 본 발명에 따른 장치들의 블록도이며, 특히, 비디오와 함께 사용하기 위한 것이다. 먼저, 단순화된 버전이 논의되며, 여기서, 소자들(62, 64, 68, 70, 72, 74, 76, 78, 80)은 논의되지 않는다. 입력(20)에 수신된 비디오는 균일한 수의 픽셀들을 갖는 화상들로 구성된다. DCT 소자(22)에서, 각각의 화상은 8 x 8 픽셀들의 어레이로 각각 구성된 일련의 비디오 블록들로 분할된다. 이미지의 중간 분할은 블록들의 두 개의 근접한 수평 열들로 구성된 슬라이스들이 된다. 각각의 블록은 8 x 8 디지털 주파수 계수들의 어레이를 생성하기 위해 불연속 코사인 변환(DCT)된다.1 is a block diagram of devices according to the invention, in particular for use with video. First, a simplified version is discussed, where elements 62, 64, 68, 70, 72, 74, 76, 78, 80 are not discussed. The video received at the input 20 consists of pictures with a uniform number of pixels. In the DCT element 22, each picture is divided into a series of video blocks each composed of an array of 8 x 8 pixels. The middle segment of the image is slices consisting of two adjacent horizontal columns of blocks. Each block is discrete cosine transformed (DCT) to produce an array of 8 x 8 digital frequency coefficients.

2차원적 DCT 결과 블록에서, 각각의 계수는 웨이브 주파수(wave frequency)와 관련된다. 상부-좌측 계수 "00"은 두 좌표들에 따라 0 공간 주파수에 대응하는 평균값과 관련된다. 이 위치(상부-좌측)의 오른쪽으로, 웨이브는 수평이다. 제 1 위치(상부-좌측)의 아래로, 웨이브는 수직이다. 경사 방향으로는, 웨이브가 좌표 방향들에 관해 대응하는 방식으로 향한다. 역 DCT에 의한 뒤따르는 디코딩은 원래의 이미지의 무손실 재구성(lossless reconstruction)을 제공할 것이다. In a two-dimensional DCT result block, each coefficient is associated with a wave frequency. The upper-left coefficient "00" is associated with an average value corresponding to zero spatial frequency in accordance with the two coordinates. To the right of this position (top-left), the wave is horizontal. Below the first position (top-left), the wave is vertical. In the oblique direction, the wave is directed in a corresponding way with respect to the coordinate directions. Subsequent decoding by inverse DCT will provide lossless reconstruction of the original image.

도 1에 있어서, 가중 소자(weighting element)(24)는 미세한 항목들(smaller details) 또는 높은 공간 주파수들에 대한 상대적으로 낮은 감지 능력을 고려하여, 각각의 계수들에 대해 가중 인자들을 도입한다. 이러한 가중의 목적은 데이터를 감소시키기 위한 것이다. 계수 "00"을 위한 가중치 인자는 1 이다. 가중치 인자들은 계수 "00"으로부터 떨어진 모든 방향에서 감소한다. 가중 인자들은 일반적으로 고정된다. 디코더에 있어서, 이 기술은 심지어 어떤 양호한 조건에서도, 인간에게 실질적으로 보이지 않는 정보의 일부 손실을 야기할 것이다.In Fig. 1, a weighting element 24 introduces weighting factors for each coefficient, taking into account the relatively low sensing capability for small details or high spatial frequencies. The purpose of this weighting is to reduce the data. The weighting factor for the coefficient "00" is one. The weighting factors decrease in all directions away from the coefficient "00". Weighting factors are generally fixed. For decoders, this technique will cause some loss of information that is virtually invisible to humans, even under any good conditions.

양자화기(26)(quantifier)에 있어서, 더욱 데이터를 감소하기 위해, 계수 00과는 별개로, 계수들은 해당 비디오 블록에 대해 균등하고, 심지어 슬라이스와 같은 일련의 비디오 블록들 또는, 전체 비디오 화상에 대해 균등한 중복 인자 Q로 나누어진다. 몫들(quotients)은 균등한 임계값에 대해 실질적으로 클리핑되는데, 이는 임계값보다 크지 않은 계수들을 줄이게 될 것이다(dropping). 소프트웨어를 사용하는 인코딩을 위한 프로세서 부하는 도 1의 소자들(26 및 28)에 적용되고, INTEL 펜티엄과 같은 단일 고성능 마이크로프로세서에 맵핑될 수 있다. Q-값과 프로세서 부하는 대체로 서로 반비례한다. In quantizer 26, in order to further reduce the data, apart from coefficient 00, the coefficients are equal for that video block, even in a series of video blocks such as slices, or the entire video picture. Is divided by the equal overlap factor Q. Quotations are substantially clipped to an even threshold, which will drop coefficients that are not greater than the threshold. The processor load for encoding using software is applied to the elements 26 and 28 of FIG. 1 and can be mapped to a single high performance microprocessor such as INTEL Pentium. The Q-value and processor load are usually inversely proportional to each other.

최종적으로, 코더(28)에서는, 결과로서 얻은 계수들이 연속화되고, 허프만 코드 또는 유사한 형태의 코드에 따라 가변 길이 인코딩(VLE) 처리된다. 결과로서 얻은 비트스트림은 출력(32)에 출력된다. 계산 소자(34)에 있어서, 실제의 처리 부하가 계산되고, 라인(30)을 따라 양자화기(26)에 재결합된다. 후자는 허용 범위 내에서 블록이나 화상 단위의 처리 부하를 보유하도록 Q의 값을 조절할 수 있다.Finally, in coder 28, the resulting coefficients are serialized and subjected to variable length encoding (VLE) in accordance with a Huffman code or similar code. The resulting bitstream is output to output 32. In the computing element 34, the actual processing load is calculated and recombined to the quantizer 26 along the line 30. The latter may adjust the value of Q to keep the processing load in blocks or pictures in the acceptable range.

상기에 있어서, 클럭 사이클의 수는 이미지의 내용에 의존한다. 단일 화상 내에서의 블록들간 또는 슬라이스들간 뿐만 아니라, 다양한 화상들간에 있어 차이가 발생할 수 있다. 따라서, 최악의 경우의 조건들을 처리하기 위한 필요성으로 인하여 하드웨어 설비들을 크게 증가시킨다. 본 발명은 처리량이 인자 Q 및 다른 제어 파라미터들에 더 의존할 수 있다는 것을 인지했다. 채널 비트레이트 및 버퍼 점유를 제어하기 위해 Q를 조절하는 것이 제안되었다. 본 발명은 Q와 가능한 다른 제어 파라미터들을 조절하여 프로세서 부하를 조절하도록 한다.In the above, the number of clock cycles depends on the content of the image. Differences may occur between various pictures as well as between blocks or slices within a single picture. Thus, the hardware facilities are greatly increased due to the need for handling worst case conditions. The present invention has recognized that throughput may be more dependent on factor Q and other control parameters. It is proposed to adjust Q to control channel bitrate and buffer occupancy. The present invention adjusts the processor load by adjusting Q and possibly other control parameters.

다음의 방법은 도 4의 흐름도에 도시되어 있다. 블록(40)에서는, 처리가 시작되어 필요한 하드웨어 및 소프트웨어 설비들이 구성 및 초기화된다. 블록(42)에서는, 비디오 블록들의 시퀀스를 포함하는 새로운 화상에 대한 처리를 시작한다. 블록(44)에서는, 실제의 진행량이 갱신된다. 예를 들어, 비디오 블록 카운트 또는 비디오 라인 카운트가 해당 화상 내의 다양한 비디오 블록들의 구성에 따라, 또한 해당 화상에 할당된 시간 간격 내의 시간의 진행에 따라 갱신된다. 블록(46)에서는 진행이 규정된 범위 내에 있는지의 여부를 체크한다. 초기에는, 진행이 시간적으로 선형인 것으로 가정하지만, 화상들의 시퀀스에 따라, 각각의 화상들 내의 진행은 비선형일 수 있다. 또한, 진행 프로파일은 시퀀스에 따라 비-균등일 수 있다. 진행이 규정된 범위 내에 있다면, 블록(50)에서는 다음의 비디오 블록을 처리한다. 규정된 범위는 비디오 블록들 및/또는 화상들의 시퀀스에 따라 계속적인 갱신이 필요 할 수 있다. 블록(52)에서는, 실제로 해당 화상이 최종 비디오 블록인지의 여부를 체크한다. 해당 화상이 최종 블록이 아닌 경우에, 시스템은 블록(44)으로 복귀한다. 해당 화상이 최종 블록인 경우, 블록(42)에서는 다음의 화상이 시작된다. 그러나, 블록(46)에서의 결과가 부정인 경우에, 블록(48)에서는 중복량(Q)을 갱신하는데, 이는 처리 부하에 바로 영향을 주고, 그로 인해 블록(46)에서 체크되어야 할 다음의 진행에도 영향을 준다. The following method is shown in the flowchart of FIG. At block 40, processing begins to configure and initialize the necessary hardware and software facilities. In block 42, processing begins for a new picture comprising a sequence of video blocks. In block 44, the actual amount of progress is updated. For example, the video block count or video line count is updated in accordance with the configuration of the various video blocks in the picture and also with the progress of time in the time interval assigned to that picture. In block 46 it is checked whether the progress is within the defined range. Initially, the progression is assumed to be linear in time, but depending on the sequence of images, the progression in each of the images may be nonlinear. Further, the progression profile may be non-uniform in sequence. If the progress is within the defined range, block 50 processes the next video block. The defined range may require continuous updating depending on the sequence of video blocks and / or pictures. In block 52, it is actually checked whether the picture is the last video block. If the picture is not the last block, the system returns to block 44. If the picture is the last block, the next picture starts at block 42. However, if the result at block 46 is negative, block 48 updates the amount of redundancy Q, which directly affects the processing load and thereby the next to be checked at block 46. It also affects progress.

이러한 관점에서, 도 2는 특히 선형인 경우의 제 1 향상된 프로파일 차트를 도시한다. 도시된 라인은 명목상의 진행을 나타내고, 모든 블록들이 처리되기 위해 실질적으로 동일한 시간 간격들을 취한다는 것을 나타낸다. 라인 하이(hi) 및 로우(lo)는 Q의 실제의 값을 갱신하기 위하여 고려된 상한 및 하한 임계값들을 각각 나타낸다. 처리가 너무 느리다면, 진행은 실선 아래로 유지되어, Q는 상승되어야 하고, 또한, 그 역으로도 적용된다. 이러한 결정이 전체로서 한 화상에만 기초하여 취해지는 경우에는, 도면의 상부 가장자리를 잘라내는 것(intercept)만이 고려된다. 라인들(hi 및 lo)은, Q의 너무 빈번한 수정을 피하기 위해, 명목상의 라인으로부터 상당한 차이(appreciable divergences)가 있다. 본 시스템은, 예견할 수 없는 문제에 대처하기 위해, 20% 정도로 일정한 나머지 용량(surplus capacity)을 유지하는 방식으로 항상 설계 및 제어되어야 한다.In this regard, FIG. 2 shows a first enhanced profile chart, especially when linear. The lines shown represent nominal progression, indicating that all blocks take substantially the same time intervals to be processed. Lines hi and lo represent the upper and lower thresholds, respectively, considered in order to update the actual value of Q. If the process is too slow, the progress remains below the solid line so Q must rise and vice versa. If such a determination is taken solely on the basis of one image, only intercept of the upper edge of the figure is considered. The lines hi and lo have appreciable divergences from the nominal line to avoid too frequent modification of Q. The system must always be designed and controlled in such a way as to maintain a surplus capacity of 20%, in order to cope with unforeseen problems.

도 3은 비-선형인 제 2 향상된 프로파일 차트를 도시하며, 이는 화상의 앞선 블록들이 뒤의 블록들보다 더 많은 처리 시간을 취하는 것을 설명한다. 이것은 앞선 화상들의 시퀀스에 의해 결정되었으며, 앞선 블록들의 비-균등 화상 내용에 의해 야기될 수 있는데, 균등한 이미지 부분은 많은 항목들을 갖는 장면 부분보다 적은 처리를 취할 것이다. 연속적인 화상들 사이에서는, 프로파일의 경사들 모두와 프로파일내의 가장자리의 위치가 유지된다. 한 화상 내의 그러한 비-균등성은 Q 및/또는 다른 제어 파라미터들에서의 변화들을 양자화하는데 다른 방법을 야기할 수 있다. 일반적으로, Q를 변경하는 양자화는 인공 지능의 문제가 있는데, 불안정은 피해야한다. 때로, 이러한 조절은 수정 단계들이 보다 작은 측면에 있을 수 있는 점에서, 완전히 안정되기 위해 하나 이상의 화상을 취한다. 또한, 전체 화상에 취해진 측정들에 대한 시간-응답은 블록 대 블록 원리(block-to-block basis)로 취해진 측정들과 다를 수 있다.3 shows a second enhanced profile chart that is non-linear, which illustrates that the preceding blocks of the picture take more processing time than the later blocks. This has been determined by the sequence of preceding pictures, which can be caused by the non-uniform picture content of the preceding blocks, where an even image part will take less processing than a scene part with many items. Between successive images, both the slopes of the profile and the position of the edges in the profile are maintained. Such non-uniformity within one picture can lead to another way to quantize changes in Q and / or other control parameters. In general, quantization to change Q is a problem for artificial intelligence, and instability should be avoided. Sometimes this adjustment takes one or more images to be completely stable, in that correction steps may be on the smaller side. Also, the time-response to the measurements taken on the whole picture may be different from the measurements taken on a block-to-block basis.

본 발명은 예를 들어, MPEG 환경에서 사용될 수 있다. 도 5는 화상들의 시퀀스로 표현된 전형적인 MPEG 구조 실시예를 도시한다. MPEG는 3개의 카테고리들, 즉 I, B 및 P 화상들을 갖는다. I 화상들은 해당 이미지를 재생하기 위한 모든 정보를 포함한다. P 화상들은, 모든 필요한 정보들보다 적은 정보를 포함하지만, 보다 늦게 처리된 화상에 대해 앞선 것(predecessor)으로서 자체적으로 작용할 수 있는 방식으로 재생될 이미 처리된 화상을 필요로 한다. 마지막으로, B 화상들은, 모든 필요한 정보보다 적은 정보를 포함하지만, 해당 이미지를 재생하기 위해 하나 이상의 이미 처리된 화상들을 필요로 한다. 그러나, 이는 여전히 처리되어야할 이미지에 대해 앞선 것으로서 자체적으로 또한 작용할 수 있는 화상을 얻지 못할 것이다. 그러므로, P 화상들은 연결될 수 있으나, B 화상들은 항상 마지막 화상들이 될 것이다. 결합성(coherence)은 화살표에 의해 표시된다.The invention can be used, for example, in an MPEG environment. 5 illustrates an exemplary MPEG structure embodiment represented by a sequence of pictures. MPEG has three categories, namely I, B and P pictures. I pictures contain all the information for reproducing the image. P pictures contain less information than all the necessary information, but require an already processed picture to be reproduced in such a way that it can act on its own as a predecessor to later processed pictures. Finally, B pictures contain less information than all the necessary information, but require one or more already processed pictures to reproduce that image. However, it will still not get an image that can also act on its own as it is ahead of the image to be processed. Therefore, P pictures can be concatenated, but B pictures will always be the last pictures. Coherence is indicated by the arrow.

MPEG 설계를 위해, 도 1은 확장되었다. 특히, 역 결합 루프는 양자화기(26)의 반대인 역 양자화기(68)를 포함한다. 또한, 소자(70)는 DCT 소자(22)의 반대인 역 불연속 코사인 변환(IDCT)을 수행한다. 이 결과는 가산기(72)로 진행하고, 이어서 메모리(74)에 저장된다. B 및 P 화상들에 대해서, 소자(76)의 입력(77)은 하나 이상의 움직임 보상 벡터들을 수신하므로, 이미지의 움직임은 효과적으로 보상될 수 있다. 결과로서 얻은 화상 내용은 B 및 P 화상들에 대해 배타적으로 동작하는 감산기(60)에 보내진다. I 화상들에 대해서, 스위치(62)는 닫히고, 가산기(60)는 효과적으로 단락된다. B 및 P 화상들에 대해서, 움직임 보상된 화상은 스위치(64)에 의해 IDCT(70)로부터 수신된 화상들에 부가하는 가산기(72)에 전달된다.For MPEG design, Figure 1 is expanded. In particular, the inversely coupled loop includes an inverse quantizer 68 that is opposite of the quantizer 26. In addition, device 70 performs an inverse discontinuous cosine transform (IDCT) that is the opposite of DCT device 22. This result proceeds to adder 72 and is then stored in memory 74. For B and P pictures, the input 77 of element 76 receives one or more motion compensation vectors, so that the motion of the image can be effectively compensated. The resulting picture content is sent to subtractor 60, which operates exclusively for B and P pictures. For I pictures, the switch 62 is closed and the adder 60 is effectively shorted. For B and P pictures, the motion compensated picture is passed by the switch 64 to an adder 72 that adds to the pictures received from the IDCT 70.

가변 길이 코더(28)는 저장 및 전송을 위해 출력(32)에 코딩된 정보 스트림을 출력한다. 또한, 신호들은 계산 소자(34)로 진행하면서, 출력 비트 레이트의 출력(32)에 관한 정보를 비트 레이트 제어 블록(80)에 보낼 수도 있다. 후자는 적용 가능한 시간 간격에 대해 평균화된 비트 레이트가 출력(32)으로부터 다운스트림되는 소자들의 처리 및/또는 버퍼링 용량을 초과하지 않는지를 체크할 것이다. 이 결과는 다운스트림 방향으로 출력(32)과 나란히 출력될 수 있는 제어 신호가 되고, 또한, 계산 소자(34)부터 논리 조합 소자(78)까지 그 제어 신호와 함께 역으로 결합될 수 있다. 비트-레이트 부하(bit-rate load)가 과도하지 않다면, 계산 소자(34)가 결정한다. 비트 레이트가 너무 높다면, 조합 소자(78)가 계산 소자(34)에 의한 제어를 무효로 한다.Variable length coder 28 outputs a coded information stream to output 32 for storage and transmission. The signals may also go to the computing element 34 and send information about the output 32 of the output bit rate to the bit rate control block 80. The latter will check that the averaged bit rate for the applicable time interval does not exceed the processing and / or buffering capacity of the elements downstream from the output 32. The result is a control signal that can be output side by side with the output 32 in the downstream direction, and can also be combined inversely with the control signal from the calculation element 34 to the logic combination element 78. If the bit-rate load is not excessive, the computing element 34 determines. If the bit rate is too high, the combination element 78 invalidates the control by the calculation element 34.

또한, 제어 파라미터들의 수정은 다음을 포함할 수 있다. MPEG 스트림에 있어서, B-화상들은 선택적으로 제외될 수 있다. 또한, 도 1의 가중 블록(24)에서의 계수 클리핑은 더욱 정밀하게 또는 더욱 관대한 방식으로 실행될 수 있다. 또한, 움직임 추정은 예를 들어, 5 대신에 3개를 취하는 보다 적은 움직임 벡터 후보들을 취하는 덜 소모적인 방식으로 수행될 수 있다.In addition, modification of the control parameters may include the following. For MPEG streams, B-pictures can optionally be excluded. In addition, coefficient clipping in the weighting block 24 of FIG. 1 may be performed in a more precise or more generous manner. In addition, motion estimation may be performed in a less consuming manner, for example, taking fewer motion vector candidates that take three instead of five.

또한, 본 발명에 따른 다른 측정은 다음과 같다. 부하 복잡성(load complexity)은 모두가 대략 Q와 같은 값을 가진 연속적인 화상들의 그룹(GOP)에 기초하여 모니터되고, MPEG에서 I, P 및 B 화상들과 같은 화상의 다양한 형식들의 성질은 개별적인 적절한 Q의 레벨들을 할당하기 위해 고려된다.Further measurements according to the invention are as follows. Load complexity is monitored based on a group of consecutive pictures (GOP), all of which have a value approximately equal to Q, and in MPEG the nature of the various formats of the picture, such as I, P and B pictures, is individually appropriate. It is considered to assign the levels of Q.

입력과 보상된 입력 사이의 차 신호는, 어떤 경우든 관련이 0에 가까운 차이를 가진 경우에, DCT 소자(22)의 입력에, 계수 클리핑을 통하여 0으로 강제될 수 있다. 이는 움직임 추정 장치의 출력 신호를 통해 신호화될 수 있다. 이때, 블록들(DCT, Q, IQ 및 IDCT)에서의 계산들이 선행될 수 있다.
또 다른 측정은, 특정한 B 또는 P 화상이 복구될 수 없을 때도, 바로 앞의 I 화상을 그 다음의 비-I 화상과 함께 취하고 두 개의 화상들 사이의 차 벡터를 2로 나누어, 사실상 평균 화상을 얻는다.The difference signal between the input and the compensated input may be forced to zero through coefficient clipping at the input of the DCT element 22, in any case where the association has a near zero difference. This may be signaled through the output signal of the motion estimation device. In this case, calculations in blocks DCT, Q, IQ, and IDCT may be preceded.
Another measure is to take the immediately preceding I picture along with the next non-I picture and divide the difference vector between the two pictures by two, even when a particular B or P picture cannot be recovered. Get

위에 언급된 실시예들은 다소 본 발명을 제한적으로 설명하고 있으며, 당업자는 부가된 청구항들의 범위에 벗어나지 않고 수많은 변경 실시예들을 설계할 수 있다는 것을 유의해야 한다. 청구항에서, 괄호 사이의 어떤 참조 표시들도 청구항을 제한하는 것으로 해석되어서는 안 된다. "포함"은 청구항에 나열된 것보다 다른 소자들 또는 단계들의 존재를 배제하지 않는다. 본 발명은 몇몇 별개의 소자들을 포함하는 하드웨어 수단으로 구현될 수 있으며, 알맞게 프로그램된 컴퓨터로 구현될 수 있다. 몇몇 수단들을 열거하는 장치 청구항에서, 이러한 수단의 몇몇은 하나 및 동일한 아이템의 하드웨어로 구현될 수 있다. It should be noted that the above-mentioned embodiments somewhat limit the present invention, and that those skilled in the art can design numerous alternative embodiments without departing from the scope of the appended claims. In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. "Include" does not exclude the presence of other elements or steps than those listed in the claims. The invention can be implemented in hardware means comprising several separate elements, and can be implemented in a suitably programmed computer. In the device claim enumerating several means, several of these means may be embodied in one and the same item of hardware.

Claims

A method of encoding an input signal according to a gradual degradation principle to obtain an information stream, wherein the information stream is configured as a sequence of fields or pictures and there is a need for temporal non-uniform data processing.

Determining 44 one or more control parameters and an information stream associated processing load, and

Adjusting 48 one or more of the control parameters in the information stream before processing 50 under the control of the determined processing load;

The information stream consists of fields or pictures each formed from an even sequence of blocks,

The processing load is measured with respect to the progression in the sequences of blocks compared to the time progression in the actually assigned field or picture time,

The one or more control parameters are adjusted for one or more subsequent blocks in the information stream, under the control of a processing load on one or more previous blocks,

Wherein said one or more control parameters comprise an amount of redundancy (Q), and said step of adjusting for said amount of redundancy is performed by a quantizer (26).

delete

The method of claim 1,

And said processing load is measured by comparing said progression with a corresponding progression for a preceding field or picture.

The method of claim 1,

Non-evenly temporal progression of the sequence of blocks is allowed.

delete

The method of claim 1,

Monitoring load complexity based on a group of consecutive pictures (GOPs) having values of approximately uniform Q but with various different categories, and

And considering the various categories to assign different levels for checking the processing load.

The method of claim 1,

Dynamically checking whether a difference between the input signal and the motion-compensated version is below a predetermined value,

Forcing the difference to zero through coefficient clipping for the processing step waiting for the difference when the difference is less than or equal to the predetermined value;

Signaling previous calculations intended to find the motion-compensated version via a motion estimator.

The method of claim 1,

Checking whether a particular image can be recovered, and

When the particular picture cannot be recovered, initiating finding an average picture between the immediately preceding picture and the next picture.

An encoder for encoding an input signal according to a gradual degradation principle in order to obtain an information stream, wherein the information stream is configured as a sequence of fields or pictures and there is a need for temporal non-uniform data processing.

Means (34) for determining one or more control parameters and an information stream associated processing load, and

Means 26 for adjusting one or more of the control parameters in the information stream prior to processing under control 34 of the determined processing load,

Means for measuring the processing load with respect to progression in the sequences of blocks that are actually compared to time progression in the assigned field or picture time,

And said adjusting means comprises a quantizer (26) for adjusting the amount of redundancy (Q) for one or more subsequent blocks in the sequence of blocks under the control of said processing load on one or more previous blocks.