TW201244486A

TW201244486A - Video coding and decoding devices and methods preserving PPG relevant information

Info

Publication number: TW201244486A
Application number: TW101100492A
Authority: TW
Inventors: Ihor Olehovych Kirenko; Haan Gerard De; Leest Adriaan Johan Van
Original assignee: Koninkl Philips Electronics Nv
Priority date: 2011-01-05
Filing date: 2012-01-05
Publication date: 2012-11-01

Abstract

The present invention relates to a video encoding device (10, 10', 10'') and method for encoding video data and to a corresponding video decoding device (60, 60') and method. To preserve PPG relevant information after encoding without requiring a large amount of additional data for the video encoder output stream, the proposed video encoding device comprises a selection unit (20, 20') for selecting a region of interest (101) in input video data (100) providing a strong PPG signal, a first encoding unit (30, 30') for encoding said selected region of interest (101) of said input video data (100) according to a predetermined encoding scheme with a first setting of the encoding to preserve PPG-relevant information in the encoded region of interest, a second encoding unit (40, 40') for encoding remaining parts (103) of said input video data (100) according to said predetermined encoding scheme with a second setting of the encoding, and an encoder combination unit (50) for combining the encoded region of interest (102) and the encoded remaining parts (104) of said input video data into an encoder output video stream (105).

Description

201244486 六、發明說明：【發明所屬之技術領域】以保存PPG(光體對應視訊編碼方本發明係關於一種用於編竭視訊資料辟積描述成像）相關資訊之視訊編碼裝置及法0 此外，本發明係關於-種用於解碼經編碼視訊資料之視訊解碼裝堇及對應視訊解碼方法。又此外’本發明侧於-種用於編碼及解碼視訊資料之視訊編碼系統且關於一種用於實施該等方法之電腦程式。【先前技術】提供強大連續監測人之生理計量信號之技術解決方案的需求曰益增長《此需求係年輕世代逐漸察覺一健康及積極生活方式之重要性之一結果。此外，由於平均壽命增加引起的人口不斷老化使人們格外迫切需要最小干擾一人員之曰常生活活動之健康監測系統。可使用生理計量信號之無干擾監測以在任何時間提供對身體及精神狀況之一事實上即時回饋’且儘快評估人之健康狀態之變更。量測生理計量信號（例如，心率、呼吸率、血壓、皮膚氧合等）之習知裝置及方法需要使用者戴上惱人的身體感測器’此對人們平常生活活動可能造成不便。因此，近年來嘗試開發用於遠端監測人體生命信號之非接觸式技術。最近開發顯示藉由為用戶設計的成像感測器（網路攝影機）或廣播視訊而實施無干擾遠端監測。在 2008 年 12 月 Optics Express 第 16 卷第 20 號 ’ Wim 161403.doc 201244486201244486 VI. Description of the invention: [Technical field to which the invention pertains] A video encoding apparatus and method for storing PPG (light body corresponding video coding method, the present invention relates to a method for compiling video image description) The present invention relates to a video decoding device and a corresponding video decoding method for decoding encoded video data. Further, the present invention is directed to a video encoding system for encoding and decoding video data and to a computer program for implementing the methods. [Prior Art] The demand for technology solutions that provide powerful continuous monitoring of human physiological signals has increased. This demand is one of the reasons why younger generations are increasingly aware of the importance of a healthy and active lifestyle. In addition, the aging of the population due to the increase in average life expectancy has made it extremely urgent to have a health monitoring system that minimizes the disruption of a person's usual living activities. Non-interference monitoring of physiologically measured signals can be used to provide virtually instantaneous feedback to one of the physical and mental conditions at any time' and to assess changes in the health status of the person as quickly as possible. Conventional devices and methods for measuring physiological measurement signals (e.g., heart rate, respiration rate, blood pressure, skin oxygenation, etc.) require the user to wear an annoying body sensor' which may cause inconvenience to people's normal living activities. Therefore, in recent years, attempts have been made to develop a non-contact technique for remotely monitoring human life signals. Recently developed, it has been shown to implement interference-free remote monitoring by imaging sensors (network cameras) or broadcast video designed for users. In December 2008 Optics Express Volume 16 No. 20 ' Wim 161403.doc 201244486

Verkruysse、Lars 〇. Svaasand 及 j stuart Nels〇n 之「Remote pUthysmographic imaging using ambient Ught」中描述一種用以量測膚色變化之方法（稱為光體積描述成像（PPG))。該方法係基於皮膚中血容量之時間性變化導致皮膚之光吸收變化之原理。可由拍攝一皮膚區域（例如，臉°卩）之影像之一視訊攝影機記錄此等變化，同時處理叶算一手動選擇區域（通常此系統中臉頰之部分）之像素平均值°藉由觀看此平均信號之週期性變化，可擷取心跳率及呼吸率。用於遠端量測心跳率或呼吸率信號之已知系統係基於繼影像感測之後直接分析未壓縮、未處理的視訊序列。在多數「真實生活」應用中，依一壓縮形式儲存或傳輸視訊序列。視訊信號之壓縮假設移除一些冗餘（自視覺感知觀點）資訊。不幸的是’對視訊感知不重要的資訊可能對偵測生理計量信號很重要。例如，MPEG壓縮標準運用圖框間預測’此稍微變更一視訊信號之時間性資訊。該等變更使偵測時間性生理計量信號變得更難或甚至不可能。然而，對於〇午夕應用’應繼發生視訊記錄之後實施自一視訊擁取心跳信號。在此等情況下，將處理經壓縮視訊。若依一高位元速率壓縮一視訊，則PPG相關資訊可保存在一編碼位元串流中。然而’用一低壓縮比壓縮一視訊將增大一儲存棺案之大小或增大傳輸頻寬。因此，需要在視訊記錄及壓縮期間，特定言之，根據習知視訊編碼標準之一者，保存離線擷取生理計量信號所要的資訊。 161403.doc 201244486 心準視訊編碣技術（如MPEG2、MPEG4、H.264)係藉由應用時間性預測而達成視訊資訊之一有效壓縮。一視訊序歹〗中之多數圖框（類型B及P，B意味著「雙向預測圖框」’ P意味著「正向預測圖框」）編碼成一原始圖框與一運動補償圖框間編碼圖框（類型B或P)之間的量化差。歸因於量化及運動預測而損失一些視覺資訊。儘管此資訊自視覺感知觀點係無關緊要的，但是此資訊含有對擷取生理計量信號（諸如心5^ )係重要的資料。右在不應用時間性預測及/或解區塊濾波器（H 264)之情況下依向位元速率壓縮視訊，則ppG資訊可保存在一視訊序列中°例如’僅基於圖框内編碼之MjpEG或MJpEG2K 可經應用以壓縮一視訊且保存PPG信號。然而，全部圖框之内編碼無法提供多數多媒體應用所要的一壓縮比。因此，尤其需要一種允許使用標準有損視訊壓縮技術來壓縮視訊且繼解碼一視訊之後保存擷取ppG信號所要的影像資訊之方法及裝置》【發明内容】本發明之一目的係提供一種用於編碼視訊資料藉以在不需要大量額外資料之情況下保存ppG相關資訊之視訊編碼裝置及對應視訊編碼方法。本發明之一進一步目的係提供一種對應視訊解碼裝置及方法、一種視訊編碼系統及一種用於實施該等方法之電腦程式。在本發明之一第一態樣中，提出一種視訊編碼裝置，該視訊編碼裝置包括： 16I403.doc 201244486 •一選擇單元’其用於選擇輸入視訊資料中提供一強ppG 信號之一關注區域； -一第一編碼單元，其用於根據一預定編碼方案用一第一編碼設定來編碼該輸入視訊資料（1〇〇)之該選定關注區域以將PPG相關資訊保存在該經編碼關注區域中； -—第二編碼單元，其用於根據該預定編碼方案用一第二編碼設定來編碼該輸入視訊資料之剩餘部分；及 -一編碼器組合單元，其用於將該輸入視訊資料之該經編碼關注區域及該等經編碼剩餘部分組合成_編碼器輸出視訊串流。在本發明之一進一步態樣中’提出一種用於解碼一經編碼視訊串流之視訊解碼裝置，該經編碼視訊_流包括經編碼視訊資料，丨中已根據—狀編碼方案用—第—編碼設定來編碼輸入視訊資料之一關注區域以將ppG相關資訊保存在該經編碼關注區域中，且已根據該預定編碼方案用一第二編碼設定來編碼該輸入視訊資料之剩餘部分，該視訊解碼裝置包括： ° ，第解碼單元’其用於根據互補於已用於編碼該關注區域之該編碼方案之一解碼方案解碼經編碼關注區域，及A method for measuring skin color changes (referred to as light volume description imaging (PPG)) is described in "Remote pUthysmographic imaging using ambient Ught" by Verkruysse, Lars 〇. Svaasand and j stuart Nels〇n. This method is based on the principle that temporal changes in blood volume in the skin result in changes in light absorption by the skin. The video camera can record such changes by one of the images of a skin area (eg, face), while processing the average of the pixels of a manually selected area (usually the portion of the cheek in the system) by viewing the average The periodic changes in the signal can capture the heart rate and respiration rate. Known systems for remotely measuring heart rate or respiration rate signals are based on direct analysis of uncompressed, unprocessed video sequences following image sensing. In most "real life" applications, video sequences are stored or transmitted in a compressed form. The compression of the video signal assumes that some redundant (self-visual perception) information is removed. Unfortunately, information that is not important to video perception may be important for detecting physiological measurements. For example, the MPEG compression standard uses inter-frame predictions to slightly change the temporal information of a video signal. These changes make it more difficult or even impossible to detect temporal physiological measurements. However, the application of the video capture signal from a video is performed after the application of the video recording. In such cases, the compressed video will be processed. If a video is compressed at a high bit rate, the PPG related information can be stored in a coded bit stream. However, compressing a video with a low compression ratio will increase the size of a storage file or increase the transmission bandwidth. Therefore, it is necessary to save the information required for offline sampling of physiological signals during video recording and compression, in particular, according to one of the conventional video coding standards. 161403.doc 201244486 The mind-based video editing technology (such as MPEG2, MPEG4, H.264) achieves effective compression of video information by applying temporal prediction. Most of the frames in a video sequence (types B and P, B means that the "bidirectional prediction frame" 'P means "forward prediction frame") is encoded into an original frame and a motion compensated frame. The quantization difference between the frames (type B or P). Loss of some visual information due to quantification and motion prediction. Although this information is inconsequential from the visual perception, this information contains information that is important for taking physiological measurements (such as the heart 5^). Right, the video is compressed at a bit rate without applying temporal prediction and/or deblocking filter (H 264), then the ppG information can be saved in a video sequence. For example, 'only based on intra-frame coding. MjpEG or MJpEG2K can be applied to compress a video and save the PPG signal. However, coding within all frames does not provide a compression ratio for most multimedia applications. Therefore, there is a need in particular for a method and apparatus for permitting the use of standard lossy video compression techniques to compress video and to preserve the video information required to retrieve the ppG signal after decoding a video. [Invention] It is an object of the present invention to provide a method for A video encoding device and a corresponding video encoding method for encoding video data to save ppG related information without requiring a large amount of additional data. It is a further object of the present invention to provide a corresponding video decoding apparatus and method, a video encoding system, and a computer program for implementing the methods. In a first aspect of the present invention, a video encoding apparatus is provided, the video encoding apparatus comprising: 16I403.doc 201244486 • a selecting unit for selecting a region of interest in the input video material to provide a strong ppG signal; a first encoding unit for encoding the selected region of interest of the input video material (1〇〇) with a first encoding setting according to a predetermined encoding scheme to save PPG related information in the encoded region of interest a second encoding unit for encoding the remaining portion of the input video material with a second encoding setting according to the predetermined encoding scheme; and an encoder combining unit for the input video material The encoded region of interest and the encoded portions are combined into an encoder output video stream. In a further aspect of the present invention, a video decoding apparatus for decoding an encoded video stream is provided, the encoded video stream comprising encoded video data, which has been encoded according to a coding scheme. Setting to encode a region of interest of the input video material to store the ppG related information in the encoded region of interest, and encoding the remaining portion of the input video material with a second encoding setting according to the predetermined encoding scheme, the video decoding The apparatus comprises: a decoding unit for decoding the encoded region of interest according to a decoding scheme complementary to one of the encoding schemes used to encode the region of interest, and

PPG棟取單疋’其用於自該解碼關注區域操取- PPG 信號。在本發明之進—步態樣中，提出-種對應視訊編碼方法及對應視贿碼方法、視訊編碼系統以及包括程式碼構件之電腦程式，該電腦程式致使當在一電腦上實行該電腦程 161403.doc 201244486 式時’該電腦實行經提議方法之步驟β 在附屬技術方案中定義本發明之較佳實施例.應瞭解所主張視訊解碼裝置、視訊編碼系統'方法及電腦程式具有如所主張視訊編碼裝置及如附屬技術方案中定義的相似及/或相同較佳實施例。本發明係基於為了將ppG相關資訊保存在編碼視訊信號中而編碼含有具PPG相關資訊之一區域之一選定關注區域之構想，該選定關注區域允許導出一強PPG信號（特定言之’最強PPG信號），不同於（即’實質上不具關於該pPG 相關資訊之損失）視訊資料之其他區域，不應（甚至不可）自該等其他區域擷取PPG信號。特定言之，本端編碼參數（一般而言’編碼器之一特定設定）經設定以編碼選定關注區域’且一位元預算可分配至有用於擷取一 PPG信號之一或多個空間影像區域（即’一或多個關注區域），同時提供自 —(至少部分）解碼信號擷取的PPG信號之編碼（例如，一壓縮比）與品質之間的最佳折衷。可使用光體積描述法（PPG)原理自視訊序列偵測生理計量信號’例如，該等視訊序列係自一視訊攝影機串流傳輸或記錄成未壓縮。如上文提及，在實際應用中，並非始終支援此觀察。本發明達成保存PPG視覺資訊以在一視訊壓縮期間（例如）藉由一標準視訊編碼器而擷取PPG信號/生理计量信號’同時允許依一低位元速率壓縮。較佳地，本發月允產生一標準相容編碼位元串流’（例如）以儲存在一資料載體上或在一傳輸線（例如，網際網路）上或透過一行 161403.doc 201244486 動通信系統而傳輪。在此上下文中’措辭「PPG相關資訊」應理解為與獲得 - PPG信號相關的資訊。此ppG相㈣訊可包含在人類眼 :不可辨識的原始視訊資料中含有的資訊，例如—人員皮膚之輕微膚色變更。在此上下文中1辭「抑砰號」一般意味著可透過光積體描述法分析獲得的任何㈣，諸如時間性生理計量U，例如心跳、心動週期、、呼吸率、麻醉深度或低血容量及高血容量。。在-較佳實施例中’編碼裝置進一步包括：—區域選擇單元’其用於選擇輸入視訊資料中之—區域⑽定言之， -皮膚區域）作為關注區域，其中該視訊資料包括一序列視訊圖框，該等圖框分成空間區塊；及—區塊選擇單元，其用於判；t該選定區域之該等空間區塊，該等經判定空間區塊表示該關注區域。一般而言，視訊資料可用作為一序列視訊圖框’且各圖框分成（例如’包括4x4U6xi6像素大小之）空間區塊。因此’ 4 了根據此實施例進行後續編碼，找到應用帛一編碼單元編碼的最佳$間區塊。根據一進__步實施例’該區域選擇單元包括：—偵測單元，其用於偵測在輸入視訊資料中可用作為關注區域之一組潛在有用區域，特定言之，皮膚區域；及一分析單元，其用於分析該組經偵測潛在有用區域且基於一或多個預定選擇準則而選擇一區域作為關注區域。例如，此一分析單元可包括-臉部及/或一皮膚偵測器，該臉部及/或皮膚偵測器用於偵測視訊資料中（特定言之，—或多個視訊圖框 161403.doc 201244486 中）之臉部及/或皮膚區域。因此，較佳地，臉部或皮膚區域係潛在有用的。較佳地，最穩定（時間上最穩定）臉部及/ 或皮膚區域選定為關注區域❶但是亦可使用其他選擇準則’諸如空間大小、照明穩定性及/或膚色穩定性。例如’在Paul Viola、Michael Jones 之「Robust Real-time Object Detection」(2nd Intern. Workshop 〇n Statistical andThe PPG is used to retrieve the PPG signal from the decoded region of interest. In the gait aspect of the present invention, a corresponding video encoding method, a corresponding bribery code method, a video encoding system, and a computer program including a code component are provided, the computer program causing the computer program to be executed on a computer 161403.doc 201244486 When the computer implements the proposed method step β defines a preferred embodiment of the present invention in the subsidiary technical solution. It should be understood that the claimed video decoding device, video coding system 'method and computer program have the claimed Video coding apparatus and similar and/or identical preferred embodiments as defined in the accompanying technical solutions. The present invention is based on the concept of encoding a selected region of interest containing one of the regions with PPG related information in order to store the ppG related information in the encoded video signal, the selected region of interest allowing the derivation of a strong PPG signal (specifically the 'strongest PPG') Signal), other areas of video data that are (ie, have no substantial loss of information about the pPG), should not (or even be) extract PPG signals from such other areas. Specifically, the local encoding parameter (generally 'one specific setting of the encoder') is set to encode the selected region of interest' and the one-bit budget can be allocated to one or more spatial images that are used to capture a PPG signal. The region (i.e., 'one or more regions of interest) provides an optimal compromise between the encoding (e.g., a compression ratio) and quality of the PPG signal captured from (at least in part) the decoded signal. The physiological measurement signal can be detected from the video sequence using the principle of optical volume description (PPG). For example, the video sequences are streamed or recorded as uncompressed from a video camera. As mentioned above, this observation is not always supported in practical applications. The present invention achieves the preservation of PPG visual information to capture PPG/physiometric signals' during a video compression (e.g., by a standard video encoder) while allowing compression at a low bit rate. Preferably, the present month allows a standard compatible coded bitstream to be generated 'for example, to be stored on a data carrier or on a transmission line (e.g., the Internet) or via a line 161403.doc 201244486 The system passes. In this context, the wording "PPG related information" should be understood as information relating to the acquisition of the - PPG signal. This ppG phase (4) message can be included in the human eye: information contained in the unrecognizable original video material, for example, a slight skin tone change in the human skin. In this context, the word "depression" generally means any (four) that can be obtained by analytic method analysis, such as temporal physiometry U, such as heart rate, cardiac cycle, respiratory rate, depth of anesthesia or hypovolemia. And high blood volume. . In the preferred embodiment, the 'encoding device further comprises: a region selection unit for selecting an area (10) in the input video material, - a skin region, as the region of interest, wherein the video material comprises a sequence of video a frame, the frames are divided into spatial blocks; and a block selection unit for determining; t the spatial blocks of the selected region, the determined spatial blocks representing the region of interest. In general, video data can be used as a sequence of video frames' and each frame is divided into (e.g., including 4x4U6xi6 pixel size) spatial blocks. Therefore, subsequent coding is performed according to this embodiment to find the best inter-block that is encoded by the coding unit. According to a step _step embodiment, the area selection unit includes: a detecting unit for detecting a potential useful area in the input video material as a group of interest areas, in particular, a skin area; An analysis unit for analyzing the set of detected potential useful areas and selecting an area as the area of interest based on one or more predetermined selection criteria. For example, the analysis unit can include a face and/or a skin detector for detecting video data (specifically, or multiple video frames 161403. Doc 201244486) The face and / or skin area. Therefore, preferably, the face or skin area is potentially useful. Preferably, the most stable (temporarily most stable) face and/or skin area is selected as the region of interest, but other selection criteria such as spatial size, illumination stability, and/or skin tone stability may also be used. For example, 'Robust Real-time Object Detection' by Paul Viola and Michael Jones (2nd Intern. Workshop 〇n Statistical and

Computational Theories of Vision, Vancouver, Canada, 2 0 01)中描述此一债測器。在另一實施例中，該分析單元包括一 PPG擷取單元，該 PPG操取單元用於自該等經偵測潛在有用區域擷取一 ppG 信號且基於該等經擷取PPG信號之品質及/或内容而選擇一區域作為關注區域。因此，該分析單元可更充分預見該等潛在有用區域之哪些將提供一強PPG信號且將因此據此選擇關注區域。較佳地，該PPG擷取單元經調適以判定該第一編碼設定之一或多個參數以由該第一編碼單元用以基於經操取PPG 信號而編碼該選定關注區域’且該第一編碼單元經調適以使用該第一編碼設定之該一或多個參數用於編碼該選定關注區域。因此，將使用PPG擷取之結果以控制選定關注區域之編碼程序以使用最佳編碼器設定以達成可在解碼器中自經編媽關注區域擷取最佳可能的pp(3信號。編碼單元之第一设疋之此等參數可包含一壓縮率、一區塊/圖場/圖框之區塊/圖場/圖框内或區塊/圖場/圖框間編碼模式、所使用AC係數之數目、量化器尺度、圖框内編碼Dc精確度及 I61403.doc -10- 201244486 自訂量化器矩陣等之一或多者。在—實施例中，該第一編碼單元經調適以編碼該選定關注區域之至少色度分量，特定言之，僅該等色度分量，且该第二編碼單元經調適以編碼該選定關注區域之明度分量且編碼該輸入視訊資料之剩餘部分之色度分量及明度分量。此有助於減少用於編碼關注區域視訊資料之資料量。較佳地但非一般而言，僅選擇及編碼色度分量。根據另一實施例，該第一編媽單元經調適以藉由區塊内編碼而編碼該選定關注區域，且該第二編碼單元經調適以藉由區塊間編碼及/或區塊内編碼而編碼該輸入視訊資料之剩餘部分。此提供實質上在無損失情況下編碼關注區域。區塊内編碼及區塊間編碼係一般已知的技術且（例如）在MPEG編碼器中通常用於編碼。因此，將不在此說明進一步細節，此係因為熟習此項技術者已知此等細節。又此外’在一實施例中，該第一編碼單元經調適以僅編碼該選定關注區域之至少色度分量（特定言之，僅該等色度分量）之區塊間編碼區塊或區塊内編碼區塊之DC分量。特定言之，若僅編碼色度分量之區塊間編碼區塊或區塊内編碼區塊之DC分量，則此進一步有助於減少用於編碼關注區域之資料量。一般由所有像素攜載PPG相關資訊，但是一般對空間資訊不太關注。相反地，為改良個別像素中所要PPG信號（例如，心跳）之信雜比，僅需要取得一樣多的像素的平均值。PPG相關資訊/PPG信號通常甚至小於一未壓縮8位元視訊信號之量化階。此平均可基於DC分量， 161403.doc -11 - 201244486 且並非絕對需要知道個別像素值，不過此可在含有皮膚及一些其他影像部分（例如，在一臉部邊界處）之區塊中有所幫助。又此外，在一實施例中，選擇單元經調適以選擇輸入視訊資料中提供強PPG信號（特定言之，最強PPG信號）之兩個或兩個以上關注區域，且第一編碼單元經調適以編碼該等選定關注區域。因此，不僅一單一關注區域且還有若干關注區域可用於在解碼期間評估及擷取ppG信號，此增大可靠度。例如，在一實施例令，可自該等關注區域之各者擷取PPG信號，且此後可評估哪個ppG信號具有最高可靠度或可取得所有PPG信號之一平均值。This debt detector is described in Computational Theories of Vision, Vancouver, Canada, 2 0 01). In another embodiment, the analysis unit includes a PPG acquisition unit for extracting a ppG signal from the detected potentially useful regions and based on the quality of the retrieved PPG signals and / or content and select an area as the area of interest. Thus, the analysis unit can more fully anticipate which of these potentially useful areas will provide a strong PPG signal and will therefore select the region of interest accordingly. Preferably, the PPG capture unit is adapted to determine one or more parameters of the first encoding to be used by the first encoding unit to encode the selected region of interest based on the manipulated PPG signal and the first The coding unit is adapted to use the one or more parameters of the first coding setting for encoding the selected region of interest. Therefore, the result of the PPG capture will be used to control the encoding process for the selected region of interest to use the optimal encoder settings to achieve the best possible pp (3 signal) coding unit in the decoder. The parameters of the first setting may include a compression ratio, a block/field/frame block/field/frame or block/map/inter-frame coding mode, AC used One or more of the number of coefficients, the quantizer scale, the intra-frame coding Dc accuracy, and the I61403.doc -10- 201244486 custom quantizer matrix, etc. In an embodiment, the first coding unit is adapted to encode At least a chrominance component of the selected region of interest, in particular, only the chrominance components, and the second coding unit is adapted to encode the luma component of the selected region of interest and encode the chrominance of the remainder of the input video material Component and lightness component. This helps to reduce the amount of data used to encode the video data of the area of interest. Preferably, but not in general, only the chrominance component is selected and encoded. According to another embodiment, the first mother unit Adapted to borrow Encoding within the block to encode the selected region of interest, and the second coding unit is adapted to encode the remainder of the input video material by inter-block coding and/or intra-block coding. This provision is substantially lossless. In the case of coding the region of interest. Intra-block coding and inter-block coding are generally known techniques and are used, for example, in MPEG encoders for coding. Therefore, further details will not be described here, as this is familiar with The details are known to the skilled person. Further, in an embodiment, the first coding unit is adapted to encode only the regions of at least the chrominance components (specifically, only the chrominance components) of the selected region of interest. The inter-block coding block or the DC component of the intra-block coding block. In particular, if only the inter-block coding block of the chroma component or the DC component of the intra-block coding block is encoded, this further contributes Reduce the amount of data used to encode the region of interest. PPG-related information is generally carried by all pixels, but generally does not pay much attention to spatial information. Conversely, to improve the PPG signal in individual pixels (for example The heartbeat ratio is only required to obtain the average of as many pixels. The PPG related information/PPG signal is usually even smaller than the quantization step of an uncompressed 8-bit video signal. This average can be based on the DC component, 161403.doc - 11 - 201244486 and it is not absolutely necessary to know the individual pixel values, but this can be helpful in blocks containing skin and some other image parts (eg at the border of a face). Furthermore, in an embodiment, The selection unit is adapted to select two or more regions of interest in the input video material that provide a strong PPG signal (specifically, the strongest PPG signal), and the first coding unit is adapted to encode the selected regions of interest. Not only a single area of interest but also several areas of interest are available for evaluating and extracting ppG signals during decoding, which increases reliability. For example, in an embodiment, the PPG signal can be retrieved from each of the regions of interest, and thereafter it can be evaluated which ppG signal has the highest reliability or an average of all of the PPG signals can be obtained.

在一實施例中，特定言之，可藉由選擇單元產生一 R〇I 資訊，該ROI資訊包括關於關注區域之位置且可包含在編碼器輸出視訊資料中之一資訊。解碼裝置可接著使用此 ROI資訊以容易找到關注區域以從中解碼及擷取信號。在解碼期間，視訊解碼裝置至少能夠解碼來自解碼器輸入視Λ資料之經編碼關注區域且自經解碼關注區域擷取一 PPG信號》為此，PPG擷取使用（例如）如在關於ppG之上文提及的論文中描述的或如在描述PPG基礎之其他引文中描述的一般已知的方法。然而在進一步實施例中，解碼單元亦可經調適以(特定言之)根據互補於在編碼期間使用的編碼方案之一解瑪方案解碼完整的視訊資料。因&，在視訊編碼裝置中執行的編碼必須經調適以確保此解碼。 I61403.doc 12 201244486 【實施方式】本發明之此等及其他態樣將參考下文所述實施例來張顯及闞明。圖1展示根據本發明之一視訊編碼裝置1〇之一第__般實施例之一示意方塊圖。根據此實施例，一原始視訊串流 100(亦稱為輸入視訊資料）提供給一選擇單元2〇，該選擇單元20選擇輸入視訊資料1〇〇中提供一強ppG信號之一關注區域10卜該選定關注區域101提供給一第一編碼單元30以根據-預定編碼方案用-第一編碼収來編碼該選定關注區域101。平行地，由一第二編碼單元4〇根據該預定編碼方案用一第二編碼設定來編碼該輸入視訊資料1〇〇之剩餘部分103。在-編碼器組合單元5〇中，該輸入視訊資料⑽ 之經編碼關注區域1 〇 2及經編碼剩餘部分丨〇 4編碼成一編碼器輸出視訊串流105。In an embodiment, in particular, a R〇I information may be generated by the selection unit, the ROI information including information about the location of the region of interest and which may be included in the output video data of the encoder. The decoding device can then use this ROI information to easily find the area of interest to decode and retrieve signals therefrom. During decoding, the video decoding device is capable of decoding at least the encoded region of interest from the decoder input view data and extracting a PPG signal from the decoded region of interest. For this purpose, the PPG capture uses, for example, as above about ppG. The generally known methods described in the papers mentioned herein or as described in other citations describing the basis of PPG. In a further embodiment, however, the decoding unit may also be adapted to, in particular, decode the complete video material in accordance with one of the encoding schemes complementary to the encoding scheme used during encoding. Because &, the encoding performed in the video encoding device must be adapted to ensure this decoding. I61403.doc 12 201244486 [Embodiment] These and other aspects of the present invention will be apparent from and elucidated with reference to the embodiments described below. BRIEF DESCRIPTION OF THE DRAWINGS Figure 1 is a block diagram showing one of the first embodiment of a video encoding apparatus 1 according to the present invention. According to this embodiment, an original video stream 100 (also referred to as input video material) is provided to a selection unit 2, which selects one of the strong ppG signals in the input video data 1〇〇. The selected region of interest 101 is provided to a first encoding unit 30 for encoding the selected region of interest 101 with a first encoding in accordance with a predetermined encoding scheme. In parallel, a second encoding unit 4 编码 encodes the remaining portion 103 of the input video material 1 by a second encoding setting in accordance with the predetermined encoding scheme. In the encoder-combining unit 5, the encoded region of interest 1 〇 2 and the encoded remainder 丨〇 4 of the input video material (10) are encoded into an encoder output video stream 105.

設定用於編碼選定關注區域101，實質於包含在該選定關注區域101中之PPG 藉由使用該第一上確保在無至少關相關資訊之損失之情況下，實質上編碼該選定關注區域 101 ’使得可在解碼裝置中自該選定關注區域1〇1棟取一強 PPG信號。分開地以—第二編碼設定（例如，依—低位元速率或至少對感知為最佳但對PPG擷取為不足之一位元速率）編碼輸入視訊資料1 〇〇之剩餘部分丨〇3。圖2展示根據本發明之一視訊解碼裝置之一第一一妒實施例之-示意方塊圖。根據此實施例，解碼__經接^ 碼視訊串流16〇。該經編碼視訊串流16〇(除了在儲存及/或 161403.doc •13- 201244486 傳輸期間引進的擾動外）應對應於編碼器輸出視訊串流 10 5且包括經編碼視戒咨a | °貢料’該經編碼視訊資料包含輸入視訊資料100之蛵編级Ha 寸匕3物八關注區域161及經編碼剩餘部分 162 ° 視訊解碼裝置60句杯· ^λ 、 .〜第一解碼單元7〇，其用於根據互補於已於視訊編碼奘番，置丨〇中編碼該關注區域1 〇 1之編碼方案的一解碼方幸組m 來解竭經編碼關注區域161 ;及一PPG擷取單元80，其用於ό分、邊解碼關注區域163擷取一 ppG信號 164。為了定義Λ卜—M 4 ° 、肖注區域’例如較佳藉由讀取包含在視訊解碼器輸入串灼丨+ 巟160中之一 R〇I資訊或藉由影像分析 (例如#由檢查可藉以辨別經編碼關注區域與經編碼剩餘區域之量化位準)而自—對應⑽資訊獲得該關注區域之座標。視需要，一分開單元9〇可經提供用於分開經編碼關注區域161與經編碼剩餘部分⑻或至少用於自解碼器輸入視訊資料16_取經編碼關注區域⑹。此外視需要，—第二解碼單兀75可經提供用於根據該解碼方案而解碼該輸入視訊資料之經編碼剩餘部分162，且接著可提供一解碼器組合單元95用於將解碼關注區域丨63及解碼剩餘部分165組合成一解碼器輸出視訊串流丨66。圖6展示根據本發明之一視訊解碼裝置60，，之一第二、更簡單實施例之-示意方塊圖。根據此實施例，輸入視訊串流160並非如在圖2中展示的實施例般分割。首先，在一共同解碼單元71中，解碼輸入視訊串流丨接著，在解碼 161403.doc •14- 201244486 視。fl U67中’在-選擇單元72中選擇關注區域i68，由 PPG擷取單it⑽自该關〉主區域168操取—ppG信號164。圖3展不根據本發明之一視訊編碼裝置1〇,之一第二更詳細實施例之一示意方塊圖，其包括選擇單元2〇，之一較佳實施例。特定言之，該選擇單元2〇,包括一區域選擇單元21，該區域選擇單元21用於選擇輸人視訊資料100中之-區域 123(特定言之’―皮膚區域）作為關注區域，其中該視訊資料包括-序列視訊圖框，該等圖框分成空間區塊。此外，該選擇單TC2G包括-區塊選擇單元24，該區塊選擇單元用於判錢選定區域123之空間區塊1G1，該等經判定空間區塊101表示關注區域。在一進一步修飾中，如在圖3中展示，區域選擇單元21 包括.一偵測單元22，其用於偵測輸入視訊資料丨〇〇中可用作為關注區域之一組潛在可用區域122(特定言之，皮膚區域），及分析單元23，其用於分析該組經偵測潛在有用區域122及基於一或多個預定選擇準則而選擇一區域123 作為關注區域4該選定關注區域，接著在區塊選擇單元 24中判定對應空間區塊101，璉後由第一編碼單元扣，如上文描述般編碼該等對應空間區塊1〇1。潛在有用區域之偵測較佳經調適用於（特定言之）藉由用於皮膚㈣之一可用方*而们則臉部或皮膚區域。取決於一特定視訊内容，經偵測皮膚區域可佔有一視訊圖框之小邛分或一整個視訊圖框。在第二種情況下，例如，使用區塊内編碼以編碼整個經偵測皮膚區域將致使壓縮效率之一 I61403.doc •15· 201244486 明顯降低。此外’一般不是整個皮膚區域可用於擷取一 PPG信號。例如，在一特定時間週期内，僅一皮膚區域之一小部分在時間上為穩定β因此’僅一皮膚區域之此部分應用於PPG 化號擷取《因此，分析單元23分析由偵測單元22在視訊圖樞中偵測到的所有皮膚區域且基於若干準則（包含空間大小 '時間性穩定性、照明穩定性及/或膚色穩定性）之一或多者而僅選擇最佳的部分。因此’分析單元23較佳搜尋最穩定臉部及/或皮膚區域’此係因為此等穩定區域一般應提供最強PPG信號。該單元23可選擇一最小ROI，該最小R〇I能夠提供一 ppG信號。可藉由分析ROI内的一空間像素均勻性或藉由偵測一較佳臉部區域（例如，前額、臉頰）而分析一 pPG信號之預期強度。分析單元23之輸出係關於關注區域之位置之一資訊（例如’依一 ROI資訊之形式），該資訊提供給區塊選擇單元24以選擇輸入視訊資料丨〇〇中屬於選定關注區域之空間區塊。Setting to encode the selected region of interest 101, substantially equal to the PPG included in the selected region of interest 101, by using the first aspect, to substantially encode the selected region of interest 101' without loss of at least related information This makes it possible to take a strong PPG signal from the selected area of interest in the decoding device. The remaining portion 输入3 of the input video material 1 is encoded separately by a second encoding setting (e.g., at a low bit rate or at least for the perception but at a bit rate that is less than one bit rate for the PPG). Figure 2 is a block diagram showing a first embodiment of a video decoding apparatus in accordance with the present invention. According to this embodiment, the decoded video stream 16 解码 is decoded. The encoded video stream 16〇 (in addition to the disturbance introduced during storage and/or 161403.doc •13-201244486 transmission) shall correspond to the encoder output video stream 105 and include the encoded video protocol a | ° The tribute 'the encoded video data includes the input video data 100, the level of the Ha 匕匕 3 object eight area of interest 161 and the remaining part of the coded 162 ° video decoding device 60 sentence cup · ^ λ, ~ ~ first decoding unit 7 That is, it is used to decompress the encoded attention area 161 according to a decoding partner group 161 complementary to the coding scheme of the attention area 1 〇1 already encoded in the video coding system; and a PPG capture The unit 80 is configured to capture and decode the region of interest 163 to retrieve a ppG signal 164. For example, it is preferable to read the R〇I information contained in the video decoder input string 丨丨巟 160 or by image analysis (for example, # by inspection) The coordinates of the region of interest are obtained from the corresponding (10) information by discriminating the quantized level of the encoded region of interest and the encoded remaining region. Optionally, a separate unit 9 can be provided for separating the encoded region of interest 161 from the encoded remainder (8) or at least for inputting the video data from the decoder 16_ to the encoded region of interest (6). In addition, as needed, a second decoding unit 75 can be provided for decoding the encoded remainder 162 of the input video material in accordance with the decoding scheme, and then a decoder combining unit 95 can be provided for decoding the region of interest. 63 and the remaining portion of decoding 165 are combined into a decoder output video stream 66. Figure 6 shows a schematic block diagram of a second, simpler embodiment of a video decoding device 60 in accordance with the present invention. According to this embodiment, the input video stream 160 is not segmented as in the embodiment shown in FIG. First, in a common decoding unit 71, the decoded input video stream is then decoded at 161403.doc • 14-201244486. In the fl U67, the attention area i68 is selected in the -selection unit 72, and the -ppG signal 164 is taken from the "off" main area 168 by the PPG acquisition unit (10). Figure 3 shows a schematic block diagram of a second, more detailed embodiment of a video encoding device, in accordance with one of the present invention, including a selection unit 2, a preferred embodiment. Specifically, the selection unit 2 includes a region selection unit 21 for selecting a region 123 (specifically, a skin region) in the input video material 100 as a region of interest, where The video material includes a sequence of video frames that are divided into spatial blocks. Further, the menu TC2G includes a block selecting unit 24 for judging the space block 1G1 of the selected area 123, and the determined space block 101 represents the area of interest. In a further modification, as shown in FIG. 3, the region selection unit 21 includes a detection unit 22 for detecting a group of potential available regions 122 that are available as one of the regions of interest in the input video data (specific a skin area, and an analysis unit 23 for analyzing the set of detected potential useful areas 122 and selecting a region 123 as the region of interest 4 based on one or more predetermined selection criteria, followed by The block selection unit 24 determines the corresponding spatial block 101, which is then deducted by the first coding unit, and encodes the corresponding spatial blocks 1〇1 as described above. The detection of potentially useful areas is preferably adapted (in particular) to one of the available sides of the skin (4) and the face or skin area. Depending on a particular video content, the detected skin area may occupy a small portion of the video frame or an entire video frame. In the second case, for example, the use of intra-block coding to encode the entire detected skin area will result in a significant reduction in compression efficiency, I61403.doc •15·201244486. Furthermore, it is generally not the entire skin area that can be used to capture a PPG signal. For example, in a particular time period, only a small portion of one skin region is stable in time β so that only this portion of a skin region is applied to the PPG number capture. Therefore, the analysis unit 23 analyzes the detection unit. 22 selects the best portion of all skin regions detected in the videogram pivot and based on one or more of a number of criteria including spatial size 'temporal stability, illumination stability, and/or skin tone stability. Therefore, the 'analysis unit 23 preferably searches for the most stable face and/or skin area' because the stable areas should generally provide the strongest PPG signal. The unit 23 can select a minimum ROI that can provide a ppG signal. The expected intensity of a pPG signal can be analyzed by analyzing a spatial pixel uniformity within the ROI or by detecting a preferred facial region (e.g., forehead, cheek). The output of the analyzing unit 23 is information about one of the positions of the region of interest (for example, 'in the form of a ROI message'), and the information is supplied to the block selecting unit 24 to select the space region of the input video data belonging to the selected region of interest. Piece.

右輸入視訊資料100之視訊圖框分成空間區塊（取決於各自壓I®方案’具有自（例如）4 X 4至16 X16像素之一大小），則尤其需要此資訊。接著由分析單元23提供最佳皮膚區域 123之座標給區塊選擇單元24,該區塊選擇單元24選擇具最佳皮膚區域之區塊101，即，表示選定關注區域之區塊。假如使用若干關注區域，則此在PPG信號擷取期間提供改良選擇最佳PPG信號之能力或自不同區域獲得的PPG I6I403.doc •16· 201244486 信號求取平均值之選項。依將繼編碼之後且（稍後）繼解碼/解壓縮之後保證保存 PPG相關資訊之一方式完成選定皮膚區域之壓縮。多半自一視訊串流之色度通道擷取PPG信號164(見圖2)。因此，在一貫施例申’為了保存PPG相關資訊，將由第一編碼單元30’將此等區塊1 〇 1編碼成區塊内編碼區塊。例如取決於第一編碼單元40’之設定及類型，由一標準編碼器在第二編碼單元40’將其他圖框區塊1 〇3(即，剩餘部分之區塊）編碼成區塊間編碼區塊或區塊内編碼區塊，視訊編碼標準允許在一區塊基礎上選擇區塊内或區塊間編碼模式。因此，經提議演算法將允許產生具經保存PPG相關資訊之一標準相容編碼位元串流105。分析單元23及區塊選擇單元24將找到可靠ppG信號擷取所要的-皮膚區域之大小與歸因於分配皮膚區域之區塊内編碼之一大位元預算之一壓縮比之一損失之間的最佳折衷。在另-實施例巾’分析單元23可(非強制性)包括一卿信號棟取25及可能— PPG㈣度量以引導選擇皮膚區域。般需要無錯誤之時間性色度資為了擷取一 PPG信號訊’如在又另-實施例中提供，此可藉由用較高位元速率編碼色度區塊而達成。特定言之’在如圖4中展示的視訊編碼裝請’之又另-實施例中’第_編碼單㈣，經調適用於至少（較佳僅）編碼該選定關注區域ι〇ι之色度分量 101a，而第二編碼單元40"經調適用於編碼該選定關注區 17 161403.docThis information is especially needed when the video frame of the right input video material 100 is divided into spatial blocks (depending on the size of each of the 4 I 4 to 16 X16 pixels). The coordinates of the optimal skin area 123 are then provided by the analysis unit 23 to the block selection unit 24, which selects the block 101 having the best skin area, i.e., the block representing the selected area of interest. If several regions of interest are used, this provides the option to improve the ability to select the best PPG signal during PPG signal acquisition or the PPG I6I403.doc •16·201244486 signal average obtained from different regions. The compression of the selected skin region is accomplished in a manner that ensures that one of the PPG related information is saved after (and later) following the decoding/decompression. Most of the chrominance channels from a video stream capture the PPG signal 164 (see Figure 2). Therefore, in order to save the PPG related information, the block 1 〇 1 will be encoded into the intra-block coding block by the first coding unit 30'. For example, depending on the setting and type of the first encoding unit 40', other frame blocks 1 〇 3 (i.e., the remaining blocks) are encoded into inter-block encoding by the second encoding unit 40' by a standard encoder. Block or block intra-block coding, the video coding standard allows for intra-block or inter-block coding mode to be selected on a block basis. Therefore, the proposed algorithm will allow the generation of a standard compatible encoded bit stream 105 with saved PPG related information. The analysis unit 23 and the block selection unit 24 will find that the size of the desired skin region of the reliable ppG signal is between one of the large bit budgets of one of the intra-block codes attributed to the allocated skin region. The best compromise. In another embodiment, the analysis unit 23 may (optionally) include a clear signal 25 and possibly a PPG (four) metric to guide the selection of the skin region. An error-free temporal chromaticity is required to capture a PPG signal, as is provided in yet another embodiment, which can be achieved by encoding a chrominance block with a higher bit rate. Specifically, 'in the video encoding installation shown in FIG. 4', in another embodiment, the 'numbering code' (four) is adapted to at least (preferably only) encode the color of the selected area of interest ι〇ι Degree component 101a, and the second coding unit 40" is adapted to encode the selected region of interest 17 161403.doc

S 201244486 域101之明度分量10 lb且用於編碼該輸入視訊資料1〇〇之剩餘部分103之色度分量及明度分量。原則上，區塊間編碼可用於選定區塊之色度編碼，只要在不損失資訊（無損失）之情況下壓縮DC分量，且ac分量之量化引進假影。可在損失資訊之情況下編碼明度區塊，此係因為其等對PPG擷取程序之貢獻明顯小於色度分量之貢獻。根據又另一實施例，僅選定關注區域之色度分量編碼成區塊内編碼區塊，或與選定關注區域相關聯的色度分量及明度分量兩者編碼成區塊内編碼區塊。在此實施例中假如一選定皮膚區域（即，關注區域）不移動，則將不需要耗費更多位元將區塊編碼成區塊内編碼區塊。然而，若一選定皮膚區域移動，則將不引進假影使得此實施例將更有效。經提議解碼程序不僅允許（例如）根據一視訊編碼標準而重新建構一視訊串流，且可自一部分解碼視訊_流（特定言之，自解碼關注區域）擷取一 PPG信號。圖5展示根據本發明之一視訊解碼裝置6〇,之一第二更詳細貫施例，其實質上對應於在圖3中展示的互補視訊編碼裝置1〇’。在此實施例中，解碼器輸入視訊資料16〇提供給第一解碼單元70及第二解碼單元75，兩者。雖然第一解碼單元實質上相同於上文說明的第一解碼單元7〇且輸出解碼關/主區域163，但是第二解碼單元75,不僅解碼剩餘區域，且了解碼元整解碼器輸入視訊資料160且輸出完整解瑪器 161403.doc -18· 201244486 輸出視訊資料166，即，所有視訊資料均（例如，習知上）在第二解碼單元7 5’中解碼。因此，首先用以解碼輸入位元串流之標準程序經應用直至編碼區塊擷取之位準。此後，進一步解碼整個位元串流及/或區塊内編碼區塊。此等區塊内編碼區塊對應於在編碼器側處選擇的最佳皮膚區域。為了獲得PPG信號164，PPG信號擷取單元8〇,包括一區塊擷取單元81，該區塊擷取單元81自解碼關注區域163擷取已由視訊編碼裝置10，之第一編碼單元3〇ι編碼的關注區域之區塊。隨後’藉由使用由區塊擷取單元提供的一區塊資訊 181，重新建構單元82自解碼關注區域之區塊内編碼區塊重新建構關注區域（例如，一或多個皮膚區域）。例如，若在第一解碼單元70中至少（較佳僅）解碼關注區域之色度分量，則在重新建構單元82中重新建構關注區域之色度分量。後’在一 PPG化號揭取單元83中’PPG信號操取演算法應用於重新建構的關注區域182(例如只要在不損失ppG 相關資訊之情況下編碼色度分量’則PPG信號擷取演算法僅應用於色度分量）’以最後獲得想要的]PPG信號164。在視訊解碼裝置之另一實施例中，若已由視訊編碼裝置將色度分虽及明度分量兩者編碼成（例如）區塊内編碼區塊’則可自色度通道、明度通道或兩者擷取PPG信號 164。因此，可基於用於重新建構ppG信號之做法而完成 161403.doc -19- 201244486 視訊編碼裝置及視訊解碼裝置之最佳實施例之選擇。如提及，PPG信號擷取單元83自重新建構的關注區域（例如’重新建構的皮膚區域）偵測及擷取PPG信號164。原則上’僅重新建構的關注區域用於擷取PPG信號。因此，並不強制依一完全原始解析度解碼一視訊序列，但是一般僅解碼（例如，區塊内編碼區塊之）關注區域便足夠獲得ppG k號。因此’若僅需要擷取PPG信號但不完全解碼視訊資料，則可省下所有區塊間編碼區塊之運動補償及重新建構原本所要的一計算能力。S 201244486 The luminance component of field 101 is 10 lb and is used to encode the chrominance component and the luma component of the remaining portion 103 of the input video material 1〇〇. In principle, inter-block coding can be used for chroma coding of selected blocks as long as the DC component is compressed without loss of information (no loss) and the quantization of the ac component introduces artifacts. The luma block can be encoded with loss of information because it contributes significantly less to the PPG acquisition process than the chroma component. According to yet another embodiment, only the chrominance components of the selected region of interest are encoded into intra-block coding blocks, or both chrominance components and luma components associated with the selected region of interest are encoded into intra-block coding blocks. In this embodiment, if a selected skin region (i.e., region of interest) does not move, then more bits will not need to be used to encode the block into intra-block coded blocks. However, if a skin area is selected for movement, no artifacts will be introduced to make this embodiment more effective. The proposed decoding procedure not only allows reconstruction of a video stream, e.g., according to a video encoding standard, but also captures a PPG signal from a portion of the decoded video stream (specifically, from the decoding region of interest). Figure 5 shows a second, more detailed embodiment of a video decoding device 6 in accordance with the present invention, which substantially corresponds to the complementary video encoding device 1' shown in Figure 3. In this embodiment, the decoder input video data 16 is provided to the first decoding unit 70 and the second decoding unit 75, both. Although the first decoding unit is substantially identical to the first decoding unit 7 described above and outputs the decoded off/main area 163, the second decoding unit 75 not only decodes the remaining area, but also decodes the meta-integrated decoder to input video data. 160 and the output complete lexicon 161403.doc -18· 201244486 outputs the video material 166, that is, all video data is decoded (eg, conventionally) in the second decoding unit 75'. Therefore, the standard procedure used to first decode the input bit stream is applied to the level captured by the coded block. Thereafter, the entire bit stream and/or intra-block coded blocks are further decoded. The coded blocks within these blocks correspond to the optimal skin area selected at the encoder side. In order to obtain the PPG signal 164, the PPG signal capturing unit 8A includes a block capturing unit 81. The block capturing unit 81 retrieves the first encoding unit 3 of the video encoding device 10 from the decoding attention area 163. The block of the area of interest coded by 〇ι. Then, by using a block information 181 provided by the block capture unit, the reconstruction unit 82 reconstructs the region of interest (e.g., one or more skin regions) from the intra-block coded block of the decoded region of interest. For example, if at least (preferably only) the chrominance component of the region of interest is decoded in the first decoding unit 70, the chrominance component of the region of interest is reconstructed in the reconstruction unit 82. The 'PPG signal manipulation algorithm is applied to the reconstructed region of interest 182 in a PPG number extraction unit 83 (for example, as long as the chrominance component is encoded without loss of ppG related information), the PPG signal acquisition algorithm The method applies only to the chrominance component) to finally obtain the desired PPG signal 164. In another embodiment of the video decoding device, if both the chrominance and the luma component have been encoded by the video encoding device into, for example, an intra-block coding block, the chrominance channel, the luma channel, or both may be used. The PPG signal 164 is taken. Therefore, the selection of the preferred embodiment of the video encoding device and the video decoding device can be completed based on the method for reconstructing the ppG signal. As mentioned, the PPG signal acquisition unit 83 detects and retrieves the PPG signal 164 from the reconstructed region of interest (e. g., 'reconstructed skin region). In principle, only re-constructed areas of interest are used to capture PPG signals. Therefore, it is not mandatory to decode a video sequence based on a full original resolution, but generally only the region of interest (e.g., the intra-block coding block) is sufficient to obtain the ppG k number. Therefore, if only the PPG signal needs to be captured but the video data is not completely decoded, the motion compensation of the coded blocks between all the blocks can be saved and the computational power originally required can be reconstructed.

可在解碼及擁取PPG信號期間定義及修改用於擷取pPG k號之特定方法及參數。換言之，經提議視訊編碼裝置既不限制一 PPG信號擷取方法之選擇，亦不限制監測主體之選擇β —旦編碼，則可在解碼期間或繼解碼之後藉由不同 PPG擷取方法而處理一視訊序列，且可擷取不同生命跡象 (例如，心率、心率變化性、Sp〇2、呼吸、ppG成像等卜可藉由新PPG擷取演算法而升級經提議對ppG有利的視訊解碼裝置，此將允許以更佳方式自已編碼視訊序列棟取 PPG信號。亦可由一標準視訊解碼裝置在無用於擷取PPG信號之嵌入式演算法之情況下解碼相同編碼視訊序列，因此保持與既有視訊解碼裝置之回溯相容性。假如在經提議方案中使用的—標準編碼解碼器含有一環路内解區塊濾波器以減少編碼假影，則應對於與選定關注區域相關聯的區塊之至少色度分量，斷開此解區塊滤波 161403.doc 201244486 器。否則’環路内解區塊濾波器可抑制對擷取PPG信號為必要之一視覺資訊。運用手動調諧參數，PPG擷取演算法可係即時或非即時。此外，本發明一般允許繼已記錄視訊資料之後取決於特定應用而選擇生理計量信號擷取之任何特定方法。因此，相同視訊可用於擷取不同生理計量信號（例如，心率、心率變化性、Sp02、呼吸、ppg成像）。在圖7中示意地描繪根據本發明之一視訊編碼裝置i〇m, 之又另一實施例。此實施例與圖丨中展示的視訊編碼裝置 10之實施例相當類似，但是此外在與第一編碼單元3〇"，形成的一回饋環路中提供一解碼單元35及一 PPG信號擷取單 /0 36。此回饋環路控制分配至選定關注區域ι〇ι之位元數目，即，控制用於編碼該選定關注區域1〇1之編碼設定以確保PPG相關資訊保存在經編碼關注區域丨〇2中。因此，解碼單元35解碼經編碼關注區域1〇2(應用互補於由第編碼單元30"·應用的第一編碼方案的一解碼方案），且PPG信號擷取單元36自解碼關注區域1〇6擷取_ ppG信號 107。接著，第—編碼單元30"，可決定PPG信號是否具有足夠的品質或是否需要變更用於編碼之設定（例如，是否需要指派更多位元給經編碼關注區域，及/或是否需要降低壓縮率）以提高經擷取PPG信號之品質。因此，可確保在一解碼裝置中可擷取具足夠品質之一 ppG信號。因此’ S 了能夠操取生命跡象，树明修改在視訊壓縮期間之SNR或品質可擴展性之已知概念。本發明可用於視 161403.doc -21 - 201244486 訊串流傳輸以及壓縮視訊材料儲存。正常情況下，僅包括經編碼視訊資料之位元串流將經傳送或經解壓縮以獲得依一基本品質之一視訊資料，其中以相同方式(即，運用一单-編碼方案及相同編碼參數設定)編碼所有視訊資料。根據本發明，額外資料包含在保存ppG必要資訊之編碼位串八中八要應摘取生理計量信號，則將傳送或解壓縮該編碼位兀串流。以士卜古4 -T、土 r~ 此方式，可達成壓縮視訊中之生理計量資訊之一壓縮效率與保存之最佳折衷。、&而=之本發明允許繼視訊壓縮（解壓縮）之後擷取 PPG信號。可基於具體應用而選擇PPG榻取演算法之複雜度及精確度。例如，一些應用可能需要僅操取心率資訊，而其他應用可能需要心跳間歇精確心跳信號、或/及呼吸或/及Sp〇2(氧合）。此外，本發明允許在可能手動選擇及調諧最佳參數之情況下自—壓縮視訊離線（非即時）榻取PPG信號。一般而言，本發明不限於特定編碼/解碼方案。一般而言，用於編碼一或多個選定關注區域之第一編碼之損失少於用於編碼剩餘資料之第二編碼。在特定實施例中，使用區塊内及/或區塊間編碼來編碼ppG相關視覺資訊，同時使用圖框間編碼來編碼其他視覺資訊（其對生理計量信號擷取為非必要）。因此，在不需要解碼完整的視訊圖框之情況下達成在視訊解碼期間自圖框内編碼區塊快速及低成本地擷取PPG資訊。雖然已在圖式及先前描述中圖解說明及描述本發明，但 161403.doc -22- 201244486 是此圖解說明及描述應被認為係圖解說明性的或例示性的而非限制性的；本發明不限於所揭示實施例。熟習此項技術者可在實踐本發明中自對圖式、本揭示内容及隨附申請專利k圍之一研究而瞭解及實現所揭示實施例之其他變化。在申請專利範圍中，字彙「包括」不排除其他元件或步驟，且不定冠詞「一」或「一個」不排除複數。一單一元件或其他單元可實現在申請專利範圍中列舉的若干項目之功能。在相互不同的附屬請求項中列舉某些措施之純粹事實並不表示不可有利地使用此等措施之一組合。一電腦程式可儲存/分佈在一適合非暫時性媒體（諸如與其他硬體起供應的或作為其他硬體之部分之一光儲存媒體或-固態媒體)上，i亦可依其他形式分佈，諸如經由 .周際網路或者其他有線或無線電信系統。申明專利範圍中之任何參考符號不應解釋為限邊。【圖式簡單說明】可圖展不根據本發明之一視訊編碼裝置之一第—實施例之一示意方塊圖，圖-展不根據本發明之一視訊解碼裝置之一第—實施例之一示意方塊圖，圖3展示根據本發明之—支目％ + 月之視訊編碼裝置之一第二實施例之一不意方塊圖，圖4_展不根據本發明之一視訊編瑪裝置之一第三實施例之—示意方塊圖， 161403.doc -23· 201244486 圖5展示根據本發明之一視訊解碼裝置之一第二實施例之一示意方塊圖，圖6展示根據本發明之一視訊解碼裝置之一第三實施例之一示意方塊圖，及圖7展示根據本發明之一視訊編碼裝置之一第四實施例之一示意方塊圖。【主要元件符號說明】 10 視訊編碼裝置 10' 視訊編碼裝置 10" 視訊編碼裝置 10"' 視訊編碼裝置 20 選擇單元 20, 選擇單元 21 區域選擇單元 22 偵測單元 23 分析單元 24 區塊選擇單元 25 光體積描述成像（PPG)擷取單元 30 第一編碼單元 30' 第一編碼單元 30" 第一編碼單元 30'" 第一編碼單元 35 解碼單元 36 光體積描述成像（PPG)信號擷取單元 161403.doc • 24- 201244486 40 第二編碼單元 40' 第二編碼單元 40" 第二編碼單元 50 編碼器組合單元 60 視訊解碼裝置 60' 視訊解碼裝置 60" 視訊解碼裝置 70 第一解碼單元 71 共同解碼單元 72 選擇單元 75 第二解碼單元 75' 第二解碼單元 80 光體積描述成像（PPG)信號擷取單元 80' 光體積描述成像（PPG)信號擷取單元 81 區塊擷取單元 82 重新建構單元 83 光體積描述成像（PPG)信號擷取單元 90 分開單元 95 解碼器組合單元 100 輸入視訊資料 101 輸入視訊資料之選定關注區域 101a 選定關注區域之色度分量 101b 選定關注區域之明度分量 102 輸入視訊資料之經編碼關注區域 161403.doc -25- 201244486 103 輸入視訊資料之剩餘部分 104 輸入視訊資料之經編碼剩餘部分 105 編碼器輸出視訊串流 106 解碼關注區域 122 潛在有用區域 123 區域 160 經編碼視訊串流 161 經編碼關注區域 162 輸入視訊資料之經編碼剩餘部分 163 解碼關注區域 164 光體積描述成像（PPG)信號 165 解碼剩餘部分 166 解碼器輸出視訊串流 167 解碼視訊串流 168 關注區域 181 區塊資訊 182 重新建構的關注區域 161403.doc -26-The particular method and parameters used to retrieve the pPG k number can be defined and modified during decoding and acquisition of the PPG signal. In other words, the proposed video encoding device neither limits the selection of a PPG signal acquisition method nor limits the selection of the monitoring body to the β-denier encoding, and may process one by different PPG acquisition methods during or after decoding. Video sequences, and can capture different signs of life (eg, heart rate, heart rate variability, Sp〇2, respiration, ppG imaging, etc.) can be upgraded by the new PPG capture algorithm to promote video decoding devices that are advantageous for ppG, This will allow the PPG signal to be encoded in the encoded video sequence in a better manner. A standard video decoding device can also decode the same encoded video sequence without the embedded algorithm for capturing the PPG signal, thus maintaining the video with the existing video. Backtracking compatibility of the decoding device. If the standard codec used in the proposed scheme contains an in-loop deblocking filter to reduce coding artifacts, then at least the block associated with the selected region of interest should be Chroma component, disconnect this deblocking filter 161403.doc 201244486. Otherwise 'the in-loop deblocking filter can suppress the capture of the PPG signal to One of the necessary visual information. Using manual tuning parameters, the PPG capture algorithm can be immediate or non-instant. In addition, the present invention generally allows any particular method of selecting a physiological metering signal to be taken depending on the particular application after the video data has been recorded. Thus, the same video can be used to extract different physiological metering signals (eg, heart rate, heart rate variability, Sp02, respiration, ppg imaging). A video encoding device i〇m according to the present invention is schematically depicted in FIG. Yet another embodiment. This embodiment is quite similar to the embodiment of the video encoding device 10 shown in the figure, but further provides a decoding unit 35 in a feedback loop formed by the first encoding unit 3" And a PPG signal acquisition list/0 36. The feedback loop controls the number of bits allocated to the selected attention area ι〇ι, that is, controls the encoding setting used to encode the selected attention area 1〇1 to ensure PPG related information It is stored in the encoded attention area 丨〇 2. Therefore, the decoding unit 35 decodes the encoded attention area 1〇2 (the application is complementary to the coding unit 30" a decoding scheme of the first coding scheme), and the PPG signal acquisition unit 36 retrieves the _ppG signal 107 from the decoding region of interest 1〇6. Then, the first coding unit 30" determines whether the PPG signal has sufficient quality or Whether it is necessary to change the settings used for encoding (for example, whether more bits need to be assigned to the encoded region of interest, and/or whether the compression ratio needs to be reduced) to improve the quality of the captured PPG signal. Therefore, a decoding can be ensured. One of the devices can capture a ppG signal of sufficient quality. Therefore, 'S is able to take life signs and clarify the known concept of modifying the SNR or quality scalability during video compression. The invention can be used to view 161403.doc -21 - 201244486 Streaming and compressing video material storage. Normally, a bit stream including only encoded video data will be transmitted or decompressed to obtain video data according to one of the basic qualities, in the same manner (ie, using a single-encoding scheme and the same encoding parameters) Set) encode all video data. According to the present invention, the additional data is included in the encoded bit string 8 of the ppG necessary information to be extracted from the physiological metering signal, and the encoded bit stream is transmitted or decompressed. In the way of Shibugu 4 -T and soil r~, the best compromise between compression efficiency and preservation of physiological measurement information in compressed video can be achieved. The invention of & and = allows the capture of the PPG signal following video compression (decompression). The complexity and accuracy of the PPG couching algorithm can be chosen based on the specific application. For example, some applications may need to only listen to heart rate information, while other applications may require a heartbeat intermittent accurate heartbeat signal, or/and a breath or/and Sp〇2 (oxygenation). In addition, the present invention allows for self-compressing video offline (non-immediate) couching of PPG signals, possibly with manual selection and tuning of optimal parameters. In general, the invention is not limited to a particular encoding/decoding scheme. In general, the loss of the first code used to encode one or more selected regions of interest is less than the second code used to encode the remaining data. In a particular embodiment, intra-block and/or inter-block coding is used to encode ppG-related visual information, while inter-frame coding is used to encode other visual information (which is not necessary for physiological metering signals). Therefore, it is achieved that the PPG information is quickly and cost-effectively captured from the intra-frame coding block during video decoding without decoding the complete video frame. The present invention has been illustrated and described in the drawings and the foregoing description, and the description and description are to be construed as illustrative or exemplary It is not limited to the disclosed embodiments. Other variations to the disclosed embodiments can be understood and effected by those skilled in the <RTIgt; </RTI> <RTIgt; </ RTI> <RTIgt; </ RTI> <RTIgt; In the context of the patent application, the word "comprising" does not exclude other elements or steps, and the indefinite article "a" or "an" does not exclude the plural. A single element or other unit may fulfill the functions of several items listed in the scope of the patent application. The mere fact that certain measures are recited in mutually different sub-claims does not mean that one of these measures may not be used in combination. A computer program can be stored/distributed on a non-transitory medium (such as an optical storage medium or a solid-state medium supplied from other hardware or as part of other hardware), i can also be distributed in other forms. Such as via a weekly network or other wired or wireless telecommunications system. Any reference signs in the scope of the claimed patent should not be construed as limiting. BRIEF DESCRIPTION OF THE DRAWINGS FIG. 1 is a schematic block diagram showing one of the first embodiment of a video encoding apparatus according to the present invention, and FIG. 1 is not one of the first embodiment of a video decoding apparatus according to the present invention. FIG. 3 is a block diagram showing a second embodiment of a video encoding device according to the present invention. FIG. 4 is a block diagram of a video encoding device according to the present invention. Figure 3 shows a schematic block diagram of a second embodiment of a video decoding device according to the present invention, and Figure 6 shows a video decoding device according to the present invention. Figure 6 is a block diagram showing a second embodiment of a video decoding device according to the present invention. A schematic block diagram of one of the third embodiments, and FIG. 7 is a schematic block diagram showing a fourth embodiment of a video encoding apparatus according to the present invention. [Description of main component symbols] 10 video encoding device 10' video encoding device 10" video encoding device 10" 'video encoding device 20 selecting unit 20, selecting unit 21 region selecting unit 22 detecting unit 23 analyzing unit 24 block selecting unit 25 Optical Volume Description Imaging (PPG) Capture Unit 30 First Encoding Unit 30' First Encoding Unit 30" First Encoding Unit 30'" First Encoding Unit 35 Decoding Unit 36 Optical Volume Description Imaging (PPG) Signal Acquisition Unit 161403.doc • 24-201244486 40 second coding unit 40' second coding unit 40" second coding unit 50 encoder combining unit 60 video decoding device 60' video decoding device 60" video decoding device 70 first decoding unit 71 common Decoding unit 72 Selection unit 75 Second decoding unit 75' Second decoding unit 80 Optical volume description imaging (PPG) signal capturing unit 80' Light volume description imaging (PPG) signal capturing unit 81 Block capturing unit 82 Reconstruction Unit 83 Light Volume Description Imaging (PPG) Signal Capture Unit 90 Separate Single The element 95 decoder combination unit 100 inputs the video data 101, selects the selected area of interest 101a of the video data, selects the chrominance component of the area of interest 101b, selects the brightness component of the area of interest 102, inputs the encoded area of interest of the video material 161403.doc -25- 201244486 103 The remaining portion 104 of the input video data is input to the encoded remainder of the video data. 105 Encoder Output Video Stream 106 Decoded Region of Interest 122 Potentially Useful Region 123 Region 160 Encoded Video Stream 161 Encoded Region of Interest 162 Input Video Data Encoded Remaining portion 163 Decoded Region of Interest 164 Optical Volume Description Imaging (PPG) Signal 165 Decoded Remaining portion 166 Decoder Output Video Stream 167 Decoded Video Stream 168 Region of Interest 181 Block Information 182 Reconstructed Region of Interest 161403.doc -26-

Claims

201244486 VII. Patent application scope · Video coding device for encoding video data (10, 10', 1〇|, 10'")' includes: day selection unit (20, 20,), which is used One of the strong PPG (Light Volume Description Imaging) signals is provided in the selected input video data (1〇〇) (101), = the first coding unit (3〇, 3〇', 3〇,,, 3〇 t,,) for encoding the selected attention area (1G1) of the input video data with a first encoding setting according to a coding scheme to save the PPG related information in the encoded attention area a second coding unit (40, 4, 40, 40) for encoding the remaining portion (103) of the input video material ((10)) with the second encoding setting according to the predetermined coding scheme, and The encoder combining unit (50) is configured to combine the encoded region of interest (102) of the input video material and the remaining portion of the code (1〇4) into an encoder output video stream (1〇5). 2. The video encoding device (丨〇1) of claim 1, wherein the selecting unit (20') comprises: a region selecting unit (21) for selecting one of the input video materials (100) ( Specifically, a skin region is used as a region of interest, wherein the video material includes a sequence of video frames, the frames are divided into spatial blocks, and a block selection unit (24) is configured to determine the selected region. The spatial blocks (123), the determined spatial blocks represent the region of interest 161403.doc 201244486. 3. The video encoding device ("〇") of claim 2, wherein the region selecting unit (21) comprises: a predicate unit (22) 'for <> (measuring that the input video material is available as a region of interest) A set of potentially useful areas (丨22), in particular, a skin area, and an analysis unit (23) 'which is used to analyze the set of detected potential useful areas (122) and based on one or more predetermined selection criteria And selecting an area (123) as the area of interest. 4. The video coding apparatus (1) of claim 3, wherein the analysis unit (23) is adapted to achieve spatial size, temporal stability, illumination stability, and / or skin color stability is used as a selection criterion. 5. The video encoding device (ι) of claim 3, wherein the analyzing unit (23) comprises a PPG capturing unit (25), the PPG capturing unit (25) And selecting a region (123) as the region of interest based on the detected potential useful region from the PPG signal (180) and based on the quality and/or content of the retrieved ppG signal. Video coding device of item 5 (1〇,), The PPG capture unit (25) is adapted to determine the one or more parameters of the first encoding setting for use by the first encoding unit (30, 30) to encode the captured PPG signal based on the first encoding unit (30, 30) Selecting a region of interest, and wherein the first coding unit (3〇, 30·) is adapted to use the one or more parameters of the first encoding setting to encode the selected region of interest. 7. Video of claim 1 Encoding apparatus (1〇,,), 161403.doc 201244486 wherein the first coding unit (30) is adapted to encode at least a chrominance component (101a) of the selected region of interest (101), in particular, the same color a component (101a), and wherein the second coding unit is adapted to encode the luma component (1〇 lb) of the selected region of interest (101) and encode the remaining portion of the input video material (1〇〇) a chrominance component and a luminosity component. 8. The video coding apparatus of claim 1 (丨〇, 1 〇, '丨〇M), wherein the first coding unit (30, 30) is adapted to use the zone Intra-block coding to encode the selected region of interest (i 〇丨), The second coding unit (4〇, 40') is adapted to encode the remaining portion (103) of the input video material (1) by inter-block coding and/or intra-block coding. a video encoding device (1〇m) of claim 1 wherein the first coding unit (3〇) is adapted to encode only at least the chrominance components (1〇la) of the selected region of interest (101) (specifically The DC component of the inter-block or intra-block coding block of the chrominance component (101 a) only. 1 〇. A video coding method for encoding video data, the video coding method includes the following steps : selecting a region of interest (101) of a strong PPG signal in the input video data (1〇〇), and encoding the selected region of interest of the input video material (100) with a first encoding setting according to a predetermined encoding scheme ( 101) storing the PPG related information in the encoded attention area, and encoding the remaining portion of the input data (100) with a second encoding setting according to the predetermined encoding scheme (103) and inputting the input Depending on the sfl data The attention area (1 〇 2) and the coded remaining part (104) are combined into an encoder output video stream (105) 〇11 · a video decoding apparatus for decoding an encoded video stream (6〇, 60, 60) The encoded video stream ("6") includes encoded video data, wherein a region of interest of the input video material (1 〇〇) has been encoded with a first encoding setting according to a predetermined encoding scheme (丨〇丨) storing the ppG related information in the encoded attention area (丨〇2), and encoding the remaining portion of the input video material (丨〇〇) with a second encoding setting according to the predetermined encoding scheme (103) The video decoding apparatus includes: a first decoding unit (70) for decoding the encoded attention area (161) according to a decoding scheme complementary to the encoding scheme that has been used to encode the region of interest, and A PPG capture unit (8A, 80') is used to retrieve a ppg signal (164) from the decoded region of interest (163). 12. The video decoding device of claim 11, wherein the video decoding device further comprises: a second decoding unit (75) for decoding the encoded video data according to the decoding scheme The remaining portion (丨62), and the decoder combining unit (95) 'is used to combine the decoded region of interest (163) and the remaining portions of the decoding (165) into a decoder output scrambling stream (166). 13. A video decoding method for decoding an encoded video stream, the encoded video stream (160) comprising 161 403.doc 201244486 comprising encoded video data, wherein a first encoding setting has been used according to a predetermined encoding scheme Encoding an area of interest (101) of the input video material (100) to store ppG related information in the encoded attention area (102), and encoding the input video data with a second encoding setting according to the predetermined encoding scheme The remaining portion (1 〇 3) of the video decoding method includes the following steps: decoding the encoded attention area (161) according to a decoding scheme complementary to one of the coding schemes used to encode the region of interest, And extracting a PPG signal (164) from the decoding region of interest (163). 14. A video encoding system for encoding and decoding video data, the video encoding system comprising: a video encoding device (10, 10, 10") for encoding input video data according to claim 1, and as requested Item 11 is a video decoding device (60, 60, p 15.) for decoding video data encoded by the video encoding device, which is a computer program including a code component, which is implemented when the computer program is executed on a computer The computer implements the steps of the method of claim 10 or 13. 161403.doc