JP2000076462A

JP2000076462A - Moving image scene detector

Info

Publication number: JP2000076462A
Application number: JP11244080A
Authority: JP
Inventors: Setsu Kunitake; 節國武; Isao Uesawa; 功上澤
Original assignee: Fuji Xerox Co Ltd
Current assignee: Fujifilm Business Innovation Corp
Priority date: 1992-12-15
Filing date: 1999-08-30
Publication date: 2000-03-14

Abstract

PROBLEM TO BE SOLVED: To provide a moving image scene detector capable of decoding only attribute information in encoded/recorded moving image information and detecting the candidate of a characteristic scene through the use of this attribute information. SOLUTION: Attribute information in code information stored in an information storing means 101 is decoded by an attribute information decoding means 102, the candidate of the characteristic scene in the moving image is detected by a candidate scene detecting means 103 through the use of this decoded attribute information, image information in the code information is decoded by an image information decoding means 104 and this decoded image information is displayed by an image information display means 105.

Description

DETAILED DESCRIPTION OF THE INVENTION

【０００１】[0001]

【発明の属する技術分野】本発明は、ディジタル記録媒
体に圧縮記録された動画像情報から特徴的なシーンを検
出し、記録する装置に関するものである。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to an apparatus for detecting and recording a characteristic scene from moving image information compressed and recorded on a digital recording medium.

【０００２】[0002]

【従来の技術】たとえば、映画やテレビジョンにおいて
動画像を編集する際には、素材フィルム或いは素材ビデ
オテープの中から所望の箇所を切り出してつなぎ合わせ
る。2. Description of the Related Art For example, when editing a moving image in a movie or television, a desired portion is cut out from a material film or a material video tape and connected.

【０００３】動画像の基本編集は、主に以下の３つの手
順から成っている。[0003] Basic editing of a moving image mainly includes the following three procedures.

【０００４】１）複数の映像情報から所望のシーンを検
索する。1) A desired scene is searched from a plurality of pieces of video information.

【０００５】２）検索されたシーンを切り出す。2) Cut out the searched scene.

【０００６】３）切り出されたシーンを所望のシーケン
スにつなぐ。3) Connect the extracted scene to a desired sequence.

【０００７】この３つの中で、編集作業の効率上最も問
題となるのは１）のシーン検索である。その理由として
は、以下の２つが挙げられる。[0007] Of these three, the most problematic in the efficiency of editing work is the scene search 1). The reasons are as follows.

【０００８】従来は目視によってシーン検索を行っ
ていたため手間と時間がかかる。シーン検索には編集者の習熟が必要とされる。[0008] Conventionally, a scene search is performed visually, which takes time and effort. Scene search requires the skill of the editor.

【０００９】一方、近年ディジタル動画像の符号化技術
が進歩し、蓄積系動画像符号化の国際標準として検討さ
れているＭＰＥＧ（ＭｏｔｉｏｎＰｉｃｔｕｒｅＥ
ｘｐｅｒｔＧｒｏｕｐ）方式等の符号化技術を導入す
ることによって、約１時間分の動画像を１枚のＣＤ−Ｒ
ＯＭに記憶できる様になってきた。この様な状況におい
ては、記憶媒体の経済性とあいまって、大量の映像情報
を取り扱う要求が高まり、さらに１）のシーン検索の効
率が大きな問題になると考えられる。On the other hand, in recent years, the technology for encoding digital moving pictures has been advanced, and MPEG (Motion Picture E), which is being studied as an international standard for storage-based moving picture encoding, has been studied.
xpert Group), a moving image for about one hour can be converted into one CD-R.
It has become possible to memorize in OM. In such a situation, the demand for handling a large amount of video information is increasing in combination with the economics of the storage medium, and the efficiency of the 1) scene search is considered to be a major problem.

【００１０】この様な背景の下、シーン検索を自動的に
行うための手法が従来から検討されている。Against this background, techniques for automatically performing scene retrieval have been studied.

【００１１】たとえば、上田：“インタラクティブな動
画編集方式の提案”，信学技報，Ｖｏｌ．ＩＥ９０−
６，１９９０には、画像の特徴量を用いてシーン検索を
行う手法の概論が開示されている。For example, Ueda: "Proposal of Interactive Video Editing System", IEICE Technical Report, Vol. IE90-
No. 6,1990 discloses an outline of a technique for performing a scene search using a feature amount of an image.

【００１２】また、長坂，田中：“ビデオ作品の場面変
わりの自動検出法”，情報処理学会第４０回（平成２年
前期）全国大会講演論文集，１Ｑ−５，ｐｐ．６４２−
６４３，１９９０には、フレーム間相関が特に低くなる
ところをシーンチェンジとして検出する手法が開示され
ている。Also, Nagasaka and Tanaka: "Automatic detection of scene change in video work", Proc. Of the 40th Annual Meeting of the Information Processing Society of Japan (Early 1990), 1Q-5, p. 642-
643, 1990 discloses a method of detecting a place where inter-frame correlation becomes particularly low as a scene change.

【００１３】また、特開平０２ー４０４２７２号公報に
おいては、フレームとの差分情報の大きさからシーンチ
ェンジ検出をサポートする装置が提案されている。この
装置は、図１３の様にシーンチェンジ判定部８０１、動
画像データ部８０２、属性情報復号手段８０３、データ
伸長部８０４、表示部８０５から構成され、動画像デー
タ部８０２には図１４に示す様な、動画像情報本体と情
報量やフレーム位置等からなる属性情報がフレーム単位
で記録されている。例えば、動画像データ部８０２に、
動画像情報として前フレームとの差分情報を符号化した
情報が、属性情報としてフレーム毎の符号量とフレーム
位置が記録されている場合には、符号量の大小を閾値処
理することによってシーン・チェンジの候補が検出でき
る。すなわち符号量（前フレームとの差分情報量）が大
きいフレームは、前フレームから大きく変化しているフ
レームであることから、シーンチェンジが発生している
可能性が高いと考えられる。但し、動く物体の領域が大
きい場合等にも前フレームとの差分情報量は大きくなる
ため、符号量の閾値処理によって必ずシーンチェンジが
検出されるわけではない。Japanese Patent Application Laid-Open No. 02-404272 proposes a device that supports scene change detection based on the size of difference information from a frame. As shown in FIG. 13, this apparatus comprises a scene change determination unit 801, a moving image data unit 802, an attribute information decoding unit 803, a data decompression unit 804, and a display unit 805. Such attribute information including a moving image information body and an information amount, a frame position, and the like are recorded in frame units. For example, in the moving image data unit 802,
If the information obtained by encoding the difference information from the previous frame as the moving image information is recorded with the code amount and the frame position for each frame as the attribute information, scene change is performed by performing threshold processing on the code amount. Can be detected. That is, a frame having a large code amount (the amount of difference information from the previous frame) is a frame that has greatly changed from the previous frame, and thus it is considered that there is a high possibility that a scene change has occurred. However, even when the area of the moving object is large, the amount of difference information from the previous frame is large, so that a scene change is not necessarily detected by the code amount threshold processing.

【００１４】また、長谷山，田中，大庭：“符号化情報
量による動画像検索の検討”，１９９２年電子情報通信
学会春季大会講演論文集，Ｄ−２９２には、フレーム間
予測ＤＣＴ符号化された符号化動画像を対象として、フ
レーム毎の符号量に基づいてシーン検索を行う手法が開
示されている。Haseyama, Tanaka, and Oba: "Study of Moving Image Retrieval Using Encoded Information Amount", Proceedings of the 1992 IEICE Spring Conference, D-292, show that interframe predictive DCT coding is used. A technique for performing a scene search on an encoded moving image based on a code amount for each frame is disclosed.

【００１５】[0015]

【発明が解決しようとする課題】目視確認によって編集
やシーンの検出作業を行う場合、動画像情報全体の再生
（または早送り）が必要となる。このため作業時間が長
くかかる、複数の動画像情報を用いて同時に作業を進め
ることは困難である等の効率上の問題がある。また動画
像情報を符号化しない状態で保持する場合には、記憶装
置の容量が問題となり、一方、符号化して保持する場合
には、検出のために対象動画像情報全部を復号しなけれ
ばならず復号処理時間等が問題である。When editing or scene detection is performed by visual confirmation, it is necessary to reproduce (or fast-forward) the entire moving image information. For this reason, there are problems in efficiency, such as a long working time and difficulty in performing the work simultaneously using a plurality of pieces of moving image information. In addition, when moving image information is stored in a non-encoded state, the capacity of the storage device becomes a problem. On the other hand, when the moving image information is encoded and stored, the entire target moving image information must be decoded for detection. However, the decoding processing time is a problem.

【００１６】前記の上田：“インタラクティブな動画編
集方式の提案”，信学技報，Ｖｏｌ．ＩＥ９０−６，１
９９０、及び、前記の長坂，田中：“ビデオ作品の場面
変わりの自動検出法”，情報処理学会第４０回（平成２
年前期）全国大会講演論文集，１Ｑ−５，ｐｐ．６４２
−６４３，１９９０に開示されている方法では、符号化
されていない動画像を対象としているため、処理システ
ムには高い処理能力と大規模な蓄積容量が要求されると
いう問題がある。Ueda: "Proposal of Interactive Video Editing System", IEICE Technical Report, Vol. IE90-6,1
990, and Nagasaka and Tanaka: "Automatic Detection of Scene Change in Video Works", IPSJ 40th (Heisei 2
1st semester), 1Q-5, pp. 642
In the method disclosed in US Pat. No. 6,643,1990, since a moving image that is not coded is targeted, there is a problem that the processing system requires a high processing capability and a large storage capacity.

【００１７】また、前記特開平０２ー４０４２７２号公
報で示されている装置では、シーンチェンジ判定部８０
１でフレーム間の差分情報の情報量（Ｄａｔａ）と閾値
（Ｔｈ）を用いてＤａｔａ＞Ｔｈの判定を行うだけなの
で、検出できるシーンはシーンチェンジ１種類だけであ
る。また、検出したシーンに関する記録が行われないた
め、利用者は編集作業の度に該シーンチェンジ検出サポ
ート装置でシーン検出処理を行う必要がある。In the apparatus disclosed in Japanese Patent Application Laid-Open No. 02-404272, a scene change determination unit 80 is provided.
Since the determination of Data> Th is performed only by using the information amount (Data) of the difference information between frames and the threshold value (Th) in 1, only one type of scene change can be detected. Further, since the recording of the detected scene is not performed, the user needs to perform the scene detection processing by the scene change detection support device every time the editing operation is performed.

【００１８】また、前記の長谷山，田中，大庭：“符号
化情報量による動画像検索の検討”，１９９２年電子情
報通信学会春季大会講演論文集，Ｄ−２９２で示されて
いる方法のように、単純にフレーム毎の符号量のみに基
づいてシーン検索を行う場合には、発生情報量が一定と
なるよう符号量制御が行われる符号化方式においては、
精度の高いシーン検出を行うことができない。Further, as described in the method of Haseyama, Tanaka, and Ohba, "Examination of Moving Image Retrieval Using Encoded Information Amount", Proc. Of the 1992 IEICE Spring Conference, D-292. In a case where a scene search is simply performed based only on the code amount for each frame, in a coding method in which the code amount is controlled so that the generated information amount is constant,
Highly accurate scene detection cannot be performed.

【００１９】本発明は、以上のような問題を解決するた
めになされたもので、その目的とするところは、符号化
されて記録されている動画像情報のうち属性情報のみを
復号し、この属性情報を用いて特徴的なシーンの候補を
検出することができる動画像シーン検出装置を提供する
ことにある。SUMMARY OF THE INVENTION The present invention has been made to solve the above-described problems. It is an object of the present invention to decode only attribute information of encoded and recorded moving image information. It is an object of the present invention to provide a moving image scene detection device capable of detecting a characteristic scene candidate using attribute information.

【００２０】また、本発明の他の目的は、前記属性情報
から得られる各種の統計量に基づいて精度の高いシーン
チェンジ検出を行うことができる動画像シーン検出装置
を提供することにある。It is another object of the present invention to provide a moving image scene detecting device capable of performing highly accurate scene change detection based on various statistics obtained from the attribute information.

【００２１】[0021]

【課題を解決するための手段】本発明の動画像シーン検
出装置は、前記目的を達成するため、符号化された動画
像情報である動画像情報本体と該動画像情報本体の属性
を示す属性情報とからなる符号情報を記憶する情報記憶
手段と、前記符号情報中の属性情報を復号する属性情報
復号手段と、前記属性情報復号手段から出力される属性
情報を用いて動画像中の特徴的なシーンの候補を検出す
る候補シーン検出手段と、前記符号情報中の画像情報を
復号する画像情報復号手段と、前記画像情報復号手段か
ら出力される画像情報を表示する画像情報表示手段とを
備えていることを特徴とする。In order to achieve the above object, a moving image scene detecting apparatus according to the present invention has a moving image information body which is encoded moving image information and an attribute indicating an attribute of the moving image information body. Information storage means for storing code information comprising information; attribute information decoding means for decoding attribute information in the code information; and characteristic information in a moving image using attribute information output from the attribute information decoding means. Candidate image detecting means for detecting a suitable scene candidate, image information decoding means for decoding image information in the code information, and image information display means for displaying image information output from the image information decoding means. It is characterized by having.

【００２２】[0022]

【発明の実施の形態】本発明の実施の形態を具体的に例
を挙げて説明する。DESCRIPTION OF THE PREFERRED EMBODIMENTS Embodiments of the present invention will be specifically described with reference to examples.

【００２３】本発明の動画像シーン検出装置は、図１に
示す様に、符号化された動画像情報と属性情報からなる
符号情報と編集情報を記憶する情報記憶手段１０１と、
前記情報記憶手段１０１に記憶された符号情報中の属性
情報を復号する属性情報復号手段１０２と、前記属性情
報復号手段１０２から出力される属性情報を用いて動画
像中の特徴的なシーンの候補を検出する候補シーン検出
手段１０３と、前記符号情報記憶手段１０１に記憶され
た符号情報中の画像情報を復号する画像情報復号手段１
０４と、前記画像情報復号手段１０４から出力される復
号画像情報を表示する画像情報表示手段１０５とを有す
る。As shown in FIG. 1, the moving picture scene detecting apparatus according to the present invention comprises an information storage means 101 for storing code information composed of coded moving picture information and attribute information and editing information;
Attribute information decoding means 102 for decoding attribute information in code information stored in the information storage means 101, and characteristic scene candidates in a moving image using attribute information output from the attribute information decoding means 102 Candidate image detecting means 103 for detecting image information, and image information decoding means 1 for decoding image information in the code information stored in the code information storage means 101
04, and image information display means 105 for displaying decoded image information output from the image information decoding means 104.

【００２４】以上の構成において、情報記憶手段１０１
から各フレーム毎の属性情報のみを読み出し、属性情報
復号手段１０２において復号し、前記属性情報復号手段
１０２から出力される復号属性情報を用いて動画像中の
特徴的なシーンの候補（以後、候補シーンと略記する）
を候補シーン検出手段１０３において検出する。候補シ
ーンが検出される度に、該検出された候補シーンとその
前後数フレームの画像情報を前記情報記憶手段１０１か
ら順次読み出して画像情報復号手段１０４において復号
し、該復号画像情報を画像情報表示手段１０５に並べて
表示する。利用者は前記画像情報表示手段１０５に並べ
て表示された画像情報を見て、前記検出されたシーンの
種類を判断し、シーンの種類とフレーム位置を前記符号
情報の編集情報として前記情報記憶手段１０１に記録す
る。このとき、シーンの種類の判定は利用者によって行
われるため一種類には限らない。以上の操作を動画像情
報全体に対して行うことにより、動画像中の特徴的なシ
ーンの種類および位置情報である編集情報が前記情報記
憶手段１０１に動画像情報とともに記憶される。従っ
て、利用者が編集作業等を行う場合には、あらためてシ
ーン検出を行うことなく該編集情報を利用することがで
き、効率化を図ることが可能となる。In the above configuration, the information storage means 101
, Only the attribute information for each frame is read out, decoded by the attribute information decoding means 102, and a characteristic scene candidate (hereinafter referred to as a candidate) in a moving image using the decoded attribute information output from the attribute information decoding means 102. (Abbreviated as scene)
Is detected by the candidate scene detecting means 103. Every time a candidate scene is detected, the detected candidate scene and image information of several frames before and after the detected candidate scene are sequentially read out from the information storage means 101 and decoded by an image information decoding means 104, and the decoded image information is displayed in an image information display. They are displayed side by side on the means 105. The user looks at the image information displayed side by side on the image information display means 105, judges the type of the detected scene, and sets the type of the scene and the frame position as edit information of the code information in the information storage means 101. To record. At this time, the scene type is determined by the user and is not limited to one type. By performing the above operation on the entire moving image information, the editing information that is the type and position information of the characteristic scene in the moving image is stored in the information storage unit 101 together with the moving image information. Therefore, when the user performs editing work or the like, the editing information can be used without performing scene detection again, and efficiency can be improved.

【００２５】[0025]

【実施例】図２は、本発明の動画像シーン検出装置の実
施例の構成を、図３は本実施例における符号情報の構成
を、図５は該装置によって動画像シーン検出記録を行う
際の信号の流れを示している。FIG. 2 shows the configuration of an embodiment of a moving picture scene detecting apparatus according to the present invention, FIG. 3 shows the configuration of code information in the present embodiment, and FIG. 3 shows the flow of signals.

【００２６】図２において、１は利用者が指示を入力す
る指示入出力部、２は情報記憶部、３は属性情報復号
部、４は候補シーン検出部、５は画像情報復号部、６は
画像情報表示部である。また、図５において、７は利用
者が検出記録開始を指示する検出記録スタート指示信
号、８は情報記憶部２から読み出される属性情報、９は
属性情報復号部３から出力される復号属性情報、１０は
情報記憶部２から属性情報の読み出しを指示する属性情
報読み出し指示信号、１１は候補シーン検出部４が検出
したフレーム位置の画像情報の表示処理を指示する指定
フレーム画像表示指示信号、１２は指定フレーム画像表
示指示信号１１により指定され情報記憶部２から読み出
された画像情報、１３は画像情報復号部５から出力され
る復号画像情報、１４は編集情報の記録と属性情報読み
出し再開を指示する記録再開指示信号、１５は検出した
候補シーンが最終フレームであったことを示す最終フレ
ーム指示信号、１６は編集情報の記録と検出処理の終了
を指示する記録検出終了指示信号、１７は検出処理の終
了のみを指示する検出終了指示信号、１８はフレームの
位置を示すフレーム位置符号、１９はフレーム間符号化
／フレーム内符号化いずれの手法により符号化されてい
るかを示すフレーム間／内符号、２０はフレームの符号
量を表す情報量符号、２１はフレーム内の非零の動き補
償ベクトルの数をあらわすベクトル数符号である。In FIG. 2, 1 is an instruction input / output unit for inputting an instruction by a user, 2 is an information storage unit, 3 is an attribute information decoding unit, 4 is a candidate scene detection unit, 5 is an image information decoding unit, and 6 is an image information decoding unit. It is an image information display section. In FIG. 5, reference numeral 7 denotes a detection recording start instruction signal for instructing the user to start detection recording, 8 denotes attribute information read from the information storage unit 2, 9 denotes decoding attribute information output from the attribute information decoding unit 3, Reference numeral 10 denotes an attribute information reading instruction signal for instructing reading of attribute information from the information storage unit 2, reference numeral 11 denotes a designated frame image display instruction signal for instructing display processing of image information at a frame position detected by the candidate scene detecting unit 4, and reference numeral 12 denotes Image information designated by the designated frame image display instruction signal 11 and read from the information storage unit 2, 13 is decoded image information output from the image information decoding unit 5, and 14 is a command to record editing information and restart reading attribute information. 15 is a last frame instruction signal indicating that the detected candidate scene is the last frame, and 16 is recording and detection of editing information. A recording detection end instruction signal for instructing the end of the processing, a detection end instruction signal for instructing only the end of the detection processing, a frame position code indicating a frame position, and an inter-frame coding / intra-frame coding. , 20 is an information amount code indicating the code amount of the frame, and 21 is a vector number code indicating the number of non-zero motion compensation vectors in the frame.

【００２７】図３に示す符号情報は、動画像情報を動き
補償フレーム間／フレーム内適応予測符号化手法を用い
て符号化した結果得られたものである。フレーム間／フ
レーム内適応予測符号化手法とは、前フレームとの差分
情報を符号化するフレーム間符号化手法と符号化対象フ
レームの情報をそのまま符号化するフレーム内符号化手
法とを適応的に切り替えて用いる手法である。この手法
では、前フレームからの変化が大きい、すなわち、差分
情報の情報量が大きいフレームに対して、フレーム内符
号化手法が適用される。また、符号化誤差の蓄積による
画質劣化の軽減と復号の際の利便性から、一定の周期で
強制的にフレーム内符号化手法を適用する周期的リフレ
ッシュが行われる。動き補償符号化手法の原理を図４に
示す。この手法では、前フレームとの差分を算出する際
に、物体の動きを考慮して、すなわち、動きベクトルを
検出して、位置をシフトさせて前フレームとの差分を最
小となるようにする。従って、背景部や動かない物体の
領域で動きベクトルの大きさが零となり、動いている物
体の領域が大きいほど、フレーム内の非零の動きベクト
ルの数が多くなる。特にパンニングやズーム等が発生し
ている場合には、大部分の領域で動きベクトルが非零と
なると考えられる。The code information shown in FIG. 3 is obtained as a result of encoding the moving image information using the motion-compensated inter-frame / intra-frame adaptive prediction encoding method. The inter-frame / intra-frame adaptive predictive encoding method is an adaptive method between an inter-frame encoding method for encoding difference information from a previous frame and an intra-frame encoding method for encoding information of an encoding target frame as it is. This is a technique used by switching. In this method, the intra-frame coding method is applied to a frame having a large change from the previous frame, that is, a frame having a large amount of difference information. In addition, from the viewpoint of reducing image quality deterioration due to accumulation of coding errors and convenience in decoding, periodic refresh forcibly applying an intra-frame coding method is performed at a fixed cycle. FIG. 4 shows the principle of the motion compensation coding method. In this method, when calculating the difference from the previous frame, the motion of the object is taken into account, that is, a motion vector is detected, and the position is shifted to minimize the difference from the previous frame. Therefore, the magnitude of the motion vector is zero in the background portion and the area of the stationary object, and the larger the area of the moving object, the larger the number of non-zero motion vectors in the frame. In particular, when panning, zooming, or the like has occurred, the motion vector is considered to be non-zero in most areas.

【００２８】次に動作について説明する。図５におい
て、利用者が指示入出力部１から検出記録開始の指示と
画像情報名を入力すると、情報記憶部２に検出記録スタ
ート指示信号７が送られ、指定された画像情報の先頭の
フレームの属性情報が情報記憶部２から読み出され属性
情報８として属性情報復号部３に送られ、属性情報復号
部３において復号された属性情報は復号属性情報９とし
て候補シーン検出部４に送られる。候補シーン検出部４
での処理の流れを図６に示す。Next, the operation will be described. In FIG. 5, when the user inputs a detection recording start instruction and an image information name from the instruction input / output unit 1, a detection recording start instruction signal 7 is sent to the information storage unit 2, and the first frame of the designated image information is transmitted. Is read from the information storage unit 2 and sent to the attribute information decoding unit 3 as attribute information 8, and the attribute information decoded by the attribute information decoding unit 3 is sent to the candidate scene detection unit 4 as decoded attribute information 9. . Candidate scene detection unit 4
FIG. 6 shows the flow of the processing in.

【００２９】まず候補シーン検出部４では復号属性情報
９を受け取ると、フレーム間／内符号１９の復号結果を
調べる。なお、本実施例では、フレーム間／内符号１９
の復号結果をＩｎｔｅｒ＿Ｉｎｔｒａとし、Ｉｎｔｅｒ
＿Ｉｎｔｒａ＝０の場合はフレーム間符号化で、Ｉｎｔ
ｅｒ＿Ｉｎｔｒａ＝１の場合はフレーム内符号化で符号
化が行われたことを示しているとする。Ｉｎｔｅｒ＿Ｉ
ｎｔｒａ＝１は、周期的リフレッシュのフレームでなけ
れば前フレームからの変化が大きいフレームであること
を示していることから、シーンチェンジ・物体の出現等
の特徴的なシーンである可能性が高い。従って、フレー
ム位置符号の復号結果に基づく指定フレーム画像表示指
示信号１１を情報記憶部２に送る。Ｉｎｔｅｒ＿Ｉｎｔ
ｒａ＝０である場合と周期的リフレッシュのフレームで
ある場合は、次の判定処理を行う（ステップ６０１）。First, upon receiving the decoding attribute information 9, the candidate scene detecting section 4 checks the decoding result of the interframe / inner code 19. In this embodiment, the inter-frame / inner code 19
Is defined as Inter_Intra, and
If _Intra = 0, interframe coding is performed, and Int
When er_Intra = 1, it indicates that encoding has been performed by intra-frame encoding. Inter_I
Since ntra = 1 indicates that the frame is a large change from the previous frame unless the frame is a periodic refresh frame, it is highly likely that the scene is a characteristic scene such as a scene change or the appearance of an object. Therefore, the designated frame image display instruction signal 11 based on the decoding result of the frame position code is sent to the information storage unit 2. Inter_Int
If ra = 0 and if the frame is a periodic refresh, the following determination processing is performed (step 601).

【００３０】次に、属性情報９の内の情報量符号２０の
復号結果（これをａｍｏｕｎｔとする）を調べる。ａｍ
ｏｕｎｔと予め設定した閾値Ｔｈ１との間にａｍｏｕｎ
ｔ≧Ｔｈ１の関係が成立していれば、前フレームからの
変化がある程度大きいフレームであることを示している
ことから、シーンチェンジ・物体の出現等の特徴的なシ
ーンである可能性が高い。従って、フレーム位置符号の
復号結果に基づく指定フレーム画像表示指示信号１１を
情報記憶部２に送る。ａｍｏｕｎｔ＜Ｔｈ１であれば次
の判定処理を行う（ステップ６０２）。Next, the result of decoding the information amount code 20 in the attribute information 9 (this is referred to as mount) is examined. am
between the threshold value and the preset threshold value Th1
If the relationship t ≧ Th1 holds, it indicates that the frame has a large change from the previous frame to some extent, and thus it is highly possible that the scene is a characteristic scene such as a scene change or the appearance of an object. Therefore, the designated frame image display instruction signal 11 based on the decoding result of the frame position code is sent to the information storage unit 2. If amount <Th1, the next determination process is performed (step 602).

【００３１】その後、属性情報９の内のベクトル数符号
２１の復号結果（これをＮｕｍｂｅｒとする）を調べ、
Ｎｕｍｂｅｒと予め設定した閾値Ｔｈ２との間にＮｕｍ
ｂｅｒ≧Ｔｈ２の関係が成立していれば、パンニング等
の特徴的なシーンである可能性が高い。従って、フレー
ム位置符号の復号結果に基づく指定フレーム画像表示指
示信号１１を情報記憶部２に送る。Ｎｕｍｂｅｒ＜Ｔｈ
２であれば次の処理を行う（ステップ６０３）。Thereafter, the decoding result of the vector number code 21 in the attribute information 9 (this is referred to as Number) is examined.
Num between Number and a preset threshold Th2
If the relationship of ber ≧ Th2 holds, it is highly possible that the scene is a characteristic scene such as panning. Therefore, the designated frame image display instruction signal 11 based on the decoding result of the frame position code is sent to the information storage unit 2. Number <Th
If it is 2, the next processing is performed (step 603).

【００３２】最後に、処理中のフレームが最終フレーム
であるか否かを調べ、最終フレームであれば最終フレー
ム指示信号１５を指示入出力部１に送る。最終フレーム
でなければフレーム属性情報読み出し指示信号１０を情
報記憶部２に送る（ステップ６０４）。Finally, it is checked whether or not the frame being processed is the last frame. If it is the last frame, the last frame instruction signal 15 is sent to the instruction input / output unit 1. If it is not the last frame, a frame attribute information read instruction signal 10 is sent to the information storage unit 2 (step 604).

【００３３】図５において情報記憶部２に属性情報読み
出し指示信号１０が送られると、最後に属性情報を読み
出したフレームの次のフレームに対して上記と同様の処
理を行うため、属性情報８が属性情報復号部３に送ら
れ、属性情報復号部３から復号属性情報９が候補シーン
検出部４に送られてシーン検出が続けられる。In FIG. 5, when the attribute information read instruction signal 10 is sent to the information storage unit 2, the same processing as described above is performed on the frame next to the frame from which the attribute information was last read, so that the attribute information 8 is The attribute information is sent to the attribute information decoding unit 3, and the attribute information decoding unit 3 sends the decoded attribute information 9 to the candidate scene detection unit 4 to continue the scene detection.

【００３４】候補シーン検出部４から指定フレーム画像
表示指示信号１１が情報記憶部２に送られると、指定フ
レームとその前後数フレーム（この時のフレーム数は予
め設定しておく）が順次画像情報１２として画像情報復
号部５に送られ、画像情報復号部５で順次復号され復号
画像情報１３として画像情報表示部６に送られ、時間の
推移に合わせて並べて表示される。但し、フレーム間符
号化手法により符号化されたフレームは前フレームとの
差分情報のみが符号化されているため、フレーム内符号
化手法により符号化されたフレームを起点に順次復号を
行っていかなければ再生画像は得られない。When the designated frame image display instruction signal 11 is sent from the candidate scene detecting section 4 to the information storage section 2, the designated frame and several frames before and after the designated frame (the number of frames at this time are set in advance) are sequentially stored in the image information. The image information is transmitted to the image information decoding unit 5 as 12, sequentially decoded by the image information decoding unit 5, transmitted to the image information display unit 6 as decoded image information 13, and displayed side by side with time. However, since only the difference information from the previous frame is encoded in the frame encoded by the inter-frame encoding method, it is necessary to sequentially decode the frames encoded by the intra-frame encoding method as a starting point. No reproduced image can be obtained.

【００３５】画像表示部６における表示例を図７に示
す。図７において、７０１は検出された候補シーン、７
０２は候補シーン７０１の時間的に前に位置している指
定された枚数の画像、７０３は候補シーン７０１の時間
的に後ろに位置している指定された枚数の画像、７０４
はフレーム内符号化手法により符号化されたフレームの
うち検出された候補シーン７０１の二つ前に位置してい
るものを示している。FIG. 7 shows a display example on the image display unit 6. 7, reference numeral 701 denotes a detected candidate scene;
02, the designated number of images temporally preceding the candidate scene 701; 703, the designated number of images temporally behind the candidate scene 701;
Indicates a frame located two places before the detected candidate scene 701 among frames encoded by the intra-frame encoding method.

【００３６】利用者は、画像表示部６に示された候補シ
ーンを中心とする数フレーム分の画像からシーンの種類
を判断する。この時、表示したシーンが最終フレームに
達していなければ、指示入出力部１からシーンの種類と
フレーム位置の記録、および、検出処理の再開を指示す
ることにより、記録検出再開指示信号１４が情報記憶部
２に送られて編集情報を記録した後検出処理が再開され
る。表示したシーンが最終フレームに達していた場合に
は、指示入出力部１から編集情報の記録と処理終了を指
示することにより、記録検出終了指示信号１６が情報記
憶部２に送られファイルクローズ等の後処理が行われ
る。The user determines the type of the scene from the images of several frames centered on the candidate scene shown on the image display section 6. At this time, if the displayed scene has not reached the last frame, the recording input / output unit 1 instructs the recording of the scene type and the frame position and the restart of the detection process, so that the recording detection restart instruction signal 14 After being sent to the storage unit 2 and recording the editing information, the detection process is restarted. When the displayed scene has reached the last frame, the recording / editing end instruction signal 16 is sent to the information storage unit 2 by instructing the recording and editing of the editing information from the instruction input / output unit 1 to close the file. Is performed.

【００３７】前記ステップ６０４により最終フレーム指
示信号１５が指示入出力部１に送られると、利用者に対
して処理終了指示の入力をうながすメッセージが表示さ
れ、利用者が検出処理の終了を指示すると、検出終了指
示信号１７が情報記憶部２に送られファイルクローズ等
の後処理が行われる。When the last frame instruction signal 15 is sent to the instruction input / output unit 1 in step 604, a message prompting the user to input a processing end instruction is displayed, and when the user instructs the end of the detection processing. , A detection end instruction signal 17 is sent to the information storage unit 2, and post-processing such as file closing is performed.

【００３８】また、検出／判別を行った結果、情報記憶
部２に記録された編集情報を用いて図８の様に編集情報
を視覚的に理解できる様表示させ、動画像情報中の特徴
的なシーンの位置や種類が利用者に簡単に把握できるよ
うにすることが可能である。図８は、動画像情報中の特
徴的なシーンに関する情報のみを表示させた例で、０８
秒でシーンチェンジが、３０秒でパンニングが、１分２
０秒でシーンチェンジが発生していることを示してい
る。As a result of the detection / determination, the editing information recorded in the information storage unit 2 is displayed so that the editing information can be visually understood as shown in FIG. It is possible to make it easy for the user to grasp the position and type of a scene. FIG. 8 shows an example in which only information relating to a characteristic scene in moving image information is displayed.
Scene change in seconds, panning in 30 seconds, 1 minute 2
At 0 seconds, a scene change has occurred.

【００３９】次に、本発明の動画像シーン検出装置の他
の実施例について説明する。Next, another embodiment of the moving picture scene detecting apparatus according to the present invention will be described.

【００４０】先ず、本発明の動画像シーン検出装置によ
ってシーンチェンジの検出が行われる符号化画像を生成
するための動画像符号化器について、図９を参照して説
明する。図９に示す動画像符号化器では、符号化方式と
して、「フレーム間予測」と、「ＤＣＴ（離散コサイン
変換）」を組み合わせた方式を採用している。First, a moving picture encoder for generating a coded picture in which a scene change is detected by the moving picture scene detecting apparatus of the present invention will be described with reference to FIG. The moving picture encoder shown in FIG. 9 employs a coding scheme that combines “inter-frame prediction” and “DCT (discrete cosine transform)”.

【００４１】ブロック抽出回路３０において、入力画像
（現フレーム画像）から一定数の画素よりなる現フレー
ムの画像ブロックが抽出され減算器３１に供給される。
減算器３１において、現フレーム画像ブロックから前フ
レーム画像ブロックが減算され、差分の画像ブロックは
離散コサイン変換（ＤＣＴ）回路３２を介して量子化回
路３３に供給され変換係数の量子化インデックス（以
後、変換係数情報と呼ぶ）が得られる。この変換係数情
報は、可変長符号化部３４に供給されて可変長符号化さ
れる。また、量子化回路３３からの変換係数情報は、逆
量子化回路３５を介して逆離散コサイン変換（ＩＤＣ
Ｔ）回路３６に供給され差分の画像ブロックが再生され
る。この差分の画像ブロックは、加算器３７を介してフ
レームメモリ３８に供給され、このフレームメモリ３８
からの出力、すなわち、前フレーム画像ブロックは減算
器３１及び加算器３７に供給される。なお、ここまでの
構成は、一般的な動画像符号化器の構成と同様である。In the block extracting circuit 30, an image block of the current frame consisting of a fixed number of pixels is extracted from the input image (current frame image) and supplied to the subtracter 31.
In the subtracter 31, the previous frame image block is subtracted from the current frame image block, and the difference image block is supplied to the quantization circuit 33 via the discrete cosine transform (DCT) circuit 32, and the quantization index (hereinafter, referred to as “transform coefficient”) (Referred to as conversion coefficient information). This transform coefficient information is supplied to the variable length coding unit 34 and is subjected to variable length coding. The transform coefficient information from the quantization circuit 33 is supplied to the inverse discrete cosine transform (IDC) via the inverse quantization circuit 35.
T) The difference image block supplied to the circuit 36 is reproduced. The difference image block is supplied to the frame memory 38 via the adder 37, and the frame memory 38
, Ie, the previous frame image block, is supplied to the subtractor 31 and the adder 37. The configuration up to this point is the same as the configuration of a general video encoder.

【００４２】図９に示す動画像符号化器においては、符
号化モード制御部４０には減算器３１からの差分画像ブ
ロックが、有意／無意ブロック制御部３９には量子化回
路３３からの変換係数情報が供給される。In the moving picture encoder shown in FIG. 9, the difference mode block from the subtractor 31 is stored in the coding mode control unit 40, and the transform coefficient from the quantization circuit 33 is stored in the significant / insignificant block control unit 39. Information is provided.

【００４３】符号化モード制御部４０は、差分が特に大
きいブロックに対してはフレーム間予測が有効に機能し
ないのでフレーム内符号化を適用するべきである（フレ
ーム内符号化モード）と判定し、それ以外のブロックに
対してはフレーム間予測を適用するベきである（フレー
ム間予測モード）と判定し、この判定結果は符号化モー
ド情報として第１スイッチ回路４１，第２スイッチ回路
４２、およぴ、可変長符号化部３４に供給する。The coding mode control unit 40 determines that intra-frame coding should be applied (intra-frame coding mode) since inter-frame prediction does not function effectively for a block having a particularly large difference. It is determined that the inter-frame prediction should be applied to other blocks (inter-frame prediction mode), and the result of this determination is used as encoding mode information as the first switch circuit 41, the second switch circuit 42, and the like. Well, it is supplied to the variable length coding unit 34.

【００４４】符号化モード制御部４０からの符号化モー
ド情報がフレーム内符号化モードである場合には、第１
スイッチ回路４１においてスイッチは端子４１ａに接続
され、ＤＣＴ回路３２には現フレーム画像ブロックが供
給される。符号化モード情報がフレーム間予測モードで
ある場合には、第１スイッチ回路４１においてスイッチ
は端子４１ｂに接続され、ＤＣＴ回路３２には差分画像
ブロックが供給される。When the encoding mode information from the encoding mode control unit 40 is the intra-frame encoding mode, the first
In the switch circuit 41, the switch is connected to the terminal 41a, and the DCT circuit 32 is supplied with the current frame image block. When the encoding mode information is the inter-frame prediction mode, the switch in the first switch circuit 41 is connected to the terminal 41b, and the differential image block is supplied to the DCT circuit 32.

【００４５】また、符号化モード制御部４０からの符号
化モード情報がフレーム内符号化モードである場合に
は、第２スイッチ回路４２においてスイッチは端子４２
ａに接続され、加算器３７にはブロック内の値がすべて
零である「零ブロック」が供給される。符号化モード情
報がフレーム間予測モードである場合には、第２スイッ
チ回路４２においてスイッチは端子４２ｂに接続され、
加算器３７にはフレームメモリ３８からの出力（前フレ
ーム画像ブロック）が供給される。When the coding mode information from the coding mode control section 40 is the intra-frame coding mode, the switch in the second switch circuit 42 is connected to the terminal 42.
a, and the adder 37 is supplied with a "zero block" in which the values in the block are all zero. When the encoding mode information is the inter-frame prediction mode, the switch in the second switch circuit 42 is connected to the terminal 42b,
The output (previous frame image block) from the frame memory 38 is supplied to the adder 37.

【００４６】有意／無意ブロック制御部３９は、符号化
モード制御部４０からの符号化モード情報がフレーム内
符号化モードである場合に、量子化回路３３からの変換
係数情報が略零となるブロックを無意ブロック、その他
のブロックを有意ブロックと判定し、その判定結果を有
意／無意ブロック情報として可変長符号化部３４に供給
する。When the coding mode information from the coding mode control section 40 is the intra-frame coding mode, the significant / insignificant block control section 39 controls the block where the transform coefficient information from the quantization circuit 33 becomes substantially zero. Is determined as an insignificant block, and the other blocks are determined as significant blocks, and the determination result is supplied to the variable length coding unit 34 as significant / insignificant block information.

【００４７】可変長符号化部３４は、有意／無意ブロッ
ク制御部３９からの有意／無意ブロック情報、および、
符号化モード制御部４０からの符号化モード情報、およ
び、量子化回路３３からの変換係数情報を可変長符号化
して符号化画像を生成し、この符号化画像を伝送線を介
して画像復号装置に供給するか、或いは、記憶装置に蓄
積する。ただし、有意／無意ブロック制御部３９からの
有意／無意ブロック情報が無意ブロックである場合には
量子化回路３３からの変換係数情報は可変長符号化しな
い。なお、無意ブロックである場合、復号の際には、前
フレームの当該プロックで補充する。また、ここでは有
意／無意ブロック情報と符号化モード情報が符号化動割
像情報中の属性情報である。The variable length coding unit 34 includes significant / insignificant block information from the significant / insignificant block control unit 39 and
The coding mode information from the coding mode control unit 40 and the transform coefficient information from the quantization circuit 33 are subjected to variable-length coding to generate a coded image, and the coded image is transmitted to the image decoding device via a transmission line. Or store it in a storage device. However, when the significant / insignificant block information from the significant / insignificant block control unit 39 is an insignificant block, the transform coefficient information from the quantization circuit 33 is not subjected to variable length coding. If the block is an insignificant block, it is supplemented by the block of the previous frame at the time of decoding. Also, here, the significant / insignificant block information and the encoding mode information are the attribute information in the encoded moving image information.

【００４８】次に、上述した動画像符号化器において使
用される属性情報を利用して、シーンチェンジ検出を行
う原理について説明する。Next, the principle of detecting a scene change using the attribute information used in the above-described moving picture encoder will be described.

【００４９】以下に説明するように、属性情報から画像
の特徴がある程度推定できる。先の説明から明らかなよ
うに、有意ブロックは、前フレームから変化があったブ
ロックである。したがって、通常、有意ブロックはフレ
ーム中の動領域である。しかし、１フレーム中の有意ブ
ロック数が特に多いフレームは、単なる動領域ではな
く、シーンチェンジが発生している可能性が高い。As described below, the characteristics of an image can be estimated to some extent from attribute information. As is clear from the above description, a significant block is a block that has changed from the previous frame. Therefore, a significant block is usually a moving area in a frame. However, a frame in which the number of significant blocks in one frame is particularly large is not a mere moving area, and a possibility that a scene change has occurred is high.

【００５０】また、フレーム内符号化モードが選択され
るのは、有意ブロック中でも前フレームからの変化が特
に激しいブロックである。従って、有意ブロック中のフ
レーム内符号化適用ブロックの割合が特に大きいフレー
ムは、シーンチェンジが発生している可能性が高い。Also, the intra-frame coding mode is selected for a block in which significant changes from the previous frame are particularly severe even among significant blocks. Therefore, there is a high possibility that a scene change has occurred in a frame in which the ratio of intra-frame coding application blocks in a significant block is particularly large.

【００５１】そこで本発明においては、この属性情報と
シーンチェンジの相関に着目してシーンチェンジの検出
を行う。Therefore, in the present invention, a scene change is detected by focusing on the correlation between the attribute information and the scene change.

【００５２】図１０は、本発明の動画シーン検出装置の
実施例を示すブロック図である。FIG. 10 is a block diagram showing an embodiment of the moving picture scene detecting apparatus according to the present invention.

【００５３】伝送線を介して供給されるか、或いは、記
憶装置から読み出された可変長符号化された符号化画像
は、可変長復号部５１において復号され、変換係数情報
及び属性情報（有意／無意ブロック情報，符号化モード
情報）が得られる。これらの変換係数情報及び属性情報
は復号部５２に供給され復号画像が得られる。また、属
性情報は、シーンチェンジ検出フィルタ５３にも供給さ
れる。この属性情報は、図９に示される動画像符号化器
において説明した有意／無意ブロック情報及び符号化モ
ード情報そのものである。The variable-length coded image supplied via the transmission line or read from the storage device is decoded by the variable-length decoding unit 51 and converted coefficient information and attribute information (significant information). / Insignificant block information, coding mode information). These transform coefficient information and attribute information are supplied to the decoding unit 52 to obtain a decoded image. The attribute information is also supplied to the scene change detection filter 53. This attribute information is the significant / insignificant block information and the encoding mode information itself described in the video encoder shown in FIG.

【００５４】シーンチェンジ検出フィルタ５３は、属性
情報が供給される二つの統計量算出回路５４ａ，５４ｂ
と、統計量算出回路５４ａ，５４ｂの出力を閾値Ｔ
Ｈ₁，ＴＨ₂と比較する比較器５５ａ，５５ｂと、比較器
５５ａ，５５ｂの出力の論理積をとるＡＮＤゲート５６
とから構成されている。The scene change detection filter 53 includes two statistic calculation circuits 54a and 54b to which attribute information is supplied.
And the output of the statistic calculation circuits 54a and 54b
Comparators 55a and 55b for comparing with H ₁ and TH _2, and an AND gate 56 for taking the logical product of the outputs of the comparators 55a and 55b
It is composed of

【００５５】統計量算出回路５４は、統計量としてフレ
ーム中の有意ブロック数を求め、統計量算出回路５５
は、統計量として有意ブロック中のフレーム内符号化モ
ードの割合を求める。The statistic calculation circuit 54 calculates the number of significant blocks in the frame as the statistic, and calculates the statistic calculation circuit 55.
Calculates the ratio of the intra-frame coding mode in the significant block as a statistic.

【００５６】シーンチェンジ検出フィルタ５３の特性
は、以下の式で表される。The characteristics of the scene change detection filter 53 are represented by the following equations.

【００５７】[0057]

【数１】但し、Ｄ：シーンチェンジ検出結果ｄ₁：統計量ｆ_iに関する判別結果ｆ_i：ｉ番目の統計量関数ｃ（ｎ）：ｎ番目のフレームのブロック特性情報ＴＨ_i ：ｉ番目の統計量に対する閾値したがって、図１０に示されるシーンチェンジ検出フィ
ルタ５３においては、フレーム中の有意ブロック数が閾
値ＴＨ₁よりも大きく、且つ、有意ブロック中のフレー
ム内符号化モードの割合が閾値ＴＨ₂よりも大きい場合
には、シーンチェンジが検出されたことを示す検出結果
が出力される。(Equation 1) D: scene change detection result d ₁ : determination result regarding statistic f _i f _i : i-th statistic function c (n): block characteristic information of n-th frame TH _i : threshold value for i-th statistic Accordingly, the scene change detection filter 53 shown in FIG. 10, the number of significant blocks in a frame is greater than the threshold value TH _1, and, when the ratio of the intraframe coding mode in significant blocks is greater than the threshold value TH ₂ Outputs a detection result indicating that a scene change has been detected.

【００５８】図１０に示す実施例においては、フレーム
中の有意ブロック数と、有意ブロック中のフレーム内符
号化モードの割合の二つの統計量からシーンチェンジ検
出結果を得ているので、誤りのないシーンチェンジ検出
を行うことができる。In the embodiment shown in FIG. 10, a scene change detection result is obtained from two statistics, that is, the number of significant blocks in a frame and the ratio of the intra-frame coding mode in the significant block. Scene change detection can be performed.

【００５９】次に、本発明の動画像シーン検出装置の更
に他の実施例について図１１を参照して説明する。な
お、図１０に示す実施例と対応する部分には同一符号を
付している。図１１に示す実施例においては、統計量と
して、フレーム中の有意ブロック数と有意ブロック中の
フレーム内符号化モードの割合に加えて、フレームの符
号量を採用している。なお、フレームの符号量とは、可
変長符号化部からの出力であるブロック単位の符号量の
１フレーム分の総量を意味している。この符号量の情報
は、可変長符号化された画像の１フレーム分の符号長を
調べることで得られる。Next, still another embodiment of the moving picture scene detecting apparatus according to the present invention will be described with reference to FIG. Parts corresponding to those in the embodiment shown in FIG. 10 are denoted by the same reference numerals. In the embodiment shown in FIG. 11, in addition to the number of significant blocks in the frame and the ratio of the intra-frame coding mode in the significant block, the code amount of the frame is employed as the statistic. Note that the code amount of a frame means the total amount of one frame of the code amount in block units output from the variable length coding unit. This code amount information is obtained by examining the code length of one frame of the image subjected to the variable length coding.

【００６０】シーンチェンジ検出フィルタ５３の統計量
算出回路５４ｃは、符号化画像から１フレーム分の符号
量を求める。この符号量は、比較器５５ｃで閾値ＴＨ₃
と比較され、この閾値ＴＨ₃より大きい場合にはｄ₃＝１
が出力される。比較器５５ｃの出力ｄ₃は、他の二つの
比較器５５ａ，５５ｂの出力ｄ₁，ｄ₂と共に、ＡＮＤゲ
ート５６に供給されているので、シーンチェンジ検出結
果Ｄは、Ｄ＝ｄ₁・ｄ₂・ｄ₃ で表される。The statistic calculation circuit 54c of the scene change detection filter 53 obtains the code amount for one frame from the coded image. This code amount is compared with the threshold value TH _{3 by the} comparator 55c.
Is compared with the threshold value TH _3, and d ₃ = 1
Is output. The output d ₃ of the comparator 55c, the other two comparators 55a, 55b with the output d _1, d ₂ of, because it is supplied to the AND gate 56, the scene change detection result D is D = d ₁ · d It is expressed by ₂ · d ₃ .

【００６１】このように、図１１に示す実施例において
は、フレーム中の有意ブロック数と、有意ブロック中の
フレーム内符号化モードの割合と、フレームの符号量の
三つの統計量からシーンチェンジ検出結果を得ているの
で、一層誤りのないシーンチェンジ検出を行うことがで
きる。As described above, in the embodiment shown in FIG. 11, the scene change detection is performed based on the three statistics of the number of significant blocks in the frame, the ratio of the intra-frame coding mode in the significant block, and the code amount of the frame. Since the result is obtained, it is possible to detect a scene change without errors.

【００６２】上記三つの統計量に基づいたシーンチェン
ジ検出の効果を確認するために実験を行った。実験条件
を表１に示す。An experiment was conducted to confirm the effect of scene change detection based on the above three statistics. Table 1 shows the experimental conditions.

【００６３】[0063]

【表１】 [Table 1]

【００６４】実験結果を図１２に示す。同図（ａ）は実
験対象画像のフレーム間差分電力、同図（ｂ），
（ｃ），（ｄ）は符号量の判別結果、有意ブロック数の
判別結果、及びフレーム内符号化モードの割合のそれぞ
れの統計量に関する個別の判別結果ｄ₃，ｄ₁，ｄ₂、同
図（ｅ）は、検出フィルタの出力Ｄである。FIG. 12 shows the experimental results. FIG. 7A shows the difference power between frames of the image to be tested, and FIGS.
(C) and (d) show the determination results of the code amount, the determination result of the number of significant blocks, and the individual determination results d ₃ , d ₁ , and d _{2 for} the respective statistics of the ratio of the intra-frame coding mode. (E) is the output D of the detection filter.

【００６５】実験対象画像シーケンスでは、シーンチェ
ンジに対応して差分電力が大きくなっている。シーンチ
ェンジの検出漏れを防ぐために、それぞれの比較器５５
ａ〜５５ｃの閾値ＴＨ_iを低く設定した。このため、個
別の判別結果には、誤検出が含まれている。しかし、３
つの判別結果の論理積をとることにより、検出フィルタ
５３の出力においては、誤検出がなくなっている。In the image sequence for the experiment, the difference power is increased in response to the scene change. In order to prevent omission of scene change detection, each comparator 55
the threshold TH _i of a~55c was set lower. For this reason, an individual detection result includes an erroneous detection. But 3
By taking the logical product of the two determination results, there is no erroneous detection in the output of the detection filter 53.

【００６６】図１２から判るように、属性情報から得ら
れる統計量とフレームの符号量を用いたシーンチェンジ
検出フィルタを設けることにより、実験対象画像シーケ
ンスのシーンチェンジが正確に検出できた。As can be seen from FIG. 12, by providing a scene change detection filter using the statistic obtained from the attribute information and the code amount of the frame, the scene change of the image sequence to be tested can be accurately detected.

【００６７】これにより、従来必要とされていた目視に
よる確認が不要となり、動画像編集の作業効率を著しく
高めることができる。This eliminates the need for visual confirmation, which was conventionally required, and can significantly improve the efficiency of moving image editing.

【００６８】なお、図１０、図１１に示す実施例におい
ても、図２に示す実施例と同様に、シーンチェンジ検出
の結果を符号化動画像と共に情報記憶部に記録するよう
にしてもよい。In the embodiments shown in FIGS. 10 and 11, similarly to the embodiment shown in FIG. 2, the result of the scene change detection may be recorded in the information storage together with the encoded moving image.

【００６９】[0069]

【発明の効果】以上の様に本発明によれば、符号動画像
情報中の属性情報を復号し、復号した属性情報を元に動
画像中の特徴的なシーンの候補を検出することが可能に
なる。従って、目視に依らずある程度自動的なシーン検
出が可能となり、また検出のために動画像情報を全て復
号する必要もないため、作業時間が短縮できる。As described above, according to the present invention, it is possible to decode attribute information in coded video information and detect characteristic scene candidates in the video based on the decoded attribute information. become. Therefore, it is possible to detect scenes automatically to some extent without relying on visual observation, and it is not necessary to decode all moving image information for detection, so that work time can be reduced.

【００７０】更に、本発明においては、符号化パラメー
タを利用してシーンチェンジを検出しているので、正確
にシーンチェンジを検出することができる。Further, in the present invention, since a scene change is detected by using an encoding parameter, a scene change can be detected accurately.

【００７１】また、検出された候補シーンについては、
この候補シーンと前後数フレームの画像情報を復号して
表示させた後、利用者がシーンの種類の判別を行うた
め、検出結果として得られるシーンの種類は一種類だけ
ではなく、複数のシーンを検出することができる。ま
た、検出／判別結果、すなわち、検出されたシーンの種
類とフレーム位置を編集情報として動画像情報とともに
記録しておくことにより、動画像編集の際の編集情報と
して再利用することが可能となる。For the detected candidate scene,
After decoding and displaying this candidate scene and the image information of several frames before and after, the user determines the type of the scene, so that not only one type of scene obtained as a detection result but a plurality of scenes is obtained. Can be detected. In addition, by recording the detection / determination result, that is, the detected scene type and frame position together with the moving image information as editing information, it is possible to reuse the edited scene as moving image information. .

[Brief description of the drawings]

【図１】本発明の動画像シーン検出装置の基本構成図
である。FIG. 1 is a basic configuration diagram of a moving image scene detection device of the present invention.

【図２】本発明の動画像シーン検出装置の実施例を示
す概略構成図である。FIG. 2 is a schematic configuration diagram showing an embodiment of a moving image scene detection device of the present invention.

【図３】図２の実施例における符号情報の構成図であ
る。FIG. 3 is a configuration diagram of code information in the embodiment of FIG. 2;

【図４】動き補償の原理図である。FIG. 4 is a principle diagram of motion compensation.

【図５】図２の実施例における信号の流れを示すブロ
ック図である。FIG. 5 is a block diagram showing a signal flow in the embodiment of FIG. 2;

【図６】図２の実施例における候補シーン検出部での
処理の流れを示す図である。6 is a diagram showing a flow of processing in a candidate scene detection unit in the embodiment of FIG.

【図７】画像表示部における表示例を示す図である。FIG. 7 is a diagram illustrating a display example on an image display unit.

【図８】編集情報の利用例を示す図である。FIG. 8 is a diagram illustrating an example of using editing information.

【図９】シーン検出が行われる符号化画像を生成する
ための動画像符号化器を示すブロック図である。FIG. 9 is a block diagram illustrating a moving image encoder for generating an encoded image on which scene detection is performed.

【図１０】本発明の動画シーン検出装置の他の実施例
を示すブロック図である。FIG. 10 is a block diagram showing another embodiment of the moving image scene detection device of the present invention.

【図１１】本発明の動画像シーン検出装置の更に他の
実施例を示すブロック図である。FIG. 11 is a block diagram showing still another embodiment of the moving image scene detection device of the present invention.

【図１２】シーンチェンジ検出の実験結果を示す波形
図である。FIG. 12 is a waveform chart showing an experimental result of scene change detection.

【図１３】従来例におけるシーン検出サポート装置の
概略構成図である。FIG. 13 is a schematic configuration diagram of a scene detection support device in a conventional example.

【図１４】従来例における動画像情報を示す図であ
る。FIG. 14 is a diagram showing moving image information in a conventional example.

[Explanation of symbols]

１…指示入出力部、２…情報記憶部、３…属性情報復号
部、４…候補シーン検出部、５…画像情報復号部、６…
画像情報表示部、７…検出記録スタート指示信号、８…
属性情報、９…複合属性情報、１０…属性情報読み出し
指示信号、１１…指定フレーム画像表示指示信号、１２
…画像情報、１３…復号画像情報、１４…記録検出再開
指示信号、最終フレーム指示信号、１６…記録検出終了
指示信号、１７…検出終了指示信号、フレーム位置符
号、１９…フレーム間／内符号、２０…符号量符号、２
１…ベクトル数符号、３０…ブロック抽出回路、３１…
減算器、３２…離散コサイン変換回路、３３…量子化回
路、３４…可変長符号化部、３５…逆量子化回路、３６
…逆離散コサイン変換回路、３７…加算器、３８…フレ
ームメモリ、３９…有意／無意ブロック制御部、４０…
符号化モード制御部、４１…第１スイッチ回路、４２…
第２スイッチ回路、５１…可変長復号部、５２…復号
部、５３…シーンチェンジ検出フィルタ、５４ａ，５４
ｂ，５４ｃ…統計量算出回路、５５ａ，５５ｂ，５５ｃ
…比較器、５６…ＡＮＤゲート、１０１…情報記憶手
段、１０２…属性情報復号手段、１０３…候補シーン検
出手段、１０４…画像情報復号手段、１０５…画像情報
表示手段DESCRIPTION OF SYMBOLS 1 ... Instruction input / output part, 2 ... Information storage part, 3 ... Attribute information decoding part, 4 ... Candidate scene detection part, 5 ... Image information decoding part, 6 ...
Image information display section, 7 ... Detection recording start instruction signal, 8 ...
Attribute information, 9: composite attribute information, 10: attribute information read instruction signal, 11: designated frame image display instruction signal, 12
... image information, 13 ... decoded image information, 14 ... recording detection restart instruction signal, last frame instruction signal, 16 ... recording detection end instruction signal, 17 ... detection end instruction signal, frame position code, 19 ... inter-frame / inner code, 20 code amount code, 2
1 ... vector number code, 30 ... block extraction circuit, 31 ...
Subtracter, 32 discrete cosine transform circuit, 33 quantization circuit, 34 variable length coding unit, 35 inverse quantization circuit, 36
... Inverse discrete cosine transform circuit, 37 ... Adder, 38 ... Frame memory, 39 ... Significant / insignificant block control unit, 40 ...
Encoding mode control unit 41 41 first switch circuit 42 42
Second switch circuit, 51: variable length decoding unit, 52: decoding unit, 53: scene change detection filter, 54a, 54
b, 54c... statistic calculation circuit, 55a, 55b, 55c
... comparator, 56 ... AND gate, 101 ... information storage means, 102 ... attribute information decoding means, 103 ... candidate scene detection means, 104 ... image information decoding means, 105 ... image information display means

Claims

[Claims]

An information storage unit for storing code information including a moving image information main body, which is encoded moving image information, and attribute information indicating an attribute of the moving image information main body; and attribute information in the code information. Attribute information decoding means for decoding the attribute information, candidate scene detection means for detecting a candidate for a characteristic scene in a moving image using attribute information output from the attribute information decoding means, and image information in the code information. A moving image scene detection apparatus comprising: image information decoding means for decoding; and image information display means for displaying image information output from the image information decoding means.

2. The method according to claim 1, wherein the code information is coded using coding that does not perform code amount control for each frame, and the attribute information is a code amount for each frame. The moving image scene detection device according to claim 1.