JPH0846864A

JPH0846864A - Detection of video cutting point

Info

Publication number: JPH0846864A
Application number: JP18202494A
Authority: JP
Inventors: Yukinobu Taniguchi; 行信谷口; Yoshinobu Tonomura; 佳伸外村
Original assignee: Nippon Telegraph and Telephone Corp
Current assignee: Nippon Telegraph and Telephone Corp
Priority date: 1994-08-03
Filing date: 1994-08-03
Publication date: 1996-02-16

Abstract

PURPOSE:To provide the method of detection of a video cutting point capable of reducing a detection error due to the motion of an object and hand shake in cut and having superior detection performance. CONSTITUTION:A video data string is constituted (101) in a space and time image that is the three-dimensional arrangement of image plane coordinates (x), (y) and a time (t), and the gradient vectors (DELTAxI, DELTAyI, DELTAtI) of the space and time image at a certain time (t) are calculated (102). The gradient vector is made nearly parallel with the axis (t) since the change of luminance and color is small in the directions of (x), (y) and large in the direction of axis (t) at a cutting point due to the steep change of a pattern. Therefore, a value representing the size of the number of gradient vectors set nearly parallel with the axis (t) is found from the count values count1, count2 of a picture element which satisfy a conditional equation by using equation of diff=count2/count1 (103-105). When diff exceeds a thresh-old value theta, this time is judged as the cutting point (106). Such operation is successively performed with respect to the time (t) (107).

Description

Detailed Description of the Invention

【０００１】[0001]

【産業上の利用分野】映像、すなわち複数枚の画像デー
タの列からそのカット点（シーンが切り替わる点）を検
出する方法に関する。BACKGROUND OF THE INVENTION 1. Field of the Invention The present invention relates to a method for detecting a cut point (point at which a scene changes) from a video, that is, a sequence of a plurality of image data.

【０００２】[0002]

【従来の技術】映像中でシーンが切り替わる点をカット
点という。映像カット点検出方法は、シーンチェンジ検
出とも呼ばれ、さまざまな方法が提案されている。2. Description of the Related Art The point at which a scene changes in a video is called a cut point. The image cut point detection method is also called scene change detection, and various methods have been proposed.

【０００３】代表的な方法として、時間的に隣合う二枚
の画像Ｉ_t，Ｉ_t-1の対応する画素における輝度値の差を
計算し、その絶対値の和をＤ（ｔ）とし、Ｄ（ｔ）があ
る与えられた閾値よりも大きい時、ｔをカット点とみな
すものがある（大辻、外村、大庭：輝度情報を使った動
画像ブラウジング。電気情報通信学会技術報告、ＩＥ９
０−１０３，１９１１．）。As a typical method, a difference in luminance value between corresponding pixels of two temporally adjacent images I _t and I _t-1 is calculated, and the sum of absolute values thereof is defined as D (t), When D (t) is larger than a given threshold, some consider t as a cut point (Otsuji, Tonomura, Ohiwa: Moving image browsing using luminance information. IEICE Technical Report, IE9
0-103, 1911. ).

【０００４】フレーム間差分の代りに画素変化面積、輝
度ヒストグラム差分、ブロック別色相関、χ²検定量な
どがＤ（ｔ）として使われる（大辻、外村：映像カット
自動検出方式の検討。テレビジョン学会技術報告、Ｖｏ
ｌ．１６，Ｎｏ．４３，ｐｐ．７−１２）。Pixel change area, luminance histogram difference, block-wise color correlation, χ ² test amount, etc. are used as D (t) instead of inter-frame difference (Otsuji, Tonomura: Study of automatic video cut detection method. Technical Report of the Society of John, Vo
l. 16, No. 43, pp. 7-12).

【０００５】また、Ｄ（ｔ）をそのまま閾値処理するの
ではなく、各種時間フィルタをＤ（ｔ）に対して作用し
た結果を閾値処理する方法もある。この方法は、映像の
中に激しく動く動体やフラッシュ光があっても誤検出を
生じにくいという特徴を持つ（Ｋ．Ｏｔｓｕｊｉａｎ
ｄＹ．Ｙｏｎｏｍｕｒａ：ＰｒｏｊｅｃｔｉｏｎＤｅ
ｔｅｃｔｉｎｇＦｉｌｔｅｒｆｏｒＶｉｄｅｏ
ＣｕｔＤｅｔｅｃｔｉｏｎ．Ｐｒｏｃ．ｏｆＡＣ
ＭＭｕｌｔｉｍｅｄｉａ９３，１９９３，ｐｐ．２
５１−２５７）。There is also a method of thresholding the result of the action of various temporal filters on D (t), instead of directly thresholding D (t). This method is characterized in that erroneous detection is unlikely to occur even if there is a moving object or flash light that moves rapidly in the image (K. Otsuji an.
d Y. Yonomura: ProjectionDe
tecting Filter for Video
Cut Detection. Proc. of AC
M Multimedia 93, 1993, pp. Two
51-257).

【０００６】[0006]

【発明が解決しようとする課題】以上述べた従来の技術
においては、被写体が動いたり、手ぶれによって画面全
体が動いたりしたときに、上記で説明した変化量が上昇
するために、検出性能が劣化するという問題点があっ
た。In the conventional technique described above, when the subject moves or the entire screen moves due to camera shake, the amount of change described above increases, and the detection performance deteriorates. There was a problem to do.

【０００７】この問題点について図７、図８を使って説
明する。図７（ａ）に示すような画像データ列を考え
る。この例では、白色（輝度値＝２５５）の四角形（高
さＨ、幅Ｗ）が黒色（輝度値＝０）の背景の上をΔｘず
つ左へ移動していき、時刻ｔ_cで四角形の色が灰色（輝
度値＝１２７）に変化する。ここで、時刻ｔ_cがカット
点である。変化量Ｄ（ｔ）をここでは時間的に隣合う二
枚の画像Ｉ_t，Ｉ_t-1の対応する画素における輝度値の差
の絶対値の和として算出する。したがって、時刻ｔ（≠
ｔ_c）における変化量Ｄ（ｔ）は、図７（ｂ）で四角形
のずれた部分の面積に黒と白の輝度値の差２５５をかけ
たものになるので、Ｄ（ｔ）＝２５５＊２＊Δｘ＊Ｈと
なる。カット点ｔ＝ｔ_cでは、白色と灰色の輝度値の差
が２５５−１２７＝１２８であることから、Ｄ（ｔ_c）
＝１２８＊Ｗ＊Ｈとなる。図８（ａ），（ｂ）のグラフ
の横軸は時刻ｔを表し、縦軸は変化量Ｄ（ｔ）を表す。
四角形の移動速度Δｘが小さいときには、図８（ａ）に
示すように、Ｄ（ｔ）はＤ（ｔ_c）に比べて十分小さい
のでカット点を正しく検出できる。しかし、移動速度Δ
ｘが大きくなるにしたがって、図８（ｂ）に示すよう
に、Ｄ（ｔ）が全体的に上昇するため、Ｄ（ｔ_c）とＤ
（ｔ）の区別が困難となる。このため、従来の方法では
被写体の動きが激しいところをカット点と見誤ることが
あった。カメラの手ぶれによっても、同様の状況が発生
し、カット点検出の性能劣化の一因となっていた。This problem will be described with reference to FIGS. 7 and 8. Consider an image data string as shown in FIG. In this example, a white (brightness value = 255) quadrangle (height H, width W) is moved to the left by Δx on a black (brightness value = 0) background, and the color of the quadrangle at time t _c. Changes to gray (brightness value = 127). Here, the time t _c is the cut point. Variation D (t) is adjacent Here temporally two images I _t, is calculated as the sum of the absolute value of the difference between the luminance values in corresponding pixels of I _t-1. Therefore, time t (≠
The amount of change D (t) in t _c ) is obtained by multiplying the area of the displaced portion of the quadrangle in FIG. 7B by the difference 255 between the brightness values of black and white, so that D (t) = 255 * It becomes 2 * Δx * H. At the cut point t = t _c , since the difference between the brightness values of white and gray is 255-127 = 128, D (t _c )
= 128 * W * H. In the graphs of FIGS. 8A and 8B, the horizontal axis represents time t and the vertical axis represents the amount of change D (t).
When the moving speed Δx of the quadrangle is small, as shown in FIG. 8A, D (t) is sufficiently smaller than D (t _c ), so that the cut point can be correctly detected. However, the moving speed Δ
As x increases, as shown in FIG. 8B, D (t) rises overall, so that D (t _c ) and D (t _c )
It becomes difficult to distinguish (t). For this reason, in the conventional method, the place where the movement of the subject is strong may be mistaken for the cut point. The same situation occurs due to camera shake, which is one of the causes of the deterioration in the performance of cut point detection.

【０００８】また、Ｄ（ｔ）をそのまま閾値処理するの
ではなく、各種時間フィルタをかけることによって検出
誤りを防ぐ従来の方法では、被写体の動きの連続性を仮
定しているので被写体が突然、非連続的な動きをした場
合にやはり検出誤りを生じていた。Further, in the conventional method of preventing detection error by applying various time filters instead of directly thresholding D (t), the continuity of the motion of the subject is assumed, so that the subject suddenly appears. In the case of discontinuous movement, a detection error still occurred.

【０００９】そこで本発明の目的は、上記従来の技術の
問題点を解決するために、被写体の動きや、カットの手
ぶれによって生じる検出誤りを軽減することのできる検
出性能の良いカット点検出方法を提供することにある。SUMMARY OF THE INVENTION Therefore, an object of the present invention is to provide a cut point detection method with good detection performance capable of reducing detection errors caused by movement of an object and camera shake of a cut in order to solve the problems of the above-mentioned conventional techniques. To provide.

【００１０】[0010]

【課題を解決するための手段】上記の目的を達成するた
め、本発明では、映像を構成する画像データ列からカッ
ト点を検出する映像カット点検出方法において、画像平
面座標ｘ，ｙと時間ｔを変数とみなして画像データ列か
ら勾配ベクトルを計算する手順と、該勾配ベクトルのう
ちｔ軸に平行に近いものの数が多い時刻をカット点とし
て出力する手順と、を有することを特徴とする。To achieve the above object, in the present invention, in a video cut point detection method for detecting a cut point from an image data string forming a video, image plane coordinates x, y and time t. Is regarded as a variable and a gradient vector is calculated from the image data sequence, and a step of outputting a time point at which a large number of gradient vectors close to the t-axis are output as a cut point.

【００１１】また、上記の方法においては、画像データ
列から勾配ベクトルを計算する手順を、画像データ列に
対して平滑化フィルタを作用し、平滑化された時空間画
像の勾配ベクトルを計算するものとするのが、ノイズの
影響を排除して検出性能を安定にする点で好適である。Further, in the above method, the procedure of calculating a gradient vector from an image data string is performed by applying a smoothing filter to the image data string to calculate the gradient vector of the smoothed spatiotemporal image. Is preferable in that the influence of noise is eliminated and the detection performance is stabilized.

【００１２】さらに、上記の方法においては、平滑化フ
ィルタをディジタルフィルタで構成するのが、計算量を
低減して検出を容易にする点で好適である。Further, in the above method, it is preferable to configure the smoothing filter with a digital filter in order to reduce the amount of calculation and facilitate detection.

【００１３】[0013]

【作用】映像を構成する画像データ列を画像平面座標
ｘ，ｙと時間ｔの３次元配列として構成した時空間画像
の勾配ベクトルは、輝度変化を例にするとカット点のと
ころでは、ｘ，ｙ方向には小さくｔ軸方向には絵柄が急
に変わるので大きいためｔ軸に対し平行に近くなる。こ
れに対して、カット点間のショット内では、勾配ベクト
ルとｔ軸がなす各度は被写体の動きによって決まる（被
写体が静止していれば、ｔ軸に垂直になる）が、いずれ
にしても、勾配ベクトルとｔ軸が平行になることは稀で
ある。本発明の映像カット点検出方法は、このように勾
配ベクトがカット点において平行に近くなる性質を利用
して、各時刻の勾配ベクトルを逐次計算し、そのうちｔ
軸に平行に近い勾配ベクトルが多い時刻をカット点とし
て検出することにより、カット点の検出性能を向上させ
る。The gradient vector of a spatiotemporal image, in which the image data string forming the image is formed as a three-dimensional array of image plane coordinates x, y and time t, is x, y at the cut point in the case of a luminance change. Since the pattern is small in the direction and suddenly changes in the t-axis direction, it is close to parallel to the t-axis because it is large. On the other hand, in the shot between the cut points, each degree formed by the gradient vector and the t-axis is determined by the movement of the subject (if the subject is stationary, it is perpendicular to the t-axis), but in any case, , The gradient vector and the t-axis are rarely parallel. The image cut point detection method of the present invention utilizes the property that the gradient vector becomes nearly parallel at the cut point in this way, and sequentially calculates the gradient vector at each time, and t
The detection performance of the cut point is improved by detecting the time when there are many gradient vectors near the axis as the cut point.

【００１４】[0014]

【実施例】以下、本発明の一実施例を、図面に基づいて
詳細に説明する。DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS An embodiment of the present invention will be described in detail below with reference to the drawings.

【００１５】まず、本発明の原理を図６を用いて説明す
る。本発明では、映像を構成する画像データ列からカッ
ト点を検出する映像カット点検出方法において、画像平
面座標ｘ，ｙと時間ｔを変数とみなして３次元の配列を
構成する。このように画像データ列を３次元ボリューム
と考えたものを時空間画像と呼ぶ。この時空間画像の画
像データ列から勾配ベクトル（Δ_xＩ，Δ_yＩ，Δ_tＩ）
を時刻ｔについて逐次計算し、その勾配ベクトルのうち
ｔ軸に平行に近いものの数が多い時刻ｔをカット点とし
て出力する。本発明は、カット点において、勾配ベクト
ルがｔ軸に平行に近くなるという性質を利用している。First, the principle of the present invention will be described with reference to FIG. According to the present invention, in the video cut point detection method of detecting the cut points from the image data sequence forming the video, the image plane coordinates x, y and the time t are regarded as variables to form a three-dimensional array. Such an image data string considered as a three-dimensional volume is called a spatiotemporal image. Gradient vector (Δ _x I, Δ _y I, Δ _t I) from the image data sequence of this spatiotemporal image
Is sequentially calculated at time t, and the time t in which the number of the gradient vectors close to the t-axis is large is output as a cut point. The present invention utilizes the property that the gradient vector becomes close to parallel to the t-axis at the cut point.

【００１６】なお、本発明は、時空間画像の勾配ベクト
ル（Δ_xＩ，Δ_yＩ，Δ_tＩ）のうちΔ_tＩの大きさだけで
なく、空間方向の差分Δ_xＩ，Δ_yＩを考慮するという点
で、従来の技術と異なる。Δ_tＩは、ある画素に着目し
たとき、時間ｔの前後で輝度あるいは色が変化するかを
定量化したものであり、従来の技術では、時間方向の差
分、すなわち｜Δ_tＩ｜のみを見ていて、空間方向の差
分Δ_xＩ，Δ_yＩについては考慮していなかった。[0016] The present invention is the gradient vector of the space-time image _{_{(Δ x I, Δ y I}} , Δ t I) as well as the magnitude of delta _t I of the difference between the spatial direction Δ _x I, Δ _y It is different from the conventional technique in that I is taken into consideration. _Δt I is a quantification of whether the luminance or the color changes before and after the time t when focusing on a certain pixel. In the conventional technique, only the difference in the time direction, that is, | _Δt I | not look, the difference Δ _x I spatial direction, for Δ _y I did not take into account.

【００１７】時空間画像の勾配ベクトルがカット点のと
ころでｔ軸に平行に近くなるという性質を、図６を使っ
て説明する。図６は、時空間画像をｘｔ平面に平行な面
で切断したときの様子を示している。カットで区切られ
た連続映像区間、すなわちショットの中では、被写体の
動きが帯状の流れとして現れてくる。カット点では絵柄
が急に変わるので、ｔ軸に垂直なエッジが現れる。勾配
ベクトル（図中では矢印で表示）はエッジとほぼ垂直に
交わるので、カット点では７２に示すように、ｔ軸にほ
ぼ平行になっている（すべてが厳密に平行になるわけで
はない）。カット点での画素７０を例にとって説明する
と、画素７０の近傍でｘ方向の輝度変化は小さいので｜
Δ_xＩ｜は小さくなる。ｔ方向の輝度変化は大きいので
｜Δ_tＩ｜は大きくなり、画素７０における勾配ベクト
ルはｔ軸にほぼ平行になるというわけである。ショット
内では、勾配ベクトルとｔ軸がなす各度は被写体の動き
によって決まる（被写体が静止していれば、ｔ軸に垂直
になる）が、いずれにしても、７１に示すように勾配ベ
クトルとｔ軸が平行になることは稀である。本発明で
は、勾配ベクトルのうちｔ軸に平行に近いものが多い時
刻をカット点として検出するので、検出性能が向上し、
被写体の動きや手ぶれがあってもそれをカット点と見誤
ることが少なくなる。The property that the gradient vector of the spatiotemporal image becomes nearly parallel to the t-axis at the cut point will be described with reference to FIG. FIG. 6 shows a state in which the spatiotemporal image is cut along a plane parallel to the xt plane. In a continuous video section divided by cuts, that is, in a shot, the movement of the subject appears as a band-shaped flow. Since the pattern suddenly changes at the cut point, an edge perpendicular to the t-axis appears. Since the gradient vector (indicated by an arrow in the drawing) intersects the edge almost perpendicularly, it is almost parallel to the t-axis (not all are exactly parallel) at the cut point, as shown at 72. Taking the pixel 70 at the cut point as an example, the brightness change in the x direction near the pixel 70 is small.
Δ _x I | becomes smaller. Since the change in luminance in the t direction is large, | _Δt I | becomes large, and the gradient vector in the pixel 70 is almost parallel to the t axis. Within the shot, the degrees formed by the gradient vector and the t-axis are determined by the movement of the subject (if the subject is stationary, the angle is perpendicular to the t-axis), but in any case, as shown by 71, The t-axes are rarely parallel. According to the present invention, since a time point that is mostly parallel to the t-axis among the gradient vectors is detected as a cut point, the detection performance is improved,
Even if there is movement or camera shake of the subject, it is less likely to mistake it as a cut point.

【００１８】次に、このような原理に基づく本発明の一
実施例を説明する。図１は、本実施例を示す処理フロー
図である。Next, an embodiment of the present invention based on such a principle will be described. FIG. 1 is a process flow chart showing the present embodiment.

【００１９】ここでは、画像データ列はディジタル化さ
れており、ｘ，ｙ座標と時刻ｔについて輝度値Ｉ（ｘ，
ｙ，ｔ）を定めるような３次元の配列（時空間画像）と
考えることにする。なお、ここでは、一実施例として輝
度値のみを考慮するが、色情報例えばＲＧＢ値を考える
こともできる。Here, the image data string is digitized, and the brightness value I (x,
Let us consider it as a three-dimensional array (spatiotemporal image) that defines y, t). Although only the luminance value is considered here as an example, color information such as RGB values may be considered.

【００２０】ステップ１０１では、時空間画像に対して
平滑化作用を持つフィルタを作用することによって平滑
化時空間画像を構成する。平滑化することによって、安
定に勾配ベクトルを求めることができるようになる（平
滑化しないとノイズによって勾配を安定に求めることが
できない）。平滑化作用を持つフィルタとしては、例え
ばガウシアンフィルタIn step 101, a smoothed spatiotemporal image is constructed by operating a filter having a smoothing action on the spatiotemporal image. By smoothing, it becomes possible to stably obtain the gradient vector (without smoothing, the gradient cannot be stably obtained due to noise). As a filter having a smoothing action, for example, a Gaussian filter

【００２１】[0021]

【数１】 [Equation 1]

【００２２】を利用することができる。ガウシアンフィ
ルタＧと時空間画像Ｉの畳み込み（ｃｏｎｖｏｌｕｔｉ
ｏｎ）をとったものが、平滑化時空間画像Ｉ′（＝Ｇ＊
Ｉ）となる。σ_x，σ_y，σ_tはそれぞれｘ，ｙ，ｔ方向
に関する平滑化のスケールである。σが大きいほどフィ
ルタ後の時空間画像は滑らかになり、ノイズが除去され
る反面、細部の特徴が失われる。計算量を削減するため
に、次のようなディジタルフィルタを利用することもで
きる。Can be used. Convolution of the Gaussian filter G and the spatiotemporal image I (convoluti)
on) is the smoothed spatiotemporal image I '(= G *
I). σ _x , σ _y , and σ _t are smoothing scales in the x, y, and t directions, respectively. The larger σ is, the smoother the spatiotemporal image after filtering is, and the noise is removed, but the fine features are lost. In order to reduce the amount of calculation, the following digital filter can be used.

【００２３】[0023]

【数２】 [Equation 2]

【００２４】ただし、Ｗ，Ｈ，Ｔは平滑化のスケールを
表す。However, W, H, and T represent smoothing scales.

【００２５】ステップ１０２では、時刻ｔについて、平
滑化された時空間画像の各画素についてその勾配（Δ_x
Ｉ，Δ_yＩ，Δ_tＩ）を算出する。ガウシアンフィルタの
場合、In step 102, at time t, the gradient (Δ _x
I, Δ _y I, Δ _t I) are calculated. For a Gaussian filter,

【００２６】[0026]

【数３】 (Equation 3)

【００２７】が成り立つので、平滑化フィルタを畳み込
んでから微分するかわりに、微分フィルタ∂Ｇ／∂ｘを
畳み込んでもよい。また、その他の微分作用をもつフィ
ルタ、例えばSince the following holds, instead of convolving the smoothing filter and then differentiating, the differential filter ∂G / ∂x may be convoluted. Also, other filters with differentiating effects, such as

【００２８】[0028]

【数４】 [Equation 4]

【００２９】を使ってもよい。May be used.

【００３０】ステップ１０３では、勾配ベクトルが｜Δ_xＩ｜＜θ₁ａｎｄ｜Δ_yＩ｜＜θ₁ （１）の条件を満たす領域（領域Ａと呼ぶ）の画素数をカウン
トし、それをｃｏｕｎｔ１とする。｜Δ_xＩ｜は近傍に
縦方向のエッジが存在すると大きな値をとり、｜Δ_yＩ
｜は近傍に横方向のエッジが存在すると大きな値をとる
ので、（１）式を満たす領域Ａの近傍にエッジがないと
いうことができる。図２（ａ）の画像に対して、領域Ａ
を求めたものを図２（ｂ）に斜線で示す。定性的には被
写体のエッジ近傍を除いた領域が領域Ａとなる。（１）
式の変わりに｜Δ_xＩ｜＋｜Δ_yＩ｜＜θ_１あるいは、 √（｜Δ_ｘＩ｜²＋｜Δ_yＩ｜²）＜θ₁ などの式を使ってもよい。[0030] At step 103, the gradient vector is _{_{| Δ x I | <θ 1}} and | Δ y I | counts the number of pixels <theta satisfies region ₁ (1) (referred to as region A), it count1. │Δ _x I │ takes a large value when there are vertical edges in the vicinity, and │Δ _y I
Since | has a large value when there is a horizontal edge in the vicinity, it can be said that there is no edge in the vicinity of the region A that satisfies the expression (1). In the image of FIG. 2A, the area A
What was calculated is shown by the diagonal lines in FIG. Qualitatively, the area A is the area excluding the vicinity of the edge of the subject. (1)
Instead of equation _{| Δ x I | + | Δ} y I | <θ 1 _{or, √ (| Δ x I |} 2 + | Δ y I | 2) < may use an expression such theta _1.

【００３１】ステップ１０４では、勾配ベクトルが
（１）の条件と｜Δ_tＩ｜＞θ₂ （２）の条件をともに満たす領域（領域Ｂと呼ぶ）の画素数を
カウントし、それをｃｏｕｎｔ２とする。条件（２）の
みを満たす領域を領域Ｃと呼ぶことにすると、領域Ｃと
領域Ａの共通部分が領域Ｂになる。従来、領域Ｃの面積
を全画素数で割ったものをｄｉｆｆ’とし、ｄｉｆｆ’
がある閾値を越えたときをカット点とみなすという方法
があった。この方法では、図４に示すように、被写体の
動きが激しいとカット点でなくてもｄｉｆｆ’が増加す
るため、被写体の動きや手ぶれによって検出誤りが多い
という問題点があった。被写体の動きや手ぶれがある
と、被写体のエッジ近傍でフレーム間差分が大きくなる
（｜Δ_tＩ｜が大きくなる）ためである。そこで、本発
明の一実施例では、領域Ｃの面積（ｄｉｆｆ’）ではな
く領域Ｂの面積をカウントするようにする。条件（１）
と条件（２）が成り立つということは、幾何学的に解釈
すると、勾配ベクトルがｔ軸に平行に近いということで
ある。例として図３（ａ）の画面から図３（ｂ）の画像
にシーンが切り替わる場合を考える。図３（ｃ）に領域
Ａを斜線で示し領域Ｂを黒色で示す。カット点では領域
Ａのうち領域Ｂが大きな部分を占めるという性質を利用
してカット点を検出する。In step 104, the number of pixels in a region (referred to as region B) in which the gradient vector satisfies both the condition (1) and the condition | Δ _t I |> θ ₂ (2) is counted as count2. To do. When the area that satisfies only the condition (2) is called area C, the common part of area C and area A becomes area B. Conventionally, the area of the region C divided by the total number of pixels is defined as diff ', and diff'
There was a method to consider when a certain threshold was exceeded as a cut point. In this method, as shown in FIG. 4, when the movement of the subject is strong, the diff 'increases even at the cut point, so that there is a problem that many detection errors occur due to the movement of the subject or camera shake. This is because if there is motion or camera shake of the subject, the inter-frame difference increases (| Δ _t I | increases) near the edge of the subject. Therefore, in one embodiment of the present invention, not the area (diff ') of the area C but the area of the area B is counted. Condition (1)
The condition (2) is satisfied that the gradient vector is parallel to the t-axis when viewed geometrically. As an example, consider the case where the scene switches from the screen of FIG. 3A to the image of FIG. In FIG. 3 (c), the area A is shaded and the area B is black. At the cut point, the cut point is detected by utilizing the property that the area B occupies a large portion of the area A.

【００３２】ステップ１０５では、ｄｉｆｆ＝ｃｏｕｎ
ｔ２／ｃｏｕｎｔ１を計算する。すなわち、領域Ｂの面
積と領域Ａの面積の比をｄｉｆｆとする。０≦ｄｉｆｆ
≦１でありカット点のところでｄｉｆｆは１に近い値を
とるので、ステップ１０６でｄｉｆｆ＞θ₃が成り立つ
ときカット点ありと判定し、そうでないときカット点な
しと判定する。図５にｄｉｆｆのグラフの典型例を示
す。被写体の動きや手ぶれがあっても、ｄｉｆｆは図４
とは異なり上昇しないので、閾値処理によってカット点
を安定に検出できる。In step 105, diff = count
Calculate t2 / count1. That is, the ratio of the area of the region B to the area of the region A is defined as diff. 0 ≦ diff
Since ≦ 1 and diff takes a value close to 1 at the cut point, it is determined in step 106 that there is a cut point when diff> θ ₃ is true, and otherwise it is determined that there is no cut point. FIG. 5 shows a typical example of the diff graph. Even if there is movement of the subject or camera shake, diff is
Unlike the above, since it does not rise, the cut point can be stably detected by the threshold processing.

【００３３】ステップ１０６ではｔを進めてステップ１
０２に戻る。At step 106, t is advanced to step 1
Return to 02.

【００３４】[0034]

【発明の効果】以上説明したように、本発明の映像カッ
ト点検出方法によれば、映像を画像平面座標と時間を変
数とする時空間画像として考え、その時空間画像の勾配
ベクトルを計算して時間方向の差分だけでなく空間方向
の差分も考慮してカット点を検出するようにしたので、
カット点の検出性能が向上し、時空間画像の被写体の動
きや手ぶれによって生じる検出誤りを軽減できる効果が
ある。As described above, according to the video cut point detection method of the present invention, a video is considered as a spatiotemporal image having image plane coordinates and time as variables, and the gradient vector of the spatiotemporal image is calculated. Since the cut point is detected by considering not only the difference in the time direction but also the difference in the spatial direction,
There is an effect that the detection performance of the cut point is improved and the detection error caused by the movement of the subject and the camera shake in the spatiotemporal image can be reduced.

【００３５】また、上記において、時空間画像に平滑化
フィルタを作用させるようにした場合には、特に、ノイ
ズの影響を排除してカット点の検出を安定にする。Further, in the above, when the smoothing filter is made to act on the spatiotemporal image, in particular, the influence of noise is eliminated and the detection of the cut point is stabilized.

【００３６】さらに、上記において、平滑化フィルタを
ディジタルフィルタで構成した場合には、特に、計算量
を削減できる。Further, in the above, when the smoothing filter is composed of a digital filter, the amount of calculation can be reduced especially.

[Brief description of drawings]

【図１】本発明の一実施例の処理フロー図。FIG. 1 is a process flow chart of an embodiment of the present invention.

【図２】（ａ），（ｂ）は、上記実施例における条件
（１）を満たす領域（領域Ａ）の説明図。2A and 2B are explanatory views of a region (region A) that satisfies the condition (1) in the above-described embodiment.

【図３】（ａ），（ｂ），（ｃ）は、上記実施例におけ
る条件（１）を満たす領域（領域Ａ）と、条件（１），
（２）を満たす領域（領域Ｂ）の説明図。3 (a), (b), and (c) are a region (region A) satisfying the condition (1) in the above-described embodiment and the condition (1),
Explanatory drawing of the area | region (area | region B) which satisfy | fills (2).

【図４】上記実施例における条件（２）のみを満たす領
域（領域Ｃ）の面積（ｄｉｆｆ’）の典型的な時間変化
を示す図。FIG. 4 is a diagram showing a typical time change of an area (diff ′) of a region (region C) satisfying only condition (2) in the above-mentioned embodiment.

【図５】上記領域Ｂの面積（ｄｉｆｆ）の典型的な時間
変化を示す図。FIG. 5 is a diagram showing a typical time change of the area (diff) of the region B.

【図６】本発明の原理を説明するための時空間画像の切
断面と勾配ベクトルの説明図。FIG. 6 is an explanatory diagram of a cutting plane and a gradient vector of a spatiotemporal image for explaining the principle of the present invention.

【図７】従来方法の問題点を説明するための図であっ
て、（ａ）は入力画像列、（ｂ）は変化量Ｄ（ｔ）の算
出方法の説明図。7A and 7B are diagrams for explaining the problems of the conventional method, in which FIG. 7A is an input image sequence, and FIG. 7B is an explanatory diagram of a method of calculating a variation D (t).

【図８】従来方法の問題点を説明するための図であっ
て、（ａ）は移動量Δが小さい時の変化量Ｄ（ｔ）のグ
ラフ、（ｂ）は移動量Δが大きい時の変化量Ｄ（ｔ）の
グラフ。8A and 8B are diagrams for explaining problems of the conventional method, in which FIG. 8A is a graph of a change amount D (t) when the movement amount Δ is small, and FIG. 8B is a graph when the movement amount Δ is large. The graph of change amount D (t).

[Explanation of symbols]

１０１〜１０７…ステップ 101-107 ... Step

Claims

[Claims]

1. A video cut point detection method for detecting a cut point from an image data string forming a video, wherein a gradient vector is calculated from the image data string by regarding the image plane coordinates x, y and time t as variables. A step of outputting, as a cut point, a time at which a large number of the gradient vectors that are close to the t-axis are output, as a cut point detection method.

2. A procedure of calculating a gradient vector from an image data string is a step of operating a smoothing filter on the image data string to calculate a gradient vector of a smoothed spatiotemporal image. The image cut point detection method according to claim 1.

3. The video cut point detection method according to claim 2, wherein the smoothing filter is a digital filter.