WO2016031573A1

WO2016031573A1 - Image-processing device, image-processing method, program, and recording medium

Info

Publication number: WO2016031573A1
Application number: PCT/JP2015/072818
Authority: WO
Inventors: 学斌胡
Original assignee: 富士フイルム株式会社
Priority date: 2014-08-29
Filing date: 2015-08-12
Publication date: 2016-03-03
Also published as: JP2016052013A

Abstract

　In this image-processing device, a person-of-interest-detecting unit detects a person of interest from each of a plurality of still images corresponding to still image data on a plurality of frames extracted from moving image data. A motion-trajectory-detecting unit detects motion trajectory by tracking the movement of the person of interest in the moving images on the basis of the detection results for the person of interest. An action-analyzing unit analyzes the actions of the person of interest in the moving images on the basis of the motion trajectory and calculates evaluation values for the actions of the person of interest on the basis of the actions of the person interest for each of the plurality of still images. In addition, a still-image-data-outputting unit outputs still image data on still images that have an evaluation value for the actions of the person of interest at or above a threshold value, the still-image-data-outputting unit outputting the still image data from the still image data on the plurality of frames.

Description

IMAGE PROCESSING APPARATUS, IMAGE PROCESSING METHOD, PROGRAM, AND RECORDING MEDIUM

The present invention relates to an image processing apparatus, an image processing method, a program and a recording medium for extracting and outputting still image data from moving image data.

In recent years, many moving pictures have been taken also in general homes. Among the captured moving images, for example, a scene of a best shot (a person captured in a moving image) that can not be captured (still difficult to capture) with a still image, such as a moment when a child blows out a candle on a birthday May be included in a scene that preferably represents the motion of On the other hand, there are cases where moving images include those with little movement of persons, those with low importance, those with poor composition, those with poor image quality, and the like.

Therefore, there is a problem that it takes much time to find the best shot scene from the moving image and extract it as a still image.

Here, there are patent documents 1 and 2 as prior art documents relevant to the present invention.

Patent Document 1 relates to a person action search device that enables a person to quickly reproduce moving image data from a recorded position at which a person is recorded. In the document, when a person is recognized on a photographed image, a representative image of the recognized person is extracted, and after the virtual center of gravity of the human image appears in the photographed image in the representative image, it disappears out of the photographed image It has been described that a bird's-eye view image is created by synthesizing tracking lines which are movement trajectories up to.

Patent Document 2 relates to a technique for extracting a representative frame that clearly represents a video included in the moving image data from the moving image data. In the document, it is described that one or a plurality of representative frames well representing the contents (video) of the section are extracted from a predetermined time section of moving image data. Further, it is described in the same document to extract, as a representative frame image, a frame image with the largest evaluation value output by the face state determination unit.

JP, 2009-75802, A JP, 2010-109592, A

An object of the present invention is an image processing apparatus capable of automatically extracting and outputting still image data of a still image corresponding to a scene of a best shot from moving image data by solving the problems of the prior art, and an image processing method , Providing a program and a recording medium.

In order to achieve the above object, the present invention provides a still image data extraction unit that extracts still image data of a plurality of frames from moving image data;
An attention person detection unit that detects an attention person who is a person to be processed from each of a plurality of still images corresponding to still image data of a plurality of frames;
A movement trajectory detection unit for tracking the movement of the person of interest in the moving image corresponding to the moving image data to detect the movement locus of the person of interest based on the detection result of the person of interest in the plurality of still images;
Based on the motion locus of the target person, the motion of the target person in the moving image is analyzed, and for each of a plurality of still images, the evaluation value for the motion of the target person is calculated based on the analyzed motion of the target person. Motion analysis unit,
The present invention provides an image processing apparatus including: a still image data output unit that outputs still image data of a still image having an evaluation value with respect to the motion of a person of interest out of a plurality of frames of still image data.

And a person registration unit for registering a person to be processed among persons photographed in a moving image as a registered person,
Preferably, the person-of-interest detection unit detects a person who matches the registered person or a person whose similarity is equal to or more than a threshold value as a person-of-interest from among each of the plurality of still images.

The person-of-interest detection unit extracts the face of the person from each of the plurality of still images, and performs the main person determination on the face image of the extracted person's face, thereby extracting the face. Among the above, it is preferable to detect a person who is determined to be the center person by the center person determination as the person of interest.

Further, the target person detection unit further detects the face area of the target person in the still image,
The motion trajectory detection unit is based on the face area of the target person and is an arbitrary face corresponding to the face region of the target person in the still image of the current frame and the face region of the target person in the still image of the frame next to the current frame. Based on the position of the detection area in the still image of the next frame, the similarity with the face area of the target person in the still image of the current frame is compared with the detection area in the position and the position detection area is compared. It is preferable to track the movement of the target person in the moving image by detecting to which detection region of the still image of the next frame the face region of the target person in the still image has moved.

In addition to the face area of the person of interest, the motion trajectory detection unit preferably tracks the movement of the person of interest for each of the regions obtained by dividing the region of the upper body of the person of interest into a predetermined number.

In addition, the motion locus detection unit generates an integral image of the still image of the next frame, and uses the generated integral image to all included in the detection area at any position in the still image of the next frame. Preferably, the calculation of the sum of the luminance values of the image is sequentially repeated for detection regions at a plurality of positions.

Further, it is preferable that the motion locus detection unit track the movement of the noted person by using an average displacement method.

In addition, the motion analysis unit defines in advance a motion trajectory for the motion of the target person, and detects a portion similar to the motion trajectory defined in advance from the motion trajectory of the target person detected by the motion trajectory detection unit. By doing this, it is preferable to analyze the movement of the noted person and calculate an evaluation value for the movement of the noted person according to the type of the movement of the noted person.

Preferably, the motion analysis unit analyzes the motion of the target person based on the motion history image of the target person as the motion trajectory of the target person, and calculates an evaluation value for the motion of the target person.

Further, the target person detection unit further detects the position of the target person in the still image, the size of the target person in the still image, and the region of the target person in the still image,
The motion trajectory detection unit further detects the length of the motion trajectory of the person of interest and the movement pattern of the person of interest,
Furthermore, the importance of each of the plurality of still images is determined based on at least one of the length of the movement locus of the target person, the position of the target person in the still image, and the size of the target person in the still image. An importance determination unit that calculates an evaluation value of the importance based on the determined importance for each of the plurality of still images;
Based on at least one of the position of the target person in the still image, the size of the target person in the still image, and the movement pattern of the target person, the quality of each composition of the plurality of still images is analyzed. A composition analysis unit that calculates an evaluation value of the composition based on the quality of the analyzed composition for each of the still images;
Image quality determination that determines the image quality of each of a plurality of still images based on the area of the person of interest in the still image, and calculates an evaluation value of the image quality based on the determined image quality for each of the plurality of still images Equipped with
The still image data output unit is configured to evaluate at least one of the evaluation value for the motion of the person of interest, the evaluation value of importance, the evaluation value of the composition, and the evaluation value of the image quality among the plurality of still images. It is preferable to output still image data of one or more still images whose overall evaluation value is greater than or equal to a threshold value.

In addition, the composition analysis unit defines in advance the movement pattern of the person of interest, and the person of interest is moved with the movement pattern defined in advance from among the movement loci of the person of interest detected by the movement locus detection unit. A portion is detected, and the composition of the still image corresponding to the portion where the noted person is moving is analyzed as good according to a predefined movement pattern, and the evaluation value of the composition of the still image analyzed as good is analyzed as good. It is preferable that the calculation is performed so as to be higher than the evaluation value of the composition of the still image which is not performed.

Further, the target person detection unit further detects the orientation of the target person's face in the still image,
Furthermore, based on the orientation of the face of the target person detected by the target person detection unit, the top and bottom of the still image corresponding to the still image data output from the still image data output unit is captured when the moving image is captured It is preferable to have a top-bottom correction unit that corrects the top and bottom of the still image corresponding to the still image data output from the still image data output unit so as to be the same as the top and bottom of the device.

Further, according to the present invention, the still image data extraction unit extracts still image data of a plurality of frames from moving image data;
Detecting a noted person who is a person to be processed from each of a plurality of still images corresponding to still image data of a plurality of frames;
The motion locus detection unit detects the movement locus of the target person by tracking the movement of the target person in the moving image corresponding to the moving image data based on the detection result of the target person in the plurality of still images;
The motion analysis unit analyzes the motion of the watched person in the moving image based on the motion trajectory of the watched person, and for each of the plurality of still images, the motion of the watched person with respect to the motion of the watched person based on the analyzed motion of the watched person. Calculating an evaluation value;
The still image data output unit provides an image processing method including the steps of outputting, among still image data of a plurality of frames, still image data of a still image whose evaluation value for the motion of the person of interest is equal to or greater than a threshold.

And detecting the position of the target person in the still image, the size of the target person in the still image, and the region of the target person in the still image;
The motion trajectory detection unit detects the length of the motion trajectory of the person of interest and the movement pattern of the person of interest;
The importance determination unit determines the importance of each of the plurality of still images based on at least one of the length of the motion locus of the target person, the position of the target person in the still image, and the size of the target person in the still image. Determining the degree of importance, and calculating an evaluation value of the degree of importance based on the degree of importance determined for each of the plurality of still images;
The composition analysis unit analyzes the quality of each of the plurality of still images based on at least one of the position of the target person in the still image, the size of the target person in the still image, and the movement pattern of the target person. Calculating an evaluation value of the composition based on the quality of the analyzed composition for each of a plurality of still images;
The image quality determination unit determines the image quality of each of the plurality of still images based on the area of the person of interest in the still image, and the image quality evaluation value based on the determined image quality for each of the plurality of still images. Calculating the
The still image data output unit, among a plurality of still images, at least one of the evaluation value for the motion of the person of interest, the evaluation value of importance, the evaluation value of composition, and the evaluation value of image quality Outputting still image data of one or more still images whose overall evaluation value is greater than or equal to a threshold value.

And detecting the direction of the face of the person of interest in the still image.
The moving image was taken with the still image corresponding to the still image data output from the still image data output unit based on the face orientation of the noted person detected by the noted person detection unit by the elevation correction unit And correcting the top and bottom of the still image corresponding to the still image data output from the still image data output unit so as to be the same as the top and bottom of the photographing apparatus at the time.

The present invention also provides a program for causing a computer to execute the steps of the image processing method described above.

The present invention also provides a computer readable recording medium having recorded thereon a program for causing a computer to execute the steps of the image processing method described above.

According to the present invention, at least one of the evaluation value for the motion of the target person in the moving image or the evaluation value for the motion of the target person, the evaluation value of the importance of the still image, the evaluation value of the composition and the evaluation value of the image quality Based on the overall evaluation value with one evaluation value, the scene of the best shot is automatically detected from the moving image, and the scene of the best shot from still image data of a plurality of frames extracted from the moving image data Still image data corresponding to the still image can be output.

It is a block diagram of one embodiment showing composition of an image processing device of the present invention. The left side of (A) to (C) is a conceptual diagram of an example showing the movement locus of the noted person, and the right side is a conceptual diagram of an example showing the movement history image of the noted person. (A) is a conceptual diagram of an example showing a still image rotated 90 ° to the left, (B) shows a still image corrected to the top and bottom by rotating the still image shown in (A) 90 ° to the right It is a conceptual diagram of an example. It is a flowchart of an example showing operation | movement of the image processing apparatus shown in FIG. It is a conceptual diagram of an example showing signs that the still picture of all the frames was extracted from a moving picture. It is a conceptual diagram of an example showing a mode that the area | region of the person detected from each of the still image of all the frames shown in FIG. 5 was surrounded by the frame. It is a graph of an example showing the comprehensive evaluation value of each of a still picture of all the frames extracted from a moving picture. It is a conceptual diagram of an example showing a mode that the star mark was provided to the still image whose comprehensive evaluation is more than a threshold value among the still images of all the frames shown in FIG.

Hereinafter, an image processing apparatus, an image processing method, a program, and a recording medium of the present invention will be described in detail based on preferred embodiments shown in the attached drawings.

FIG. 1 is a block diagram of an embodiment showing the configuration of the image processing apparatus of the present invention. The image processing apparatus 10 shown in the figure automatically detects a scene of a best shot from a moving image, and outputs still image data of a still image corresponding to the scene of the best shot. Still image data extraction unit 14, attention person detection unit 16, motion trajectory detection unit 18, motion analysis unit 20, importance determination unit 22, composition analysis unit 24, image quality determination unit 26, still image data An output unit 28 and a top-bottom correction unit 30 are provided.

The target person registration unit 12 registers a target person to be processed as a registered person among persons photographed in a moving image corresponding to moving image data.

The notable person registration unit 12 can register, for example, a person designated by the user among persons photographed in a moving image as a registered person. In addition, the focused person registration unit 12 can register an image of a registered person (a face image or the like for specifying the focused person).

Subsequently, the still image data extraction unit 14 extracts still image data of a plurality of frames from the moving image data.

The still image data extraction unit 14 can extract, for example, still image data of all frames (each frame) of moving image data. However, the present invention is not limited to this, and still image data of one frame may be extracted, for example, every two frames, for each fixed number of frames. Alternatively, only still image data of a frame of an arbitrary section of a moving image corresponding to moving image data may be extracted.

Subsequently, the focused person detection unit 16 selects a person to be processed from among each of a plurality of still images corresponding to still image data of a plurality of frames extracted from moving image data by the still image data extraction unit 14. The person of interest is detected.

The noted person detection unit 16 detects, for example, the presence or absence of a person in each of a plurality of still images, and detects an image of the detected person and, for example, an image of a registered person registered in the noted person registration unit 12. By comparing (compare face images and the like), it is possible to specify a person who matches or is similar to the registered person (a person whose similarity is equal to or higher than a threshold) among the detected persons as the noted person.

Alternatively, the target person detection unit 16 extracts the face of the person from each of the plurality of still images, and the face person is extracted by performing the central person determination on the face image of the extracted face of the person. From among the persons, the person determined to be the center person by the center person determination can be identified as the person of interest.

In the center person determination, for example, the same person determination process is performed on a plurality of face images, and the plurality of face images are classified into an image group including face images of the same person. Subsequently, one or more persons of the persons classified into the image group are determined as the main character, and one or more persons having high relevancy with the main character among the persons other than the main character are determined as the important persons.
Further, based on the face image of each registered person registered in the focused person registration unit 12, the person corresponding to each image group can be specified.

For example, a person with the largest number of face images detected can be determined as the main character, or a person other than the main character with a large number of still images photographed with the main character can be determined as the important person.
In addition, the distance between the face image of the main character and the face image of a person other than the main character photographed in the same still image may be calculated, and a person whose distance between the face images is close may be determined as an important person.
The difference between the shooting date and time information of the still image in which the main character was shot and the shooting date and time information of the still image in which the person other than the main character was shot, and the shooting position information of the still image in which the main character was shot and the person other than the main character The important person may be determined based on one or both of the difference from the still image shooting position information.

Also, the focused person detection unit 16 detects the position of the focused person, the size of the focused person, the area of the focused person, the upper body area of the focused person, the position of the face of the focused person, and the size of the face of the focused person in the still image. The face area of the person of interest, the direction of the face of the person of interest, etc. can be detected.

In addition, since the detection method of the attention person in a still image, the detection method of the face of the attention person, etc. are known, detailed explanation is omitted here, but the specific detection method is not limited at all. Also, the method of detecting the person of interest is not limited at all.

Subsequently, the motion locus detection unit 18 tracks the movement of the target person in the moving image corresponding to the moving image data based on the detection result of the target person in the plurality of still images by the target person detection unit 16. It detects the movement locus of a person. Further, the motion locus detection unit 18 can detect the length of the motion locus of the person of interest, the movement pattern of the person of interest, and the like by detecting the motion locus of the person of interest.

Here, the movement locus of the person of interest is a region of interest (ROI: Region of Interest), for example, as shown on the left side of FIGS. It is possible to use the ones represented in. Also, as shown in the right side of the figures (A) to (C), a motion history image (MHI: Motion History Image) may be used as the motion trajectory of the person of interest. The action history image is a view showing the action history of the person of interest, for example, by changing the color at regular intervals. By using the operation history image, it is possible to know the position of the target person, the size of the target person, the moving position of the target person, the moving direction of the target person, and the like in the operation history image.

The motion locus detection unit 18 is, for example, based on the face area of the noted person, an arbitrary face corresponding to the face area of the noted person in the still image of the current frame and the face area of the noted person in the still image of the next frame. Based on the position of the detection area in the still image of the next frame, the similarity with the face area of the target person in the still image of the current frame is compared with the detection area in the position and the position detection area is compared. It is possible to track the movement of the target person in the moving image by detecting to which detection region of the still image of the next frame the face region of the target person in the still image has moved.

Here, it is difficult to track the movement of the target person by detecting the face area of the target person by changing the position of the target person in the still image, the size of the target person, etc. with the passage of time. It may be In this case, in addition to the face region of the person of interest, the upper body region of the person of interest is divided into a fixed number, for example, four regions, and the movement of the person of interest is similarly tracked for each of the five regions in total. , Can improve the tracking success rate.

In addition, when the similarity between the face area of the person of interest in the still image of the current frame and the detection area in the still image of the next frame is determined, the attention in the still image of the current frame in the still image of the next frame In order to detect a detection area at a position corresponding to the face area of a person, calculating the sum of luminance values of all pixels included in the detection area at an arbitrary position is sequentially repeated for detection areas at a plurality of positions. There is a need. Therefore, the amount of calculation for calculating the sum of luminance values for each frame is enormous.

In this case, the integral image of the still image of the next frame (that is, each frame) is generated, and the total amount of luminance values is calculated using the generated integral image, thereby reducing the amount of calculation and processing. Can be speeded up. In the integral image, for example, assuming that the coordinates of the pixels of the still image increase from left to right and from top to bottom of the still image, the pixel at each coordinate is from the pixel at the upper left to each coordinate It is an image having an integral value of luminance values up to the pixel.

In addition, since a method of calculating the sum of the luminance values of all the pixels included in the area corresponding to the face area of the person of interest using the integral image is known, the detailed description thereof is omitted here. . In addition, when tracking the movement of the person of interest, the use of the integral image is not limited for the purpose of reduction of calculation amount and speeding up of processing, for example, mean shift (Mean Shift) method etc. Various methods can be used. Since the mean displacement method is also known, its detailed description is omitted.

Subsequently, the motion analysis unit 20 analyzes the motion of the target person in the moving image, based on the motion track of the target person detected by the motion track detection unit 18, for example, the motion track of the region of interest such as a face region, For each of the plurality of still images, the evaluation value for the motion of the target person is calculated based on the analyzed motion of the target person.

The motion analysis unit 20 defines, for example, a motion trajectory for the motion of the person of interest, for example, a motion trajectory of when the person of interest is running, and the motion trajectory of the person of interest detected by the motion trajectory detection unit 18. From the inside, the motion of the person of interest is analyzed by detecting a portion similar to the previously defined movement trajectory. Then, when the motion of the target person is a running motion, it is possible to calculate the evaluation value for the motion of the target person according to the type of the motion of the target person, such as what evaluation value.

Further, the motion analysis unit 20 analyzes the motion of the target person based on the motion history image as shown on the right side of FIGS. 2A to 2C as the motion locus of the target person, and the motion of the target person is obtained. An evaluation value can be calculated.

The motion analysis unit 20 analyzes the motion of the noted person based on the motion history image, so that the noted person is running from the right to the left in the figure as shown on the right of FIG. 2 (A). As shown on the right side of the figure (B), the noted person is moving only the right hand while standing still, and as shown on the right side of the figure (C), the noted person is falling on the ground It can be recognized that something is picked up. In addition, it is possible to calculate an evaluation value for the movement of the person of interest based on whether or not the person of interest is moving, at which position, in which direction, or the like.

Subsequently, the importance degree determination unit 22 determines a plurality of still images based on at least one of the length of the motion locus of the target person, the position of the target person in the still image, and the size of the target person in the still image. The degree of importance of each of the plurality of still images is determined, and the evaluation value of the degree of importance is calculated based on the determined degree of importance for each of the plurality of still images.

For example, when the motion locus of the person of interest is long (when the length is equal to or greater than the threshold), it can be estimated that the photographer's interest in the person of interest is high. Therefore, the importance degree determination unit 22 determines that, in the moving image, the importance degree of the still image corresponding to the scene in which the movement locus of the target person is long is high. In addition, it is determined that the still image in which the person of interest is photographed in the central part and the still image in which the person of interest is largely photographed (the size of the person of interest is equal to or larger than the threshold) have high importance. Then, the evaluation value of the importance is calculated to be higher as the importance is higher.

Subsequently, the composition analysis unit 24 composes each of the plurality of still images based on at least one of the position of the target person in the still image, the size of the target person in the still image, and the movement pattern of the target person. The quality of the image is analyzed, and the evaluation value of the composition is calculated based on the quality of the analyzed composition for each of the plurality of still images.

The composition analysis unit 24 may, for example, have a still image in which the person of interest is photographed at the center, or a composition of still images in which the person of interest is largely photographed (the size of the person of interest is equal to or larger than the threshold). Is analyzed to be better than the composition of a still image not captured in the central part or a still image in which the person of interest is not captured appreciably. Then, the evaluation value of the composition of the still image analyzed as good can be calculated to be higher than the evaluation value of the composition of the still image not analyzed as good.

In addition, the composition analysis unit 24 defines in advance a movement pattern of the target person, for example, a movement pattern in which the target person moves from the left end to the right end of the moving image. From the trajectory, a portion in which the person of interest is moving is detected by a predefined movement pattern. Then, it analyzes that the composition of the still image corresponding to the portion where the noted person is moving with the predefined movement pattern is good, and the evaluation value of the composition of the still image analyzed as good is not analyzed as good. It can be calculated to be higher than the evaluation value of the composition of the still image.

Subsequently, the image quality determination unit 26 determines the image quality of each of the plurality of still images based on the region of interest person in the still image, for example, the region of interest such as the face region. The evaluation value of the image quality is calculated based on the determined image quality.

The still image extracted from the moving image may or may not have high image quality depending on the compression method of the moving image data. In addition, blurring or blurring may occur in the still image due to defocusing or camera shake, or the luminance, color tone, contrast or the like may not be appropriate. However, even if the image quality of the background or the like is poor, if the image quality of the face area or body area of the target person who is the region of interest is good, the image quality judgment unit 26 judges that the image quality of the still image is good. Then, for still images judged to have good image quality, the evaluation value of image quality can be calculated to be higher as the image quality is better.

Subsequently, the still image data output unit 28 selects still image data of a still image corresponding to the scene of the best shot among still image data of a plurality of frames extracted from the moving image data by the still image data extraction unit 14 The evaluation value for the motion of the person of interest, or the evaluation value for the motion of the person of interest, and the evaluation value of importance, the evaluation value of composition, and the evaluation value of at least one of the evaluation values of image quality Still image data of a still image whose value is equal to or greater than a threshold is output.

Finally, based on the orientation of the face of the target person detected by the target person detection unit 16, the top / bottom correction unit 30 determines the top / bottom position of the still image corresponding to the still image data output from the still image data output unit 28 The top and bottom of the still image corresponding to the still image data output from the still image data output unit 28 is corrected so as to be the same as the top and bottom of the photographing apparatus when the moving image is captured.

FIG. 3A is a conceptual view of an example showing a still image rotated 90 degrees to the left. Such a still image can be obtained by rotating the imaging device by 90 degrees to the right when capturing a moving image. The top-bottom correction unit 30 rotates the still image shown in FIG. 6A by 90 ° to the right so that the top and bottom of the still image are the same as the top and bottom of the imaging device when the moving image is captured. As shown in FIG. 6B, the top and bottom of the still image can be corrected.

When two or more persons are registered in the person-of-interest registration unit 12, the person-of-interest detection unit 16 detects and detects each of the two or more persons of interest from a plurality of still images. It is possible to sequentially identify who is the watched person who has been Further, in this case, the motion locus detection unit 18, the motion analysis unit 20, the importance degree determination unit 22, the composition analysis unit 24, the image quality determination unit 26, the still image data output unit 28, and the top-bottom correction unit 30 Processing is sequentially performed on each of the noted persons.

Next, the operation of the image processing apparatus 10 shown in FIG. 1 will be described with reference to the flowchart shown in FIG.

As shown in the flowchart of FIG. 4, first, among the persons photographed in the moving image, for example, a person designated by the user is registered as an attention person by the attention person registration unit 12 (step S1).

Subsequently, the still image data extraction unit 14 extracts, for example, still image data of all the frames from the moving image data (step S2). That is, as shown in FIG. 5, still images of all frames are extracted from the moving image.

Note that after still image data is extracted from moving image data, registration of a target person may be performed.

Subsequently, a notable person registered in the notable person registration unit 12 is detected by the notable person detection unit 16 out of each of the still images of all the frames extracted by the still image data extraction unit 14 (step S3). ). As a result, the person of interest is identified in each of the still images of all the frames, and the position of the person of interest in each of the still images of all the frames, as shown by the frame in FIG. The size, the area of the noted person, etc. are detected.

Subsequently, based on the detection results of the target person in the still images of all the frames, the motion locus detection unit 18 tracks the movement of the target person in the moving image, for example, the movement of the region of interest shown by the frame in FIG. The motion locus of the person of interest is detected (step S4). Thus, for example, as shown in the left side of FIGS. 2 (A) to 2 (C), a motion locus of a person of interest representing in a line the locus of movement of a region of interest such as a face region An operation history image as shown on the right of A) to (C) can be obtained.

Subsequently, the motion analysis unit 20 analyzes the motion of the target person in the moving image based on the motion track of the target person detected by the motion track detection unit 18. Then, for each of the still images of all the frames, an evaluation value for the motion of the target person is calculated based on the analyzed motion of the target person (step S5-1).

Also, the importance degree determination unit 22 determines the importance degree of each of all still images based on the length of the motion locus of the person of interest, the position of the person of interest in the still image, and the size of the person of interest. Then, the evaluation value of the importance is calculated based on the determined importance for each of the still images of all the frames (step S5-2).

Further, the composition analysis unit 24 analyzes the quality of each composition of all still images based on the position of the target person in the still image, the size of the target person, and the movement pattern of the target person. Then, for each of the still images of all the frames, the evaluation value of the composition is calculated based on the quality of the analyzed composition (step S5-3).

Also, the image quality determination unit 26 determines the image quality of each of the still images of all the frames based on the area of the person of interest in the still images. Then, for each of all still images, the image quality evaluation value is calculated according to the determined image quality, and in the case of the present embodiment, the degree of blur (step S5-4).
For example, determination of blurring of the region of interest shown by a frame in FIG. 6 is performed, and the evaluation value of the image quality is calculated to be lower as the degree of blurring is larger.

The order of calculating the evaluation value of the motion of the person of interest, the evaluation value of importance, the evaluation value of the composition, and the evaluation value of the image quality is not limited at all, and can be calculated in any order. Also, these evaluation values can be calculated in parallel, that is, simultaneously.

Subsequently, among still image data of all the frames extracted from the moving image data by the still image data extraction unit 14 by the still image data output unit 28, as shown in FIG. As still image data of a still image, the evaluation value for the motion of the person of interest, the evaluation value of importance, the evaluation value of composition, the comprehensive evaluation value of evaluation value of image quality (addition value of each evaluation value etc.) is above threshold 1 Still image data of at least one still image is output (step S6).

Here, FIG. 7 is a graph of an example showing the comprehensive evaluation value of each of the still images of all the frames extracted from the moving image. The vertical axis of the figure represents the comprehensive evaluation value of each still image, and the horizontal axis represents time (frame). As shown in the figure, out of all the still images, the person of interest is detected by the person of interest detection unit 16 and the motion locus of the person of interest is detected by the motion locus detection unit 18; As shown by adding an asterisk in FIG. 8, still image data of a still image whose overall evaluation value is equal to or greater than a threshold is output.

Finally, based on the orientation of the face of the target person detected by the target person detection unit 16 by the top-bottom correction unit 30, the top and bottom of the still image are the same as the top and bottom of the imaging device when the moving image is captured. The top and bottom of the still image is corrected so as to be (step S7).

As described above, in the image processing apparatus 10, for example, based on the comprehensive evaluation value including the evaluation value for the motion of the target person in the moving image, the evaluation value of the importance of the still image, the evaluation value of the composition and the evaluation value of the image quality. Automatically detects the scene of the best shot from the moving image, and extracts the still image data of the still image corresponding to the scene of the best shot from the still image data of all the frames extracted from the moving image data be able to.

In the device of the present invention, each component of the device may be configured by dedicated hardware, or each component may be configured by a programmed computer.
The method of the present invention can be implemented, for example, by a program for causing a computer to execute each of the steps. It is also possible to provide a computer readable recording medium in which the program is recorded.

The present invention is basically as described above.
The present invention has been described above in detail, but the present invention is not limited to the above embodiment, and it goes without saying that various improvements and changes may be made without departing from the spirit of the present invention.

Claims

A still image data extraction unit for extracting still image data of a plurality of frames from moving image data;
A noted person detection unit that detects a noted person who is a person to be processed out of each of a plurality of still images corresponding to still image data of the plurality of frames;
A motion locus detection unit for tracking the movement of the target person in the moving image corresponding to the moving image data based on the detection result of the target person in the plurality of still images and detecting the movement locus of the target person; ,
The motion of the target person in the moving image is analyzed based on the motion locus of the target person, and for each of the plurality of still images, the motion of the target person is analyzed based on the motion of the target person analyzed. A motion analysis unit that calculates an evaluation value;
An image processing apparatus, comprising: a still image data output unit that outputs still image data of a still image having an evaluation value with respect to the motion of the person of interest among a plurality of still image data of the plurality of frames.
And a person registration unit for registering, as a registered person, the person to be processed among the persons photographed in the moving image.
The noted person detection unit detects a person who matches the registered person or a person whose similarity is equal to or more than a threshold value as the noted person from each of the plurality of still images. Image processing device.
The noted person detection unit extracts the face of a person from each of the plurality of still images, and performs the central person determination on the face image of the face of the extracted person to extract the face. The image processing apparatus according to claim 1, wherein a person who is determined as the central person by the central person determination is detected as the noted person among the persons.
The noted person detection unit further detects a face area of the noted person in the still image,
The motion locus detection unit corresponds to the face region of the person of interest in the still image of the current frame and the face region of the person of interest in the still image of the next frame of the current frame based on the face region of the person of interest Based on the position of the detection area in the still image of the next frame, the similarity of the still image of the current frame with the face area of the target person being equal to or greater than a threshold value. Tracking the movement of the target person in the moving image by detecting to which detection region of the still image of the next frame the face region of the target person in the still image of the current frame is moved The image processing apparatus according to any one of claims 1 to 3, wherein
The motion locus detection unit is configured to track the movement of the noted person for each of the regions obtained by dividing the region of the upper body of the noted person into a predetermined number in addition to the face region of the noted person. Image processing apparatus as described.
The motion trajectory detection unit generates an integral image of a still image of the next frame, and using the generated integral image, the still image of the next frame is detected within the detection area at an arbitrary position. 6. The image processing apparatus according to claim 4, wherein calculating the sum of luminance values of all the included images is sequentially repeated for the detection areas at a plurality of positions.
The image processing apparatus according to claim 4, wherein the motion locus detection unit tracks the movement of the noted person using an average displacement method.
The motion analysis unit defines in advance a motion trajectory for the motion of the noted person, and a portion similar to the previously-defined motion trajectory among motion trajectories of the noted person detected by the motion trajectory detection unit. 8. The method according to any one of claims 1 to 7, wherein the motion of the noted person is analyzed by detecting the movement of the noted person, and the evaluation value for the motion of the noted person is calculated according to the type of the motion of the noted person. The image processing apparatus according to claim 1.
The motion analysis unit analyzes the motion of the noted person based on the motion history image of the noted person as the motion locus of the noted person, and calculates an evaluation value for the motion of the noted person. The image processing apparatus according to any one of 1 to 7.
The noted person detection unit further detects the position of the noted person in the still image, the size of the noted person in the still image, and the area of the noted person in the still image;
The motion trajectory detection unit further detects a length of a motion trajectory of the noted person and a movement pattern of the noted person,
Furthermore, each of the plurality of still images is important based on at least one of the length of the movement locus of the noted person, the position of the noted person in the still image, and the size of the noted person in the still image. A degree of importance determining unit that determines the degree of importance and calculates an evaluation value of the degree of importance based on the determined degree of importance for each of the plurality of still images;
The quality of each of the plurality of still images is analyzed based on at least one of the position of the target person in the still image, the size of the target person in the still image, and the movement pattern of the target person. A composition analysis unit configured to calculate an evaluation value of the composition based on the quality of the analyzed composition for each of the plurality of still images;
The image quality of each of the plurality of still images is determined based on the area of the person of interest in the still image, and the evaluation value of the image quality is determined based on the determined image quality for each of the plurality of still images. And an image quality determination unit that calculates
Among the plurality of still images, the still image data output unit includes an evaluation value for the motion of the person of interest, an evaluation value of the importance, an evaluation value of the composition, and an evaluation value of the image quality. The image processing apparatus according to any one of claims 1 to 9, wherein still image data of one or more still images having a total evaluation value with at least one evaluation value of or greater than a threshold is output.
The composition analysis unit defines in advance the movement pattern of the noted person, and the noted person moves with the previously-defined movement pattern from among the movement locus of the noted person detected by the movement locus detection unit. Detected, and the composition of the still image corresponding to the part in which the noted person is moving according to the predefined movement pattern is analyzed as good, and the evaluation value of the composition of the still image analyzed as good is 11. The image processing apparatus according to claim 10, wherein the image processing apparatus is calculated to be higher than the evaluation value of the composition of the still image not analyzed as being good.
The noted person detection unit further detects the direction of the face of the noted person in the still image,
Furthermore, based on the orientation of the face of the target person detected by the target person detection unit, the moving image is taken of the top and bottom of a still image corresponding to the still image data output from the still image data output unit. 12. The image processing apparatus according to claim 1, further comprising: a top-bottom correction unit configured to correct the top-bottom of the still image corresponding to the still image data output from the still-image data output unit so as to be the same as the top An image processing apparatus according to claim 1.
A still image data extraction unit extracting still image data of a plurality of frames from moving image data;
Detecting a noted person who is a person to be processed from each of a plurality of still images corresponding to the still image data of the plurality of frames;
The movement locus detection unit detects the movement locus of the notable person by tracking the movement of the notable person in the moving image corresponding to the moving image data based on the detection result of the notable person in the plurality of still images Step to
The motion analysis unit analyzes the motion of the watched person in the moving image based on the motion locus of the watched person, and, for each of the plurality of still images, the motion of the watched person based on the analyzed motion of the watched person. Calculating an evaluation value for the movement of the noted person;
And D. outputting the still image data of the still image whose evaluation value for the motion of the person of interest is equal to or greater than the threshold value out of the still image data of the plurality of frames.
Further, the target person detection unit detects the position of the target person in the still image, the size of the target person in the still image, and the region of the target person in the still image;
The motion trajectory detection unit detects the length of the motion trajectory of the noted person and the movement pattern of the noted person;
The importance level determination unit determines the plurality of still images based on at least one of the length of the motion locus of the noted person, the position of the noted person in the still image, and the size of the noted person in the still image. Determining an importance of each of the plurality of still images, and calculating an evaluation value of the importance based on the determined importance for each of the plurality of still images.
A composition analysis unit composes each of the plurality of still images based on at least one of the position of the target person in the still image, the size of the target person in the still image, and the movement pattern of the target person. Analyzing the quality of each of the plurality of still images based on the quality of the analyzed composition, and calculating an evaluation value of the composition;
The image quality determination unit determines the image quality of each of the plurality of still images based on the area of the person of interest in the still image, and the image quality determination unit determines the quality of each of the plurality of still images based on the determined image quality. Calculating an evaluation value of the image quality;
Among the plurality of still images, the still image data output unit includes an evaluation value for the motion of the person of interest, an evaluation value of the importance, an evaluation value of the composition, and an evaluation value of the image quality. And D. outputting still image data of one or more still images having an overall evaluation value with at least one evaluation value of or greater than a threshold value.
Furthermore, the noted person detection unit detects the direction of the face of the noted person in the still image;
The moving image is the moving image of the still image corresponding to the still image data output from the still image data output unit based on the orientation of the face of the noted person detected by the noted person detection unit. And correcting the top and bottom of the still image corresponding to the still image data output from the still image data output unit so as to be the same as the top and bottom of the photographing apparatus at the time of photographing. Image processing method.
A program for causing a computer to execute each step of the image processing method according to any one of claims 13 to 15.
A computer readable recording medium having recorded thereon a program for causing a computer to execute the steps of the image processing method according to any one of claims 13 to 15.