WO2011129059A1 - Information processing device and information processing method - Google Patents

Information processing device and information processing method Download PDF

Info

Publication number
WO2011129059A1
WO2011129059A1 PCT/JP2011/001725 JP2011001725W WO2011129059A1 WO 2011129059 A1 WO2011129059 A1 WO 2011129059A1 JP 2011001725 W JP2011001725 W JP 2011001725W WO 2011129059 A1 WO2011129059 A1 WO 2011129059A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
unit
shooting
code amount
clip
Prior art date
Application number
PCT/JP2011/001725
Other languages
French (fr)
Japanese (ja)
Inventor
武紀 米津
Original Assignee
パナソニック株式会社
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by パナソニック株式会社 filed Critical パナソニック株式会社
Publication of WO2011129059A1 publication Critical patent/WO2011129059A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/40Client devices specifically adapted for the reception of or interaction with content, e.g. set-top-box [STB]; Operations thereof
    • H04N21/43Processing of content or additional data, e.g. demultiplexing additional data from a digital video stream; Elementary client operations, e.g. monitoring of home network or synchronising decoder's clock; Client middleware
    • H04N21/44Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs
    • H04N21/4402Processing of video elementary streams, e.g. splicing a video clip retrieved from local storage with an incoming video stream, rendering scenes according to MPEG-4 scene graphs involving reformatting operations of video signals for household redistribution, storage or real-time display
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N21/00Selective content distribution, e.g. interactive television or video on demand [VOD]
    • H04N21/80Generation or processing of content or additional data by content creator independently of the distribution process; Content per se
    • H04N21/81Monomedia components thereof
    • H04N21/8146Monomedia components thereof involving graphical data, e.g. 3D object, 2D graphics
    • H04N21/8153Monomedia components thereof involving graphical data, e.g. 3D object, 2D graphics comprising still images, e.g. texture, background image

Definitions

  • the present invention relates to an information processing apparatus and an information processing method, and more particularly to an information processing apparatus and an information processing method for realizing seamless reproduction without reproduction interruption.
  • a desired scene that is, a video is often shot (recorded) by repeating recording and pause (shooting start and shooting stop).
  • seamless playback when playing back images shot in such a manner, there is a demand for smoothly playing back joints between scenes (hereinafter referred to as seamless playback).
  • the video shot as described above is recorded on an information recording medium (recording medium) by image signal processing that performs data compression using, for example, the MPEG (Moving Picture Experts Group) technique, and when the recorded video is played back, seamless playback is performed. May not be possible.
  • a video composed of a plurality of shot images obtained from the start of shooting (recording) to the stop of shooting (recording) is referred to as a video clip.
  • FIG. 1 is a diagram showing a functional configuration of a conventional imaging system.
  • FIG. 1 shows a conventional imaging system including an imaging device 901, an information processing device 900, a recording medium 905, and a display device 908.
  • the photographing device 901 is, for example, a camera or a camcorder, and photographs a video clip composed of a plurality of images. Specifically, the imaging device 901 captures a video clip by repeating recording start and stop according to an instruction from the photographer, and the information processing device 900 records it.
  • the recording medium 905 is, for example, a recording medium capable of recording such as SD (Secure Digital) memory, HDD (Hard Disk Drive), CD-R (Compact Disc Recordable), DVD-R (Digital Versatile Disk Recordable), and the like.
  • SD Secure Digital
  • HDD Hard Disk Drive
  • CD-R Compact Disc Recordable
  • DVD-R Digital Versatile Disk Recordable
  • An image signal (video clip) photographed by the photographing device 901 is subjected to signal processing, stored in a frame buffer 902, and encoded into a format that can be recorded on a recording medium by an encoder 903.
  • the encoded image signal (video clip) is temporarily stored in the stream buffer 904, but is stored in the recording medium 905 at an appropriate timing according to the buffer amount managed by the information processing apparatus 900.
  • a stream (video clip) necessary for decoding is temporarily stored in the stream buffer 904 from the recording medium 905 at an appropriate timing according to the buffer amount managed by the information processing apparatus 900.
  • the temporarily accumulated stream (video clip) is decoded by the decoder 906 into a format that can be displayed by the display device 908, stored in the frame buffer 907, and then sent to the display device 908 such as a digital television.
  • the video imaged by the imaging device 901 is data-compressed by the encoder 903 that performs encoding and is decompressed by the decoder 906 that performs decoding.
  • the decoder 906 when decoding the encoded data (video clip) read from the recording medium 905, the decoder 906 stores the encoded data (video clip) in the internal buffer (stream buffer 904). Decode the encoded data (video clip).
  • the decoder 906 uses the internal buffer (stream buffer 904) for decoding, the internal buffer (stream buffer 904) may overflow.
  • Patent Documents 1 to 3 a technique for enabling seamless reproduction without causing interruption of continuity is disclosed (for example, Patent Documents 1 to 3).
  • seamless playback can be performed without interruption of continuity even when playback is performed across a plurality of video clips when a specific condition is satisfied.
  • a method of recording on a recording medium 905 after performing an encoding process for realizing seamless reproduction when recording an image using a photographing device 901 such as a camcorder is disclosed. That is, when video recording is performed at discontinuous timing, buffer status information or stream time stamp when controlling the amount of code managed by the encoder 903 at the recording stop time (shooting stop time) of the preceding video clip, etc.
  • a method of using so-called seamless information for encoding a subsequent video clip is disclosed.
  • the encoding is performed by the encoder 903 so that the internal buffer (stream buffer 904) used by the decoder 906 for decoding does not overflow when the video clip is reproduced, that is, the buffer model does not break down. Recording) can be performed. More specifically, the encoder 903 generates a stream that can be seamlessly reproduced and records the stream on the recording medium 905, and stores the seamless information in a memory such as a RAM provided separately in the information processing apparatus 900. Then, when the next video recording is performed (when the next video recording is started and the captured video clip is encoded), the seamless information is read to generate a video clip that can be seamlessly reproduced. In this way, encoding (seamless recording) processing by the encoder 903 can be performed so that the buffer model does not fail.
  • AVCHD Advanced Video Codec High Definition
  • the buffer status information when the code amount managed by the encoder 903 is controlled at the recording stop time (shooting stop time) of the preceding video clip is used. For this reason, when the subsequent video clip is shot, if the remaining amount of the buffer that can be used by the encoder 903 is small, the code amount that is originally intended to be allocated is not allocated, and there is a problem that the image quality is deteriorated.
  • a large number of rates are assigned to places where image quality degradation is easily perceived, and the rate distribution at places where image quality degradation is difficult to perceive is deleted.
  • the recording rate of the video clip is dynamically changed.
  • compression is performed so as not to deteriorate the image quality as a whole without assigning a rate more than necessary to a portion where it is difficult to perceive image quality deterioration.
  • the bit rate is changed in this way, the bit rate is controlled on the encoder 903 side so that the continuous video can be decoded without interruption on the decoder 906 side (so that seamless playback can be performed).
  • the data amount of the VBV buffer is controlled on the encoder 903 side so that a VBV (Video Buffering Verifier) buffer that temporarily holds data input to the decoder 906 does not overflow or underflow. .
  • VBV Video Buffering Verifier
  • the current picture is generated based on the generated code amount of the past picture by utilizing the fact that the correlation of the image contents between temporally adjacent pictures is high.
  • There is a method of assigning a code amount More specifically, in an image compression method that combines motion compensation represented by the MPEG standard and DCT (Discrete Cosine Transform), code amount control is performed so that the bit stream compressed by the encoder has a desired rate. Yes.
  • the code amount control is generally a method in which the quantization step is feedback-controlled based on the relationship between the previous quantization step and the code amount and the current average rate.
  • the quantization step is slightly roughened to lower the total code amount, and if the current average rate is lower, the quantization step is slightly finer to increase the total code amount. .
  • the total code amount will increase or decrease when viewed in fine time, but the average code amount will be set as the target value when viewed over a long time. Can do.
  • This code amount control is proposed as a test model in MPEG2. That is, the code amount control here is performed by performing feedback control using the relationship between the remaining amount of the virtual buffer and the generated code amount at the time of previous encoding.
  • the rate is instantaneously increased at a location such as a scene change where the correlation of the image contents between adjacent pictures is low. Therefore, depending on the application, the reproduced image may be broken, or a sufficient amount of code cannot be allocated, resulting in extreme image quality degradation.
  • the decoder of the playback device can pre-read and process the next video clip.
  • this requires an image holding buffer before decoding and before display, and a buffer memory for the time required for the decoder to decode one frame and for the number of reference images necessary for decoding. It will be a significant cost increase.
  • a sufficient code amount cannot be assigned to the video information at the scene junction point, and the image quality deteriorates.
  • the present invention has been made in view of the above-described circumstances, and an information processing apparatus and information processing capable of easily and appropriately controlling the amount of generated code in an encoding process even at a scene change location and easily seamlessly reproducing. It aims to provide a method.
  • an information processing apparatus encodes and records a plurality of video clips shot by repeating start and stop of shooting a plurality of times in a shooting apparatus.
  • a video holding unit that holds a video clip composed of a plurality of shot images shot by the shooting device, and one video clip obtained from the start to the stop of shooting of the shooting device,
  • An end image holding unit that retains an end shot image as a clip end image, the clip end image, and a shot image shot at the start of the next shooting, and the input clip end image and input
  • a blend unit that generates image data by blending the captured image captured at the start time of the next capturing, and the clip from the terminal image holding unit
  • a synchronization control unit that synchronizes input of an end image and input of a captured image captured at the start of the next capturing from the video holding unit, and a blend control unit that controls the blending unit by control information
  • an encoding processing unit that encodes the image data generated by the blending unit and controls the code amount of the image
  • the encoding processing unit includes an encoding unit that encodes the image data generated by the blend unit, and a code that controls a code amount of image data encoded by the encoding unit based on the control information.
  • An amount control unit and at the start of the next shooting, the code amount control unit controls the amount of code encoded by the encoding unit according to the control information, and the encoding unit includes the code
  • the image data generated by the blending unit is encoded with the code amount controlled by the amount control unit, and the code amount control unit depends on the control information from the start to the stop of the photographing of the photographing apparatus.
  • the encoding unit is input to the blend unit with the code amount controlled by the code amount control unit After the start of the next shooting
  • the image data generated by the blending unit may be encoded by blending only a plurality of photographed images.
  • the blend control unit may determine the control information based on a code amount preset in the information processing apparatus, and the encoding processing unit may be configured such that the end image holding unit is the clip end point. When holding the image, the code amount of the clip end image may be calculated, and the blend control unit may determine the control information based on the code amount calculated by the encoding processing unit.
  • the generated code amount in the encoding process can be controlled easily and appropriately by controlling the encoded code amount according to the control information.
  • the blend control unit uses the information on the current code amount calculated by the encoding processing unit to cause the blend unit to capture the clip end image and a captured image captured at the start of the next capturing.
  • the image data may be generated by changing the blending ratio.
  • the clip end image can be blended step by step with the captured image captured at the time when the next capturing is started.
  • the blend control unit is configured to control the control information based on the code amount calculated by the encoding processing unit, the clip end image, and the image correlation of the captured image immediately before the clip end image.
  • the blending unit is caused to generate the image data by changing a ratio of blending the clip end image and the captured image captured at the start of the next capturing. It is good.
  • the present invention can be realized not only as such an information processing apparatus, but also as an information processing method using steps characteristic of the information processing apparatus. It can also be realized as a program to be executed. Needless to say, such a program can be distributed via a recording medium such as a CD-R or a transmission medium such as the Internet.
  • the present invention it is possible to easily and appropriately control the amount of generated code in an encoding process even at a scene change location, and to realize an information processing apparatus that can be easily seamlessly reproduced.
  • a moving image is a video recording method that accompanies a time axis rather than a moment, and therefore, images from recording to stop are recorded while repeating this operation. For example, when a recorded image (video) is to be viewed, there is a demand to seamlessly reproduce the image. According to the present invention, a method for easily realizing this at an extremely low cost is provided in the AVCHD standard. be able to.
  • the data obtained up to the recording stop instruction is recorded on the information recording medium, and then one frame or 1 GOP (Group Of Pictures), etc.
  • a very short data processing unit is held in the buffer, and is superposed on the data scheduled to be recorded at the start of subsequent recording.
  • FIG. 1 is a diagram illustrating a functional configuration of a conventional imaging system.
  • FIG. 2 is a diagram showing a main block configuration of the imaging system according to the present invention.
  • FIG. 3 is a diagram showing a main functional configuration of the imaging system according to the present invention.
  • FIG. 4 is a diagram for explaining an information processing method for captured images that enables seamless reproduction of the information processing apparatus according to the present invention.
  • FIG. 5 is a diagram for explaining a method of recording a captured image that enables seamless reproduction of the information processing apparatus according to the present invention.
  • FIG. 6 is a flowchart for explaining the information processing method of the imaging system according to the present invention.
  • FIG. 7 is a diagram illustrating an example of an imaging apparatus equipped with the information processing apparatus according to the present invention.
  • FIG. 8 is a diagram illustrating an example of an imaging apparatus equipped with the information processing apparatus according to the present invention.
  • FIG. 2 is a diagram showing a main block configuration of the imaging system according to the present invention
  • FIG. 3 is a diagram showing a main functional configuration of the imaging system according to the present invention.
  • the image capturing apparatus 101 is a moving image recording apparatus such as a camera or a camcorder, for example, and captures a video clip (captured image group) by capturing a plurality of images. Specifically, the photographing apparatus 101 records a video clip (captured image group) by repeatedly starting and stopping recording in accordance with a photographer's instruction.
  • the recording medium 105 is a recordable medium such as an SD memory, HDD, CD-R, or DVD-R, for example, and data (here, image data) is recorded by the information processing apparatus 100.
  • the information processing apparatus 100 shown in FIG. 2 encodes and records a plurality of video clips shot by repeating the start and stop of shooting a plurality of times in the shooting apparatus 101. Specifically, the information processing apparatus 100 encodes an image input from the photographing apparatus 101 by a technique such as MPEG2 or H264, and records the encoded image on the recording medium 105, for example. Note that a video clip is composed of a plurality of captured images.
  • the information processing apparatus 100 illustrated in FIG. 2 includes a video clip holding unit 102, an encoding processing unit 103, a stream buffer 104, a clip end image holding unit 201, a blending unit 202, a synchronization control unit 203, a blend And a control unit 204.
  • the video clip holding unit 102 holds a video clip composed of the number of shot images shot by the shooting device 101.
  • the image signal photographed by the photographing apparatus 101 is subjected to signal processing and held as a photographed image signal.
  • the video clip holding unit 102 holds image information for a processing unit sufficient to encode the end image information at a timing when a stop instruction is given to the photographing apparatus 101 by the photographer.
  • the image information for the processing unit sufficient to encode the end image information is, for example, a plurality of times corresponding to a minimum unit (1 GOP) or more composed of pictures having a reference relationship with the clip end image. A plurality of captured images such as 1 GOP after a stop instruction.
  • the video clip holding unit 102 may hold one video clip obtained from the start to the stop of shooting of the shooting apparatus 101, and is shot by repeating the start and stop of shooting of the shooting apparatus 101 a plurality of times. A plurality of video clips may be held.
  • the clip end image holding unit 201 holds a terminal shot image as a clip end image among one video clip obtained from the start to the stop of shooting by the shooting apparatus 101.
  • the clip end image holding unit 201 corresponds to a processing unit sufficient for the video clip holding unit 102 to encode the end image information at a timing when a stop instruction is given to the photographing apparatus 101 by the photographer. After the image information is held, the next one frame of image information is held as a clip end image signal (clip end image).
  • the clip end image signal may be, for example, a plurality of captured images for a time equal to or longer than the minimum unit (1 GOP) composed of pictures having a reference relationship with the clip end image.
  • the clip end image signal (clip end image) is, for example, a final shot image of video clips (a plurality of shot images) obtained from the start to the stop of shooting by the shooting apparatus 101, and is equivalent to one frame. It may be a still image.
  • the synchronization control unit 203 controls the clip end image holding unit 201 and the video clip holding unit 102 to input the clip end image from the clip end image holding unit 201. Then, synchronization with the input of the photographed image taken at the start time of the next photography from the video clip holding unit 102 is performed.
  • the synchronization control unit 203 receives the captured image signal with the actual time input from the image capturing apparatus 101 to the video clip holding unit 102 and the clip end point in response to a recapture start instruction (capture resumption instruction) by the photographer.
  • the clip end image signal held in the image holding unit 201 is synchronized with the two inputs.
  • the two inputs are sent to the blend unit 202.
  • the blend unit 202 includes a blend execution unit 2021 and a frame buffer 2022, and blends (superimposes) the input image to generate image data. Specifically, when resuming shooting (starting the next shooting), the blending unit 202 uses the clip end image input in synchronization by the synchronization control unit 203 and the shot image shot at the shooting restart point, Image data is generated by blending according to the control information. That is, the blending unit 202 receives the clip end image and the shot image shot at the start of the next shooting, and inputs the clip end image and the shot image shot at the start of the next shooting. Blend to generate image data.
  • the blend execution unit 2021 is controlled by the control information output from the blend control unit 204, and blends the two inputs according to the weighted superimposition ratio (value ⁇ indicating the blend ratio) indicated in the control information to generate image data. Is generated.
  • the blend execution unit 2021 stores the generated image data in the frame buffer 2022. Specifically, as shown in FIG. 3, the blend execution unit 2021 converts the captured image held by the video clip holding unit 102 and the clip end image held by the clip end image holding unit 201 to ⁇ : (1- Mix at a ratio of ⁇ ), that is, blend (superimpose).
  • the value ⁇ indicating the blend ratio takes a value from 0 to 1.
  • the blend execution unit 2021 blends only the clip end image held in the clip end image holding unit 201 out of the two inputs, and blends the image data of only the clip end image. Generated as later video information.
  • the blend ratio indicates a value of 1
  • only the captured image held by the video clip holding unit 102 is blended, and image data of only the captured image is generated as the video information after blending.
  • the value ⁇ indicating the blend ratio is determined by the value of ⁇ that can be taken when starting the next shooting (resuming the shooting), the clip end image, and the shot image shot at the next shooting start time. The ratio of blending is determined.
  • the value ⁇ indicating the blend ratio may be initialized to 0 by a re-shooting start instruction from the photographer, and the value ⁇ indicating the blend ratio is initialized to 0 by a shooting stop instruction from the photographer. It may be done.
  • the value ⁇ indicating the blend ratio may be set to 1.
  • white data separately prepared in advance by the clip end image holding unit 201 using ROM data or the like And / or a fixed image such as black may be blended.
  • the clip end image can be blended in a stepwise manner with the shot image taken at the next start of shooting.
  • the value ⁇ indicating the blend ratio may be changed based on, for example, control information including a code amount. In that case, by controlling the rate of change in the blend ratio, it is possible to suppress the generation of an unnecessary code amount from the relevance of the image, and to further suppress the buffer failure during seamless reproduction.
  • the frame buffer 2022 stores the image data generated by the blend execution unit 2021.
  • the stored image data is read out by the encoding processing unit 103 during the encoding process.
  • the blend control unit 204 controls the blend unit 202 with control information. Specifically, the blend control unit 204 determines the control information based on the code amount calculated by the code amount control unit 1031 and the image correlation between the clip end image and the captured image immediately before the clip end image. decide. Then, according to the determined control information, the blend unit 202 changes the ratio of blending the clip end image and the photographed image taken at the next photographing start time to generate image data.
  • the blend control unit 204 may determine control information based on a code amount preset in the information processing apparatus 100, for example. Further, the blend control unit 204, for example, when the clip end image holding unit 201 holds the clip end image, the control information is obtained based on the code amount calculated by the code amount control unit 1031 of the clip end image. It may be decided. Also, the blend control unit 204 uses the information on the current code amount calculated by the code amount control unit 1031, for example, to the blend unit 202, and the captured image captured at the time when the next capturing starts. The image data may be generated by changing the blend ratio.
  • the encoding processing unit 103 includes a code amount control unit 1031, a quantization unit 1032, and an encoding unit 1033.
  • the encoding processing unit 103 encodes the image data generated by the blend unit 202 and encodes it based on the control information. Controls the code amount of image data.
  • the code amount control unit 1031 controls the code amount of the image data encoded by the encoding processing unit 103 based on the control information. Specifically, the code amount control unit 1031 controls the code amount encoded by the encoding unit 1033 according to the control information at the start of the next shooting, and from the start of shooting of the shooting apparatus 101 to the stop thereof. The amount of code encoded by the encoding unit 1033 is controlled for each encoding processing unit without depending on the control information.
  • the encoding unit 1033 encodes the image data generated by the blending unit 202. Specifically, the encoding unit 1033 encodes the image data generated by the blend unit 202 with the code amount controlled by the code amount control unit 1031 at the start of the next imaging, and the imaging device 101 performs imaging. From the start to the stop, the blend unit is obtained by blending only a plurality of photographed images taken after the start of the next photographing input to the blend unit 202 with the code amount controlled by the code amount control unit 1031. The image data generated in 202 is encoded.
  • the code amount may be a small code amount set in advance, or may be a code amount obtained by the code amount control unit 1031 calculating the code amount of the clip end image in advance. Also good.
  • the stream buffer 104 temporarily stores a plurality of video clips encoded in a format that can be recorded on the recording medium 105 by the encoding processing unit 103.
  • the plurality of encoded video clips temporarily stored are recorded on the recording medium 105.
  • the information processing apparatus 100 configured as described above includes a video clip holding unit 102 that holds a shot image signal input from the shooting apparatus 101, a clip end image holding unit 201 that holds a clip end image signal, and a clip end point.
  • Blend unit 202 having as input the clip end image signal of image holding unit 201 and the captured image signal of video clip holding unit 102, synchronization control unit 203 that synchronizes the two inputs, and blend that controls blend unit 202
  • the encoding unit 103 includes a control unit 204 and an encoding processing unit 103.
  • the encoding processing unit 103 includes a code amount control unit 1031 that controls the code amount based on control information of the blend control unit 204, and then starts shooting ( It is characterized by recording the image data (video) generated by blending the captured image and the clip end image at the time of shooting resumption) .
  • a code amount control unit 1031 that controls the code amount based on control information of the blend control unit 204, and then starts shooting ( It is characterized by recording the image data (video) generated by blending the captured image and the clip end image at the time of shooting resumption) .
  • video is a video recording method that is continuous with the time axis, not the moment, so the photographer can take multiple shot images (video clips) from the start of shooting (recording start) to the stop of shooting. Recording is performed while repeating the start and stop operations of shooting.
  • the clip end image is video information of one frame, 1 GOP or 1 GOP or more after the shooting stop instruction of the photographer, a plurality of shot images (video clips) up to the shooting stop instruction of the photographer are stored on the hard disk.
  • a storage medium such as a semiconductor memory.
  • FIG. 4 is a diagram for explaining a captured image information processing method that enables seamless playback of the information processing apparatus according to the present invention.
  • the photographed image (image data) until being encoded and stored in the recording medium 105 according to the photographing stop instruction by the photographer is the (n ⁇ 1) th (n is a natural number), the photographing start instruction (shooting restart instruction)
  • the clip end image to be superimposed is set to the nth (n is a natural number) of the captured image (image data).
  • the video clip holding unit 102 has a processing unit sufficient to encode the end image information at the timing when the photographing apparatus is instructed to stop photographing by the photographer.
  • Image information that is, a video clip up to a photographed image (n-1) photographed by the photographing apparatus 101 and a photographed image (n) which is image information of the next one frame are held.
  • the clip end image holding unit 201 is configured so that the video clip holding unit 102 takes the shot image (n ⁇ 1) at the timing when the shooting is instructed to the shooting apparatus 101 by the photographer. After that, the image information of the next one frame, that is, the captured image (n) is held as the clip end image.
  • (c) of FIG. 4 shows that the video clip holding unit 102 has a plurality of images captured by the imaging device 101 at the timing when the next imaging start instruction (imaging restart instruction) is given to the imaging device 101 by the photographer. A photographed image (video clip to be held from now on) is shown.
  • the blend unit 202 synchronizes and inputs the captured image (when the photographer next instructs the photographing apparatus 101 to perform photographing start (shooting restart instruction) when the photographing is started next (photographing restart instruction).
  • the clip end image of n) and the shot image shot at the shooting start time (shooting restart time) are blended according to the value ⁇ indicating the blend ratio indicated in the control information, as shown in FIG. Then, the encoding processing unit 103 generates image data as shown in FIG.
  • the information processing apparatus 100 can correlate images with respect to parts such as scene changes.
  • the clip end image (clip end image signal) is one frame of still image information immediately after the shooting stop instruction.
  • the present invention is not limited to this.
  • An integer multiple of 1 GOP which is a processing unit of a plurality of captured images for a time equal to or longer than a minimum unit (1 GOP) composed of a plurality of pictures in which a clip end image (clip end image signal) has a reference relationship with the clip end image Video information.
  • FIG. 5 is a diagram for explaining an information processing method for captured images that enables seamless reproduction of the information processing apparatus according to the present invention.
  • FIG. 5 shows a case where the clip end image (signal) is video information that is an integral multiple of 1 GOP. That is, a video in which a clip end image (clip end image signal) maintains a correlation between frames (captured images) in units of GOPs, which are a plurality of pictures as a minimum unit that establishes a reference relationship necessary for image compression processing. The case of information is shown.
  • n ⁇ 1th n is a natural number of GOP (GOP data).
  • the clip end image which is the moving image information of an integral multiple of 1 GOP, to be superimposed by the next shooting start instruction (shooting restart instruction) is set to the nth (n is a natural number) of the GOP (GOP data).
  • the video clip holding unit 102 has a processing unit sufficient to encode the end image information at the timing when the photographing apparatus is instructed to stop the photographing apparatus 101.
  • Image information that is, a plurality of photographed images (video clips) up to GOP (n ⁇ 1) photographed by the photographing apparatus 101 and GOP (n) which is the next GOP are held.
  • the clip end image holding unit 201 causes the video clip holding unit 102 to set GOP (n ⁇ 1) at the timing when the shooting is instructed to the shooting apparatus 101 by the photographer. After holding, the next GOP, that is, GOP (n) is held as the clip end image.
  • (c) of FIG. 5 shows that the video clip holding unit 102 has a plurality of images captured by the imaging apparatus 101 at the timing when the next imaging start instruction (imaging restart instruction) is given to the imaging apparatus 101 by the photographer. A photographed image (video clip to be held from now on) is shown.
  • the blending unit 202 synchronizes and inputs the GOP (n) when the next photographing is started, that is, at the timing when the photographer gives the photographing device 101 a next photographing start instruction (shooting restart instruction). ) And the shot end image taken at the next shooting start time (shooting restart time), as shown in FIG. 5D, are blended according to the value ⁇ indicating the blend ratio indicated in the control information. . Then, the encoding processing unit 103 generates GOP data (image data) as shown in FIG.
  • the information processing apparatus 100 can have image correlation with respect to a part such as a scene change by being blended from the nth GOP and encoded later.
  • FIG. 6 is a flowchart for explaining the information processing method of the imaging system according to the present invention.
  • the initial blend ratio is set.
  • the blending unit 202 starts blending
  • the encoding processing unit 103 starts encoding (S103).
  • changing the blend amount means changing the blend ratio by adding or subtracting the value ⁇ indicating the blend ratio in increments of 0.1, for example.
  • the photographing stop (recording stop) instruction is monitored, that is, whether or not the photographing is finished (S106). If a shooting stop instruction is issued by the photographer (YES in S106), processing sufficient for the video clip holding unit 102 to encode the end image information at the timing when the shooting instruction is given to the shooting apparatus 101 by the photographer. After holding the image information for the unit, the next one frame of image information is held as a clip end image.
  • the photographing stop instruction by the photographer is further monitored in the same manner as described above (S106).
  • the video clip holding unit 102 holds image information for a processing unit sufficient to encode the above-described end image information at the timing when the photographing apparatus 101 instructs the photographing apparatus 101 to stop. Then, the image information of the next one frame is held as a clip end image (S107).
  • the blending process can be performed by the above series of processing flows.
  • the correlation between image contents may be low between adjacent pictures, such as a scene change.
  • the amount of generated codes to be allocated increases unnecessarily, and the virtual buffer for code amount smoothing may overflow, which is one factor that makes seamless reproduction extremely difficult.
  • the CC 5 connection method of the AVCHD standard as means for solving this, that is, the code of the image after the scene change based on the coding amount of the image before the scene change and the buffer remaining amount at the time of decoding (decoding) Even if the amount of code is limited so that the buffer does not overflow during playback, the amount of code cannot be sufficiently allocated for the amount of code originally intended to be allocated, resulting in fatal image quality degradation. There was a risk of inviting.
  • the recording resumption position (the next shooting start position) after the recording is stopped (after the shooting is stopped)
  • the clip end image and the video clip at the time of the recording restart (the next shooting start).
  • a clip end image relative to the beginning of the video clip at the time of resuming recording that is, the shot image after the shooting is stopped and shot at the time when the shooting is resumed.
  • the encoding processing unit 103 includes a code amount control unit 1031 that controls the code amount based on the control information of the blend control unit 204.
  • the code amount control unit 1031 when encoding the image data that has been blended and generated immediately after the recording resumption instruction (shooting resumption instruction), specifies, for example, a small code amount set in advance.
  • the encoding unit 1033 performs encoding processing. Thereby, not only there is no buffer overflow at the time of reproduction, but also seamless reproduction can be realized without causing image quality degradation.
  • the code amount control unit 1031 may calculate the code amount of the clip end image in advance and use it as the code amount after the recording restart instruction. good.
  • the information processing apparatus and information processing method of the present invention it is possible to avoid the discontinuity of the video information by maintaining the correlation between the video clips. It becomes unnecessary to be aware of the processing on the code side. Accordingly, seamless reproduction can be realized easily at a low cost.
  • the information processing apparatus and the information processing method of the present invention in seamless playback between video clips, editing using an expensive editing device is unnecessary, and a video effect with a high crossfade is obtained at the same time as shooting is completed. Can be provided. Further, the effect can be obtained without changing the basic operation flow performed when the photographer shoots a moving image. Furthermore, it can be realized by a combination of the functions that are the basis of the moving image photographing device, and has an effect that it can be obtained relatively easily, inexpensively, and extremely high effects.
  • a management unit that manages the head position immediately after the image data of the blended part (joined part) as a playlist of the AVCHD standard And the management unit may select the head position by managing the time information of the head positions.
  • the information processing apparatus and the information processing method of the present invention have been described based on the embodiment, but the present invention is not limited to this embodiment. Unless it deviates from the meaning of this invention, the form which carried out the various deformation
  • This imaging device is, for example, a camera (digital still camera) as shown in FIG. 7 or a camcorder as shown in FIG. 8, and captures a desired scene by repeating recording and pause (shooting start and shooting stop). Can be recorded.
  • the present invention can be used for an information processing device and an information processing method, and is particularly used for home photographing using a photographing device such as a camera or a camcorder, and is desired by repeatedly recording and pausing (starting photographing and stopping photographing). It can be used in an information processing apparatus and an information processing method for photographing and recording a scene.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Computer Graphics (AREA)
  • Compression Or Coding Systems Of Tv Signals (AREA)

Abstract

Even at scene changes, the disclosed information processing device can easily and appropriately control the amount of data produced in an encoding process. Said information processing device (100), which also allows easy seamless playback, is provided with a video clip storage unit (102), a clip end-image storage unit (201), a blending unit (202), and an encoding process unit (103). The video clip storage unit (102) stores a video clip comprising a plurality of taken images. The clip end-image storage unit (201) stores a clip end image, which is the last taken image from a single video clip obtained from the beginning to the end of imaging by an imaging device. The clip end image and an image taken at the beginning of the next imaging are inputted to the blending unit (202), which blends said images to generate image data. The encoding process unit (103) encodes the image data generated by the blending unit (202), and on the basis of control information, controls the size of the encoded image data.

Description

情報処理装置及び情報処理方法Information processing apparatus and information processing method
 本発明は、情報処理装置及び情報処理方法に関し、特に、再生途切れのないシームレス再生を実現するための情報処理装置及び情報処理方法に関する。 The present invention relates to an information processing apparatus and an information processing method, and more particularly to an information processing apparatus and an information processing method for realizing seamless reproduction without reproduction interruption.
 例えば、カメラやカムコーダ等の撮影装置を用いた撮影では、録画及び一時停止(撮影開始及び撮影停止)の繰り返しにより所望のシーンすなわち映像を撮影(記録)することが多い。また、そのように撮影した映像を再生する際には、シーン間のつなぎ目をスムーズに再生(以下、シームレス再生と記載)したいという要求がある。 For example, in shooting using a shooting device such as a camera or a camcorder, a desired scene, that is, a video is often shot (recorded) by repeating recording and pause (shooting start and shooting stop). In addition, when playing back images shot in such a manner, there is a demand for smoothly playing back joints between scenes (hereinafter referred to as seamless playback).
 上記のように撮影した映像では、例えばMPEG(Moving Picture Experts Group)の手法によりデータ圧縮を行う画像信号処理により情報記録媒体(記録メディア)に記録し、記録された映像を再生する場合、シームレス再生ができない場合がある。なお以下では、映像を記録する際、撮影(記録)開始から撮影(記録)停止までに得た複数の撮影画像で構成される映像を映像クリップと称す。 The video shot as described above is recorded on an information recording medium (recording medium) by image signal processing that performs data compression using, for example, the MPEG (Moving Picture Experts Group) technique, and when the recorded video is played back, seamless playback is performed. May not be possible. Hereinafter, when recording a video, a video composed of a plurality of shot images obtained from the start of shooting (recording) to the stop of shooting (recording) is referred to as a video clip.
 図1は、従来の撮像システムの機能構成を示す図である。図1には、撮影装置901と、情報処理装置900と、記録メディア905と、表示機器908とを備える従来の撮像システムが示されている。 FIG. 1 is a diagram showing a functional configuration of a conventional imaging system. FIG. 1 shows a conventional imaging system including an imaging device 901, an information processing device 900, a recording medium 905, and a display device 908.
 撮影装置901は、例えば、カメラ、カムコーダであり、複数の画像で構成される映像クリップを撮影する。具体的には、撮影者の指示による記録開始及び停止の繰り返しにより撮影装置901は映像クリップを撮影し、情報処理装置900は、それを記録する。 The photographing device 901 is, for example, a camera or a camcorder, and photographs a video clip composed of a plurality of images. Specifically, the imaging device 901 captures a video clip by repeating recording start and stop according to an instruction from the photographer, and the information processing device 900 records it.
 記録メディア905は、例えば、SD(Secure Digital)メモリ、HDD(Hard Disk Drive)、CD-R(Compact Disc Recordable)、DVD-R(Digital Versatile Disk Recordable)等の記録可能なメディアであり、撮影装置901により撮影された映像クリップが情報処理装置900により記録される。 The recording medium 905 is, for example, a recording medium capable of recording such as SD (Secure Digital) memory, HDD (Hard Disk Drive), CD-R (Compact Disc Recordable), DVD-R (Digital Versatile Disk Recordable), and the like. The video clip photographed in 901 is recorded by the information processing apparatus 900.
 次に、図1に示す従来の撮像システムの記録処理と再生処理とについて説明する。 Next, recording processing and reproduction processing of the conventional imaging system shown in FIG. 1 will be described.
 まず、記録処理について説明する。撮影装置901で撮影された画像信号(映像クリップ)は、信号処理が施され、フレームバッファ902に蓄えられ、エンコーダ903によって記録メディアへ記録できる形式に符号化される。符号化された画像信号(映像クリップ)は、ストリームバッファ904に一時的に蓄積されるが、情報処理装置900で管理されるバッファ量に応じた適切なタイミングで記録メディア905へ蓄積される。 First, the recording process will be described. An image signal (video clip) photographed by the photographing device 901 is subjected to signal processing, stored in a frame buffer 902, and encoded into a format that can be recorded on a recording medium by an encoder 903. The encoded image signal (video clip) is temporarily stored in the stream buffer 904, but is stored in the recording medium 905 at an appropriate timing according to the buffer amount managed by the information processing apparatus 900.
 次に、再生処理について説明する。記録メディア905から情報処理装置900で管理されるバッファ量に応じた適切なタイミングで復号に必要なストリーム(映像クリップ)がストリームバッファ904に一時的に蓄積される。一時的に蓄積されたストリーム(映像クリップ)は、デコーダ906によって表示機器908が表示できる形式へ復号されてフレームバッファ907に蓄えられ、その後、デジタルテレビなどの表示機器908へ送り出される。 Next, the playback process will be described. A stream (video clip) necessary for decoding is temporarily stored in the stream buffer 904 from the recording medium 905 at an appropriate timing according to the buffer amount managed by the information processing apparatus 900. The temporarily accumulated stream (video clip) is decoded by the decoder 906 into a format that can be displayed by the display device 908, stored in the frame buffer 907, and then sent to the display device 908 such as a digital television.
 このようにして、撮影装置901で撮影された映像は、符号化を行なうエンコーダ903によりデータ圧縮され、復号を行なうデコーダ906によりデータ伸長される。具体的には、デコーダ906は、記録メディア905から読み取った符号化データ(映像クリップ)を復号する場合には、内部バッファ(ストリームバッファ904)へ符号化データ(映像クリップ)を蓄積し、蓄積した符号化データ(映像クリップ)に対して復号を行う。 In this way, the video imaged by the imaging device 901 is data-compressed by the encoder 903 that performs encoding and is decompressed by the decoder 906 that performs decoding. Specifically, when decoding the encoded data (video clip) read from the recording medium 905, the decoder 906 stores the encoded data (video clip) in the internal buffer (stream buffer 904). Decode the encoded data (video clip).
 しかし、デコーダ906が内部バッファ(ストリームバッファ904)を復号に使用すると、内部バッファ(ストリームバッファ904)は、溢れてしまう可能性がある。 However, if the decoder 906 uses the internal buffer (stream buffer 904) for decoding, the internal buffer (stream buffer 904) may overflow.
 それに対して、内部バッファ(ストリームバッファ904)が溢れてしまうことを抑制するため、前の映像クリップの再生を終えてから次の映像クリップを再生する制御を行う方法も考えられる。しかし、この制御によって、例えば映像クリップを跨ぐ再生を行う際には、後続の映像クリップに渡る直前の画像が間延びして表示されるなどの連続性の途切れが発生してしまう。 On the other hand, in order to prevent the internal buffer (stream buffer 904) from overflowing, a method of controlling the playback of the next video clip after the playback of the previous video clip is conceivable. However, with this control, for example, when playback is performed across video clips, a discontinuity of continuity occurs, for example, the image immediately before the subsequent video clip is displayed in an extended manner.
 そこで、連続性の途切れを発生させることなくするシームレス再生を可能とするための手法が開示されている(例えば、特許文献1~特許文献3)。 Therefore, a technique for enabling seamless reproduction without causing interruption of continuity is disclosed (for example, Patent Documents 1 to 3).
 これら特許文献1~特許文献3に開示されているように、ある特定の条件を満たす場合には複数の映像クリップをまたぐ再生においても連続性の途切れなくシームレス再生を行なうことが可能となる。 As disclosed in Patent Documents 1 to 3, seamless playback can be performed without interruption of continuity even when playback is performed across a plurality of video clips when a specific condition is satisfied.
 具体的には、カムコーダなどの撮影装置901を用いて映像を記録する場合、シームレス再生を実現するためのエンコード処理を行なった上で、記録メディア905に記録する方法が開示されている。つまり、不連続タイミングで映像の記録が行われる場合、先行する映像クリップの記録停止時点(撮影停止時点)で、エンコーダ903が管理する符号量を制御する際のバッファ状態情報またはストリームのタイムスタンプといったいわゆるシームレス情報を後続の映像クリップの符号化に用いる方法が開示されている。 Specifically, a method of recording on a recording medium 905 after performing an encoding process for realizing seamless reproduction when recording an image using a photographing device 901 such as a camcorder is disclosed. That is, when video recording is performed at discontinuous timing, buffer status information or stream time stamp when controlling the amount of code managed by the encoder 903 at the recording stop time (shooting stop time) of the preceding video clip, etc. A method of using so-called seamless information for encoding a subsequent video clip is disclosed.
 これによって、映像クリップの再生時にデコーダ906が復号に使用する内部バッファ(ストリームバッファ904)が溢れることがないように、すなわちバッファ・モデルが破綻することがないように、エンコーダ903による符合化(シームレス記録)の処理を行うことができる。より具体的には、エンコーダ903はシームレス再生が可能なストリームを生成して記録メディア905に記録するとともに、そのシームレス情報を、情報処理装置900が別途備えるRAMなどのメモリに格納する。そして、次の映像記録に際して(次に撮影開始され撮影された映像クリップが符号化される際)、そのシームレス情報を読み取りシームレス再生可能な映像クリップを生成する。このようにして、バッファ・モデルが破綻することがないように、エンコーダ903による符合化(シームレス記録)の処理を行うことができる。 Thus, the encoding (seamless) is performed by the encoder 903 so that the internal buffer (stream buffer 904) used by the decoder 906 for decoding does not overflow when the video clip is reproduced, that is, the buffer model does not break down. Recording) can be performed. More specifically, the encoder 903 generates a stream that can be seamlessly reproduced and records the stream on the recording medium 905, and stores the seamless information in a memory such as a RAM provided separately in the information processing apparatus 900. Then, when the next video recording is performed (when the next video recording is started and the captured video clip is encoded), the seamless information is read to generate a video clip that can be seamlessly reproduced. In this way, encoding (seamless recording) processing by the encoder 903 can be performed so that the buffer model does not fail.
 なお、AVCHD(Advanced Video Codec High Definition)規格では、上記シームレス情報を用いてシームレス再生可能な映像クリップを生成することについての定義がある。 In the AVCHD (Advanced Video Codec High Definition) standard, there is a definition for generating a video clip that can be seamlessly played using the seamless information.
特開2008-061032号公報JP 2008-061032 A 特開2005-136633号公報JP 2005-136633 A 特開2007-306258号公報JP 2007-306258 A
 しかしながら、上記のシームレス情報を適用するシームレス記録制御においては、先行する映像クリップの記録停止時点(撮影停止時点)で、エンコーダ903が管理する符号量を制御する際のバッファ状態情報を用いている。そのため、後続する映像クリップを撮影する際に、エンコーダ903が使用可能なバッファ残量が少ない場合、本来割り当てたい符号量が割り当てられず、画質劣化を招いてしまうという課題がある。 However, in the seamless recording control to which the above-described seamless information is applied, the buffer status information when the code amount managed by the encoder 903 is controlled at the recording stop time (shooting stop time) of the preceding video clip is used. For this reason, when the subsequent video clip is shot, if the remaining amount of the buffer that can be used by the encoder 903 is small, the code amount that is originally intended to be allocated is not allocated, and there is a problem that the image quality is deteriorated.
 例えば、MPEGに代表されるデータ圧縮技術を用いて作成される映像クリップは、画質劣化が知覚されやすい個所にはレートが多く割り当てられ、画質劣化が知覚されにくい個所のレート配分が削除されて、映像クリップの記録レートが動的に変化される。これにより、画質の劣化を知覚するのが困難な部分に必要以上にレートを割り当てることなく、全体として、画質を劣化させることがないように圧縮している。そして、このようにビットレートを変更する場合、デコーダ906側で、連続する映像を途切れることなくデコードするように(シームレス再生することができるように)、エンコーダ903側でビットレートを制御する。すなわち、エンコーダ903側では、デコーダ906へ入力するデータを一時的に保持する、VBV(Video Buffering Verifier)バッファが、オーバーフローまたはアンダーフローしないように、エンコーダ903側において、VBVバッファのデータ量を制御する。 For example, in a video clip created using a data compression technique typified by MPEG, a large number of rates are assigned to places where image quality degradation is easily perceived, and the rate distribution at places where image quality degradation is difficult to perceive is deleted. The recording rate of the video clip is dynamically changed. As a result, compression is performed so as not to deteriorate the image quality as a whole without assigning a rate more than necessary to a portion where it is difficult to perceive image quality deterioration. When the bit rate is changed in this way, the bit rate is controlled on the encoder 903 side so that the continuous video can be decoded without interruption on the decoder 906 side (so that seamless playback can be performed). That is, on the encoder 903 side, the data amount of the VBV buffer is controlled on the encoder 903 side so that a VBV (Video Buffering Verifier) buffer that temporarily holds data input to the decoder 906 does not overflow or underflow. .
 例えば通常の動画像で構成される映像クリップの場合、時間的に隣り合うピクチャ同士での画像の内容の相関が高いことを利用し、過去のピクチャの発生符号量に基づいて現在のピクチャの発生符号量の割り当てを行うなどの方法がある。より具体的には、MPEG規格に代表される動き補償とDCT(Discrete Cosine Transform)を組み合わせる画像圧縮方式では、エンコーダで圧縮されるビットストリームが所望のレートになるように、符号量制御を行なっている。ここで、符号量制御は、以前の量子化ステップと符号量の関係と現在の平均レートに基づいて、量子化ステップをフィードバック制御する方式が一般的である。すなわち、現在の平均レートが目標レートよりも高めならば量子化ステップを若干粗めにして総符号量を下げ、現在の平均レートが低めならば量子化ステップを若干細かくして総符号量を増やす。このようにして平均的に目標レートを実現するように制御を行うことで、細かい時間で見ると総符号量は増減するものの、長い時間で見ると平均的に総符号量を目標値とすることができる。また、この符号量制御は、MPEG2でテストモデルとして提案されている。すなわち、ここでの符号量制御は、仮想バッファの残量と、以前エンコードした際の発生符号量の関係を用いてフィードバック制御することによって、行われている。 For example, in the case of a video clip composed of normal moving images, the current picture is generated based on the generated code amount of the past picture by utilizing the fact that the correlation of the image contents between temporally adjacent pictures is high. There is a method of assigning a code amount. More specifically, in an image compression method that combines motion compensation represented by the MPEG standard and DCT (Discrete Cosine Transform), code amount control is performed so that the bit stream compressed by the encoder has a desired rate. Yes. Here, the code amount control is generally a method in which the quantization step is feedback-controlled based on the relationship between the previous quantization step and the code amount and the current average rate. That is, if the current average rate is higher than the target rate, the quantization step is slightly roughened to lower the total code amount, and if the current average rate is lower, the quantization step is slightly finer to increase the total code amount. . By controlling to achieve the target rate on average in this way, the total code amount will increase or decrease when viewed in fine time, but the average code amount will be set as the target value when viewed over a long time. Can do. This code amount control is proposed as a test model in MPEG2. That is, the code amount control here is performed by performing feedback control using the relationship between the remaining amount of the virtual buffer and the generated code amount at the time of previous encoding.
 しかし、上記のフィードバック制御による符号量制御では、例えば隣り合うピクチャ同士で画像の内容の相関が低くなるシーンチェンジのような箇所では瞬間的にレートが大きくなる。そのため、アプリケーションによっては再生画像に破綻を来たしてしまったり、十分な符号量を割り当てることができず極端な画質劣化が生じてしまったりする。 However, in the code amount control by the above feedback control, for example, the rate is instantaneously increased at a location such as a scene change where the correlation of the image contents between adjacent pictures is low. Therefore, depending on the application, the reproduced image may be broken, or a sufficient amount of code cannot be allocated, resulting in extreme image quality degradation.
 以上の例からもわかるように、後続する映像クリップを撮影する際に、エンコーダ903が使用可能なバッファ残量が少ない場合には、本来割り当てたい符号量が割り当てられず、画質劣化を招いてしまうという課題がある。 As can be seen from the above example, if the remaining amount of buffer that can be used by the encoder 903 is small when a subsequent video clip is shot, the code amount that is originally intended to be allocated is not allocated, resulting in image quality degradation. There is a problem.
 また、シームレスにつながらない(AVCHD規格ではCC=1)映像クリップの再生では、デコードバッファが破綻しないようにするため、先行の映像クリップのデコードを完了した後に、後続の映像クリップをデコードする必要がある。そのため、先行の映像クリップの最終フレームの映像が継続されることを回避するために、再生装置のデコーダが次の映像クリップを先読みして処理できるようにする。しかし、これは、デコード終了後、かつ、表示前の画像保持バッファと、デコーダが1フレーム分デコードするにかかる時間分、かつ、デコードに必要な参照画像枚数分のバッファメモリとが必要となるため、大幅なコスト高となってしまう。 In addition, when reproducing a video clip that is not seamlessly connected (CC = 1 in the AVCHD standard), it is necessary to decode the subsequent video clip after completing the decoding of the previous video clip in order to prevent the decoding buffer from failing. . Therefore, in order to prevent the video of the last frame of the preceding video clip from being continued, the decoder of the playback device can pre-read and process the next video clip. However, this requires an image holding buffer before decoding and before display, and a buffer memory for the time required for the decoder to decode one frame and for the number of reference images necessary for decoding. It will be a significant cost increase.
 それに対して、これを解決する手段として、AVCHD規格には映像クリップをシームレスにつなげるための規格(CC=5)がある。しかし、上記のような映像クリップの再生では、デコード時のバッファが破綻しないことをエンコード時に保障する必要がある。そのため、TVなどの再生装置のデコーダがシーン間の接合点をシームレス再生できるように、記録(符号化)時にあらかじめデコーダの処理を意識したシーン接合アルゴリズムを規格に則って実装する必要がある。特に各種シーンや、ユースケースに応じた対応を行う場合、シーン接合点の映像情報に十分な符号量が割り当てることができずに画質が劣化してしまう。さらに、デコード時のバッファ破綻、または、エンコードもしくはデコード処理時間性能破綻を起こさないようにする必要もあり、開発の難易度が極めて高くなってしまう。 On the other hand, as a means for solving this, the AVCHD standard includes a standard (CC = 5) for seamlessly connecting video clips. However, in the reproduction of the video clip as described above, it is necessary to ensure at the time of encoding that the buffer at the time of decoding does not fail. For this reason, it is necessary to implement a scene joining algorithm that is conscious of decoder processing in advance at the time of recording (encoding) so that a decoder of a playback device such as a TV can seamlessly reproduce the junction between scenes. In particular, when dealing with various scenes and use cases, a sufficient code amount cannot be assigned to the video information at the scene junction point, and the image quality deteriorates. Furthermore, it is necessary to prevent a buffer failure at the time of decoding or an encoding or decoding processing time performance failure, and the development difficulty becomes extremely high.
 さらに、上記のシームレス再生では、撮影再開前後の撮影画像を無機質に接続するのみで、あたかもイベントや放送でプロカメラマンが撮影する映像には程遠いという課題もある。編集機器を用いた加工手段、特殊効果を加えて再生する再生機はあるが、機器自体が極めて高価である。また、AVCHDなどH.264フォーマットの動画ストリームは、AV機器との親和性は高いものの、ストリームのフォルダ構成が複雑であると同時に、必要とするプロセッサの性能面でも高い性能を要し、PCとの親和性はあまり高いものではない。そのため、PCの編集ソフトを用いるとしても高い映像効果を得るためには必要性能を満たす環境が必要であり、事実上、普及しているとはいえない。つまり汎用性がない。また、編集は、撮影が終わった後で、撮影とは別の時間、別の場所で実施されるため、すぐに視聴し楽しむことができないという課題もある。 Furthermore, with the seamless playback described above, there is a problem that it is far from the images taken by professional photographers at events and broadcasts, simply by connecting the captured images before and after resuming shooting to the inorganic material. There are processing means using editing equipment and playback machines that reproduce by adding special effects, but the equipment itself is very expensive. Also, AVCHD and other H.264. Although the H.264 format video stream has high compatibility with AV equipment, the stream folder structure is complex, and at the same time, it requires high performance in terms of the required processor performance, and has high compatibility with PCs. It is not a thing. For this reason, even if PC editing software is used, an environment that satisfies the required performance is necessary in order to obtain a high video effect, and it cannot be said that it is practically used. That is, there is no versatility. In addition, since editing is performed after shooting is completed at a different location and time from shooting, there is a problem that viewing and enjoyment cannot be performed immediately.
 本発明は、上述の事情に鑑みてなされたものであり、シーンチェンジ箇所でも、符号化処理での発生符号量の制御を容易かつ適切に行い、容易にシームレス再生可能な情報処理装置及び情報処理方法を提供することを目的とする。 The present invention has been made in view of the above-described circumstances, and an information processing apparatus and information processing capable of easily and appropriately controlling the amount of generated code in an encoding process even at a scene change location and easily seamlessly reproducing. It aims to provide a method.
 上記目的を達成するために、本発明の一態様に係る情報処理装置は、撮影装置において撮影の開始及び停止を複数回繰り返すことにより撮影された複数の映像クリップを、符号化して記録する情報処理装置であって、前記撮影装置により撮影された複数の撮影画像で構成される映像クリップを保持する映像保持部と、前記撮影装置の撮影の開始から停止までに得た一つの映像クリップのうち、終端の撮影画像をクリップ終端画像として保持する終端画像保持部と、前記クリップ終端画像と、次の撮影の開始時点に撮影された撮影画像とが入力され、入力された前記クリップ終端画像と、入力された当該次の撮影の開始時点に撮影された撮影画像とをブレンドして画像データを生成するブレンド部と、前記終端画像保持部からの前記クリップ終端画像の入力と、前記映像保持部からの当該次の撮影の開始時点に撮影された撮影画像の入力との同期をとる同期制御部と、前記ブレンド部を制御情報により制御するブレンド制御部と、前記ブレンド部により生成された画像データを符号化するとともに、前記制御情報に基づき符号化する画像データの符号量を制御する符号化処理部とを備える。 In order to achieve the above object, an information processing apparatus according to an aspect of the present invention encodes and records a plurality of video clips shot by repeating start and stop of shooting a plurality of times in a shooting apparatus. A video holding unit that holds a video clip composed of a plurality of shot images shot by the shooting device, and one video clip obtained from the start to the stop of shooting of the shooting device, An end image holding unit that retains an end shot image as a clip end image, the clip end image, and a shot image shot at the start of the next shooting, and the input clip end image and input A blend unit that generates image data by blending the captured image captured at the start time of the next capturing, and the clip from the terminal image holding unit A synchronization control unit that synchronizes input of an end image and input of a captured image captured at the start of the next capturing from the video holding unit, and a blend control unit that controls the blending unit by control information And an encoding processing unit that encodes the image data generated by the blending unit and controls the code amount of the image data to be encoded based on the control information.
 この構成により、シーンチェンジ箇所でも、符号化処理での発生符号量の制御を容易かつ適切に行い、容易にシームレス再生可能な情報処理装置を実現することができる。それにより、シーンチェンジのような箇所に関して画像の相関を持たせることができるだけでなく、符号化処理における発生符号量の制御を容易かつ適切に行うことができ、容易にシームレス再生することができる。 With this configuration, it is possible to easily and appropriately control the amount of generated code in the encoding process at a scene change location, and to realize an information processing apparatus that can be easily seamlessly reproduced. Thus, not only can the image be correlated with respect to a location such as a scene change, the generated code amount in the encoding process can be controlled easily and appropriately, and seamless reproduction can be easily performed.
 また、前記符号化処理部は、前記ブレンド部により生成された画像データを符号化する符号化部と、前記制御情報に基づき前記符号化部で符号化される画像データの符号量を制御する符号量制御部とを備え、前記次の撮影の開始時点では、前記符号量制御部は、前記制御情報に従って前記符号化部で符号化される符号量を制御し、前記符号化部は、前記符号量制御部により制御された符号量で、前記ブレンド部により生成された画像データを符号化し、前記撮影装置の撮影の開始後から停止まででは、前記符号量制御部は、前記制御情報に拠らないで、符号化処理単位毎に前記符号化部で符号化される符号量を制御し、前記符号化部は、前記符号量制御部により制御された符号量で、前記ブレンド部に入力された前記次の撮影の開始後に撮影された複数の撮影画像のみをブレンドすることにより前記ブレンド部で生成された画像データを符号化するとしてもよい。ここで、前記ブレンド制御部は、前記情報処理装置に予め設定された符号量に基づき、前記制御情報を決定するとしてもよいし、前記符号化処理部は、前記終端画像保持部が前記クリップ終端画像を保持する際に、前記クリップ終端画像の符号量を算出し、前記ブレンド制御部は、前記符号化処理部により算出された符号量に基づき、前記制御情報を決定するとしてもよい。 The encoding processing unit includes an encoding unit that encodes the image data generated by the blend unit, and a code that controls a code amount of image data encoded by the encoding unit based on the control information. An amount control unit, and at the start of the next shooting, the code amount control unit controls the amount of code encoded by the encoding unit according to the control information, and the encoding unit includes the code The image data generated by the blending unit is encoded with the code amount controlled by the amount control unit, and the code amount control unit depends on the control information from the start to the stop of the photographing of the photographing apparatus. Without controlling the code amount encoded by the encoding unit for each encoding processing unit, the encoding unit is input to the blend unit with the code amount controlled by the code amount control unit After the start of the next shooting The image data generated by the blending unit may be encoded by blending only a plurality of photographed images. Here, the blend control unit may determine the control information based on a code amount preset in the information processing apparatus, and the encoding processing unit may be configured such that the end image holding unit is the clip end point. When holding the image, the code amount of the clip end image may be calculated, and the blend control unit may determine the control information based on the code amount calculated by the encoding processing unit.
 それにより、撮影装置の次の撮影開始時には、制御情報に従って、符号化される符号量を制御することで、符号化処理での発生符号量の制御を容易かつ適切に行うことができる。 Thereby, at the start of the next photographing of the photographing apparatus, the generated code amount in the encoding process can be controlled easily and appropriately by controlling the encoded code amount according to the control information.
 また、前記ブレンド制御部は、前記符号化処理部により算出された現符号量の情報を用いて、前記ブレンド部に、前記クリップ終端画像と前記次の撮影の開始時点に撮影された撮影画像とをブレンドする割合を変化させて、前記画像データを生成させるとしてもよい。 Further, the blend control unit uses the information on the current code amount calculated by the encoding processing unit to cause the blend unit to capture the clip end image and a captured image captured at the start of the next capturing. The image data may be generated by changing the blending ratio.
 この構成により、クリップ終端画像と、次に撮影を開始する時点で撮影された撮影画像とを段階的にブレンドすることができる。 With this configuration, the clip end image can be blended step by step with the captured image captured at the time when the next capturing is started.
 また、前記ブレンド制御部は、前記符号化処理部が算出した符号量と、前記クリップ終端画像、及び、当該クリップ終端画像の1つ前の撮影画像の画像相関関係とに基づいて、前記制御情報を決定し、決定した前記制御情報により、前記ブレンド部に、前記クリップ終端画像及び当該次の撮影の開始時点に撮影された撮影画像とをブレンドする割合を変化させて、前記画像データを生成させるとしてもよい。 Further, the blend control unit is configured to control the control information based on the code amount calculated by the encoding processing unit, the clip end image, and the image correlation of the captured image immediately before the clip end image. In accordance with the determined control information, the blending unit is caused to generate the image data by changing a ratio of blending the clip end image and the captured image captured at the start of the next capturing. It is good.
 この構成により、クリップ終端画像と、次に撮影を開始する時点で撮影された撮影画像とをブレンドする割合の変化量の緩急を制御することで、画像の関連性の観点から不要な符号量の発生を抑制し、シームレス再生時のバッファ破綻をより抑制することもできる。 With this configuration, by controlling the rate of change in the ratio of blending the clip end image and the shot image shot at the next shooting start time, an unnecessary code amount from the viewpoint of image relevance is controlled. Occurrence can be suppressed, and buffer failure during seamless playback can be further suppressed.
 なお、本発明は、このような情報処理装置として実現することができるだけでなく、このような情報処理装置が備える特徴的な手段をステップとする情報処理方法として実現したり、それらのステップをコンピュータに実行させるプログラムとして実現したりすることもできる。そして、そのようなプログラムは、CD-R等の記録媒体やインターネット等の伝送媒体を介して配信することができるのは言うまでもない。 The present invention can be realized not only as such an information processing apparatus, but also as an information processing method using steps characteristic of the information processing apparatus. It can also be realized as a program to be executed. Needless to say, such a program can be distributed via a recording medium such as a CD-R or a transmission medium such as the Internet.
 本発明によれば、シーンチェンジ箇所でも、符号化処理での発生符号量の制御を容易かつ適切に行い、容易にシームレス再生可能な情報処理装置を実現することができる。 According to the present invention, it is possible to easily and appropriately control the amount of generated code in an encoding process even at a scene change location, and to realize an information processing apparatus that can be easily seamlessly reproduced.
 具体的には、動画は静止画とは異なり、瞬間ではなく時間軸に伴う映像の記録方法であるため、記録から停止までの画像をこの操作を繰り返しながら記録するものである。例えば、記録された画像(映像)を鑑賞する場合には、これをシームレスに再生するという要求があり、本発明によればAVCHDの規格において、これを極めて安価で容易に実現する方法を提供することができる。 Specifically, unlike a still image, a moving image is a video recording method that accompanies a time axis rather than a moment, and therefore, images from recording to stop are recorded while repeating this operation. For example, when a recorded image (video) is to be viewed, there is a demand to seamlessly reproduce the image. According to the present invention, a method for easily realizing this at an extremely low cost is provided in the AVCHD standard. be able to.
 さらに具体的には、情報記録媒体(記録メディア)に対するデータ記録を停止する際に記録停止指示までに得たデータを情報記録媒体に記録し、その後の1フレームや、1GOP(Group Of Pictures)といったごく短時間のデータ処理単位をバッファに保持し、後続記録開始時に記録予定のデータに重畳する。それにより、映像クリップ間の映像情報に相関を持たせることができ、映像クリップ間を跨いだ後続クリップ再生時のバッファ破綻を防止するとともに、画質劣化を防ぐシームレス再生を実現することができる。 More specifically, when data recording on the information recording medium (recording medium) is stopped, the data obtained up to the recording stop instruction is recorded on the information recording medium, and then one frame or 1 GOP (Group Of Pictures), etc. A very short data processing unit is held in the buffer, and is superposed on the data scheduled to be recorded at the start of subsequent recording. Thereby, it is possible to correlate the video information between the video clips, and it is possible to realize the seamless reproduction that prevents the buffer failure at the time of reproducing the subsequent clip across the video clips and prevents the deterioration of the image quality.
図1は、従来の撮像システムの機能構成を示す図である。FIG. 1 is a diagram illustrating a functional configuration of a conventional imaging system. 図2は、本発明に係る撮像システムの主要なブロック構成を示す図である。FIG. 2 is a diagram showing a main block configuration of the imaging system according to the present invention. 図3は、本発明に係る撮像システムの主要な機能構成を示す図である。FIG. 3 is a diagram showing a main functional configuration of the imaging system according to the present invention. 図4は、本発明に係る情報処理装置のシームレス再生を可能とする撮影画像の情報処理方法を説明するための図である。FIG. 4 is a diagram for explaining an information processing method for captured images that enables seamless reproduction of the information processing apparatus according to the present invention. 図5は、本発明に係る情報処理装置のシームレス再生を可能とする撮影画像の記録方法を説明するための図である。FIG. 5 is a diagram for explaining a method of recording a captured image that enables seamless reproduction of the information processing apparatus according to the present invention. 図6は、本発明に係る撮像システムの情報処理方法を説明するためのフローチャートである。FIG. 6 is a flowchart for explaining the information processing method of the imaging system according to the present invention. 図7は、本発明に係る情報処理装置を搭載する撮像装置の1例を示す図である。FIG. 7 is a diagram illustrating an example of an imaging apparatus equipped with the information processing apparatus according to the present invention. 図8は、本発明に係る情報処理装置を搭載する撮像装置の1例を示す図である。FIG. 8 is a diagram illustrating an example of an imaging apparatus equipped with the information processing apparatus according to the present invention.
 以下、本発明の実施の形態につき図面を参照して詳細に説明する。 Hereinafter, embodiments of the present invention will be described in detail with reference to the drawings.
 (実施の形態)
 図2は、本発明に係る撮影システムの主要なブロック構成を示す図であり、図3は、本発明に係る撮影システムの主要な機能構成を示す図である。
(Embodiment)
FIG. 2 is a diagram showing a main block configuration of the imaging system according to the present invention, and FIG. 3 is a diagram showing a main functional configuration of the imaging system according to the present invention.
 撮影装置101は、例えば、カメラ、カムコーダ等の動画像記録装置であり、複数の画像を撮影することにより映像クリップ(撮像画像群)を撮影する。具体的には、撮影装置101は、撮影者の指示による記録開始及び停止の繰り返しにより映像クリップ(撮像画像群)の記録を行う。 The image capturing apparatus 101 is a moving image recording apparatus such as a camera or a camcorder, for example, and captures a video clip (captured image group) by capturing a plurality of images. Specifically, the photographing apparatus 101 records a video clip (captured image group) by repeatedly starting and stopping recording in accordance with a photographer's instruction.
 記録メディア105は、例えば、SDメモリ、HDD、CD-R、DVD-R等の記録可能なメディアであり、情報処理装置100により、データ(ここでは画像データ)が記録される。 The recording medium 105 is a recordable medium such as an SD memory, HDD, CD-R, or DVD-R, for example, and data (here, image data) is recorded by the information processing apparatus 100.
 図2に示す情報処理装置100は、撮影装置101において撮影の開始及び停止を複数回繰り返すことにより撮影された複数の映像クリップを、符号化して記録する。具体的には、この情報処理装置100は、撮影装置101から入力された画像を例えばMPEG2、H264等の手法により符号化し、符号化した画像を例えば記録メディア105に記録する。なお、映像クリップは、複数の撮影画像で構成されている。 The information processing apparatus 100 shown in FIG. 2 encodes and records a plurality of video clips shot by repeating the start and stop of shooting a plurality of times in the shooting apparatus 101. Specifically, the information processing apparatus 100 encodes an image input from the photographing apparatus 101 by a technique such as MPEG2 or H264, and records the encoded image on the recording medium 105, for example. Note that a video clip is composed of a plurality of captured images.
 図2に示すこの情報処理装置100は、映像クリップ保持部102と、符号化処理部103と、ストリームバッファ104と、クリップ終端画像保持部201と、ブレンド部202と、同期制御部203と、ブレンド制御部204とを備える。 The information processing apparatus 100 illustrated in FIG. 2 includes a video clip holding unit 102, an encoding processing unit 103, a stream buffer 104, a clip end image holding unit 201, a blending unit 202, a synchronization control unit 203, a blend And a control unit 204.
 映像クリップ保持部102は、撮影装置101により撮影された数の撮影画像で構成される映像クリップを保持する。 The video clip holding unit 102 holds a video clip composed of the number of shot images shot by the shooting device 101.
 具体的には、撮影装置101により撮影された画像信号は信号処理を施され、撮影画像信号として保持する。また、映像クリップ保持部102は、撮影者により撮影装置101に停止指示がなされたタイミングにおいて、終端の画像情報を符号化するに十分な処理単位分の画像情報を保持する。ここで、終端の画像情報を符号化するに十分な処理単位分の画像情報とは、例えば、当該クリップ終端画像と参照関係にあるピクチャから構成される最小単位(1GOP)以上の時間分の複数の撮影画像すなわち停止指示後の1GOPなどの複数の撮影画像である。 Specifically, the image signal photographed by the photographing apparatus 101 is subjected to signal processing and held as a photographed image signal. In addition, the video clip holding unit 102 holds image information for a processing unit sufficient to encode the end image information at a timing when a stop instruction is given to the photographing apparatus 101 by the photographer. Here, the image information for the processing unit sufficient to encode the end image information is, for example, a plurality of times corresponding to a minimum unit (1 GOP) or more composed of pictures having a reference relationship with the clip end image. A plurality of captured images such as 1 GOP after a stop instruction.
 なお、映像クリップ保持部102は、撮影装置101の撮影の開始から停止までに得た一つの映像クリップを保持するとしてもよく、撮影装置101の撮影の開始及び停止を複数回繰り返すことにより撮影された複数の映像クリップを保持するとしてもよい。 Note that the video clip holding unit 102 may hold one video clip obtained from the start to the stop of shooting of the shooting apparatus 101, and is shot by repeating the start and stop of shooting of the shooting apparatus 101 a plurality of times. A plurality of video clips may be held.
 クリップ終端画像保持部201は、撮影装置101の撮影の開始から停止までに得た一つの映像クリップのうち、終端の撮影画像をクリップ終端画像として保持する。 The clip end image holding unit 201 holds a terminal shot image as a clip end image among one video clip obtained from the start to the stop of shooting by the shooting apparatus 101.
 具体的には、クリップ終端画像保持部201は、撮影者により撮影装置101に停止指示がなされたタイミングにおいて、映像クリップ保持部102が上記終端の画像情報を符号化するに十分な処理単位分の画像情報を保持した後、次の1フレームの画像情報をクリップ終端画像信号(クリップ終端画像)として保持する。 Specifically, the clip end image holding unit 201 corresponds to a processing unit sufficient for the video clip holding unit 102 to encode the end image information at a timing when a stop instruction is given to the photographing apparatus 101 by the photographer. After the image information is held, the next one frame of image information is held as a clip end image signal (clip end image).
 なお、クリップ終端画像信号(クリップ終端画像)とは、例えば、当該クリップ終端画像と参照関係にあるピクチャで構成される最小単位(1GOP)以上の時間分の複数の撮影画像であってもよい。また、クリップ終端画像信号(クリップ終端画像)は、例えば撮影装置101の撮影の開始から停止までに得た映像クリップ(複数の撮影画像)のうち、最終の撮影画像であって、1フレーム分の静止画像であってもよい。 Note that the clip end image signal (clip end image) may be, for example, a plurality of captured images for a time equal to or longer than the minimum unit (1 GOP) composed of pictures having a reference relationship with the clip end image. Further, the clip end image signal (clip end image) is, for example, a final shot image of video clips (a plurality of shot images) obtained from the start to the stop of shooting by the shooting apparatus 101, and is equivalent to one frame. It may be a still image.
 同期制御部203は、撮影を再開(次の撮影の開始)する際、クリップ終端画像保持部201及び映像クリップ保持部102を制御して、クリップ終端画像保持部201からのクリップ終端画像の入力と、映像クリップ保持部102からの当該次の撮影の開始時点に撮影された撮影画像の入力との同期をとる。 When resuming shooting (starting the next shooting), the synchronization control unit 203 controls the clip end image holding unit 201 and the video clip holding unit 102 to input the clip end image from the clip end image holding unit 201. Then, synchronization with the input of the photographed image taken at the start time of the next photography from the video clip holding unit 102 is performed.
 具体的には、同期制御部203は、撮影者による再度の撮影開始指示(撮影再開指示)によって、撮影装置101から映像クリップ保持部102に入力された実時刻を伴う撮影画像信号と、クリップ終端画像保持部201に保持されたクリップ終端画像信号と2つの入力の同期をとる。2つの入力は、ブレンド部202に送り込まれる。 Specifically, the synchronization control unit 203 receives the captured image signal with the actual time input from the image capturing apparatus 101 to the video clip holding unit 102 and the clip end point in response to a recapture start instruction (capture resumption instruction) by the photographer. The clip end image signal held in the image holding unit 201 is synchronized with the two inputs. The two inputs are sent to the blend unit 202.
 ブレンド部202は、ブレンド実行部2021と、フレームバッファ2022とを備え、入力された画像をブレンド(重畳)して画像データを生成する。具体的には、ブレンド部202は、撮影を再開(次の撮影の開始)する際、同期制御部203により同期をとり入力されたクリップ終端画像と撮影再開時点で撮影された撮影画像とを、制御情報に従いブレンドして画像データを生成する。つまり、ブレンド部202は、クリップ終端画像と、次の撮影の開始時点に撮影された撮影画像とが入力され、入力されたクリップ終端画像と次の撮影の開始時点に撮影された撮影画像とをブレンドして画像データを生成する。 The blend unit 202 includes a blend execution unit 2021 and a frame buffer 2022, and blends (superimposes) the input image to generate image data. Specifically, when resuming shooting (starting the next shooting), the blending unit 202 uses the clip end image input in synchronization by the synchronization control unit 203 and the shot image shot at the shooting restart point, Image data is generated by blending according to the control information. That is, the blending unit 202 receives the clip end image and the shot image shot at the start of the next shooting, and inputs the clip end image and the shot image shot at the start of the next shooting. Blend to generate image data.
 ブレンド実行部2021は、ブレンド制御部204が出力する制御情報により制御され、制御情報に示される重み付けがなされた重畳の割合(ブレンド割合を示す値α)に従って、2つの入力をブレンドして画像データを生成する。ブレンド実行部2021は、生成した画像データをフレームバッファ2022に格納する。具体的には、ブレンド実行部2021は、図3に示すように、映像クリップ保持部102が保持する撮影画像と、クリップ終端画像保持部201に保持されたクリップ終端画像とをα:(1-α)の割合で混ぜ合わせるすなわちブレンド(重畳)する。 The blend execution unit 2021 is controlled by the control information output from the blend control unit 204, and blends the two inputs according to the weighted superimposition ratio (value α indicating the blend ratio) indicated in the control information to generate image data. Is generated. The blend execution unit 2021 stores the generated image data in the frame buffer 2022. Specifically, as shown in FIG. 3, the blend execution unit 2021 converts the captured image held by the video clip holding unit 102 and the clip end image held by the clip end image holding unit 201 to α: (1- Mix at a ratio of α), that is, blend (superimpose).
 ここで、例えばブレンド割合を示す値αは、0から1の値をとる。ブレンド実行部2021は、ブレンド割合が0の値を示す場合は、2つの入力のうち、クリップ終端画像保持部201に保持されたクリップ終端画像のみをブレンドし、クリップ終端画像のみの画像データをブレンド後の映像情報として生成する。一方、ブレンド割合が1の値を示す場合は、映像クリップ保持部102が保持する撮影画像のみをブレンドし、撮影画像のみの画像データをブレンド後の映像情報として生成する。このように、ブレンド割合を示す値αは、次の撮影を開始(撮影再開)する際、取りうるαの値によって、クリップ終端画像と、次に撮影を開始する時点で撮影された撮影画像とをブレンドする割合を決めるものである。 Here, for example, the value α indicating the blend ratio takes a value from 0 to 1. When the blend ratio indicates a value of 0, the blend execution unit 2021 blends only the clip end image held in the clip end image holding unit 201 out of the two inputs, and blends the image data of only the clip end image. Generated as later video information. On the other hand, when the blend ratio indicates a value of 1, only the captured image held by the video clip holding unit 102 is blended, and image data of only the captured image is generated as the video information after blending. In this way, the value α indicating the blend ratio is determined by the value of α that can be taken when starting the next shooting (resuming the shooting), the clip end image, and the shot image shot at the next shooting start time. The ratio of blending is determined.
 なお、ブレンド割合を示す値αは、撮影者による再度の撮影開始指示によって0に初期化されることとしてもよいし、ブレンド割合を示す値αは、撮影者による撮影停止指示によって0に初期化されるとしても良い。 It should be noted that the value α indicating the blend ratio may be initialized to 0 by a re-shooting start instruction from the photographer, and the value α indicating the blend ratio is initialized to 0 by a shooting stop instruction from the photographer. It may be done.
 また、撮影装置101の電源がオンされた後など、初めの撮影開始指示においては、ブレンドを実行しない、すなわちブレンド割合を示す値αの値を1とするとしてもよい。また、撮影装置101の電源がオンされるシステムブート時には、ブレンド割合を示す値αの値を1とする代わりに、ROMデータなどを用いてあらかじめクリップ終端画像保持部201で別途準備されている白色及び/または黒色などの固定画像をブレンドするなどを行っても良い。 Also, in the first shooting start instruction such as after the power of the shooting apparatus 101 is turned on, blending is not executed, that is, the value α indicating the blend ratio may be set to 1. Also, at the time of system boot when the power of the photographing apparatus 101 is turned on, instead of setting the value α indicating the blend ratio to 1, white data separately prepared in advance by the clip end image holding unit 201 using ROM data or the like And / or a fixed image such as black may be blended.
 このようにして、クリップ終端画像と、次に撮影を開始する時点で撮影された撮影画像とを段階的にブレンドすることができる。 In this way, the clip end image can be blended in a stepwise manner with the shot image taken at the next start of shooting.
 なお、ブレンド割合を示す値αの値は、例えば、符号量を含む制御情報に基づき、変化させてもよい。その場合、ブレンドする割合の変化量の緩急を制御することで、画像の関連性から不要な符号量の発生を抑制し、シームレス再生時のバッファ破綻をより抑制することもできる。 Note that the value α indicating the blend ratio may be changed based on, for example, control information including a code amount. In that case, by controlling the rate of change in the blend ratio, it is possible to suppress the generation of an unnecessary code amount from the relevance of the image, and to further suppress the buffer failure during seamless reproduction.
 フレームバッファ2022は、ブレンド実行部2021により生成された画像データを格納している。格納されている画像データは、符号化処理部103により符号化処理時に読み出される。 The frame buffer 2022 stores the image data generated by the blend execution unit 2021. The stored image data is read out by the encoding processing unit 103 during the encoding process.
 ブレンド制御部204は、ブレンド部202を制御情報により制御する。具体的には、ブレンド制御部204は、符号量制御部1031が算出した符号量と、クリップ終端画像及び当該クリップ終端画像の1つ前の撮影画像の画像相関関係とに基づいて、制御情報を決定する。そして、決定した制御情報により、ブレンド部202に、クリップ終端画像及び次に撮影を開始する時点で撮影された撮影画像とをブレンドする割合を変化させて、画像データを生成させる。 The blend control unit 204 controls the blend unit 202 with control information. Specifically, the blend control unit 204 determines the control information based on the code amount calculated by the code amount control unit 1031 and the image correlation between the clip end image and the captured image immediately before the clip end image. decide. Then, according to the determined control information, the blend unit 202 changes the ratio of blending the clip end image and the photographed image taken at the next photographing start time to generate image data.
 なお、ブレンド制御部204は、例えば情報処理装置100に予め設定された符号量に基づき、制御情報を決定するとしてもよい。また、ブレンド制御部204は、例えばクリップ終端画像保持部201がクリップ終端画像を保持する際に、符号量制御部1031によりそのクリップ終端画像の符号量が算出された符号量に基づき、制御情報を決定するとしてもよい。また、ブレンド制御部204は、例えば、符号量制御部1031により算出された現符号量の情報を用いて、ブレンド部202に、クリップ終端画像と次に撮影を開始する時点で撮影された撮影画像とをブレンドの割合を変化させて、画像データを生成させるとしてもよい。 Note that the blend control unit 204 may determine control information based on a code amount preset in the information processing apparatus 100, for example. Further, the blend control unit 204, for example, when the clip end image holding unit 201 holds the clip end image, the control information is obtained based on the code amount calculated by the code amount control unit 1031 of the clip end image. It may be decided. Also, the blend control unit 204 uses the information on the current code amount calculated by the code amount control unit 1031, for example, to the blend unit 202, and the captured image captured at the time when the next capturing starts. The image data may be generated by changing the blend ratio.
 符号化処理部103は、符号量制御部1031と、量子化部1032と、符号化部1033とを備え、ブレンド部202により生成された画像データを符号化するとともに、制御情報に基づき符号化する画像データの符号量を制御する。 The encoding processing unit 103 includes a code amount control unit 1031, a quantization unit 1032, and an encoding unit 1033. The encoding processing unit 103 encodes the image data generated by the blend unit 202 and encodes it based on the control information. Controls the code amount of image data.
 符号量制御部1031は、制御情報に基づき符号化処理部103で符号化される画像データの符号量を制御する。具体的には、符号量制御部1031は、次の撮影の開始時点では、制御情報に従って符号化部1033で符号化される符号量を制御し、撮影装置101の撮影の開始後から停止までは、制御情報に拠らないで、符号化処理単位毎に符号化部1033で符号化される符号量を制御する。 The code amount control unit 1031 controls the code amount of the image data encoded by the encoding processing unit 103 based on the control information. Specifically, the code amount control unit 1031 controls the code amount encoded by the encoding unit 1033 according to the control information at the start of the next shooting, and from the start of shooting of the shooting apparatus 101 to the stop thereof. The amount of code encoded by the encoding unit 1033 is controlled for each encoding processing unit without depending on the control information.
 符号化部1033は、ブレンド部202により生成された画像データを符号化する。具体的には、符号化部1033は、次の撮影の開始時点では、符号量制御部1031により制御された符号量で、ブレンド部202により生成された画像データを符号化し、撮影装置101の撮影の開始後から停止までは、符号量制御部1031により制御された符号量で、ブレンド部202に入力された前記次の撮影の開始後に撮影された複数の撮影画像のみをブレンドすることによりブレンド部202で生成された画像データを符号化する。 The encoding unit 1033 encodes the image data generated by the blending unit 202. Specifically, the encoding unit 1033 encodes the image data generated by the blend unit 202 with the code amount controlled by the code amount control unit 1031 at the start of the next imaging, and the imaging device 101 performs imaging. From the start to the stop, the blend unit is obtained by blending only a plurality of photographed images taken after the start of the next photographing input to the blend unit 202 with the code amount controlled by the code amount control unit 1031. The image data generated in 202 is encoded.
 ここで、符号量は、上述したように、予め設定した少ない符号量であってもよいし、符号量制御部1031がクリップ終端画像の符号量を予め算出することで得た符号量であってもよい。 Here, as described above, the code amount may be a small code amount set in advance, or may be a code amount obtained by the code amount control unit 1031 calculating the code amount of the clip end image in advance. Also good.
 ストリームバッファ104は、符号化処理部103によって記録メディア105へ記録できる形式に符号化された複数の映像クリップを、一時蓄積する。そして、一時蓄積された符号化された複数の映像クリップは、記録メディア105へ記録される。 The stream buffer 104 temporarily stores a plurality of video clips encoded in a format that can be recorded on the recording medium 105 by the encoding processing unit 103. The plurality of encoded video clips temporarily stored are recorded on the recording medium 105.
 以上のように構成された情報処理装置100は、撮影装置101から入力された撮影画像信号を保持する映像クリップ保持部102と、クリップ終端画像信号を保持するクリップ終端画像保持部201と、クリップ終端画像保持部201のクリップ終端画像信号と映像クリップ保持部102の撮影画像信号とを入力に持つブレンド部202と、その2つの入力の同期をとる同期制御部203と、ブレンド部202を制御するブレンド制御部204と、符号化処理部103とを備え、符号化処理部103には、ブレンド制御部204の制御情報により符号量を制御する符号量制御部1031を有し、次に撮影を開始(撮影再開)した時点の撮影画像とクリップ終端画像とをブレンドして生成した画像データ(映像)を記録することを特徴とする。この構成により、シーンチェンジのような画像の相関が低い点においても、シームレス再生を安価でかつ容易に実現でき、さらには、高い映像効果を得ることができる。 The information processing apparatus 100 configured as described above includes a video clip holding unit 102 that holds a shot image signal input from the shooting apparatus 101, a clip end image holding unit 201 that holds a clip end image signal, and a clip end point. Blend unit 202 having as input the clip end image signal of image holding unit 201 and the captured image signal of video clip holding unit 102, synchronization control unit 203 that synchronizes the two inputs, and blend that controls blend unit 202 The encoding unit 103 includes a control unit 204 and an encoding processing unit 103. The encoding processing unit 103 includes a code amount control unit 1031 that controls the code amount based on control information of the blend control unit 204, and then starts shooting ( It is characterized by recording the image data (video) generated by blending the captured image and the clip end image at the time of shooting resumption) . With this configuration, seamless reproduction can be easily realized at a low cost and a high video effect can be obtained even at a point where the correlation of images such as a scene change is low.
 なお、映像(動画像)は、瞬間ではなく時間軸に伴って連続した映像記録方法であるため、撮影者によって、撮影開始(記録開始)から撮影停止までの複数の撮影画像(映像クリップ)を、撮影の開始及び停止の操作を繰り返しながら記録するものである。ここでクリップ終端画像が、撮影者の撮影停止指示後、1フレーム、1GOPまたは1GOP以上の映像情報であるとすることで、撮影者の撮影停止指示までの複数の撮影画像(映像クリップ)をハードディスクや、半導体メモリなどの蓄積メディアへ保存するとしてもよい。それにより、撮影者の撮影意図通りの情報を十分に記録した上で、本発明の効果を得るためのブレンド用画像を保持することができる。 Note that video (moving images) is a video recording method that is continuous with the time axis, not the moment, so the photographer can take multiple shot images (video clips) from the start of shooting (recording start) to the stop of shooting. Recording is performed while repeating the start and stop operations of shooting. Here, by assuming that the clip end image is video information of one frame, 1 GOP or 1 GOP or more after the shooting stop instruction of the photographer, a plurality of shot images (video clips) up to the shooting stop instruction of the photographer are stored on the hard disk. Alternatively, it may be stored in a storage medium such as a semiconductor memory. Thereby, it is possible to hold an image for blending for obtaining the effect of the present invention after sufficiently recording information as intended by the photographer.
 図4は、本発明に係る情報処理装置のシームレス再生を可能とする撮像画像の情報処理方法を説明するための図である。 FIG. 4 is a diagram for explaining a captured image information processing method that enables seamless playback of the information processing apparatus according to the present invention.
 説明の便宜上、撮影者による撮影停止指示によって、符号化されかつ記録メディア105へ保存されるまでの撮影画像(画像データ)をn-1番目(nは自然数)、撮影開始指示(撮影再開指示)によって重畳したいクリップ終端画像を撮影画像(画像データ)のn番目(nは自然数)とする。 For convenience of explanation, the photographed image (image data) until being encoded and stored in the recording medium 105 according to the photographing stop instruction by the photographer is the (n−1) th (n is a natural number), the photographing start instruction (shooting restart instruction) The clip end image to be superimposed is set to the nth (n is a natural number) of the captured image (image data).
 まず、映像クリップ保持部102は、図4の(a)に示すように、撮影者により撮影装置101に撮影停止指示がなされたタイミングにおいて、終端の画像情報を符号化するに十分な処理単位分の画像情報、すなわち撮影装置101により撮影される撮影画像(n―1)までの映像クリップとその次の1フレームの画像情報である撮影画像(n)とを保持する。 First, as shown in FIG. 4A, the video clip holding unit 102 has a processing unit sufficient to encode the end image information at the timing when the photographing apparatus is instructed to stop photographing by the photographer. Image information, that is, a video clip up to a photographed image (n-1) photographed by the photographing apparatus 101 and a photographed image (n) which is image information of the next one frame are held.
 また、クリップ終端画像保持部201は、図4の(b)に示すように、撮影者により撮影装置101に撮影停止指示がなされたタイミングにおいて、映像クリップ保持部102が撮影画像(n-1)を保持した後、次の1フレームの画像情報すなわち撮影画像(n)をクリップ終端画像として保持する。 Further, as shown in FIG. 4B, the clip end image holding unit 201 is configured so that the video clip holding unit 102 takes the shot image (n−1) at the timing when the shooting is instructed to the shooting apparatus 101 by the photographer. After that, the image information of the next one frame, that is, the captured image (n) is held as the clip end image.
 ここで、図4の(c)は、映像クリップ保持部102は、撮影者により撮影装置101に次の撮影開始指示(撮影再開指示)がなされたタイミングにおいて、撮影装置101により撮影される複数の撮影画像(これから保持される映像クリップ)を示している。 Here, (c) of FIG. 4 shows that the video clip holding unit 102 has a plurality of images captured by the imaging device 101 at the timing when the next imaging start instruction (imaging restart instruction) is given to the imaging device 101 by the photographer. A photographed image (video clip to be held from now on) is shown.
 ブレンド部202は、次に撮影を開始する際すなわち撮影者により撮影装置101に次に撮影開始指示(撮影再開指示)がなされたタイミングにおいて、同期制御部203により同期をとり入力された撮影画像(n)のクリップ終端画像と撮影開始時点(撮影再開時点)で撮影された撮影画像とを、図4の(d)に示すように、制御情報に示されるブレンド割合を示す値αに従ってブレンドする。そして、符号化処理部103は、図4の(e)に示すように、画像データを生成する。 The blend unit 202 synchronizes and inputs the captured image (when the photographer next instructs the photographing apparatus 101 to perform photographing start (shooting restart instruction) when the photographing is started next (photographing restart instruction). The clip end image of n) and the shot image shot at the shooting start time (shooting restart time) are blended according to the value α indicating the blend ratio indicated in the control information, as shown in FIG. Then, the encoding processing unit 103 generates image data as shown in FIG.
 このようにして、情報処理装置100は、シーンチェンジのような箇所に関して画像の相関を持たせることができる。 In this way, the information processing apparatus 100 can correlate images with respect to parts such as scene changes.
 なお、説明を簡単にするため、クリップ終端画像(クリップ終端画像信号)が撮影停止指示直後の1フレームの静止画情報である場合について説明したが、それに限らない。クリップ終端画像(クリップ終端画像信号)がクリップ終端画像と参照関係にある複数のピクチャから構成される最小単位(1GOP)以上の時間分の複数の撮影画像すなわち動画の処理単位である1GOPの整数倍の映像情報でもよい。次に、その場合について説明する。 For the sake of simplicity, the case where the clip end image (clip end image signal) is one frame of still image information immediately after the shooting stop instruction has been described. However, the present invention is not limited to this. An integer multiple of 1 GOP, which is a processing unit of a plurality of captured images for a time equal to or longer than a minimum unit (1 GOP) composed of a plurality of pictures in which a clip end image (clip end image signal) has a reference relationship with the clip end image Video information. Next, the case will be described.
 図5は、本発明に係る情報処理装置のシームレス再生を可能とする撮像画像の情報処理方法を説明するための図である。図5は、クリップ終端画像(信号)が1GOPの整数倍の映像情報である場合について示している。つまり、クリップ終端画像(クリップ終端画像信号)が、画像圧縮処理に必要な参照関係を成立する最小単位である複数のピクチャであるGOPを単位とするフレーム(撮影画像)間の相関関係を保つ映像情報である場合について示している。なお、以下では、説明の便宜上、撮影者による撮影停止指示によって、符号化されかつ記録メディア105へ保存されるまでの複数の撮影画像をGOP(GOPデータ)のn-1番目(nは自然数)、次の撮影開始指示(撮影再開指示)によって重畳したい1GOPの整数倍の動画映像情報であるクリップ終端画像をGOP(GOPデータ)のn番目(nは自然数)とする。 FIG. 5 is a diagram for explaining an information processing method for captured images that enables seamless reproduction of the information processing apparatus according to the present invention. FIG. 5 shows a case where the clip end image (signal) is video information that is an integral multiple of 1 GOP. That is, a video in which a clip end image (clip end image signal) maintains a correlation between frames (captured images) in units of GOPs, which are a plurality of pictures as a minimum unit that establishes a reference relationship necessary for image compression processing. The case of information is shown. In the following, for convenience of explanation, a plurality of photographed images that have been encoded and stored in the recording medium 105 in accordance with a photographing stop instruction from a photographer are represented as n−1th (n is a natural number) of GOP (GOP data). Then, the clip end image, which is the moving image information of an integral multiple of 1 GOP, to be superimposed by the next shooting start instruction (shooting restart instruction) is set to the nth (n is a natural number) of the GOP (GOP data).
 まず、映像クリップ保持部102は、図5の(a)に示すように、撮影者により撮影装置101に撮影停止指示がなされたタイミングにおいて、終端の画像情報を符号化するに十分な処理単位分の画像情報、すなわち撮影装置101により撮影されるGOP(n―1)までの複数の撮影画像(映像クリップ)とその次のGOPであるGOP(n)とを保持する。 First, as shown in FIG. 5A, the video clip holding unit 102 has a processing unit sufficient to encode the end image information at the timing when the photographing apparatus is instructed to stop the photographing apparatus 101. Image information, that is, a plurality of photographed images (video clips) up to GOP (n−1) photographed by the photographing apparatus 101 and GOP (n) which is the next GOP are held.
 また、クリップ終端画像保持部201は、図5の(b)に示すように、撮影者により撮影装置101に撮影停止指示がなされたタイミングにおいて、映像クリップ保持部102がGOP(n-1)を保持した後、次のGOPすなわちGOP(n)をクリップ終端画像として保持する。 In addition, as shown in FIG. 5B, the clip end image holding unit 201 causes the video clip holding unit 102 to set GOP (n−1) at the timing when the shooting is instructed to the shooting apparatus 101 by the photographer. After holding, the next GOP, that is, GOP (n) is held as the clip end image.
 ここで、図5の(c)は、映像クリップ保持部102は、撮影者により撮影装置101に次の撮影開始指示(撮影再開指示)がなされたタイミングにおいて、撮影装置101により撮影される複数の撮影画像(これから保持される映像クリップ)を示している。 Here, (c) of FIG. 5 shows that the video clip holding unit 102 has a plurality of images captured by the imaging apparatus 101 at the timing when the next imaging start instruction (imaging restart instruction) is given to the imaging apparatus 101 by the photographer. A photographed image (video clip to be held from now on) is shown.
 ブレンド部202は、次の撮影を開始する際すなわち撮影者により撮影装置101に次の撮影開始指示(撮影再開指示)がなされたタイミングにおいて、同期制御部203により同期をとり入力されたGOP(n)のクリップ終端画像と次の撮影開始時点(撮影再開時点)で撮影された撮影画像とを、図5の(d)に示すように、制御情報に示されるブレンド割合を示す値αに従ってブレンドする。そして、符号化処理部103は、図5の(e)に示すように、GOPデータ(画像データ)を生成する。 The blending unit 202 synchronizes and inputs the GOP (n) when the next photographing is started, that is, at the timing when the photographer gives the photographing device 101 a next photographing start instruction (shooting restart instruction). ) And the shot end image taken at the next shooting start time (shooting restart time), as shown in FIG. 5D, are blended according to the value α indicating the blend ratio indicated in the control information. . Then, the encoding processing unit 103 generates GOP data (image data) as shown in FIG.
 このようにして、情報処理装置100は、GOPのn番目からブレンディングされ、後に符号化されることにより、シーンチェンジのような箇所に関して画像の相関を持たせることができる。 In this way, the information processing apparatus 100 can have image correlation with respect to a part such as a scene change by being blended from the nth GOP and encoded later.
 次に、以上のように構成された撮像システムの情報処理方法を説明する。図6は、本発明に係る撮像システムの情報処理方法を説明するためのフローチャートである。 Next, an information processing method of the imaging system configured as described above will be described. FIG. 6 is a flowchart for explaining the information processing method of the imaging system according to the present invention.
 まず、撮影開始時、上記で説明したように例えば次の撮影開始時(撮影再開時)のようにブレンド機能が有効となる条件である場合(S101のYES)、設定されたブレンドの割合の初期値に基づき(S102)、ブレンド部202は、ブレンドを開始し、符号化処理部103は符号化を開始する(S103)。ここで、例えば、設定されたブレンドの割合の初期値は、0(α=0)である。 First, at the start of shooting, as described above, for example, when the blending function is valid, such as at the start of the next shooting (when shooting is resumed) (YES in S101), the initial blend ratio is set. Based on the value (S102), the blending unit 202 starts blending, and the encoding processing unit 103 starts encoding (S103). Here, for example, the initial value of the set blend ratio is 0 (α = 0).
 次に、ブレンド処理が終了か否かを判定する(S104)。 Next, it is determined whether or not the blending process is finished (S104).
 ブレンド処理が終了でないすなわちブレンド処理中の場合(S104のNO)、ブレンドの割合の最大値(例えば、ブレンドの割合を示す値α=1)でブレンド処理が終了となるまで、ブレンド量を変更(例えば、ブレンドの割合αを加算)する(S105)。ここで、ブレンド量を変更とは、ブレンドの割合を示す値αを例えば0.1刻みなどで加算または減算することで、ブレンドの割合を変更することを意味する。 If the blending process is not finished, that is, the blending process is in progress (NO in S104), the blend amount is changed until the blending process is finished at the maximum value of the blending ratio (for example, the value α = 1 indicating the blending ratio) ( For example, the blending ratio α is added) (S105). Here, changing the blend amount means changing the blend ratio by adding or subtracting the value α indicating the blend ratio in increments of 0.1, for example.
 そして、撮影者により撮影停止指示がなされる可能性があるため、撮影の停止(記録の停止)指示を監視するすなわち撮影終了か否かを監視する(S106)。撮影者により撮影停止指示がなされれば(S106のYES)、撮影者により撮影装置101に停止指示がなされたタイミングにおいて、映像クリップ保持部102が上記終端の画像情報を符号化するに十分な処理単位分の画像情報を保持した後、次の1フレームの画像情報をクリップ終端画像として保持する。 Then, since there is a possibility that the photographing stop instruction is given by the photographer, the photographing stop (recording stop) instruction is monitored, that is, whether or not the photographing is finished (S106). If a shooting stop instruction is issued by the photographer (YES in S106), processing sufficient for the video clip holding unit 102 to encode the end image information at the timing when the shooting instruction is given to the shooting apparatus 101 by the photographer. After holding the image information for the unit, the next one frame of image information is held as a clip end image.
 一方、ブレンド処理が終了すなわち撮影が継続されてブレンド処理が完了となった場合(S104のYES)、さらに、上記同様に撮影者による撮影停止指示を監視し(S106)、撮影停止指示があれば(S106のYES)、撮影者により撮影装置101に停止指示がなされたタイミングにおいて、映像クリップ保持部102が上述した終端の画像情報を符号化するに十分な処理単位分の画像情報を保持した後、次の1フレームの画像情報をクリップ終端画像として保持する(S107)。ここで、ブレンド処理が完了となった場合とは、ブレンドの割合を示す値α=1になった場合を意味する。 On the other hand, when the blending process is completed, that is, the photographing is continued and the blending process is completed (YES in S104), the photographing stop instruction by the photographer is further monitored in the same manner as described above (S106). (YES in S106), after the video clip holding unit 102 holds image information for a processing unit sufficient to encode the above-described end image information at the timing when the photographing apparatus 101 instructs the photographing apparatus 101 to stop. Then, the image information of the next one frame is held as a clip end image (S107). Here, the case where the blending process is completed means a case where the value α = 1 indicating the blending ratio is reached.
 以上の一連の処理フローによって、ブレンド処理を実施することができる。 The blending process can be performed by the above series of processing flows.
 以上のように、本発明によれば、シーンチェンジ箇所でも、符号化処理での発生符号量の制御を容易かつ適切に行い、容易にシームレス再生可能な情報処理装置及び情報処理方法を実現することができる。 As described above, according to the present invention, it is possible to easily and appropriately control the amount of generated code in an encoding process even at a scene change location, and to realize an information processing apparatus and information processing method that can be easily seamlessly reproduced. Can do.
 具体的には、撮影者によるカメラワークにおいて、例えばシーンチェンジのように、隣り合うピクチャ同士で画像の内容の相関が低くなることもある。このような動画像に対して、ピクチャ間の相関関係を利用し、過去のピクチャの発生符号量に基づいて現在のピクチャの発生符号量の割り当てを行う従来の方法があった。しかし、割り当てる発生符号量が不要に多くなり、符号量平滑化のための仮想バッファが溢れてしまう恐れがあり、シームレス再生を極めて困難にする一因であった。さらに、これを解決する手段であるAVCHD規格のCC=5接続手法、すなわちシーンチェンジ前の画像の符号化量とデコード時(復号時)のバッファ残量に基づいて、シーンチェンジ後の画像の符号量に対して、再生時にバッファ溢れがおこさないように、符号量に制限をかける手法でも、本来割り当てたい符号量に対して、符号量を十分に割り当てることができず、致命的な画質の劣化を招く恐れがあった。 Specifically, in the camera work by the photographer, the correlation between image contents may be low between adjacent pictures, such as a scene change. There is a conventional method for assigning the generated code amount of the current picture based on the generated code amount of the past picture to such a moving image using the correlation between the pictures. However, the amount of generated codes to be allocated increases unnecessarily, and the virtual buffer for code amount smoothing may overflow, which is one factor that makes seamless reproduction extremely difficult. Further, the CC = 5 connection method of the AVCHD standard as means for solving this, that is, the code of the image after the scene change based on the coding amount of the image before the scene change and the buffer remaining amount at the time of decoding (decoding) Even if the amount of code is limited so that the buffer does not overflow during playback, the amount of code cannot be sufficiently allocated for the amount of code originally intended to be allocated, resulting in fatal image quality degradation. There was a risk of inviting.
 それに対して、本発明は、記録停止後(撮影停止後)の記録再開箇所(次の撮影開始箇所)では、クリップ終端画像と、記録再開時(次の撮影開始)の映像クリップとの間では相関が低いと予測し、記録再開時(次の撮影開始)の映像クリップの先頭すなわち撮影の停止後の撮影画像であって、撮影再開時点に撮影される撮影画像とに対して、クリップ終端画像と相関の高い映像(画像)を重畳(ブレンド)する。それにより、必要以上の符号量の割り当てを不要とできるだけでなく、少ない符号量で十分な画質を再現できる。 On the other hand, in the present invention, at the recording resumption position (the next shooting start position) after the recording is stopped (after the shooting is stopped), between the clip end image and the video clip at the time of the recording restart (the next shooting start). A clip end image relative to the beginning of the video clip at the time of resuming recording (starting the next shooting), that is, the shot image after the shooting is stopped and shot at the time when the shooting is resumed. And superimpose (blend) video (image) with high correlation. Thereby, not only the unnecessary code amount allocation is unnecessary, but also a sufficient image quality can be reproduced with a small code amount.
 また、符号化処理部103には、ブレンド制御部204の制御情報に基づき符号量を制御する符号量制御部1031を有している。符号化処理部103では、記録再開指示(撮影再開指示)直後にブレンドされ生成された後の画像データを符号化する際に、符号量制御部1031によって、例えば予め設定した少ない符号量を指定することで、符号化部1033で符号化処理を実施する。それにより、再生時のバッファあふれがないだけでなく、画質劣化を発生することなく、シームレス再生を実現することができる。なお、符号量制御部1031は、クリップ終端画像保持部201がクリップ終端画像を保持する際に、そのクリップ終端画像の符号量を予め算出しておき、記録再開指示後の符号量として用いても良い。 Also, the encoding processing unit 103 includes a code amount control unit 1031 that controls the code amount based on the control information of the blend control unit 204. In the encoding processing unit 103, when encoding the image data that has been blended and generated immediately after the recording resumption instruction (shooting resumption instruction), the code amount control unit 1031 specifies, for example, a small code amount set in advance. Thus, the encoding unit 1033 performs encoding processing. Thereby, not only there is no buffer overflow at the time of reproduction, but also seamless reproduction can be realized without causing image quality degradation. When the clip end image holding unit 201 holds the clip end image, the code amount control unit 1031 may calculate the code amount of the clip end image in advance and use it as the code amount after the recording restart instruction. good.
 また、本発明の情報処理装置及び情報処理方法によれば、映像クリップの相関を保つことによって、映像情報の不連続性を回避することができるので、記録時(符号化時)にあらかじめデコーダ(符号側)の処理を意識することが不要となる。それにより、安価で、容易にシームレス再生を実現することができる。 Also, according to the information processing apparatus and information processing method of the present invention, it is possible to avoid the discontinuity of the video information by maintaining the correlation between the video clips. It becomes unnecessary to be aware of the processing on the code side. Accordingly, seamless reproduction can be realized easily at a low cost.
 さらに、本発明の情報処理装置及び情報処理方法によれば、映像クリップ間のシームレス再生においては、高価な編集機器を用いた編集が不要で、かつ、撮影完了と同時にクロスフェードの高い映像効果を提供できる。また、撮影者が動画を撮影する際に行う基本動作フローを変更することなく、当該の効果を得ることができる。さらには、動画撮影機器の基本となる機能の組み合わせで実現でき、比較的容易に、安価で極めて高い効果を得ることができるという効果を奏する。 Furthermore, according to the information processing apparatus and the information processing method of the present invention, in seamless playback between video clips, editing using an expensive editing device is unnecessary, and a video effect with a high crossfade is obtained at the same time as shooting is completed. Can be provided. Further, the effect can be obtained without changing the basic operation flow performed when the photographer shoots a moving image. Furthermore, it can be realized by a combination of the functions that are the basis of the moving image photographing device, and has an effect that it can be obtained relatively easily, inexpensively, and extremely high effects.
 なお、本発明の情報処理装置及び情報処理方法によって得た映像クリップの画像データにおいて、ブレンド処理された部分の画像データ直後(接合部分)の先頭位置をAVCHD規格のプレイリストとして管理する管理部を備え、その管理部がそれら先頭位置の時間情報を管理することによって頭だし位置を選択できるとしてもよい。 In addition, in the image data of the video clip obtained by the information processing apparatus and the information processing method of the present invention, a management unit that manages the head position immediately after the image data of the blended part (joined part) as a playlist of the AVCHD standard And the management unit may select the head position by managing the time information of the head positions.
 以上、本発明の情報処理装置及び情報処理方法について、実施の形態に基づいて説明したが、本発明は、この実施の形態に限定されるものではない。本発明の趣旨を逸脱しない限り、当業者が思いつく各種変形を本実施の形態に施したものや、異なる実施の形態における構成要素を組み合わせて構築される形態も、本発明の範囲内に含まれる。また、上述した情報処理装置を搭載する撮像装置も本発明の範囲内に含まれる。 As described above, the information processing apparatus and the information processing method of the present invention have been described based on the embodiment, but the present invention is not limited to this embodiment. Unless it deviates from the meaning of this invention, the form which carried out the various deformation | transformation which those skilled in the art can think to this embodiment, and the structure constructed | assembled combining the component in different embodiment is also contained in the scope of the present invention. . In addition, an imaging apparatus equipped with the above-described information processing apparatus is also included in the scope of the present invention.
 なお、この撮像装置は、例えば図7に示すようなカメラ(デジタルスチルカメラ)または図8に示すカムコーダなどであり、録画及び一時停止(撮影開始及び撮影停止)の繰り返しにより所望のシーンを撮影して記録できる機器である。 This imaging device is, for example, a camera (digital still camera) as shown in FIG. 7 or a camcorder as shown in FIG. 8, and captures a desired scene by repeating recording and pause (shooting start and shooting stop). Can be recorded.
 本発明は、情報処理装置及び情報処理方法に利用でき、特に、カメラやカムコーダ等の撮影装置を用いた家庭での撮影に用いられ、録画及び一時停止(撮影開始及び撮影停止)の繰り返しにより所望のシーンを撮影して記録する情報処理装置及び情報処理方法に利用することができる。 INDUSTRIAL APPLICABILITY The present invention can be used for an information processing device and an information processing method, and is particularly used for home photographing using a photographing device such as a camera or a camcorder, and is desired by repeatedly recording and pausing (starting photographing and stopping photographing). It can be used in an information processing apparatus and an information processing method for photographing and recording a scene.
  100、900 情報処理装置
  101、901 撮影装置
  102 映像クリップ保持部
  103 符号化処理部
  104、904 ストリームバッファ
  105、905 記録メディア
  201 クリップ終端画像保持部
  202 ブレンド部
  203 同期制御部
  204 ブレンド制御部
  902、907、2022 フレームバッファ
  903 エンコーダ
  906 デコーダ
  908 表示機器
  1031 符号量制御部
  1032 量子化部
  1033 符号化部
  2021 ブレンド実行部
100, 900 Information processing device 101, 901 Image capturing device 102 Video clip holding unit 103 Encoding processing unit 104, 904 Stream buffer 105, 905 Recording medium 201 Clip end image holding unit 202 Blending unit 203 Synchronization control unit 204 Blend control unit 902, 907, 2022 Frame buffer 903 Encoder 906 Decoder 908 Display device 1031 Code amount control unit 1032 Quantization unit 1033 Encoding unit 2021 Blend execution unit

Claims (11)

  1.  撮影装置において撮影の開始及び停止を複数回繰り返すことにより撮影された複数の映像クリップを、符号化して記録する情報処理装置であって、
     前記撮影装置により撮影された複数の撮影画像で構成される映像クリップを保持する映像保持部と、
     前記撮影装置の撮影の開始から停止までに得た一つの映像クリップのうち、終端の撮影画像をクリップ終端画像として保持する終端画像保持部と、
     前記クリップ終端画像と、次の撮影の開始時点に撮影された撮影画像とが入力され、入力された前記クリップ終端画像と、入力された当該次の撮影の開始時点に撮影された撮影画像とをブレンドして画像データを生成するブレンド部と、
     前記終端画像保持部からの前記クリップ終端画像の入力と、前記映像保持部からの当該次の撮影の開始時点に撮影された撮影画像の入力との同期をとる同期制御部と、
     前記ブレンド部を制御情報により制御するブレンド制御部と、
     前記ブレンド部により生成された画像データを符号化するとともに、前記制御情報に基づき符号化する画像データの符号量を制御する符号化処理部とを備える
     情報処理装置。
    An information processing apparatus that encodes and records a plurality of video clips shot by repeating the start and stop of shooting a plurality of times in a shooting device,
    A video holding unit for holding a video clip composed of a plurality of shot images shot by the shooting device;
    Of one video clip obtained from the start to the stop of shooting of the shooting device, a terminal image holding unit that holds a terminal shot image as a clip terminal image;
    The clip end image and a shot image taken at the start of the next shooting are input, and the input clip end image and the input shot image shot at the start of the next shooting are input. A blending unit that blends to generate image data;
    A synchronization control unit that synchronizes the input of the clip end image from the end image holding unit and the input of the shot image shot at the start of the next shooting from the video holding unit;
    A blend control unit for controlling the blend unit by control information;
    An information processing apparatus comprising: an encoding processing unit that encodes image data generated by the blending unit and controls a code amount of image data to be encoded based on the control information.
  2.  前記符号化処理部は、
     前記ブレンド部により生成された画像データを符号化する符号化部と、
     前記制御情報に基づき前記符号化部で符号化される画像データの符号量を制御する符号量制御部とを備え、
     前記次の撮影の開始時点では、
     前記符号量制御部は、前記制御情報に従って前記符号化部で符号化される符号量を制御し、前記符号化部は、前記符号量制御部により制御された符号量で、前記ブレンド部により生成された画像データを符号化し、
     前記撮影装置の撮影の開始後から停止まででは、
     前記符号量制御部は、前記制御情報に拠らないで、符号化処理単位毎に前記符号化部で符号化される符号量を制御し、前記符号化部は、前記符号量制御部により制御された符号量で、前記ブレンド部に入力された前記次の撮影の開始後に撮影された複数の撮影画像のみをブレンドすることにより前記ブレンド部で生成された画像データを符号化する
     請求項1に記載の情報処理装置。
    The encoding processing unit includes:
    An encoding unit for encoding the image data generated by the blend unit;
    A code amount control unit that controls a code amount of image data encoded by the encoding unit based on the control information;
    At the start of the next shooting,
    The code amount control unit controls a code amount encoded by the encoding unit according to the control information, and the encoding unit is generated by the blend unit with a code amount controlled by the code amount control unit. Encoded image data,
    From the start to the stop of shooting of the shooting device,
    The code amount control unit controls the code amount encoded by the encoding unit for each encoding processing unit without relying on the control information, and the encoding unit is controlled by the code amount control unit. The image data generated by the blending unit is encoded by blending only a plurality of captured images captured after the start of the next imaging input to the blending unit with the code amount thus obtained. The information processing apparatus described.
  3.  前記ブレンド制御部は、前記情報処理装置に予め設定された符号量に基づき、前記制御情報を決定する
     請求項1または2に記載の情報処理装置。
    The information processing apparatus according to claim 1, wherein the blend control unit determines the control information based on a code amount preset in the information processing apparatus.
  4.  前記符号化処理部は、前記終端画像保持部が前記クリップ終端画像を保持する際に、前記クリップ終端画像の符号量を算出し、
     前記ブレンド制御部は、前記符号化処理部により算出された符号量に基づき、前記制御情報を決定する
     請求項1または2に記載の情報処理装置。
    The encoding processing unit calculates a code amount of the clip end image when the end image holding unit holds the clip end image,
    The information processing apparatus according to claim 1, wherein the blend control unit determines the control information based on a code amount calculated by the encoding processing unit.
  5.  前記ブレンド制御部は、前記符号化処理部により算出された現符号量の情報を用いて、前記ブレンド部に、前記クリップ終端画像と前記次の撮影の開始時点に撮影された撮影画像とをブレンドする割合を変化させて、前記画像データを生成させる
     請求項1~4のいずれか1項に記載の情報処理装置。
    The blend control unit uses the information on the current code amount calculated by the encoding processing unit to blend the clip end image and the shot image shot at the start of the next shooting in the blend unit. The information processing apparatus according to any one of claims 1 to 4, wherein the image data is generated by changing a ratio to be generated.
  6.  前記ブレンド制御部は、前記符号化処理部が算出した符号量と、前記クリップ終端画像、及び、当該クリップ終端画像の1つ前の撮影画像の画像相関関係とに基づいて、前記制御情報を決定し、
     決定した前記制御情報により、前記ブレンド部に、前記クリップ終端画像、及び、当該次の撮影の開始時点に撮影された撮影画像とをブレンドする割合を変化させて、前記画像データを生成させる
     請求項1または2に記載の情報処理装置。
    The blend control unit determines the control information based on the code amount calculated by the encoding processing unit, the clip end image, and the image correlation of the photographed image immediately before the clip end image. And
    The image data is generated by changing the ratio of blending the clip end image and the captured image captured at the start time of the next capturing according to the determined control information. The information processing apparatus according to 1 or 2.
  7.  前記クリップ終端画像は、前記撮影装置の撮影の開始から停止までに得た複数の撮影画像で構成される映像クリップのうち、最終の撮影画像であって、1フレーム分の静止画像である
     請求項1~6のいずれか1項に記載の情報処理装置。
    The clip end image is a final photographed image of video clips composed of a plurality of photographed images obtained from the start to the stop of photographing by the photographing device, and is a still image for one frame. 7. The information processing apparatus according to any one of 1 to 6.
  8.  前記クリップ終端画像は、前記撮影装置の撮影の開始から停止までに得た複数の撮影画像で構成される映像クリップのうち、当該クリップ終端画像と参照関係にあるピクチャから構成される最小単位(1GOP)以上の時間分の複数の撮影画像である
     請求項1~7のいずれか1項に記載の情報処理装置。
    The clip end image is a minimum unit (1 GOP) composed of pictures having a reference relationship with the clip end image, out of video clips composed of a plurality of shot images obtained from the start to the stop of shooting by the shooting apparatus. The information processing apparatus according to any one of claims 1 to 7, wherein the information is a plurality of photographed images for the above time.
  9.  前記情報処理装置は、さらに、
     前記ブレンド部により前記撮影の開始時点にブレンド処理されて生成された画像データと当該画像データ後の画像データとの接合部分である先頭位置を管理する管理部を備える
     請求項1~8のいずれか1項に記載の情報処理装置。
    The information processing apparatus further includes:
    9. A management unit that manages a head position that is a joint portion between the image data generated by the blending process at the start time of the photographing by the blending unit and the image data after the image data. The information processing apparatus according to item 1.
  10.  請求項1~9のいずれか1項に記載の情報処理装置を搭載する
     撮像装置。
    An imaging apparatus equipped with the information processing apparatus according to any one of claims 1 to 9.
  11.  撮影装置において撮影の開始及び停止を複数回繰り返すことにより撮影された複数の映像クリップを、符号化して記録する情報処理方法であって、
     前記撮影装置により撮影された複数の撮影画像で構成される映像クリップを保持する映像保持ステップと、
     前記撮影装置の撮影の開始から停止までに得た一つの映像クリップのうち、終端の撮影画像をクリップ終端画像として保持する終端画像保持ステップと、
     前記クリップ終端画像と、次の撮影の開始時点に撮影された撮影画像とが入力され、入力された前記クリップ終端画像と、当該次の撮影の開始時点に撮影された撮影画像とをブレンドして画像データを生成するブレンドステップと、
     前記クリップ終端画像の入力と、当該次の撮影の開始時点に撮影された撮影画像の入力との同期をとる同期制御ステップと、
     前記ブレンドステップにおいて生成された画像データを符号化するとともに、前記制御情報に基づき符号化される画像データの符号量を制御する符号化処理ステップとを含む
     情報処理方法。
     
    An information processing method for encoding and recording a plurality of video clips shot by repeating the start and stop of shooting a plurality of times in a shooting device,
    A video holding step for holding a video clip composed of a plurality of shot images shot by the shooting device;
    Of one video clip obtained from the start to the stop of shooting by the shooting device, a terminal image holding step for holding a terminal shot image as a clip terminal image;
    The clip end image and a shot image taken at the start of the next shooting are input, and the input clip end image and the shot image shot at the start of the next shooting are blended. A blending step for generating image data;
    A synchronization control step for synchronizing the input of the clip end image and the input of the photographed image photographed at the start of the next photographing;
    An encoding processing step that encodes the image data generated in the blending step and controls a code amount of the image data encoded based on the control information.
PCT/JP2011/001725 2010-04-16 2011-03-24 Information processing device and information processing method WO2011129059A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
JP2010-095525 2010-04-16
JP2010095525 2010-04-16

Publications (1)

Publication Number Publication Date
WO2011129059A1 true WO2011129059A1 (en) 2011-10-20

Family

ID=44798448

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/JP2011/001725 WO2011129059A1 (en) 2010-04-16 2011-03-24 Information processing device and information processing method

Country Status (1)

Country Link
WO (1) WO2011129059A1 (en)

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003319259A (en) * 2002-04-24 2003-11-07 Matsushita Electric Ind Co Ltd Video effect control apparatus and video effect control method
JP2004120544A (en) * 2002-09-27 2004-04-15 Fuji Photo Film Co Ltd Device, program and method for editing image
JP2004248139A (en) * 2003-02-17 2004-09-02 Matsushita Electric Ind Co Ltd Imaging and recording apparatus
JP2007088932A (en) * 2005-09-22 2007-04-05 Matsushita Electric Ind Co Ltd Video audio recording apparatus and method
JP2007306258A (en) * 2006-05-10 2007-11-22 Sony Corp Unit and method for processing information, and computer program

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2003319259A (en) * 2002-04-24 2003-11-07 Matsushita Electric Ind Co Ltd Video effect control apparatus and video effect control method
JP2004120544A (en) * 2002-09-27 2004-04-15 Fuji Photo Film Co Ltd Device, program and method for editing image
JP2004248139A (en) * 2003-02-17 2004-09-02 Matsushita Electric Ind Co Ltd Imaging and recording apparatus
JP2007088932A (en) * 2005-09-22 2007-04-05 Matsushita Electric Ind Co Ltd Video audio recording apparatus and method
JP2007306258A (en) * 2006-05-10 2007-11-22 Sony Corp Unit and method for processing information, and computer program

Similar Documents

Publication Publication Date Title
KR100906957B1 (en) Adaptive video processing using sub-frame metadata
TWI400939B (en) Adaptive video processing circuitry & player using sub-frame metadata
JP4791129B2 (en) Image coding apparatus, image coding method, and image editing apparatus
JP3529599B2 (en) Method for inserting editable point in encoding device and encoding device
JP2010035133A (en) Moving image encoding apparatus and moving image encoding method
JP5284074B2 (en) Image processing apparatus and image processing method
WO2003067881A1 (en) Information processing apparatus and method
JP2007158432A (en) Video recording device
KR20040010155A (en) Image data reproducing device and method
WO2011129059A1 (en) Information processing device and information processing method
JP2003052040A (en) Mpeg data reproduction equipment
JP2003052010A (en) Mpeg data recording method
JP2011049927A (en) Image processing device, and imaging apparatus with the same mounted thereon
JP2008258858A (en) Moving image coding device
JP3897783B2 (en) Image processing apparatus, control method therefor, computer program, and computer-readable storage medium
WO2004006573A1 (en) Image data processing device and method
JPH10108200A (en) Image coding method and its device
JP2007097146A (en) Method of printing still image and apparatus corresponding to printing request timing
JP2008005520A (en) Mpeg data recording and reproducing method
JP5084586B2 (en) Image recording / reproducing apparatus and image reproducing apparatus
JP5825976B2 (en) Encoding apparatus and method
JP2007325304A (en) Mpeg data recording and reproducing method
JP2006135446A (en) Stream reproducing device
JP2007300443A (en) Thumbnail generating device
JP2011040827A (en) Video recording/reproducing device, and video clip processing method

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 11768586

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 11768586

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: JP