WO2022227040A1 - Video stability augmentation method, imaging apparatus, handheld gimbal, movable platform and storage medium - Google Patents

Video stability augmentation method, imaging apparatus, handheld gimbal, movable platform and storage medium Download PDF

Info

Publication number
WO2022227040A1
WO2022227040A1 PCT/CN2021/091620 CN2021091620W WO2022227040A1 WO 2022227040 A1 WO2022227040 A1 WO 2022227040A1 CN 2021091620 W CN2021091620 W CN 2021091620W WO 2022227040 A1 WO2022227040 A1 WO 2022227040A1
Authority
WO
WIPO (PCT)
Prior art keywords
image
offset
imaging device
frequency
key
Prior art date
Application number
PCT/CN2021/091620
Other languages
French (fr)
Chinese (zh)
Inventor
李路
唐克坦
李广
朱传杰
邹文
于雄飞
Original Assignee
深圳市大疆创新科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳市大疆创新科技有限公司 filed Critical 深圳市大疆创新科技有限公司
Priority to PCT/CN2021/091620 priority Critical patent/WO2022227040A1/en
Publication of WO2022227040A1 publication Critical patent/WO2022227040A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/20Analysis of motion
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules

Definitions

  • the present application relates to the technical field of image processing, and in particular, to a video stabilization method, an imaging device, a handheld PTZ, a movable platform, and a storage medium.
  • the electronic anti-shake algorithm mainly refers to the use of forcibly increasing the photosensitive parameters of the photosensitive element (Charge coupled Device, CCD) on the imaging device to speed up the shutter, and analyzes the image obtained on the CCD, and then uses the edge image to compensate for anti-shake. technology.
  • CCD Charge coupled Device
  • the video or image after electronic anti-shake processing still has the phenomenon of picture shaking.
  • one of the objectives of the present application is to provide a video stabilization method, an imaging device, a handheld pan/tilt, a movable platform and a storage medium.
  • an embodiment of the present application provides a video stabilization method, including:
  • Image stabilization is performed on the image sequence according to the high-frequency jitter and the low-frequency jitter, so as to eliminate picture jitter in the image sequence caused by the high-frequency jitter and the low-frequency jitter of the imaging device.
  • an embodiment of the present application provides an imaging device, including an image sensor and one or more processors;
  • the image sensor is used to acquire a sequence of images
  • the one or more processors are individually or collectively configured to:
  • Image stabilization is performed on the image sequence according to the high-frequency jitter and the low-frequency jitter, so as to eliminate picture jitter in the image sequence caused by the high-frequency jitter and the low-frequency jitter of the imaging device.
  • an embodiment of the present application provides a handheld pan/tilt head, including an attitude sensor and the imaging device according to the second aspect; wherein, the attitude sensor is used to collect attitude data of the imaging device.
  • an embodiment of the present application provides a movable platform, including:
  • a power system mounted on the body, for driving the movable platform to move;
  • attitude sensor which is installed on the body and used to collect attitude data of the imaging device.
  • an embodiment of the present application provides a computer-readable storage medium, where the computer-readable storage medium stores executable instructions, and when the executable instructions are executed by a processor, the method according to the first aspect is implemented .
  • the video stabilization method considering the picture jitter problem in the image sequence caused by the low-frequency jitter of the imaging device, in addition to using the identified high-frequency jitter to perform image stabilization , and further identify the low-frequency shaking based on the relative motion between the image sequences, so as to use the identified low-frequency shaking to perform image stabilization, and eliminate the high-frequency shaking and low-frequency shaking of the imaging device in the image sequence.
  • the picture shakes, and the complete stabilization of the image sequence is achieved.
  • Fig. 1 and Fig. 6 are different schematic diagrams of the pan/tilt provided by the embodiment of the present application.
  • FIGS. 2 and 7 are schematic diagrams of a movable platform provided by an embodiment of the present application.
  • FIG. 3 and FIG. 4 are different schematic flowcharts of the video stabilization method provided by the embodiment of the present application.
  • FIG. 5 is a schematic structural diagram of an imaging device provided by an embodiment of the present application.
  • EIS electronic anti-shake technology is based on the data collected by the attitude sensor to eliminate the picture shaking phenomenon;
  • the accuracy of the sensor is limited, and it is impossible to collect attitude data about various shaking conditions of the imaging device.
  • the attitude sensor can collect attitude data about high-frequency shaking of the imaging device, but cannot collect attitude data about low-frequency shaking of the imaging device.
  • the video or image after the electronic anti-shake processing still has the phenomenon of picture shaking;
  • the EIS algorithm collects the posture of the imaging device from the posture sensor for a period of time before the exposure time.
  • the data is subjected to low-pass filtering.
  • the attitude data related to the low-frequency shaking of the imaging device will also be filtered out, resulting in the video or image after the electronic anti-shake processing.
  • the high-frequency shaking can be considered as the imaging device moving back and forth around a certain position, and the frequency of shaking occurs in a unit time is high;
  • the low-frequency shaking can be considered as the imaging device shifting in a certain direction , the frequency of shaking that occurs per unit time is low, and the low-frequency shaking is usually generated by the imaging device involuntary (or unconsciously).
  • an embodiment of the present application provides a video stabilization method. After acquiring an image sequence collected by an imaging device, the image sequence caused by the high-frequency jitter of the imaging device can be identified based on an attitude sensor. image jitter in the image sequence, and identify the image jitter in the image sequence caused by the low-frequency shaking of the imaging device based on the relative motion between the image sequences, and finally according to the high-frequency jitter and the low-frequency shaking The image sequence is subjected to image stabilization, so as to eliminate the image shake caused by the high-frequency shake and the low-frequency shake of the imaging device in the image sequence.
  • the relative motion of the imaging device is used to identify the low-frequency shaking, so as to use the identified low-frequency shaking to perform image stabilization, eliminate the picture shaking caused by the high-frequency shaking and low-frequency shaking of the imaging device in the image sequence, and realize the complete stabilization of the image sequence.
  • the video stabilization method may be applied to an imaging device, and the imaging device may be a physical imaging device.
  • the imaging device may be configured to detect electromagnetic radiation (eg, visible light, infrared light, and/or ultraviolet light) and generate image data based on the detected electromagnetic radiation.
  • Imaging devices may include charge coupled device (CCD) sensors or complementary metal oxide semiconductor (CMOS) sensors that generate electrical signals in response to wavelengths of light. The resulting electrical signals can be processed to generate image data.
  • the image data generated by the imaging device may include one or more images, which may be still images (eg, photographs), moving images (eg, videos), or a suitable combination thereof.
  • Image data may be multi-color (eg, RGB, CMYK, HSV) or monochrome (eg, grayscale, black and white, sepia).
  • the imaging device may include a lens configured to direct light onto the image sensor.
  • the imaging device may be a camera.
  • the camera may be a motion camera or a video camera that captures motion image data (eg, video).
  • the camera may be a still camera that captures still images (eg, photographs).
  • the camera can capture both moving image data and still images.
  • the camera can switch between capturing moving image data and still images. While certain embodiments provided herein are described in the context of a camera, it is to be understood that the present application may apply to any suitable imaging device, and that any description of a camera herein may also apply to any suitable imaging device. imaging devices, and any descriptions herein regarding cameras may also apply to other types of imaging devices.
  • a camera may be used to generate multiple 2D images of a 3D scene (eg, an environment, one or more objects, etc.).
  • a camera may include optical elements (eg, lenses, mirrors, filters, etc.). Cameras can capture color images, grayscale images, infrared images, and more.
  • the camera may be a thermal imaging device when it is configured to capture infrared images.
  • the imaging device may capture an image or series of images with a particular image resolution.
  • the image resolution may be defined by the number of pixels in the image.
  • the image resolution may be greater than or equal to about 352x420 pixels, or 720x480 pixels, or the like.
  • the camera may be a 4K camera or a camera with a higher resolution.
  • the imaging device may capture a series of images at a particular capture rate.
  • the series of images can be captured at a standard video frame rate, such as about 24p, 30p, 48p, 72p, 120p, 50i, or 60i.
  • the series of images can be captured at a rate of less than or equal to about one image every 0.0001 seconds, 0.002 seconds, 0.01 seconds, 0.1 seconds, 5 seconds, or 10 seconds.
  • the capture rate may vary based on user input and/or external conditions (eg, rain, snow, wind, subtle surface textures of the environment).
  • the imaging device may have a number of adjustable parameters. Imaging devices may capture different images with different parameters when subjected to the same external conditions (e.g., location, lighting). Adjustable parameters may include exposure (eg, exposure time, shutter speed, aperture, film speed), gain, brightening factor (gamma), region of interest, binning/subsampling, pixel clock, offset, trigger, ISO, and the like. Exposure-related parameters can control the amount of light reaching an image sensor in an imaging device. For example, shutter speed controls the amount of time light hits the image sensor, while aperture controls the amount of light that reaches the image sensor in a given time. A gain-related parameter can control the amplification of the signal from the optical sensor. ISO controls the level of sensitivity of the camera to the available light.
  • the imaging device may be a handheld device or the imaging device may also be mounted on a movable platform.
  • the imaging device may be a handheld camera or the imaging device may be installed on a handheld pan/tilt, for example, the imaging device may be detachably connected to the handheld pan/tilt, or the imaging device may be detachably connected to the handheld pan/tilt.
  • the imaging device is integrally formed with the handheld pan/tilt, which is not limited in this embodiment.
  • the movable platform may be a self-propelled vehicle.
  • the vehicle may traverse the environment by means of one or more propulsion units.
  • the vehicle may be an air vehicle, a land vehicle, a water vehicle or a space vehicle.
  • the vehicle may be an unmanned vehicle.
  • the vehicle may be able to traverse the environment without a human occupant on it. Alternatively, the vehicle may carry a human occupant.
  • the movable platform includes, but is not limited to, an unmanned aerial vehicle (UAV), an unmanned vehicle, an unmanned vessel, or a mobile robot, and the like.
  • UAV unmanned aerial vehicle
  • the imaging device 10 may be a handheld device, for example, the imaging device 10 may be a handheld camera or (as shown in FIG. 1 ) the imaging device 10 may be mounted on a handheld platform 20 .
  • the hand-held camera or the hand-held pan/tilt 20 may be installed with an attitude sensor, and the attitude sensor may collect attitude data corresponding to the high-frequency shaking of the imaging device 10 in a short period of time, and the attitude data may be used for The picture shake caused by the high frequency shake of the imaging device 10 in the image sequence is eliminated.
  • the part holding the handheld device may shake unintentionally at a low frequency, causing the imaging device 10 to shift, thereby causing
  • the image sequence captured by the imaging device 10 has screen shake; therefore, after the imaging device 10 captures the image sequence, it can identify the image caused by the low-frequency shaking of the imaging device 10 according to the relative motion between the image sequences
  • the picture shaking in the sequence can further eliminate the picture shaking caused by the low-frequency shaking of the imaging device 10 in the image sequence.
  • the imaging device 10 may be installed on a movable platform 30 , for example, the imaging device 10 is fixedly installed on the The movable platform 30, or the imaging device 10 is mounted on the movable platform 30 through a pan/tilt head.
  • the movable platform 30 may be installed with an attitude sensor, and the attitude sensor may collect attitude data corresponding to the high-frequency shaking of the imaging device 10 in a short period of time, and the attitude data may be used to eliminate the image sequence The picture jitter caused by the high-frequency jitter of the imaging device 10.
  • the imaging device 10 may shake unintentionally at a low frequency due to the operation of its own power system or due to natural factors (such as wind or rain). or the imaging device 10 is shifted due to the unintentional low-frequency shaking of the pan/tilt head, which in turn causes the image sequence captured by the imaging device 10 to appear jittery; therefore, the imaging device 10 is collecting After the image sequence, the image shake in the image sequence caused by the low-frequency shaking of the imaging device 10 can be identified according to the relative motion between the image sequences, and then the image sequence caused by the low-frequency shaking of the imaging device 10 can be eliminated. caused the screen to shake.
  • FIG. 3 provides a schematic flowchart of a video stabilization method.
  • the method can be applied to an imaging device, and the method includes:
  • step S101 an image sequence acquired by an imaging device is acquired.
  • step S102 the image shake in the image sequence caused by the high-frequency shake of the imaging device is identified based on the attitude sensor.
  • step S103 screen shake in the image sequence caused by low-frequency shaking of the imaging device is identified based on the relative motion between the image sequences.
  • step S104 image stabilization is performed on the image sequence according to the high-frequency jitter and the low-frequency jitter, so as to eliminate the image jitter caused by the high-frequency jitter and the low-frequency jitter of the imaging device in the image sequence.
  • the image sequence includes multiple images, and the image sequence may be a sequence composed of images that are being collected by the imaging device, or may be multiple images that have been collected by the imaging device.
  • the images in the image sequence may be images currently collected by the imaging device, or images collected by the imaging device at a certain point in time in the past.
  • the imaging device may identify, based on the gesture data about the imaging device collected by the gesture sensor, the picture shaking in the image sequence caused by the high-frequency shaking of the imaging device; and, based on The relative motion between the image sequences identifies picture shake in the image sequence due to low frequency shaking of the imaging device.
  • step S102 may be processed first and then step S103 may be processed, or step S103 may be processed first and then step S102 may be processed.
  • the high-frequency jitter may include jitter caused by high-frequency rotation and/or jitter caused by high-frequency translation, that is, the imaging device may rotate or translate with high frequency in a unit time, and this high-frequency jitter may Captured by the attitude sensor.
  • the attitude sensor can be any combination of accelerometer, gyroscope, gravity detection sensor, inertial measurement unit (IMU) and/or compass; an axis electronic compass; the attitude sensor is used to collect attitude data of the imaging device.
  • the low-frequency shaking includes shaking caused by low-frequency translation, that is, the imaging device may undergo low-frequency translation in a certain direction.
  • the gesture sensor captures or may be filtered out in the data processing stage, so this embodiment captures such low-frequency shaking through relative motion between image sequences.
  • the images in the image sequence have a first offset corresponding to the low-frequency shaking and a second offset corresponding to the high-frequency shaking
  • the imaging device can An offset and the second offset are used to perform stabilization processing on the images in the image sequence, so as to eliminate the image jitter caused by the high-frequency jitter and low-frequency jitter of the imaging device, so as to realize the Full stabilization of the image.
  • the zoom factor of the imaging device considering that the higher the zoom factor of the imaging device, the more pixels in the image sequence are shifted due to the low-frequency shaking of the imaging device; while the zoom factor of the imaging device is lower. In the case of the image sequence, there are fewer pixels shifted due to low-frequency shaking of the imaging device. Based on the principle of human eye vision, if only a small number of pixels are offset, the human eye may not be very sensitive to this. Therefore, in the case of limited processing resources of the imaging device, in low magnification scenarios, it may not be considered The picture jitter in the image sequence caused by the low-frequency shaking of the imaging device is only in a high-magnification scene, when the current zoom factor of the imaging device is higher than the preset factor, because there are more pixels.
  • the visual perception of the user is affected, so the imaging device recognizes the picture shake in the image sequence caused by the low-frequency shaking of the imaging device based on the relative motion between the image sequences, and further eliminates the image caused by the low-frequency shaking.
  • the screen shaking caused by shaking that is, the offset problem of pixels in the image is corrected, thereby improving the user's visual perception.
  • the picture shake in the image sequence caused by the low-frequency shaking of the imaging device is generated when the imaging device is operated autonomously, such as in the imaging device.
  • the image sequence caused by the low-frequency shaking of the imaging device in the above scenario
  • the picture jitter in the image is not generated unconsciously, but is caused by the imaging device being autonomously changing the shooting parameters (zoom, orientation, etc.), then the imaging device may not recognize the image sequence caused by the low-frequency shaking of the imaging device.
  • the screen shakes so as to ensure the correct presentation of the captured images.
  • the imaging device is in the target following mode, that is, when the image sequence collected by the imaging device is used for target tracking, the tracked target is usually a moving target. , usually the imaging device needs to change its orientation or zoom, etc. At this time, the low-frequency shaking and/or high-frequency shaking of the imaging device causes the pixel shift between the images because of the follow-up shooting of the target. In this case, you can The image sequence is not stabilized, thereby ensuring the correct presentation of the acquired images.
  • the imaging device may be controlled to correct its own posture according to the low-frequency shaking , for example, the pixels in the image mentioned above will be offset due to the low-frequency shaking, the imaging device can obtain the first offset corresponding to the low-frequency shaking, and then control the Imaging device movement.
  • the posture of the imaging device is corrected by the first offset caused by the low-frequency shaking, so that the imaging device is more stable, which is beneficial to ensure the stable acquisition of the image sequence.
  • the imaging device is mounted on a pan/tilt, for example, the imaging device is mounted on a handheld pan/tilt or the imaging device is mounted on a movable platform through the pan/tilt, in which case the imaging device can be adjusted by adjusting the
  • the posture of the gimbal can be used to correct the posture of the imaging device, that is, the gimbal can be controlled to correct its posture according to the low-frequency shaking. For example, the first offset corresponding to the low-frequency shaking can be obtained, and then the The first offset controls the movement of the gimbal.
  • the first offset corresponding to the low-frequency shaking can be converted into a data format (such as a quaternion) that the gimbal can read, and then the converted The first offset is sent to the gimbal, so that the gimbal can move its own gimbal axis according to the obtained converted first offset (such as a quaternion) to realize the attitude correction process, so that the cloud
  • the stage and the imaging device are more stable, which is beneficial to ensure the stable acquisition of the image sequence.
  • the pixels of the images in the image sequence have a first offset due to the low-frequency jitter and a second offset due to the high-frequency jitter; please refer to FIG. 4 , which is a schematic diagram of FIG. Another schematic flowchart of a video stabilization method provided by an application embodiment, the method can be executed by an imaging device, and the method includes:
  • step S201 an image sequence acquired by an imaging device is acquired.
  • step S202 for the images in the image sequence, perform motion estimation on the images according to preset key frames, and determine a first offset of the images relative to the key frames caused by the low-frequency shaking of the imaging device quantity.
  • step S203 for the images in the image sequence, electronic anti-shake processing is performed on the images based on the attitude data of the imaging device collected by the attitude sensor, and the second image is obtained due to the high-frequency jitter of the imaging device. Offset.
  • step S204 stabilization processing is performed on the images in the image sequence according to the first offset and the second offset.
  • the imaging device may perform motion estimation on the image according to a preset key frame, determine a first offset of the image relative to the key frame, and implement the use of
  • the motion offset of the image (that is, the first offset) is calculated based on the global motion vector (GMV) in motion estimation (Motion Estimation, ME), so that the image is increased according to the first offset.
  • the stabilization process improves the stability of the image and eliminates the image shake caused by the low-frequency shaking of the image.
  • the image in the case of no key frame, the image can be used as a key frame, and the first offset of the image relative to the key frame is set to 0; in the case of a key frame, all The imaging device determines a first offset of the image relative to the key frame through a motion estimation process.
  • the imaging device may first acquire at least one key point in the key frame, and the key point is located at a first position in the key frame , then determine the second position of the at least one key point in the image, and finally determine the first position of the image relative to the key frame according to the difference between the first position and the second position Offset.
  • the first offset may include an offset along a specified direction, for example, the specified direction is at least one of a horizontal direction or a vertical direction in an image coordinate system.
  • the first offset may be represented by a coordinate mapping table, and the imaging device may obtain a coordinate mapping table describing the coordinate transformation of the pixels in the horizontal direction in the image and coordinates describing the coordinate transformation of the pixels in the vertical direction in the image. mapping table.
  • At least one key point in the key frame can be obtained in the following ways:
  • the key frame in the process of acquiring the key points, can be divided into several non-overlapping image blocks; then for each of the image blocks, the center point of the image block and/ Or at least one feature point with the largest gradient in the image block determines the key point; for example, both the center point of the image block and at least one feature point with the largest gradient in the image block can be used as the key point, or One can be randomly selected from the center point and the at least one feature point as the key point, and the feature point with the largest gradient in the image block can also be used as the key point to improve the accuracy of the key point;
  • the imaging device records the position information of the first position of the key point in the key frame, for example, the position information includes the corresponding position of the key point in the key frame. Horizontal and vertical coordinates.
  • feature point extraction may be performed on the key frame, for example, using FAST, SIFT, SURF, SUSAN or Harris detection algorithm to obtain feature points, and the extracted feature points are used as the key points, and the imaging device The position information of the first position of the key point is recorded based on the position of the feature point in the key frame.
  • the key point may be obtained by uniform sampling in the key frame, and the imaging device records position information of the first position of the uniformly sampled key point in the key frame.
  • the imaging device may acquire first feature information of the at least one key point in the key frame respectively; it can be understood that the The first feature information is information that can characterize the key point.
  • the first feature information is information that can characterize the key point.
  • the first feature information may be gradient information or color information of the key point, or the like.
  • the first feature information may include a feature vector obtained by projecting the key point by the imaging device, such as performing a horizontal projection or a vertical projection on the key point; specifically, the first feature The information may be a feature vector obtained by projecting a preset area including the key point along a specified direction.
  • a one-dimensional feature vector obtained by projecting the preset area centered on the key point in the horizontal direction may be obtained and/or a one-dimensional feature vector obtained by projecting in the vertical direction; wherein, the shape and size of the preset area can be specifically set according to the actual application scenario, for example, the preset area is a rectangular area.
  • the Influence brought by, after obtaining the first feature information of the key points, the first feature information of the key points can be mapped to a preset range, and the preset range is used to make the The brightness variation range is the same, which is beneficial to remove the influence of illumination variation and is suitable for more complex scenes.
  • the key point can be projected to obtain The eigenvectors of are mapped to the preset range.
  • the mean value of the feature vectors obtained by the projection of all key points may be obtained, and the feature vector of the key point mapped to a preset range is obtained by the difference between the feature vector obtained by the projection of each key point and the mean value.
  • the feature vector obtained by the projection of the key points can be normalized, for example, the standard deviation of the feature vectors obtained by the projection of all key points can be obtained, and the feature vector obtained by the projection of each key point The ratio of the standard deviations is used to obtain the feature vectors that map the key points to a preset range.
  • the imaging device may first A second position in the image of at least one key point in the key frame is estimated using a preset image transformation relationship.
  • the second position of the key point in the image may be similar to the position of the key point in the image of the previous frame, so , the preset image conversion relationship may be determined according to the positional relationship between the first position of the key point in the key frame and the position of the key point in the previous frame of image.
  • the preset image conversion relationship is a unit matrix.
  • the imaging device may acquire second features of the at least one key point under different offsets based on the estimated second position information, and further for each key point, the second feature information under different offsets can be matched with the first feature information respectively, and then according to the second feature information matched with the first feature information The second position is adjusted by the corresponding offset, so as to obtain the accurate second position of the key point in the image.
  • the different offsets refer to different distances by which the key points are offset along a specified direction
  • the key points may be shifted by a specified number of unit lengths in turn along the horizontal direction and/or the vertical direction,
  • the key points can be shifted by 1 unit length, 2 unit lengths, 3 unit lengths...
  • the different offsets may be different distances by which the key points are offset along a specified direction within a preset offset range, and the preset offset range may be specifically set according to the actual application scenario, which is not done in this embodiment. any restrictions.
  • the imaging device may calculate the second feature information of the shifted key points, that is, the key points are at different offsets. characteristic information under the quantity.
  • the second feature information is obtained in the same manner as the first feature information.
  • the second feature information includes a feature vector obtained by projecting the shifted key points.
  • the The second feature information includes a feature vector obtained by projecting a preset area including the shifted key points along a specified direction, where the specified direction at least includes a horizontal direction and/or a vertical direction in image coordinates.
  • the second feature information may be mapped into a preset range, and the preset range is used to make the brightness change ranges of each of the key points the same.
  • the feature vector obtained by projecting the shifted key points can be mapped to a preset range.
  • the mean value of the feature vectors obtained by the projection of all the shifted key points can be obtained, and the difference between the feature vector obtained by the projection of each shifted key point and the mean value can be obtained .
  • the feature vector obtained by the projection of the shifted key points may be normalized, for example, the standard of the feature vector obtained by the projection of all the shifted key points under the same offset may be obtained. difference, obtain the ratio of the feature vector obtained by the projection of each shifted key point to the standard deviation.
  • the imaging device After acquiring the second feature information of the at least one key point under different offsets, for each of the key points, the imaging device compares the second feature information under the different offsets with the second feature information respectively.
  • the first feature information is matched, and the differences between the second feature information under the different offsets and the first feature information are compared, and the difference between the second feature information and the first feature information is the smallest according to the second feature information.
  • the estimated second position is adjusted by the corresponding offset, so as to obtain a relatively accurate second position of the key point in the image, and then the imaging device can be based on the first position of the key point and the adjusted first position.
  • the positional relationship between the two positions is determined by using the principle of coordinate transformation to determine the first offset of the image relative to the key frame.
  • the imaging device may perform the adjustment according to the first position of the at least one key point and the adjusted second position The positional relationship between them determines the image conversion relationship for the next frame of image.
  • the imaging device may filter valid key points from the at least one key point, and then select the valid key point according to the first position of the valid key point and The adjusted positional relationship between the second positions determines an image conversion relationship for the next frame of image.
  • the preset image conversion relationship may be used to determine the third position of the key point located at the adjusted second position in the key frame, and then determine each The displacement of the third position of the key point relative to the first position, the displacement of one of the key points (for example, a key point can be randomly selected) is determined as the target displacement, and the relative displacement of other key points is obtained.
  • conf represents the confidence level (for example, 0.99)
  • pix_size is the total number of keypoints
  • num is the number of valid keypoints.
  • the key frame when the number of valid key points is less than a preset number and/or when the first offset of the image relative to the key frame is greater than a preset offset threshold, it is indicated that the image If the difference from the key frame is large, the key frame may be replaced at this time, the image may be determined as the key frame, and the first offset of the image relative to the key frame is set to 0.
  • the imaging device may use the first offset of at least one stabilized image acquired before the key frame smooth the first offset of the image, and obtain the first offset after the image processing, so as to ensure the smooth transition of the video picture after switching the key frame, eliminate or reduce the above-mentioned change band The influence of the coming, makes the transition of the video picture more natural.
  • all images may not be smoothed, but only a small number of images after the key frame is replaced, for example, 30 frames of images after the key frame need to be smoothed. , the image starting from the 31st frame after the key frame does not need to be smoothed. Or it can be said that if the image is acquired within a preset time period after the key frame, the first offset of the at least one stabilized image acquired before the key frame can be used to determine the value of the image. Smoothing is performed on the first offset, and the first offset after image processing is obtained.
  • the difference between the acquisition time of the stabilized image and the acquisition time of the key frame should be as small as possible, that is, the acquisition time interval between the stabilized image and the key frame If it is less than the specified duration, the specified duration should be as small as possible, and the specified duration can be specifically set according to the actual application scenario; for example, the stabilized image includes the image of the previous frame of the key frame.
  • the smoothing processing may include linear filtering processing, that is, linear filtering may be performed on the first offset of the image by using the first offset of the at least one stabilized image acquired before the key frame. processing, to obtain a processed first offset, and further, the processed first offset may be used to eliminate picture shake of the image caused by low-frequency shaking of the imaging device.
  • the linear filtering process may be performed in a weighted average manner, that is, the first offset after the image processing is the weighted average of the first offset of the stabilized image and the first offset of the image where, in order to gradually reduce the influence of the first offset of the stabilized image, as the acquisition time interval between the image and the key frame gradually increases, the stabilized image
  • the weight coefficient corresponding to the first offset of the image may gradually decrease, and the weight coefficient corresponding to the first offset of the image may gradually increase, that is, the weight coefficient corresponding to the first offset of the stabilized image
  • the The key frame and the image are down-sampled, and the down-sampling rate is recorded, and then the offset of the down-sampled image relative to the down-sampled key frame is calculated, and then the pre-recorded down-sampling rate is used to restore the sample
  • the first offset is obtained from the offset of the post-image relative to the down-sampled key frame, so as to achieve the purpose of reducing the amount of calculation and improving the calculation efficiency, and can also meet the real-time requirements in some scenarios.
  • the low-frequency shaking or the first offset generated by the low-frequency shaking may also be used to align the image with other images.
  • the imaging device may perform electronic anti-shake processing on the images based on the attitude data about the imaging device collected by the attitude sensor, and obtain a result of the high-frequency jitter of the imaging device.
  • the generated second offset so that stabilization processing is performed on the image based on the second offset to improve the stability of the image.
  • performing electronic anti-shake processing on the image can be considered as aligning the image with its adjacent images, and then cropping off the uneven part of the image and the adjacent images; for example, after the image is exposed,
  • the imaging device acquires the attitude data collected by the attitude sensor for a period of time before the exposure time.
  • the attitude data is expressed in the form of quaternions, and the imaging device can perform the attitude data expressed in the form of quaternions. Interpolate and convert it into a rotation matrix; then use the positional relationship between the attitude sensor and the imaging device to obtain the second offset according to the rotation matrix.
  • the second offset includes an offset relative to each pixel in the image, for example, the second offset can be represented by a coordinate mapping table, and each element in the coordinate mapping table represents an offset relative to the pixel in the image.
  • the offset of each pixel in the image considering that the higher the zoom factor of the imaging device is, the more pixels in the image sequence are shaken due to the high-frequency dithering of the imaging device; stable, the imaging device can adjust the second offset according to its own current zoom factor, and the adjusted second offset is adapted to the number of pixels of the image under the current zoom factor, and the adjusted second offset is used.
  • the second offset is used for stabilization processing, which is beneficial to improve the stability of images captured in high-magnification scenes.
  • the second offset includes an offset along a specified direction, for example, the specified direction may be at least a horizontal direction and/or a vertical direction in image coordinates.
  • the second offset may be represented by a coordinate mapping table, and the imaging device may obtain a coordinate mapping table describing the coordinate transformation of the pixels in the horizontal direction in the image and coordinates describing the coordinate transformation of the pixels in the vertical direction in the image. mapping table.
  • the imaging device may, according to the first offset and the second offset, analyze the image Images in the sequence are stabilized.
  • a total offset of the image may be acquired according to the first offset and the second offset, and stabilization processing is performed on the image according to the total offset.
  • the first offset and the second offset may be transmitted to a distortion correction (GDC) module in the imaging device, and the image is stabilized by the distortion correction module.
  • GDC distortion correction
  • the image after stabilization processing eliminates the image jitter caused by the high-frequency jitter and the low-frequency jitter of the imaging device, and its stability is improved, which is beneficial to improve the user's visual perception.
  • an embodiment of the present application further provides an imaging device 10 , including an image sensor 11 and one or more processors 12 ;
  • the image sensor 11 is used to collect image sequences
  • the one or more processors 12 are individually or collectively configured to:
  • Image stabilization is performed on the image sequence according to the high-frequency jitter and the low-frequency jitter, so as to eliminate the image jitter caused by the high-frequency jitter and the low-frequency jitter of the imaging device 10 in the image sequence.
  • the processor 12 executes executable instructions included in the memory, and the executable instructions include instructions for executing the above-described video stabilization method.
  • the processor 12 may be a central processing unit (Central Processing Unit, CPU), or other general-purpose processors, digital signal processors (Digital Signal Processor, DSP), application specific integrated circuits (Application Specific Integrated Circuit, ASIC), Off-the-shelf programmable gate array (Field-Programmable Gate Array, FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc.
  • a general purpose processor may be a microprocessor or the processor 12 may be any conventional processor or the like.
  • the high-frequency jitter includes jitter caused by high-frequency rotation and/or jitter caused by high-frequency translation; and the low-frequency jitter includes jitter caused by low-frequency translation.
  • the imaging device 10 is a handheld device, and the low-frequency shaking is caused by the shaking of the part where the user holds the handheld device; or, the imaging device 10 is mounted on a movable platform, and the low-frequency shaking is caused by The movable platform shakes during the movement.
  • the processor 12 is further configured to: in the case that the current zoom factor of the imaging device 10 is higher than a preset factor, identify the cause of the imaging device 10 based on the relative motion between the image sequences. Picture shaking in the image sequence caused by low frequency shaking.
  • the processor 12 is further configured to: during the process of zooming by the imaging device 10 or during the process of changing the orientation of the imaging device 10, not to identify the low frequency caused by the imaging device 10 Shaking in the image sequence caused by shaking.
  • the processor 12 is further configured to: in the case of using the image sequence for target tracking, not to perform stabilization processing on the image sequence.
  • the processor 12 is further configured to: control the imaging device 10 to correct its own posture according to the low-frequency shaking.
  • the processor 12 is further configured to: acquire a first offset corresponding to the low-frequency shaking, and control the imaging device 10 to move according to the first offset.
  • the imaging device 10 is installed on a pan/tilt head.
  • the processor 12 is further configured to: obtain a first offset according to the low-frequency shaking, and control the movement of the gimbal according to the first offset.
  • the images in the sequence of images correspond to a first offset due to the low-frequency shaking and a second offset corresponds to the high-frequency jitter.
  • the processor 12 is further configured to: perform stabilization processing on the images in the image sequence according to the first offset and the second offset.
  • the first offset includes an offset in a specified direction; and/or the second offset includes an offset in a specified direction.
  • the processor 12 is further configured to: for the images in the image sequence, perform electronic anti-shake processing on the images based on the attitude data of the imaging device 10 collected by the attitude sensor, and obtain the image factors.
  • the second offset is generated by the high frequency jitter of the imaging device 10 .
  • the second offset includes an offset for each pixel in the image.
  • the processor 12 is further configured to: adjust the second offset according to the current zoom factor of the imaging device 10; the adjusted second offset is adapted to the number of pixels of the image under the current zoom factor .
  • the processor 12 is further configured to: for the images in the image sequence, perform motion estimation on the images according to preset key frames, and determine whether the images are caused by low-frequency shaking of the imaging device 10 .
  • the image in the absence of a key frame, the image is used as a key frame, and the first offset is set to 0.
  • the processor 12 is further configured to: acquire at least one key point in the key frame, where the key point is located at a first position in the key frame; determine where the at least one key point is located; a second position in the image; and determining a first offset of the image relative to the key frame according to the difference between the first position and the second position.
  • the processor 12 is further configured to: divide the key frame into several image blocks; for each image block, according to the center point of the image block and/or the At least one feature point with the largest gradient determines the keypoint.
  • the processor 12 is further configured to: acquire first feature information of the at least one key point in the key frame respectively; use a preset image conversion relationship to estimate at least one of the key frames; The second position of the key point in the image; in the image, obtain the second feature information of the at least one key point under different offsets; for each of the key points, the different The second feature information under the offset is respectively matched with the first feature information, and the second position is adjusted according to the offset corresponding to the second feature information matched with the first feature information.
  • the different offset amounts are different distances offset along a specified direction within a preset offset range.
  • the matched second feature information has the smallest difference from the first feature information.
  • the first feature information includes: a feature vector obtained by projecting the key point; and/or the second feature information includes: a feature vector obtained by projecting the key point.
  • the first feature information includes: a feature vector obtained by projecting a preset area including the key point along a specified direction; and/or the second feature information includes: The feature vector obtained by projecting the preset area of the keypoint along the specified direction.
  • the specified direction includes at least a horizontal direction and/or a vertical direction in image coordinates.
  • the feature vector is a feature vector mapped to a preset range, and the preset range is used to make the luminance variation range of each of the key points the same.
  • the feature vector mapped to the preset range includes: the difference between the feature vector obtained by the projection and the mean value of the feature vector, or the result of normalization of the feature vector obtained by the projection.
  • the preset image conversion relationship is determined according to a positional relationship between a first position of the key point in the key frame and a position of the key point in a previous frame of image.
  • the preset image conversion relationship is an identity matrix
  • the processor 12 is further configured to: determine an image conversion relationship for the next frame of image according to the positional relationship between the first position of the at least one key point and the adjusted second position .
  • the processor 12 is further configured to: filter valid key points from the at least one key point; according to the positional relationship between the first position of the valid key point and the adjusted second position , to determine the image conversion relationship for the next frame of image.
  • the processor 12 is further configured to: use the preset image conversion relationship to determine the third position of the key point at the adjusted second position in the key frame; for each key point, determine the displacement of the third position of the key point relative to the first position; determine the displacement of one of the key points as the target displacement, and obtain the displacement of other key points relative to the The error value of the target displacement; from the at least one key point, select an effective key point whose error value is less than the preset value.
  • the processor 12 is further configured to: determine the image as a key frame when the number of valid key points is less than a preset number, and determine the first offset Set to 0.
  • the processor 12 is further configured to: when the first offset is greater than a preset offset threshold, determine the image as a key frame, and convert the first offset into a key frame. The amount is set to 0.
  • the processor 12 is further configured to: use at least one stabilized image acquired before the key frame
  • the first offset of the image is processed by smoothing the first offset of the image to obtain the processed first offset of the image.
  • smoothing is performed on the first offset of the image.
  • the acquisition time interval between the stabilized image and the key frame is less than a specified time period.
  • the stabilized image includes a previous frame image of the key frame.
  • the smoothing process includes a linear filtering process.
  • the first offset after image processing is a weighted average result of the first offset of the stabilized image and the first offset of the image.
  • the weight coefficient corresponding to the first offset of the stabilized image gradually decreases, and the The weight coefficient corresponding to the first offset gradually increases.
  • the processor 12 is further configured to: acquire a total offset of the image according to the first offset and the second offset, and pair the image according to the total offset. The image is stabilized.
  • the processor 12 is further configured to: use the first offset to perform a first stabilization process on the image, and then use the second offset to perform a first stabilization process on the image. The resulting image is subjected to a second stabilization process.
  • the image and the key frame are obtained after down-sampling.
  • the various embodiments described herein can be implemented using computer readable media such as computer software, hardware, or any combination thereof.
  • the embodiments described herein can be implemented using application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable gate arrays ( FPGA), processors, controllers, microcontrollers, microprocessors, electronic units designed to perform the functions described herein are implemented.
  • ASICs application specific integrated circuits
  • DSPs digital signal processors
  • DSPDs digital signal processing devices
  • PLDs programmable logic devices
  • FPGA field programmable gate arrays
  • processors controllers, microcontrollers, microprocessors, electronic units designed to perform the functions described herein are implemented.
  • embodiments such as procedures or functions may be implemented with separate software modules that allow the performance of at least one function or operation.
  • the software codes may be implemented by a software application (or program) written in any suitable programming language, which may be stored in
  • an embodiment of the present application further provides a handheld pan/tilt head 20 , including an attitude sensor 21 and the above-mentioned imaging device 10 ; wherein, the attitude sensor 21 is used to collect attitude data of the imaging device .
  • the hand-held pan/tilt head includes a pan/tilt head shaft, and the pan/tilt head shaft is used to change the posture of the imaging device.
  • an embodiment of the present application further provides a movable platform 30, including:
  • a power system 32 mounted on the body 31, for driving the movable platform 30 to move;
  • the above-mentioned imaging device 10 installed on the body 31;
  • attitude sensor 33 which is installed on the body 31 and used to collect attitude data of the imaging device 10 .
  • the movable platform includes, but is not limited to, an unmanned aerial vehicle, an unmanned vehicle, an unmanned vessel, or a mobile robot.
  • non-transitory computer-readable storage medium such as a memory including instructions, executable by a processor of an apparatus to perform the above-described method.
  • the non-transitory computer-readable storage medium may be ROM, random access memory (RAM), CD-ROM, magnetic tape, floppy disk, optical data storage device, and the like.
  • a non-transitory computer-readable storage medium when the instructions in the storage medium are executed by the processor of the terminal, enable the terminal to execute the above method.

Abstract

A video stability augmentation method, an imaging apparatus, a handheld gimbal, a movable platform and a storage medium. The method comprises: acquiring image sequences, which are collected by an imaging apparatus; identifying, on the basis of a pose sensor, an image jitter in the image sequences that is caused by the high-frequency jitter of the imaging apparatus; identifying, on the basis of the relative movement between the image sequences, the image jitter in the image sequences that is caused by the low-frequency shaking of the imaging apparatus; performing image stability augmentation on the image sequences according to the high-frequency jitter and the low-frequency shake, so as to eliminate the image jitter from the image sequences that is caused by the high-frequency jitter and the low-frequency shake of the imaging apparatus. By means of the present embodiment, the complete stability of an image sequence is realized.

Description

视频增稳方法、成像装置、手持云台、可移动平台及存储介质Video stabilization method, imaging device, handheld pan/tilt, movable platform and storage medium 技术领域technical field
本申请涉及图像处理技术领域,具体而言,涉及一种视频增稳方法、成像装置、手持云台、可移动平台及存储介质。The present application relates to the technical field of image processing, and in particular, to a video stabilization method, an imaging device, a handheld PTZ, a movable platform, and a storage medium.
背景技术Background technique
目前,大部分成像装置(例如运动相机)设置有电子防抖算法,由于电子防抖算法(Electric Image Stabilization,EIS)对成像装置获取的视频(或者图像)进行增稳处理,从而使用户观看到比较流畅的视频画面。其中,电子防抖算法主要是指在成像装置上采用强制提高感光元件(Charge coupled Device,CCD)感光参数同时加快快门,并针对CCD上取得的图像进行分析,然后利用边缘图像进行补偿的防抖技术。然而,经过电子防抖处理后的视频或者图像仍存在画面抖动现象。At present, most imaging devices (such as motion cameras) are equipped with electronic image stabilization algorithms. Since the Electronic Image Stabilization (EIS) performs stabilization processing on the videos (or images) obtained by the imaging devices, users can watch Relatively smooth video screen. Among them, the electronic anti-shake algorithm mainly refers to the use of forcibly increasing the photosensitive parameters of the photosensitive element (Charge coupled Device, CCD) on the imaging device to speed up the shutter, and analyzes the image obtained on the CCD, and then uses the edge image to compensate for anti-shake. technology. However, the video or image after electronic anti-shake processing still has the phenomenon of picture shaking.
发明内容SUMMARY OF THE INVENTION
有鉴于此,本申请的目的之一是提供一种视频增稳方法、成像装置、手持云台、可移动平台及存储介质。In view of this, one of the objectives of the present application is to provide a video stabilization method, an imaging device, a handheld pan/tilt, a movable platform and a storage medium.
第一方面,本申请实施例提供了一种视频增稳方法,包括:In a first aspect, an embodiment of the present application provides a video stabilization method, including:
获取成像装置采集的图像序列;acquiring a sequence of images acquired by the imaging device;
基于姿态传感器识别因所述成像装置高频抖动造成的所述图像序列中的画面抖动;Identifying, based on an attitude sensor, image shake in the image sequence caused by high-frequency shake of the imaging device;
基于所述图像序列之间的相对运动识别因所述成像装置低频晃动造成的所述图像序列中的画面抖动;Identifying screen shake in the image sequence due to low-frequency shaking of the imaging device based on relative motion between the image sequences;
根据所述高频抖动和所述低频晃动对所述图像序列进行图像增稳,以消除所述图像序列中因所述成像装置高频抖动和低频晃动造成的画面抖动。Image stabilization is performed on the image sequence according to the high-frequency jitter and the low-frequency jitter, so as to eliminate picture jitter in the image sequence caused by the high-frequency jitter and the low-frequency jitter of the imaging device.
第二方面,本申请实施例提供了一种成像装置,包括图像传感器以及一个或多个处理器;In a second aspect, an embodiment of the present application provides an imaging device, including an image sensor and one or more processors;
所述图像传感器用于采集图像序列;the image sensor is used to acquire a sequence of images;
所述一个或多个处理器被单独地或共同地配置成:The one or more processors are individually or collectively configured to:
基于姿态传感器识别因所述成像装置高频抖动造成的所述图像序列中的画面抖动;Identifying, based on an attitude sensor, image shake in the image sequence caused by high-frequency shake of the imaging device;
基于所述图像序列之间的相对运动识别因所述成像装置低频晃动造成的所述图像序列中的画面抖动;Identifying screen shake in the image sequence due to low-frequency shaking of the imaging device based on relative motion between the image sequences;
根据所述高频抖动和所述低频晃动对所述图像序列进行图像增稳,以消除所述图像序列中因所述成像装置高频抖动和低频晃动造成的画面抖动。Image stabilization is performed on the image sequence according to the high-frequency jitter and the low-frequency jitter, so as to eliminate picture jitter in the image sequence caused by the high-frequency jitter and the low-frequency jitter of the imaging device.
第三方面,本申请实施例提供了一种手持云台,包括姿态传感器和如第二方面的成像装置;其中,所述姿态传感器用于采集所述成像装置的姿态数据。In a third aspect, an embodiment of the present application provides a handheld pan/tilt head, including an attitude sensor and the imaging device according to the second aspect; wherein, the attitude sensor is used to collect attitude data of the imaging device.
第四方面,本申请实施例提供了一种可移动平台,包括:In a fourth aspect, an embodiment of the present application provides a movable platform, including:
机体;body;
动力系统,安装于所述机体,用于驱动所述可移动平台移动;a power system, mounted on the body, for driving the movable platform to move;
如第二方面所述的成像装置;The imaging device of the second aspect;
以及姿态传感器,安装于所述机体,用于采集所述成像装置的姿态数据。and an attitude sensor, which is installed on the body and used to collect attitude data of the imaging device.
第五方面,本申请实施例提供了一种计算机可读存储介质,所述计算机可读存储介质存储有可执行指令,所述可执行指令被处理器执行时实现如第一方面所述的方法。In a fifth aspect, an embodiment of the present application provides a computer-readable storage medium, where the computer-readable storage medium stores executable instructions, and when the executable instructions are executed by a processor, the method according to the first aspect is implemented .
本申请实施例所提供的一种视频增稳方法,考虑到因所述成像装置低频晃动造成的所述图像序列中的画面抖动问题,除了使用识别到的高频抖动来进行图像增稳之外,还进一步基于所述图像序列之间的相对运动来识别所述低频晃动,从而使用识别到的低频晃动进行图像增稳,消除所述图像序列中因所述成像装置高频抖动和低频晃动造成的画面抖动,实现图像序列的完全稳定。In the video stabilization method provided by the embodiment of the present application, considering the picture jitter problem in the image sequence caused by the low-frequency jitter of the imaging device, in addition to using the identified high-frequency jitter to perform image stabilization , and further identify the low-frequency shaking based on the relative motion between the image sequences, so as to use the identified low-frequency shaking to perform image stabilization, and eliminate the high-frequency shaking and low-frequency shaking of the imaging device in the image sequence. The picture shakes, and the complete stabilization of the image sequence is achieved.
附图说明Description of drawings
为了更清楚地说明本申请实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本申请的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动性的前提下,还可以根据这些附图获得其他的附图。In order to illustrate the technical solutions in the embodiments of the present application more clearly, the following briefly introduces the drawings that are used in the description of the embodiments. Obviously, the drawings in the following description are only some embodiments of the present application. For those of ordinary skill in the art, other drawings can also be obtained from these drawings without creative labor.
图1和图6是本申请实施例提供的云台的不同示意图;Fig. 1 and Fig. 6 are different schematic diagrams of the pan/tilt provided by the embodiment of the present application;
图2和图7是本申请实施例提供的可移动平台的示意图;2 and 7 are schematic diagrams of a movable platform provided by an embodiment of the present application;
图3和图4是本申请实施例提供的视频增稳方法的不同流程示意图;FIG. 3 and FIG. 4 are different schematic flowcharts of the video stabilization method provided by the embodiment of the present application;
图5是本申请实施例提供的成像装置的结构示意图。FIG. 5 is a schematic structural diagram of an imaging device provided by an embodiment of the present application.
具体实施方式Detailed ways
下面将结合本申请实施例中的附图,对本申请实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本申请一部分实施例,而不是全部的实施例。The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments are only a part of the embodiments of the present application, but not all of the embodiments.
发明人发现,相关技术中经过电子防抖处理后的视频或者图像仍存在画面抖动现象,原因在于:EIS电子防抖技术是基于姿态传感器采集的数据来消除画面抖动现象的;第一、姿态传感器的精度有限,无法采集到关于所述成像装置各种抖动情况的姿态数据,比如所述姿态传感器可以采集到关于成像装置高频抖动的姿态数据,而无法采集关于成像装置低频晃动的姿态数据,从而造成经过电子防抖处理后的视频或者图像仍存在画面抖动现象;第二、为了消除噪声,在一帧图像曝光后,EIS算法对曝光时刻之前一段时间内姿态传感器采集的关于成像装置的姿态数据进行低通滤波,这时候也会把有关于成像装置低频晃动的姿态数据滤除,从而造成经过电子防抖处理后的视频或者图像仍存在画面抖动现象。The inventor found that the video or image after electronic anti-shake processing in the related art still has the phenomenon of picture shaking. The reason is: EIS electronic anti-shake technology is based on the data collected by the attitude sensor to eliminate the picture shaking phenomenon; The accuracy of the sensor is limited, and it is impossible to collect attitude data about various shaking conditions of the imaging device. For example, the attitude sensor can collect attitude data about high-frequency shaking of the imaging device, but cannot collect attitude data about low-frequency shaking of the imaging device. As a result, the video or image after the electronic anti-shake processing still has the phenomenon of picture shaking; second, in order to eliminate noise, after a frame of image is exposed, the EIS algorithm collects the posture of the imaging device from the posture sensor for a period of time before the exposure time. The data is subjected to low-pass filtering. At this time, the attitude data related to the low-frequency shaking of the imaging device will also be filtered out, resulting in the video or image after the electronic anti-shake processing.
其中,所述高频抖动可以认为是所述成像装置围绕某一位置来回运动,在单位时间内发生的抖动频次较高;所述低频晃动可以认为是所述成像装置朝某个方向发生偏移,在单位时间内发生的晃动频次较低,所述低频晃动通常是成像装置非自主(或者说无意识)产生的。Wherein, the high-frequency shaking can be considered as the imaging device moving back and forth around a certain position, and the frequency of shaking occurs in a unit time is high; the low-frequency shaking can be considered as the imaging device shifting in a certain direction , the frequency of shaking that occurs per unit time is low, and the low-frequency shaking is usually generated by the imaging device involuntary (or unconsciously).
针对于相关技术中的问题,本申请实施例提供了一种视频增稳方法,在获取成像装置采集的图像序列之后,可以基于姿态传感器识别因所述成像装置高频抖动造成的所述图像序列中的画面抖动,以及基于所述图像序列之间的相对运动识别因所述成像装置低频晃动造成的所述图像序列中的画面抖动,最后根据所述高频抖动和所述低频晃动对所述图像序列进行图像增稳,以消除所述图像序列中因所述成像装置高频抖动和低频晃动造成的画面抖动。本实施例中考虑到因所述成像装置低频晃动造成的所述图像序列中的画面抖动问题,除了使用识别到的高频抖动来进行图像增稳之外,还进一步基于所述图像序列之间的相对运动来识别所述低频晃动,从而使用识别到的低频晃动进行图像增稳,消除所述图像序列中因所述成像装置高频抖动和低频晃动造成的画面抖动,实现图像序列的完全稳定。In view of the problems in the related art, an embodiment of the present application provides a video stabilization method. After acquiring an image sequence collected by an imaging device, the image sequence caused by the high-frequency jitter of the imaging device can be identified based on an attitude sensor. image jitter in the image sequence, and identify the image jitter in the image sequence caused by the low-frequency shaking of the imaging device based on the relative motion between the image sequences, and finally according to the high-frequency jitter and the low-frequency shaking The image sequence is subjected to image stabilization, so as to eliminate the image shake caused by the high-frequency shake and the low-frequency shake of the imaging device in the image sequence. In this embodiment, considering the problem of picture jitter in the image sequence caused by the low-frequency shaking of the imaging device, in addition to using the identified high-frequency jitter to perform image stabilization, it is further based on the difference between the image sequences. The relative motion of the imaging device is used to identify the low-frequency shaking, so as to use the identified low-frequency shaking to perform image stabilization, eliminate the picture shaking caused by the high-frequency shaking and low-frequency shaking of the imaging device in the image sequence, and realize the complete stabilization of the image sequence. .
其中,所述视频增稳方法可以应用于成像装置中,所述成像装置可以是物理成像装置。成像装置可以被配置成用于检测电磁辐射(例如,可见光、红外光和/或紫外光)并基于检测到的电磁辐射生成图像数据。成像装置可以包括响应于光的波长生成电信号的电荷耦合装置(CCD)传感器或互补型金属氧化物半导体(CMOS)传感器。可以处理所产生的电信号来产生图像数据。由成像装置生成的图像数据可以包括一张或多张图像,所述一张或多张图像可以是静态图像(例如,照片)、动态图像(例如,视频)、或其合适的组合。图像数据可以是多色的(例如,RGB、CMYK、HSV)或单色的(例如,灰度、黑白、棕褐)。所述成像装置可以包括被配置成将光引导到图像传感器上的镜头。Wherein, the video stabilization method may be applied to an imaging device, and the imaging device may be a physical imaging device. The imaging device may be configured to detect electromagnetic radiation (eg, visible light, infrared light, and/or ultraviolet light) and generate image data based on the detected electromagnetic radiation. Imaging devices may include charge coupled device (CCD) sensors or complementary metal oxide semiconductor (CMOS) sensors that generate electrical signals in response to wavelengths of light. The resulting electrical signals can be processed to generate image data. The image data generated by the imaging device may include one or more images, which may be still images (eg, photographs), moving images (eg, videos), or a suitable combination thereof. Image data may be multi-color (eg, RGB, CMYK, HSV) or monochrome (eg, grayscale, black and white, sepia). The imaging device may include a lens configured to direct light onto the image sensor.
所述成像装置可以是相机。相机可以是捕捉动态图像数据(例如,视频)的动态相机或视频摄相机。相机可以是捕捉静态图像(例如,照片)的静态相机。相机可以捕捉动态图像数据和静态图像两者。相机可以在捕捉动态图像数据与静态图像之间切换。虽然在此提供的某些实施方式是在相机的背景下描述的,但是应理解的是,本申请可以适用于任何适合的成像装置,并且在此关于相机的任何描述也可以适用于任何适合的成像装置,并且在此关于相机的任何描述也可以适用于其他类型的成像装置。可以用相机来生成3D场景(例如,环境、一个或多个物体等等)的多个2D图像。由相机生成的这些图像可以表示3D场景到2D图像平面上的投影。因此,2D图像中的每一点与场景中的一个3D空间坐标相对应。相机可以包括光学元件(例如,镜头、反射镜、滤镜等)。相机可以捕捉彩色图像、灰度图像、红外图像等等。相机在其被配置成捕捉红外线图像时可以是热成像装置。The imaging device may be a camera. The camera may be a motion camera or a video camera that captures motion image data (eg, video). The camera may be a still camera that captures still images (eg, photographs). The camera can capture both moving image data and still images. The camera can switch between capturing moving image data and still images. While certain embodiments provided herein are described in the context of a camera, it is to be understood that the present application may apply to any suitable imaging device, and that any description of a camera herein may also apply to any suitable imaging device. imaging devices, and any descriptions herein regarding cameras may also apply to other types of imaging devices. A camera may be used to generate multiple 2D images of a 3D scene (eg, an environment, one or more objects, etc.). These images generated by the camera can represent the projection of the 3D scene onto the 2D image plane. Therefore, each point in the 2D image corresponds to a coordinate in 3D space in the scene. A camera may include optical elements (eg, lenses, mirrors, filters, etc.). Cameras can capture color images, grayscale images, infrared images, and more. The camera may be a thermal imaging device when it is configured to capture infrared images.
所述成像装置可以用特定图像分辨率来捕捉图像或一系列图像。在一些实施方式中,图像分辨率可以是由图像中的像素数目限定的。在些实施方式中,图像分辨率可以大于或等于约352x420像素或者720x480像素等。在一些实施方式中,相机可以是4K相机或者具有更高分辨率的相机。The imaging device may capture an image or series of images with a particular image resolution. In some embodiments, the image resolution may be defined by the number of pixels in the image. In some embodiments, the image resolution may be greater than or equal to about 352x420 pixels, or 720x480 pixels, or the like. In some embodiments, the camera may be a 4K camera or a camera with a higher resolution.
所述成像装置可以用特定捕捉速率捕捉一系列图像。在一些实施方式中,能够以如约24p、30p、48p、72p、120p、50i或60i等标准视频帧速率捕捉此系列图像。在一些实施方式中,能够以小于或等于约每0.0001秒、0.002秒、0.01秒、0.1秒、5秒或10秒等一张图像的速率捕捉此系列图像。在一些实施方式中,捕捉速率可以根据用户输入和/或外部条件(例如,雨、雪、风、环境的不明显的表面纹理)而改变。The imaging device may capture a series of images at a particular capture rate. In some embodiments, the series of images can be captured at a standard video frame rate, such as about 24p, 30p, 48p, 72p, 120p, 50i, or 60i. In some embodiments, the series of images can be captured at a rate of less than or equal to about one image every 0.0001 seconds, 0.002 seconds, 0.01 seconds, 0.1 seconds, 5 seconds, or 10 seconds. In some embodiments, the capture rate may vary based on user input and/or external conditions (eg, rain, snow, wind, subtle surface textures of the environment).
所述成像装置可以具有多个可调参数。成像装置可以用不同的参数在经受相同的 外部条件(例如,位置、光照)时捕捉不同的图像。可调参数可以包括曝光(例如,曝光时间、快门速度、光圈、胶片速度)、增益、提亮系数(gamma)、兴趣区、像素合并/子采样、像素时钟、偏移、触发、ISO等。与曝光相关的参数可以控制到达成像装置中的图像传感器的光量。例如,快门速度可以控制光照到图像传感器的时间量,而光圈可以控制在给定时间内到达图像传感器的光量。与增益相关的参数可以控制对来自光学传感器的信号的放大。ISO可以控制相机对可用光的灵敏度水平。The imaging device may have a number of adjustable parameters. Imaging devices may capture different images with different parameters when subjected to the same external conditions (e.g., location, lighting). Adjustable parameters may include exposure (eg, exposure time, shutter speed, aperture, film speed), gain, brightening factor (gamma), region of interest, binning/subsampling, pixel clock, offset, trigger, ISO, and the like. Exposure-related parameters can control the amount of light reaching an image sensor in an imaging device. For example, shutter speed controls the amount of time light hits the image sensor, while aperture controls the amount of light that reaches the image sensor in a given time. A gain-related parameter can control the amplification of the signal from the optical sensor. ISO controls the level of sensitivity of the camera to the available light.
在一些实施例中,所述成像装置可以是手持设备或者所述成像装置也可以安装于可移动平台。示例性的,在手持设备场景下,所述成像装置可以是手持相机或者所述成像装置可以安装于手持云台上,例如所述成像装置可以与所述手持云台可拆卸连接,或者所述成像装置与所述手持云台一体成型,本实施例对此不做任何限制。In some embodiments, the imaging device may be a handheld device or the imaging device may also be mounted on a movable platform. Exemplarily, in a handheld device scenario, the imaging device may be a handheld camera or the imaging device may be installed on a handheld pan/tilt, for example, the imaging device may be detachably connected to the handheld pan/tilt, or the imaging device may be detachably connected to the handheld pan/tilt. The imaging device is integrally formed with the handheld pan/tilt, which is not limited in this embodiment.
其中,所述可移动平台可以是自推进式的载运工具。所述载运工具可以借助于一个或多个推进单元跨越环境。所述载运工具可以是空中载运工具、陆地载运工具、水上载运工具或太空载运工具。所述载运工具可以是无人载运工具。所述载运工具可能能够在其上没有人类乘员的情况下跨越环境。替代地,所述载运工具可以携带人类乘员。示例性的,所述可移动平台包括但不限于无人飞行器(UAV)、无人驾驶车辆、无人驾驶船只或者可移动机器人等。Wherein, the movable platform may be a self-propelled vehicle. The vehicle may traverse the environment by means of one or more propulsion units. The vehicle may be an air vehicle, a land vehicle, a water vehicle or a space vehicle. The vehicle may be an unmanned vehicle. The vehicle may be able to traverse the environment without a human occupant on it. Alternatively, the vehicle may carry a human occupant. Exemplarily, the movable platform includes, but is not limited to, an unmanned aerial vehicle (UAV), an unmanned vehicle, an unmanned vessel, or a mobile robot, and the like.
在一示例性的应用场景中,所述成像装置10可以是手持设备,比如所述成像装置10可以是手持相机或者(如图1所示)所述成像装置10可以搭载于手持云台20。其中,所述手持相机或者所述手持云台20可以安装有姿态传感器,所述姿态传感器可以采集所述成像装置10在短时间内高频次抖动对应的姿态数据,所述姿态数据可以用于消除所述图像序列中因所述成像装置10高频抖动造成的画面抖动。另外,用户在握持所述手持设备采集图像或者视频时,握持所述手持设备的部位(比如手部)可能会发生无意识地低频次地晃动从而使得所述成像装置10产生偏移,进而导致所述成像装置10采集的图像序列出现画面抖动;因此,所述成像装置10在采集图像序列之后,可以根据图像序列之间的相对运动来识别因所述成像装置10低频晃动造成的所述图像序列中的画面抖动,进而可以消除所述图像序列中因所述成像装置10低频晃动造成的画面抖动。In an exemplary application scenario, the imaging device 10 may be a handheld device, for example, the imaging device 10 may be a handheld camera or (as shown in FIG. 1 ) the imaging device 10 may be mounted on a handheld platform 20 . Wherein, the hand-held camera or the hand-held pan/tilt 20 may be installed with an attitude sensor, and the attitude sensor may collect attitude data corresponding to the high-frequency shaking of the imaging device 10 in a short period of time, and the attitude data may be used for The picture shake caused by the high frequency shake of the imaging device 10 in the image sequence is eliminated. In addition, when the user holds the handheld device to capture images or videos, the part holding the handheld device (such as the hand) may shake unintentionally at a low frequency, causing the imaging device 10 to shift, thereby causing The image sequence captured by the imaging device 10 has screen shake; therefore, after the imaging device 10 captures the image sequence, it can identify the image caused by the low-frequency shaking of the imaging device 10 according to the relative motion between the image sequences The picture shaking in the sequence can further eliminate the picture shaking caused by the low-frequency shaking of the imaging device 10 in the image sequence.
在另一示例性应用场景中,如图2所示(图2以无人机为例),所述成像装置10可以安装于可移动平台30上,例如所述成像装置10固定安装于所述可移动平台30,或者所述成像装置10通过云台安装于所述可移动平台30。其中,所述可移动平台30可以安装有姿态传感器,所述姿态传感器可以采集所述成像装置10在短时间内高频次抖动对应的姿态数据,所述姿态数据可以用于消除所述图像序列中因所述成像装置10高频抖动造成的画面抖动。另外,所述可移动云台在运动过程中,可能会因为自身的动力系统运行原因或者因为自然环境因素(比如起风或者下雨等)发生无意识地低频次地晃动从而使得所述成像装置10产生偏移;或者因云台发生无意识地低频次地晃动从而使得所述成像装置10产生偏移,进而导致所述成像装置10采集的图像序列出现画面抖动;因此,所述成像装置10在采集图像序列之后,可以根据图像序列之间的相对运动来识别因所述成像装置10低频晃动造成的所述图像序列中的画面抖动,进而可以消除所述图像序列中因所述成像装置10低频晃动造成的画面抖动。In another exemplary application scenario, as shown in FIG. 2 ( FIG. 2 takes a drone as an example), the imaging device 10 may be installed on a movable platform 30 , for example, the imaging device 10 is fixedly installed on the The movable platform 30, or the imaging device 10 is mounted on the movable platform 30 through a pan/tilt head. The movable platform 30 may be installed with an attitude sensor, and the attitude sensor may collect attitude data corresponding to the high-frequency shaking of the imaging device 10 in a short period of time, and the attitude data may be used to eliminate the image sequence The picture jitter caused by the high-frequency jitter of the imaging device 10. In addition, during the movement of the movable pan/tilt head, the imaging device 10 may shake unintentionally at a low frequency due to the operation of its own power system or due to natural factors (such as wind or rain). or the imaging device 10 is shifted due to the unintentional low-frequency shaking of the pan/tilt head, which in turn causes the image sequence captured by the imaging device 10 to appear jittery; therefore, the imaging device 10 is collecting After the image sequence, the image shake in the image sequence caused by the low-frequency shaking of the imaging device 10 can be identified according to the relative motion between the image sequences, and then the image sequence caused by the low-frequency shaking of the imaging device 10 can be eliminated. caused the screen to shake.
接下来对本申请实施例提供的视频增稳方法进行说明:请参阅图3,图3提供了 一种视频增稳方法的流程示意图,所述方法可以应用于成像装置,所述方法包括:Next, the video stabilization method provided by the embodiment of the present application will be described: please refer to FIG. 3 , which provides a schematic flowchart of a video stabilization method. The method can be applied to an imaging device, and the method includes:
在步骤S101中,获取成像装置采集的图像序列。In step S101, an image sequence acquired by an imaging device is acquired.
在步骤S102中,基于姿态传感器识别因所述成像装置高频抖动造成的所述图像序列中的画面抖动。In step S102, the image shake in the image sequence caused by the high-frequency shake of the imaging device is identified based on the attitude sensor.
在步骤S103中,基于所述图像序列之间的相对运动识别因所述成像装置低频晃动造成的所述图像序列中的画面抖动。In step S103, screen shake in the image sequence caused by low-frequency shaking of the imaging device is identified based on the relative motion between the image sequences.
在步骤S104中,根据所述高频抖动和所述低频晃动对所述图像序列进行图像增稳,以消除所述图像序列中因所述成像装置高频抖动和低频晃动造成的画面抖动。In step S104, image stabilization is performed on the image sequence according to the high-frequency jitter and the low-frequency jitter, so as to eliminate the image jitter caused by the high-frequency jitter and the low-frequency jitter of the imaging device in the image sequence.
对于步骤S101,所述图像序列包括多张图像,所述图像序列可以是所述成像装置正在采集的图像所构成的序列,也可以是所述成像装置已经采集完的多张图像。For step S101, the image sequence includes multiple images, and the image sequence may be a sequence composed of images that are being collected by the imaging device, or may be multiple images that have been collected by the imaging device.
其中,所述图像序列中的相邻图像之间具有预设的重叠率。所述图像序列中的图像可以是所述成像装置当前采集到的图像,也可以是所述成像装置在过去某个时间点采集到的图像。在获取所述图像序列之后,所述成像装置可以基于姿态传感器采集的关于所述成像装置的姿态数据,识别因所述成像装置高频抖动造成的所述图像序列中的画面抖动;以及,基于所述图像序列之间的相对运动识别因所述成像装置低频晃动造成的所述图像序列中的画面抖动。可以理解的是,本实施例对于步骤S102和步骤S103的处理顺序不做任何限制;例如在处理资源足够的情况下,所述成像装置可以并行执行步骤S102和步骤S103;在处理资源不足的情况下,可以先处理步骤S102再处理步骤S103,或者先处理步骤S103再处理步骤S102。Wherein, there is a preset overlap ratio between adjacent images in the image sequence. The images in the image sequence may be images currently collected by the imaging device, or images collected by the imaging device at a certain point in time in the past. After acquiring the image sequence, the imaging device may identify, based on the gesture data about the imaging device collected by the gesture sensor, the picture shaking in the image sequence caused by the high-frequency shaking of the imaging device; and, based on The relative motion between the image sequences identifies picture shake in the image sequence due to low frequency shaking of the imaging device. It can be understood that this embodiment does not impose any restrictions on the processing sequence of step S102 and step S103; for example, in the case of sufficient processing resources, the imaging device may execute steps S102 and S103 in parallel; in the case of insufficient processing resources Next, step S102 may be processed first and then step S103 may be processed, or step S103 may be processed first and then step S102 may be processed.
其中,所述高频抖动可以包括高频旋转产生的抖动和/或高频平移产生的抖动,即所述成像装置在单位时间内可能发生高频次地旋转或者平移,这种高频抖动可以被姿态传感器捕捉到。所述姿态传感器可以是加速度计、陀螺仪、重力检测传感器,惯性测量单元(IMU)和/或罗盘的任意组合;示例性的,所述姿态传感器包含三轴陀螺仪、三轴加速度计,三轴电子罗盘;所述姿态传感器用于采集所述成像装置的姿态数据。Wherein, the high-frequency jitter may include jitter caused by high-frequency rotation and/or jitter caused by high-frequency translation, that is, the imaging device may rotate or translate with high frequency in a unit time, and this high-frequency jitter may Captured by the attitude sensor. The attitude sensor can be any combination of accelerometer, gyroscope, gravity detection sensor, inertial measurement unit (IMU) and/or compass; an axis electronic compass; the attitude sensor is used to collect attitude data of the imaging device.
而所述低频晃动包括低频平移产生的晃动,即所述成像装置可能朝某一方向发生低频次地平移,所述低频晃动通常是所述成像装置无意识产生,所述低频晃动无法由精度低的姿态传感器捕捉到或者会在数据处理阶段被滤除,因此,本实施例通过图像序列之间的相对运动来捕捉该种低频晃动。The low-frequency shaking includes shaking caused by low-frequency translation, that is, the imaging device may undergo low-frequency translation in a certain direction. The gesture sensor captures or may be filtered out in the data processing stage, so this embodiment captures such low-frequency shaking through relative motion between image sequences.
在一些实施例中,所述图像序列中的图像因所述低频晃动对应有第一偏移量以及因所述高频抖动对应有第二偏移量,进而所述成像装置可以根据所述第一偏移量和所述第二偏移量,对所述图像序列中的图像进行增稳处理,以消除所述图像因所述成像装置高频抖动和低频晃动造成的画面抖动,实现所述图像的完全稳定。In some embodiments, the images in the image sequence have a first offset corresponding to the low-frequency shaking and a second offset corresponding to the high-frequency shaking, and the imaging device can An offset and the second offset are used to perform stabilization processing on the images in the image sequence, so as to eliminate the image jitter caused by the high-frequency jitter and low-frequency jitter of the imaging device, so as to realize the Full stabilization of the image.
在一些实施例中,考虑到所述成像装置的变焦倍数越高,所述图像序列的图像因所述成像装置低频晃动产生偏移的像素越多;而在所述成像装置的变焦倍数较低的情况下,所述图像序列的图像因所述成像装置低频晃动产生偏移的像素较少。基于人眼视觉原理,如果仅有较少数量的像素发生偏移,人眼可能对此不太敏感,因此在所述成像装置的处理资源有限的情况下,在低倍率场景下,可以不考虑因所述成像装置低频晃动造成的所述图像序列中的画面抖动,只有在高倍率场景下,在所述成像装置当前变焦倍数高于预设倍数的情况下,由于有较多像素产生偏移影响用户的视觉观感,因此所述成像装置基于所述图像序列之间的相对运动识别因所述成像装置低频晃动造 成的所述图像序列中的画面抖动,并进一步消除所述图像因所述低频晃动导致的画面抖动,即校正了图像中像素的偏移问题,从而提高用户的视觉观感。In some embodiments, considering that the higher the zoom factor of the imaging device, the more pixels in the image sequence are shifted due to the low-frequency shaking of the imaging device; while the zoom factor of the imaging device is lower. In the case of the image sequence, there are fewer pixels shifted due to low-frequency shaking of the imaging device. Based on the principle of human eye vision, if only a small number of pixels are offset, the human eye may not be very sensitive to this. Therefore, in the case of limited processing resources of the imaging device, in low magnification scenarios, it may not be considered The picture jitter in the image sequence caused by the low-frequency shaking of the imaging device is only in a high-magnification scene, when the current zoom factor of the imaging device is higher than the preset factor, because there are more pixels. The visual perception of the user is affected, so the imaging device recognizes the picture shake in the image sequence caused by the low-frequency shaking of the imaging device based on the relative motion between the image sequences, and further eliminates the image caused by the low-frequency shaking. The screen shaking caused by shaking, that is, the offset problem of pixels in the image is corrected, thereby improving the user's visual perception.
在一些实施例中,考虑到在某些特殊场景下,因所述成像装置低频晃动造成的所述图像序列中的画面抖动是因为成像装置在被自主操作情况下产生的,比如在所述成像装置进行变焦的过程中,或者在所述成像装置被改变朝向(或者说所述成像装置被控制移动的情况)的过程中,在上述场景下因所述成像装置低频晃动造成的所述图像序列中的画面抖动不是无意识产生的,而是因成像装置被自主改变拍摄参数(变焦、朝向等)产生,则所述成像装置可以不识别因所述成像装置低频晃动造成的所述图像序列中的画面抖动,从而保证采集的图像的正确呈现。In some embodiments, it is considered that in some special scenarios, the picture shake in the image sequence caused by the low-frequency shaking of the imaging device is generated when the imaging device is operated autonomously, such as in the imaging device. In the process of zooming by the device, or in the process of changing the orientation of the imaging device (or in the case of the imaging device being controlled to move), the image sequence caused by the low-frequency shaking of the imaging device in the above scenario The picture jitter in the image is not generated unconsciously, but is caused by the imaging device being autonomously changing the shooting parameters (zoom, orientation, etc.), then the imaging device may not recognize the image sequence caused by the low-frequency shaking of the imaging device. The screen shakes, so as to ensure the correct presentation of the captured images.
另外,如果所述成像装置处于目标跟随模式中,即所述成像装置所采集的图像序列被用于进行目标跟踪的情况下,通常跟踪的目标为运动的目标,为实现对跟踪的目标进行拍摄,通常需要成像装置改变自身的朝向或者变焦等,这时候成像装置发生的低频晃动和/或高频抖动导致的图像之间像素的偏移是因为对目标进行跟随拍摄,在这种情况下可以不对所述图像序列进行增稳处理,从而保证采集的图像的正确呈现。In addition, if the imaging device is in the target following mode, that is, when the image sequence collected by the imaging device is used for target tracking, the tracked target is usually a moving target. , usually the imaging device needs to change its orientation or zoom, etc. At this time, the low-frequency shaking and/or high-frequency shaking of the imaging device causes the pixel shift between the images because of the follow-up shooting of the target. In this case, you can The image sequence is not stabilized, thereby ensuring the correct presentation of the acquired images.
在一些实施例中,在基于图像序列之间的相对运动识别因所述成像装置低频晃动造成的所述图像序列中的画面抖动之后,可以控制所述成像装置根据所述低频晃动修正自身的姿态,例如上述提到图像中的像素会因所述低频晃动产生偏移,则所述成像装置可以获取所述低频晃动对应的第一偏移量,然后根据所述第一偏移量控制所述成像装置运动。本实施例中,通过因低频晃动产生的第一偏移量来修正所述成像装置的姿态,使得所述成像装置更加稳定,有利于保证图像序列的稳定采集。In some embodiments, after identifying the picture shake in the image sequence caused by the low-frequency shaking of the imaging device based on the relative motion between the image sequences, the imaging device may be controlled to correct its own posture according to the low-frequency shaking , for example, the pixels in the image mentioned above will be offset due to the low-frequency shaking, the imaging device can obtain the first offset corresponding to the low-frequency shaking, and then control the Imaging device movement. In this embodiment, the posture of the imaging device is corrected by the first offset caused by the low-frequency shaking, so that the imaging device is more stable, which is beneficial to ensure the stable acquisition of the image sequence.
在一些场景中,所述成像装置安装于云台,例如所述成像装置安装于手持云台或者所述成像装置通过所述云台安装于可移动平台,在这种情况下可以通过调整所述云台的姿势来实现修正所述成像装置的姿态,即可以控制所述云台根据所述低频晃动修正自身的姿态,例如可以获取所述低频晃动对应的第一偏移量,然后根据所述第一偏移量控制所述云台运动,更具体说,可以将所述低频晃动对应的第一偏移量转换成云台能够读取的数据格式(比如四元数),再将转换后的第一偏移量发送给所述云台,从而云台可以根据获取的转换后的第一偏移量(比如四元数)来移动自身的云台轴,实现姿态修正过程,从而使得云台和所述成像装置更加稳定,有利于保证图像序列的稳定采集。In some scenarios, the imaging device is mounted on a pan/tilt, for example, the imaging device is mounted on a handheld pan/tilt or the imaging device is mounted on a movable platform through the pan/tilt, in which case the imaging device can be adjusted by adjusting the The posture of the gimbal can be used to correct the posture of the imaging device, that is, the gimbal can be controlled to correct its posture according to the low-frequency shaking. For example, the first offset corresponding to the low-frequency shaking can be obtained, and then the The first offset controls the movement of the gimbal. More specifically, the first offset corresponding to the low-frequency shaking can be converted into a data format (such as a quaternion) that the gimbal can read, and then the converted The first offset is sent to the gimbal, so that the gimbal can move its own gimbal axis according to the obtained converted first offset (such as a quaternion) to realize the attitude correction process, so that the cloud The stage and the imaging device are more stable, which is beneficial to ensure the stable acquisition of the image sequence.
在一些实施例中,所述图像序列中图像的像素因所述低频晃动产生有第一偏移量以及因所述高频抖动产生有第二偏移量;请参阅图4,图4为本申请实施例提供的视频增稳方法的另一种流程示意图,所述方法可以由成像装置执行,所述方法包括:In some embodiments, the pixels of the images in the image sequence have a first offset due to the low-frequency jitter and a second offset due to the high-frequency jitter; please refer to FIG. 4 , which is a schematic diagram of FIG. Another schematic flowchart of a video stabilization method provided by an application embodiment, the method can be executed by an imaging device, and the method includes:
在步骤S201中,获取成像装置采集的图像序列。In step S201, an image sequence acquired by an imaging device is acquired.
在步骤S202中,对于图像序列中的图像,根据预置的关键帧对所述图像进行运动估计,确定所述图像因所述成像装置低频晃动产生的相对于所述关键帧的第一偏移量。In step S202, for the images in the image sequence, perform motion estimation on the images according to preset key frames, and determine a first offset of the images relative to the key frames caused by the low-frequency shaking of the imaging device quantity.
在步骤S203中,对于图像序列中的图像,基于姿态传感器采集的所述成像装置的姿态数据对所述图像进行电子防抖处理,获得所述图像因所述成像装置高频抖动产生的第二偏移量。In step S203, for the images in the image sequence, electronic anti-shake processing is performed on the images based on the attitude data of the imaging device collected by the attitude sensor, and the second image is obtained due to the high-frequency jitter of the imaging device. Offset.
在步骤S204中,根据所述第一偏移量和所述第二偏移量,对所述图像序列中的图像进行增稳处理。In step S204, stabilization processing is performed on the images in the image sequence according to the first offset and the second offset.
对于步骤S202,在获取所述图像序列之后,所述成像装置可以根据预置的关键帧对所述图像进行运动估计,确定所述图像相对于所述关键帧的第一偏移量,实现采用基于运动估计(Motion Estimation,ME)中全局运动矢量(GMV)来计算图像的运动偏移量(即所述第一偏移量),从而根据所述第一偏移量对所述图像进行增稳处理,提升所述图像的稳定性,消除所述图像因所述低频晃动导致的画面抖动。For step S202, after acquiring the image sequence, the imaging device may perform motion estimation on the image according to a preset key frame, determine a first offset of the image relative to the key frame, and implement the use of The motion offset of the image (that is, the first offset) is calculated based on the global motion vector (GMV) in motion estimation (Motion Estimation, ME), so that the image is increased according to the first offset. The stabilization process improves the stability of the image and eliminates the image shake caused by the low-frequency shaking of the image.
其中,在没有关键帧的情况下,可以将所述图像作为关键帧使用,且所述图像相对于关键帧的所述第一偏移量被置为0;在有关键帧的情况下,所述成像装置通过运动估计过程确定所述图像相对于所述关键帧的第一偏移量。Wherein, in the case of no key frame, the image can be used as a key frame, and the first offset of the image relative to the key frame is set to 0; in the case of a key frame, all The imaging device determines a first offset of the image relative to the key frame through a motion estimation process.
在一些实施例中,在获取所述第一偏移量的过程中,所述成像装置可以先获取所述关键帧中至少一个关键点,所述关键点位于所述关键帧中的第一位置,然后确定所述至少一个关键点在所述图像中的第二位置,最后根据所述第一位置与所述第二位置之间的差异,确定所述图像相对于所述关键帧的第一偏移量。In some embodiments, in the process of acquiring the first offset, the imaging device may first acquire at least one key point in the key frame, and the key point is located at a first position in the key frame , then determine the second position of the at least one key point in the image, and finally determine the first position of the image relative to the key frame according to the difference between the first position and the second position Offset.
其中,所述第一偏移量可以包括沿指定方向的偏移量,比如所述指定方向为图像坐标系下的水平方向或者垂直方向中的至少一种。示例性的,所述第一偏移量可以通过坐标映射表来表示,所述成像装置可以获得描述图像中的像素水平方向坐标变换的坐标映射表和描述图像中的像素垂直方向坐标变换的坐标映射表。The first offset may include an offset along a specified direction, for example, the specified direction is at least one of a horizontal direction or a vertical direction in an image coordinate system. Exemplarily, the first offset may be represented by a coordinate mapping table, and the imaging device may obtain a coordinate mapping table describing the coordinate transformation of the pixels in the horizontal direction in the image and coordinates describing the coordinate transformation of the pixels in the vertical direction in the image. mapping table.
其中,所述关键帧中至少一个关键点可以通过以下方式获得:Wherein, at least one key point in the key frame can be obtained in the following ways:
示例性地,在获取所述关键点的过程中,可以将所述关键帧分割成若干互不重叠的图像块;然后对于每个所述图像块,可以根据所述图像块的中心点和/或所述图像块中梯度最大的至少一个特征点确定所述关键点;例如可以将所述图像块的中心点和所述图像块中梯度最大的至少一个特征点都作为所述关键点,也可以从中心点和所述至少一个特征点中随机选择一个作为所述关键点,还可以将所述图像块中梯度最大的特征点作为所述关键点以提高所述关键点的精度;在获得所述关键帧的至少一个关键点之后,所述成像装置记录所述关键点在所述关键帧中的第一位置的位置信息,比如所述位置信息包括所述关键点在关键帧中对应的水平坐标和垂直坐标。Exemplarily, in the process of acquiring the key points, the key frame can be divided into several non-overlapping image blocks; then for each of the image blocks, the center point of the image block and/ Or at least one feature point with the largest gradient in the image block determines the key point; for example, both the center point of the image block and at least one feature point with the largest gradient in the image block can be used as the key point, or One can be randomly selected from the center point and the at least one feature point as the key point, and the feature point with the largest gradient in the image block can also be used as the key point to improve the accuracy of the key point; After at least one key point of the key frame, the imaging device records the position information of the first position of the key point in the key frame, for example, the position information includes the corresponding position of the key point in the key frame. Horizontal and vertical coordinates.
示例性地,可以对所述关键帧进行特征点提取,比如使用为FAST、SIFT、SURF、SUSAN或Harris检测算法获取特征点,并将提取的特征点作为所述关键点,并且所述成像装置基于所述特征点在关键帧中所处位置记录所述关键点的第一位置的位置信息。Exemplarily, feature point extraction may be performed on the key frame, for example, using FAST, SIFT, SURF, SUSAN or Harris detection algorithm to obtain feature points, and the extracted feature points are used as the key points, and the imaging device The position information of the first position of the key point is recorded based on the position of the feature point in the key frame.
示例性地,可以在所述关键帧中通过均匀采样得到所述关键点,并且所述成像装置记录均匀采样的关键点在所述关键帧中的第一位置的位置信息。Exemplarily, the key point may be obtained by uniform sampling in the key frame, and the imaging device records position information of the first position of the uniformly sampled key point in the key frame.
在一些实施例中,在获取所述关键帧中至少一个关键点之后,所述成像装置可以获取所述至少一个关键点分别在所述关键帧中的第一特征信息;可以理解的是,所述第一特征信息为可以表征所述关键点的信息,本申请实施例对于所述第一特征信息的具体内容不做任何限制,可依据实际应用场景进行具体设置。In some embodiments, after acquiring at least one key point in the key frame, the imaging device may acquire first feature information of the at least one key point in the key frame respectively; it can be understood that the The first feature information is information that can characterize the key point. This embodiment of the present application does not impose any restrictions on the specific content of the first feature information, and can be specifically set according to actual application scenarios.
示例性的,所述第一特征信息可以是所述关键点的梯度信息或者颜色信息等。Exemplarily, the first feature information may be gradient information or color information of the key point, or the like.
示例性的,所述第一特征信息可以包括所述成像装置对所述关键点进行投影得到的特征向量,如对所述关键点进行水平投影或者垂直投影;具体来说,所述第一特征信息可以是将包括所述关键点的预设区域沿指定方向进行投影得到的特征向量。在一个例子中,以所述指定方向为图像坐标系中的水平方向和/或垂直方向为例,可以获取以所述关键点为中心的预设区域往水平方向投影后得到的一维特征向量和/或往垂直 方向投影后得到的一维特征向量;其中,所述预设区域的形状和尺寸可依据实际应用场景进行具体设置,例如所述预设区域为矩形区域。Exemplarily, the first feature information may include a feature vector obtained by projecting the key point by the imaging device, such as performing a horizontal projection or a vertical projection on the key point; specifically, the first feature The information may be a feature vector obtained by projecting a preset area including the key point along a specified direction. In an example, taking the specified direction as the horizontal direction and/or the vertical direction in the image coordinate system as an example, a one-dimensional feature vector obtained by projecting the preset area centered on the key point in the horizontal direction may be obtained and/or a one-dimensional feature vector obtained by projecting in the vertical direction; wherein, the shape and size of the preset area can be specifically set according to the actual application scenario, for example, the preset area is a rectangular area.
另外,考虑到如果所述图像序列是在光照变化较大的情况下采集的,可能导致同一关键点在不同的图像帧的像素值不一致导致匹配失败的问题,因此,为了避免或者降低光照变化所带来的影响,在获取所述关键点的第一特征信息之后,可以将所述关键点的第一特征信息映射到预设范围内,所述预设范围用于使各个所述关键点的亮度变化范围相同,从而有利于去除光照变化的影响,适用于更加复杂的场景。In addition, considering that if the image sequence is collected under the circumstance that the illumination changes greatly, the pixel value of the same key point in different image frames may be inconsistent, which may lead to the problem of matching failure. Therefore, in order to avoid or reduce the illumination changes, the Influence brought by, after obtaining the first feature information of the key points, the first feature information of the key points can be mapped to a preset range, and the preset range is used to make the The brightness variation range is the same, which is beneficial to remove the influence of illumination variation and is suitable for more complex scenes.
示例性的,以所述第一特征信息包括所述成像装置对所述关键点进行投影得到的特征向量为例:为了避免或者降低光照变化所带来的影响,可以将所述关键点投影得到的特征向量映射到预设范围内。在一个例子中,可以获取所有关键点投影得到的特征向量的均值,通过每个关键点投影得到的特征向量与所述均值之差获得所述关键点映射到预设范围内的特征向量。在另一个例子中,可以将所述关键点投影得到的特征向量进行归一化处理,比如可以获取所有关键点投影得到的特征向量的标准差,通过每个关键点投影得到的特征向量与所述标准差的比值获得所述关键点映射到预设范围内的特征向量。Exemplarily, take the first feature information including the feature vector obtained by the imaging device projecting the key point as an example: in order to avoid or reduce the influence of illumination changes, the key point can be projected to obtain The eigenvectors of are mapped to the preset range. In one example, the mean value of the feature vectors obtained by the projection of all key points may be obtained, and the feature vector of the key point mapped to a preset range is obtained by the difference between the feature vector obtained by the projection of each key point and the mean value. In another example, the feature vector obtained by the projection of the key points can be normalized, for example, the standard deviation of the feature vectors obtained by the projection of all key points can be obtained, and the feature vector obtained by the projection of each key point The ratio of the standard deviations is used to obtain the feature vectors that map the key points to a preset range.
在一些实施例中,在获取所述第一偏移量的过程中,需要确定所述关键帧的至少一个关键点在所述图像中的第二位置,具体来说,所述成像装置可以先使用预设的图像转换关系估计所述关键帧中至少一个关键点在所述图像中的第二位置。其中,考虑到所述图像与上一帧图像的差异通常比较小,所述关键点在所述图像中的第二位置与所述关键点在所述上一帧图像中的位置可能相近,因此,所述预设的图像转换关系可以是根据所述关键点在所述关键帧中的第一位置与所述关键点在上一帧图像中的位置之间的位置关系确定的。另外,在所述图像作为关键帧的情况下,所述预设的图像转换关系为单位矩阵。In some embodiments, in the process of acquiring the first offset, it is necessary to determine the second position of at least one key point of the key frame in the image. Specifically, the imaging device may first A second position in the image of at least one key point in the key frame is estimated using a preset image transformation relationship. Wherein, considering that the difference between the image and the image of the previous frame is usually relatively small, the second position of the key point in the image may be similar to the position of the key point in the image of the previous frame, so , the preset image conversion relationship may be determined according to the positional relationship between the first position of the key point in the key frame and the position of the key point in the previous frame of image. In addition, when the image is used as a key frame, the preset image conversion relationship is a unit matrix.
在获取所述至少一个关键点在所述图像中估计的第二位置之后,所述成像装置可以以估计的第二位置为基准获取所述至少一个关键点在不同偏移量下的第二特征信息,进而对于每一个关键点,可以将所述在不同偏移量下的第二特征信息分别与所述第一特征信息进行匹配,然后根据与所述第一特征信息匹配的第二特征信息对应的偏移量调整所述第二位置,从而获取所述关键点在所述图像中的准确的第二位置。After acquiring the estimated second position of the at least one key point in the image, the imaging device may acquire second features of the at least one key point under different offsets based on the estimated second position information, and further for each key point, the second feature information under different offsets can be matched with the first feature information respectively, and then according to the second feature information matched with the first feature information The second position is adjusted by the corresponding offset, so as to obtain the accurate second position of the key point in the image.
示例性的,所述不同偏移量指的是所述关键点沿指定方向偏移的不同距离,比如所述关键点可以沿水平方向和/或垂直方向依次偏移指定数量单位长度的距离,如所述关键点可以沿水平方向依次偏移1个单位长度、2个单位长度、3个单位长度……,从而获得在不同偏移量下的关键点;进一步地,为了降低计算量,所述不同偏移量可以是所述关键点在预设偏移范围内沿指定方向偏移的不同距离,所述预设偏移范围可依据实际应用场景进行具体设置,本实施例对此不做任何限制。Exemplarily, the different offsets refer to different distances by which the key points are offset along a specified direction, for example, the key points may be shifted by a specified number of unit lengths in turn along the horizontal direction and/or the vertical direction, For example, the key points can be shifted by 1 unit length, 2 unit lengths, 3 unit lengths... The different offsets may be different distances by which the key points are offset along a specified direction within a preset offset range, and the preset offset range may be specifically set according to the actual application scenario, which is not done in this embodiment. any restrictions.
在所述图像中,在沿指定方向对所述关键点进行不同距离的偏移之后,所述成像装置可以计算偏移后的关键点的第二特征信息,即所述关键点在不同偏移量下的特征信息。其中,所述第二特征信息的获取方式与所述第一特征信息的获取方式相同,例如所述第二特征信息包括对偏移后的关键点进行投影得到的特征向量,进一步说,所述第二特征信息包括对包含偏移后的关键点的预设区域沿指定方向进行投影得到的特征向量,所述指定方向至少包括在图像坐标中的水平方向和/或垂直方向。In the image, after the key points are shifted by different distances along a specified direction, the imaging device may calculate the second feature information of the shifted key points, that is, the key points are at different offsets. characteristic information under the quantity. The second feature information is obtained in the same manner as the first feature information. For example, the second feature information includes a feature vector obtained by projecting the shifted key points. Further, the The second feature information includes a feature vector obtained by projecting a preset area including the shifted key points along a specified direction, where the specified direction at least includes a horizontal direction and/or a vertical direction in image coordinates.
另外,考虑到光照变化的影响,可以将所述第二特征信息映射到预设范围内,所述预设范围用于使各个所述关键点的亮度变化范围相同。在一个例子中,可以将偏移后的关键点投影得到的特征向量映射到预设范围内。在一个例子中,可以获取在同一偏移量下,所有偏移后的关键点投影得到的特征向量的均值,并获取每个偏移后的关键点投影得到的特征向量与所述均值之差。在另一个例子中,可以将所述偏移后的关键点投影得到的特征向量进行归一化处理,比如可以获取在同一偏移量下所有偏移后的关键点投影得到的特征向量的标准差,获取每个偏移后的关键点投影得到的特征向量与所述标准差的比值。In addition, considering the influence of illumination changes, the second feature information may be mapped into a preset range, and the preset range is used to make the brightness change ranges of each of the key points the same. In one example, the feature vector obtained by projecting the shifted key points can be mapped to a preset range. In one example, under the same offset, the mean value of the feature vectors obtained by the projection of all the shifted key points can be obtained, and the difference between the feature vector obtained by the projection of each shifted key point and the mean value can be obtained . In another example, the feature vector obtained by the projection of the shifted key points may be normalized, for example, the standard of the feature vector obtained by the projection of all the shifted key points under the same offset may be obtained. difference, obtain the ratio of the feature vector obtained by the projection of each shifted key point to the standard deviation.
在获取所述至少一个关键点分别在不同偏移量下的第二特征信息之后,对于每个所述关键点,所述成像装置将所述不同偏移量下的第二特征信息分别与所述第一特征信息进行匹配,比较所述不同偏移量下的第二特征信息分别与所述第一特征信息之间的差异,并根据与所述第一特征信息差异最小的第二特征信息对应的偏移量调整估计的第二位置,从而获取所述关键点在所述图像中较为准确的第二位置,进而所述成像装置可以根据所述关键点的第一位置和调整后的第二位置之间的位置关系,使用坐标变换原理确定所述图像相对于所述关键帧的第一偏移量。After acquiring the second feature information of the at least one key point under different offsets, for each of the key points, the imaging device compares the second feature information under the different offsets with the second feature information respectively. The first feature information is matched, and the differences between the second feature information under the different offsets and the first feature information are compared, and the difference between the second feature information and the first feature information is the smallest according to the second feature information. The estimated second position is adjusted by the corresponding offset, so as to obtain a relatively accurate second position of the key point in the image, and then the imaging device can be based on the first position of the key point and the adjusted first position. The positional relationship between the two positions is determined by using the principle of coordinate transformation to determine the first offset of the image relative to the key frame.
在一些实施例中,在获取所述关键点在所述图像中较为准确的第二位置之后,所述成像装置可以根据所述至少一个关键点的第一位置与调整后的所述第二位置之间的位置关系,确定用于下一帧图像的图像转换关系。In some embodiments, after acquiring a relatively accurate second position of the key point in the image, the imaging device may perform the adjustment according to the first position of the at least one key point and the adjusted second position The positional relationship between them determines the image conversion relationship for the next frame of image.
为了进一步提高所确定的用于下一帧图像的图像转换关系的准确性,所述成像装置可以从所述至少一个关键点中筛选有效关键点,然后根据所述有效关键点的第一位置与调整后的所述第二位置之间的位置关系,确定用于下一帧图像的图像转换关系。In order to further improve the accuracy of the determined image conversion relationship for the next frame of image, the imaging device may filter valid key points from the at least one key point, and then select the valid key point according to the first position of the valid key point and The adjusted positional relationship between the second positions determines an image conversion relationship for the next frame of image.
示例性的,在确定所述有效关键点时,可以使用所述预设的图像转换关系确定位于调整后的第二位置的关键点在所述关键帧中的第三位置,然后确定每一个所述关键点的第三位置相对于所述第一位置的位移量,将其中一个关键点(比如可以随机取一个关键点)的位移量确定为目标位移量,并获取其他关键点的位移量相对于所述目标位移量的误差值,进而从所述至少一个关键点中选取误差值小于预设值的有效关键点;可以重复上述获取有效关键点的过程直到迭代次数达到预设次数,或者到达如下定义所需要迭代次数niters,
Figure PCTCN2021091620-appb-000001
其中,conf表示置信度(例如为0.99),pix_size为关键点总数,num为有效关键点的数量。
Exemplarily, when determining the effective key points, the preset image conversion relationship may be used to determine the third position of the key point located at the adjusted second position in the key frame, and then determine each The displacement of the third position of the key point relative to the first position, the displacement of one of the key points (for example, a key point can be randomly selected) is determined as the target displacement, and the relative displacement of other key points is obtained. based on the error value of the target displacement, and then select from the at least one key point an effective key point with an error value less than a preset value; the above process of obtaining effective key points can be repeated until the number of iterations reaches a preset number, or Define the required number of iterations niters as follows,
Figure PCTCN2021091620-appb-000001
Among them, conf represents the confidence level (for example, 0.99), pix_size is the total number of keypoints, and num is the number of valid keypoints.
当然,也可以使用其他方式来获取所述有效关键点,本实施例对此不做任何限制。Of course, the valid key points may also be acquired in other manners, which are not limited in this embodiment.
在一些实施例中,在所述有效关键点的数量少于预设数量和/或在所述图像相对于关键帧的第一偏移量大于预设偏移阈值的情况下,表明所述图像与所述关键帧的差异较大,此时可以考虑更换关键帧,则可以将所述图像确定为关键帧,并将所述图像相对于关键帧的第一偏移量置为0。In some embodiments, when the number of valid key points is less than a preset number and/or when the first offset of the image relative to the key frame is greater than a preset offset threshold, it is indicated that the image If the difference from the key frame is large, the key frame may be replaced at this time, the image may be determined as the key frame, and the first offset of the image relative to the key frame is set to 0.
在一些实施例中,考虑到在更换所述关键帧之后,由于第一偏移量被突然置为0,原本稳定维持的画面会发生明显的变化,用户会感受到从一个画面突然跳到另一个画面的变化,用户的视觉观感不佳。因此,为了消除或者降低这种变化带来的影响,在获取所述第一偏移量之后,所述成像装置可以使用在所述关键帧之前采集的至少一个已增稳图像的第一偏移量对所述图像的第一偏移量进行平滑处理,获取所述图像处理后的第一偏移量,从而保证在切换关键帧之后视频画面的平滑过渡,消除或者降低上述提到的变化带来的影响,使得视频画面过渡更加自然。In some embodiments, considering that after the key frame is replaced, since the first offset is suddenly set to 0, the originally stable picture will change significantly, and the user will feel a sudden jump from one picture to another A screen change, the user's visual perception is not good. Therefore, in order to eliminate or reduce the influence of such changes, after acquiring the first offset, the imaging device may use the first offset of at least one stabilized image acquired before the key frame smooth the first offset of the image, and obtain the first offset after the image processing, so as to ensure the smooth transition of the video picture after switching the key frame, eliminate or reduce the above-mentioned change band The influence of the coming, makes the transition of the video picture more natural.
示例性的,为了降低工作量,可以不对所有图像都进行平滑处理,而只需在更换关键帧之后的少量图像进行平滑处理即可,比如在所述关键帧之后的30帧图像需要进行平滑处理,在所述关键帧之后的第31帧开始的图像无需进行平滑处理。或者可以说,如果所述图像是在关键帧之后的预设时长内采集得到的,则可以使用在所述关键帧之前采集的至少一个已增稳图像的第一偏移量对所述图像的第一偏移量进行平滑处理,获取所述图像处理后的第一偏移量。Exemplarily, in order to reduce the workload, all images may not be smoothed, but only a small number of images after the key frame is replaced, for example, 30 frames of images after the key frame need to be smoothed. , the image starting from the 31st frame after the key frame does not need to be smoothed. Or it can be said that if the image is acquired within a preset time period after the key frame, the first offset of the at least one stabilized image acquired before the key frame can be used to determine the value of the image. Smoothing is performed on the first offset, and the first offset after image processing is obtained.
其中,为了进一步保证视频画面的自然过渡,所述已增稳图像的采集时间与所述关键帧的采集时间的差距应尽量小,即所述已增稳图像与所述关键帧的采集时间间隔小于指定时长,所述指定时长应尽可能小,所述指定时长可依据实际应用场景进行具体设置;例如所述已增稳图像包括所述关键帧的前一帧图像。Wherein, in order to further ensure the natural transition of the video picture, the difference between the acquisition time of the stabilized image and the acquisition time of the key frame should be as small as possible, that is, the acquisition time interval between the stabilized image and the key frame If it is less than the specified duration, the specified duration should be as small as possible, and the specified duration can be specifically set according to the actual application scenario; for example, the stabilized image includes the image of the previous frame of the key frame.
示例性的,所述平滑处理可以包括线性滤波处理,即可以使用在所述关键帧之前采集的至少一个已增稳图像的第一偏移量对所述图像的第一偏移量进行线性滤波处理,获得处理后的第一偏移量,进一步地,可以使用所述处理后的第一偏移量来消除所述图像因成像装置低频晃动导致的画面抖动。Exemplarily, the smoothing processing may include linear filtering processing, that is, linear filtering may be performed on the first offset of the image by using the first offset of the at least one stabilized image acquired before the key frame. processing, to obtain a processed first offset, and further, the processed first offset may be used to eliminate picture shake of the image caused by low-frequency shaking of the imaging device.
作为例子,可以采用加权平均方式进行线性滤波处理,即所述图像处理后的第一偏移量为所述已增稳图像的第一偏移量和所述图像的第一偏移量加权平均后的结果;其中,为了逐渐降低所述已增稳图像的第一偏移量的影响,随着所述图像与所述关键帧之间的采集时间间隔逐渐增大,所述已增稳图像的第一偏移量对应的权重系数可以逐渐减小,所述图像的第一偏移量对应的权重系数可以逐渐增大,即所述已增稳图像的第一偏移量对应的权重系数与所述采集时间间隔成负相关关系,所述图像的第一偏移量对应的权重系数与所述采集时间间隔成正相关关系。As an example, the linear filtering process may be performed in a weighted average manner, that is, the first offset after the image processing is the weighted average of the first offset of the stabilized image and the first offset of the image where, in order to gradually reduce the influence of the first offset of the stabilized image, as the acquisition time interval between the image and the key frame gradually increases, the stabilized image The weight coefficient corresponding to the first offset of the image may gradually decrease, and the weight coefficient corresponding to the first offset of the image may gradually increase, that is, the weight coefficient corresponding to the first offset of the stabilized image There is a negative correlation with the acquisition time interval, and a weight coefficient corresponding to the first offset of the image is in a positive correlation with the acquisition time interval.
在一些实施例中,为了进一步减少计算量,在识别因所述成像装置低频晃动造成的所述图像序列中的画面抖动之前(或者说获取所述第一偏移量之前),可以先对所述关键帧和所述图像进行下采样处理,并记录下采样率,然后计算下采样后的图像相对于下采样后的关键帧的偏移量,进而使用预先记录的下采样率还原所述采样后的图像相对于下采样后的关键帧的偏移量,得到所述第一偏移量,从而达到减少计算量并提高计算效率的目的,也可以满足某些场景下的实时性需求。In some embodiments, in order to further reduce the amount of calculation, before identifying the picture shake in the image sequence caused by the low-frequency shaking of the imaging device (or before obtaining the first offset), the The key frame and the image are down-sampled, and the down-sampling rate is recorded, and then the offset of the down-sampled image relative to the down-sampled key frame is calculated, and then the pre-recorded down-sampling rate is used to restore the sample The first offset is obtained from the offset of the post-image relative to the down-sampled key frame, so as to achieve the purpose of reducing the amount of calculation and improving the calculation efficiency, and can also meet the real-time requirements in some scenarios.
在一些实施例中,所述低频晃动或者说因所述低频晃动产生的所述第一偏移量还可以用于将所述图像与其他图像进行对齐处理。In some embodiments, the low-frequency shaking or the first offset generated by the low-frequency shaking may also be used to align the image with other images.
在步骤S102中,对于图像序列中的图像,所述成像装置可以基于姿态传感器采集的有关于所述成像装置的姿态数据对所述图像进行电子防抖处理,获得因所述成像装置高频抖动产生的第二偏移量,从而基于所述第二偏移量对所述图像进行增稳处理,提升所述图像的稳定性。In step S102 , for the images in the image sequence, the imaging device may perform electronic anti-shake processing on the images based on the attitude data about the imaging device collected by the attitude sensor, and obtain a result of the high-frequency jitter of the imaging device. The generated second offset, so that stabilization processing is performed on the image based on the second offset to improve the stability of the image.
其中,对所述图像进行电子防抖处理,可以认为是将所述图像与其相邻的图像对齐,然后将所述图像与相邻图像不齐的部分裁剪掉;例如在所述图像曝光后,所述成像装置获取曝光时刻之前一段时间内所述姿态传感器采集的姿态数据,例如所述姿态数据是以四元数形式表示的,所述成像装置可以将以四元数形式表示的姿态数据进行插值并将其转换成旋转矩阵;然后利用姿态传感器与所述成像装置之间的位置关系,根据所述旋转矩阵获得所述第二偏移量。Wherein, performing electronic anti-shake processing on the image can be considered as aligning the image with its adjacent images, and then cropping off the uneven part of the image and the adjacent images; for example, after the image is exposed, The imaging device acquires the attitude data collected by the attitude sensor for a period of time before the exposure time. For example, the attitude data is expressed in the form of quaternions, and the imaging device can perform the attitude data expressed in the form of quaternions. Interpolate and convert it into a rotation matrix; then use the positional relationship between the attitude sensor and the imaging device to obtain the second offset according to the rotation matrix.
所述第二偏移量包括针对于所述图像中各像素的偏移量,例如所述第二偏移量可 以通过坐标映射表来表示,该坐标映射表中的各元素表示针对于所述图像中各像素的偏移量。在一些实施例中,考虑到成像装置的变焦倍数越高,所述图像序列的图像因所述成像装置高频抖动产生抖动的像素越多;因此,为了可以实现对高倍率下的图像进行增稳,所述成像装置可以根据自身当前变焦倍数调整所述第二偏移量,调整后的第二偏移量适应于在所述当前变焦倍数下所述图像的像素数量,使用调整后的第二偏移量进行增稳处理,有利于提升高倍率场景采集的图像的稳定性。The second offset includes an offset relative to each pixel in the image, for example, the second offset can be represented by a coordinate mapping table, and each element in the coordinate mapping table represents an offset relative to the pixel in the image. The offset of each pixel in the image. In some embodiments, considering that the higher the zoom factor of the imaging device is, the more pixels in the image sequence are shaken due to the high-frequency dithering of the imaging device; stable, the imaging device can adjust the second offset according to its own current zoom factor, and the adjusted second offset is adapted to the number of pixels of the image under the current zoom factor, and the adjusted second offset is used. The second offset is used for stabilization processing, which is beneficial to improve the stability of images captured in high-magnification scenes.
在一些实施例中,所述第二偏移量包括沿指定方向的偏移量,例如所述指定方向至少可以是在图像坐标中的水平方向和/或垂直方向。示例性的,所述第二偏移量可以通过坐标映射表来表示,所述成像装置可以获得描述图像中的像素水平方向坐标变换的坐标映射表和描述图像中的像素垂直方向坐标变换的坐标映射表。In some embodiments, the second offset includes an offset along a specified direction, for example, the specified direction may be at least a horizontal direction and/or a vertical direction in image coordinates. Exemplarily, the second offset may be represented by a coordinate mapping table, and the imaging device may obtain a coordinate mapping table describing the coordinate transformation of the pixels in the horizontal direction in the image and coordinates describing the coordinate transformation of the pixels in the vertical direction in the image. mapping table.
在步骤S204中,在获取所述第一偏移量和所述第二偏移量之后,所述成像装置可以根据所述第一偏移量和所述第二偏移量,对所述图像序列中的图像进行增稳处理。示例性的,可以根据所述第一偏移量和所述第二偏移量获取所述图像的总偏移量,并根据所述总偏移量对所述图像进行增稳处理。示例性,也可以先使用所述第一偏移量对所述图像进行第一增稳处理,再使用所述第二偏移量对经第一增稳处理后的图像进行第二增稳处理。示例性的,可以将所述第一偏移量和所述第二偏移量传输给所述成像装置中的畸变校正(GDC)模块,由所述畸变校正模块对所述图像进行增稳处理。本实施例中,增稳处理后的图像消除了因所述成像装置高频抖动和低频晃动造成的画面抖动,其稳定性有所提高,有利于提高用户的视觉观感。In step S204, after acquiring the first offset and the second offset, the imaging device may, according to the first offset and the second offset, analyze the image Images in the sequence are stabilized. Exemplarily, a total offset of the image may be acquired according to the first offset and the second offset, and stabilization processing is performed on the image according to the total offset. Exemplarily, it is also possible to use the first offset to perform a first stabilization process on the image, and then use the second offset to perform a second stabilization process on the image after the first stabilization process. . Exemplarily, the first offset and the second offset may be transmitted to a distortion correction (GDC) module in the imaging device, and the image is stabilized by the distortion correction module. . In this embodiment, the image after stabilization processing eliminates the image jitter caused by the high-frequency jitter and the low-frequency jitter of the imaging device, and its stability is improved, which is beneficial to improve the user's visual perception.
相应的,请参阅图5,本申请实施例还提供了一种成像装置10,包括图像传感器11以及一个或多个处理器12;Correspondingly, referring to FIG. 5 , an embodiment of the present application further provides an imaging device 10 , including an image sensor 11 and one or more processors 12 ;
所述图像传感器11用于采集图像序列;The image sensor 11 is used to collect image sequences;
所述一个或多个处理器12被单独地或共同地配置成:The one or more processors 12 are individually or collectively configured to:
基于姿态传感器识别因所述成像装置10高频抖动造成的所述图像序列中的画面抖动;Identifying the image shake in the image sequence caused by the high-frequency shake of the imaging device 10 based on the attitude sensor;
基于所述图像序列之间的相对运动识别因所述成像装置10低频晃动造成的所述图像序列中的画面抖动;Identifying screen shake in the image sequence caused by low-frequency shaking of the imaging device 10 based on relative motion between the image sequences;
根据所述高频抖动和所述低频晃动对所述图像序列进行图像增稳,以消除所述图像序列中因所述成像装置10高频抖动和低频晃动造成的画面抖动。Image stabilization is performed on the image sequence according to the high-frequency jitter and the low-frequency jitter, so as to eliminate the image jitter caused by the high-frequency jitter and the low-frequency jitter of the imaging device 10 in the image sequence.
所述处理器12执行存储器中包括的可执行指令,所述可执行指令包括用于执行上述的视频增稳方法的指令。所述处理器12可以是中央处理单元(Central Processing Unit,CPU),还可以是其他通用处理器、数字信号处理器(Digital Signal Processor,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现成可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件等。通用处理器可以是微处理器或者该处理器12也可以是任何常规的处理器等。The processor 12 executes executable instructions included in the memory, and the executable instructions include instructions for executing the above-described video stabilization method. The processor 12 may be a central processing unit (Central Processing Unit, CPU), or other general-purpose processors, digital signal processors (Digital Signal Processor, DSP), application specific integrated circuits (Application Specific Integrated Circuit, ASIC), Off-the-shelf programmable gate array (Field-Programmable Gate Array, FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components, etc. A general purpose processor may be a microprocessor or the processor 12 may be any conventional processor or the like.
在一实施例中,所述高频抖动包括高频旋转产生的抖动和/或高频平移产生的抖动;所述低频晃动包括低频平移产生的晃动。In one embodiment, the high-frequency jitter includes jitter caused by high-frequency rotation and/or jitter caused by high-frequency translation; and the low-frequency jitter includes jitter caused by low-frequency translation.
在一实施例中,所述成像装置10为手持设备,所述低频晃动因用户握持所述手持 设备的部位晃动产生;或者,所述成像装置10搭载于可移动平台,所述低频晃动因可移动平台在运动过程中晃动产生。In one embodiment, the imaging device 10 is a handheld device, and the low-frequency shaking is caused by the shaking of the part where the user holds the handheld device; or, the imaging device 10 is mounted on a movable platform, and the low-frequency shaking is caused by The movable platform shakes during the movement.
在一实施例中,所述成像装置10的变焦倍数越高,所述图像序列的图像因所述成像装置10低频晃动产生偏移的像素越多。In one embodiment, the higher the zoom factor of the imaging device 10 is, the more pixels in the image sequence are shifted due to the low-frequency shaking of the imaging device 10 .
在一实施例中,所述处理器12还用于:在所述成像装置10当前变焦倍数高于预设倍数的情况下,基于所述图像序列之间的相对运动识别因所述成像装置10低频晃动造成的所述图像序列中的画面抖动。In one embodiment, the processor 12 is further configured to: in the case that the current zoom factor of the imaging device 10 is higher than a preset factor, identify the cause of the imaging device 10 based on the relative motion between the image sequences. Picture shaking in the image sequence caused by low frequency shaking.
在一实施例中,所述处理器12还用于:在所述成像装置10进行变焦的过程中、或者在所述成像装置10被改变朝向的过程中,不识别因所述成像装置10低频晃动造成的所述图像序列中的画面抖动。In an embodiment, the processor 12 is further configured to: during the process of zooming by the imaging device 10 or during the process of changing the orientation of the imaging device 10, not to identify the low frequency caused by the imaging device 10 Shaking in the image sequence caused by shaking.
在一实施例中,所述处理器12还用于:在使用所述图像序列进行目标跟踪的情况下,不对所述图像序列进行增稳处理。In one embodiment, the processor 12 is further configured to: in the case of using the image sequence for target tracking, not to perform stabilization processing on the image sequence.
在一实施例中,所述处理器12还用于:控制所述成像装置10根据所述低频晃动修正自身的姿态。In an embodiment, the processor 12 is further configured to: control the imaging device 10 to correct its own posture according to the low-frequency shaking.
在一实施例中,所述处理器12还用于:获取所述低频晃动对应的第一偏移量,并根据所述第一偏移量控制所述成像装置10运动。In an embodiment, the processor 12 is further configured to: acquire a first offset corresponding to the low-frequency shaking, and control the imaging device 10 to move according to the first offset.
在一实施例中,所述成像装置10安装于云台。所述处理器12还用于:根据所述低频晃动获得第一偏移量,并根据所述第一偏移量控制所述云台运动。In one embodiment, the imaging device 10 is installed on a pan/tilt head. The processor 12 is further configured to: obtain a first offset according to the low-frequency shaking, and control the movement of the gimbal according to the first offset.
在一实施例中,所述图像序列中的图像因所述低频晃动对应有第一偏移量以及因所述高频抖动对应有第二偏移量。In one embodiment, the images in the sequence of images correspond to a first offset due to the low-frequency shaking and a second offset corresponds to the high-frequency jitter.
所述处理器12还用于:根据所述第一偏移量和所述第二偏移量,对所述图像序列中的图像进行增稳处理。The processor 12 is further configured to: perform stabilization processing on the images in the image sequence according to the first offset and the second offset.
在一实施例中,所述第一偏移量包括沿指定方向的偏移量;和/或,所述第二偏移量包括沿指定方向的偏移量。In one embodiment, the first offset includes an offset in a specified direction; and/or the second offset includes an offset in a specified direction.
在一实施例中,所述处理器12还用于:对于图像序列中的图像,基于姿态传感器采集的所述成像装置10的姿态数据对所述图像进行电子防抖处理,获得所述图像因所述成像装置10高频抖动产生的第二偏移量。In an embodiment, the processor 12 is further configured to: for the images in the image sequence, perform electronic anti-shake processing on the images based on the attitude data of the imaging device 10 collected by the attitude sensor, and obtain the image factors. The second offset is generated by the high frequency jitter of the imaging device 10 .
在一实施例中,所述第二偏移量包括针对于所述图像中各像素的偏移量。In one embodiment, the second offset includes an offset for each pixel in the image.
所述处理器12还用于:根据所述成像装置10当前变焦倍数调整所述第二偏移量;调整后的第二偏移量适应于在所述当前变焦倍数下所述图像的像素数量。The processor 12 is further configured to: adjust the second offset according to the current zoom factor of the imaging device 10; the adjusted second offset is adapted to the number of pixels of the image under the current zoom factor .
在一实施例中,所述处理器12还用于:对于图像序列中的图像,根据预置的关键帧对所述图像进行运动估计,确定所述图像因所述成像装置10低频晃动产生的相对于所述关键帧的第一偏移量。In one embodiment, the processor 12 is further configured to: for the images in the image sequence, perform motion estimation on the images according to preset key frames, and determine whether the images are caused by low-frequency shaking of the imaging device 10 . The first offset relative to the keyframe.
在一实施例中,在没有关键帧的情况下,所述图像作为关键帧使用,且所述第一偏移量被置为0。In one embodiment, in the absence of a key frame, the image is used as a key frame, and the first offset is set to 0.
在一实施例中,所述处理器12还用于:获取所述关键帧中至少一个关键点,所述关键点位于所述关键帧中的第一位置;确定所述至少一个关键点在所述图像中的第二位置;根据所述第一位置与所述第二位置之间的差异,确定所述图像相对于所述关键帧的第一偏移量。In one embodiment, the processor 12 is further configured to: acquire at least one key point in the key frame, where the key point is located at a first position in the key frame; determine where the at least one key point is located; a second position in the image; and determining a first offset of the image relative to the key frame according to the difference between the first position and the second position.
在一实施例中,所述处理器12还用于:将所述关键帧分割成若干图像块;对于每 个所述图像块,根据所述图像块的中心点和/或所述图像块中梯度最大的至少一个特征点确定所述关键点。In an embodiment, the processor 12 is further configured to: divide the key frame into several image blocks; for each image block, according to the center point of the image block and/or the At least one feature point with the largest gradient determines the keypoint.
在一实施例中,所述处理器12还用于:获取所述至少一个关键点分别在所述关键帧中的第一特征信息;使用预设的图像转换关系估计所述关键帧中至少一个关键点在所述图像中的第二位置;在所述图像中,获取所述至少一个关键点分别在不同偏移量下的第二特征信息;对于每个所述关键点,将所述不同偏移量下的第二特征信息分别与所述第一特征信息进行匹配,并根据与所述第一特征信息匹配的第二特征信息对应的偏移量调整所述第二位置。In an embodiment, the processor 12 is further configured to: acquire first feature information of the at least one key point in the key frame respectively; use a preset image conversion relationship to estimate at least one of the key frames; The second position of the key point in the image; in the image, obtain the second feature information of the at least one key point under different offsets; for each of the key points, the different The second feature information under the offset is respectively matched with the first feature information, and the second position is adjusted according to the offset corresponding to the second feature information matched with the first feature information.
在一实施例中,所述不同偏移量是在预设偏移范围内沿指定方向偏移的不同距离。In one embodiment, the different offset amounts are different distances offset along a specified direction within a preset offset range.
所述匹配的第二特征信息与所述第一特征信息的差异最小。The matched second feature information has the smallest difference from the first feature information.
在一实施例中,所述第一特征信息包括:对所述关键点进行投影得到的特征向量;和/或,所述第二特征信息包括:对所述关键点进行投影得到的特征向量。In an embodiment, the first feature information includes: a feature vector obtained by projecting the key point; and/or the second feature information includes: a feature vector obtained by projecting the key point.
在一实施例中,所述第一特征信息包括:对包含所述关键点的预设区域沿指定方向进行投影得到的特征向量;和/或,所述第二特征信息包括:对包含所述关键点的预设区域沿指定方向进行投影得到的特征向量。In an embodiment, the first feature information includes: a feature vector obtained by projecting a preset area including the key point along a specified direction; and/or the second feature information includes: The feature vector obtained by projecting the preset area of the keypoint along the specified direction.
在一实施例中,所述指定方向至少包括在图像坐标中的水平方向和/或垂直方向。In one embodiment, the specified direction includes at least a horizontal direction and/or a vertical direction in image coordinates.
在一实施例中,所述特征向量为映射到预设范围内的特征向量,所述预设范围用于使各个所述关键点的亮度变化范围相同。In an embodiment, the feature vector is a feature vector mapped to a preset range, and the preset range is used to make the luminance variation range of each of the key points the same.
在一实施例中,所述映射到预设范围内的特征向量包括:所述投影得到的特征向量与特征向量均值之差、或者所述投影得到的特征向量归一化的结果。In an embodiment, the feature vector mapped to the preset range includes: the difference between the feature vector obtained by the projection and the mean value of the feature vector, or the result of normalization of the feature vector obtained by the projection.
在一实施例中,所述预设的图像转换关系根据所述关键点在所述关键帧中的第一位置与所述关键点在上一帧图像中的位置之间的位置关系确定。In one embodiment, the preset image conversion relationship is determined according to a positional relationship between a first position of the key point in the key frame and a position of the key point in a previous frame of image.
在一实施例中,在所述图像作为关键帧的情况下,所述预设的图像转换关系为单位矩阵。In an embodiment, when the image is used as a key frame, the preset image conversion relationship is an identity matrix.
在一实施例中,所述处理器12还用于:根据所述至少一个关键点的第一位置与调整后的第二位置之间的位置关系,确定用于下一帧图像的图像转换关系。In one embodiment, the processor 12 is further configured to: determine an image conversion relationship for the next frame of image according to the positional relationship between the first position of the at least one key point and the adjusted second position .
在一实施例中,所述处理器12还用于:从所述至少一个关键点中筛选有效关键点;根据所述有效关键点的第一位置与调整后的第二位置之间的位置关系,确定用于下一帧图像的图像转换关系。In one embodiment, the processor 12 is further configured to: filter valid key points from the at least one key point; according to the positional relationship between the first position of the valid key point and the adjusted second position , to determine the image conversion relationship for the next frame of image.
在一实施例中,所述处理器12还用于:使用所述预设的图像转换关系确定位于调整后的第二位置的关键点在所述关键帧中的第三位置;对于每一个关键点,确定所述关键点的所述第三位置相对于所述第一位置的位移量;将其中一个关键点的位移量确定为目标位移量,并获取其他关键点的位移量相对于所述目标位移量的误差值;从所述至少一个关键点中选取误差值小于预设值的有效关键点。In one embodiment, the processor 12 is further configured to: use the preset image conversion relationship to determine the third position of the key point at the adjusted second position in the key frame; for each key point, determine the displacement of the third position of the key point relative to the first position; determine the displacement of one of the key points as the target displacement, and obtain the displacement of other key points relative to the The error value of the target displacement; from the at least one key point, select an effective key point whose error value is less than the preset value.
在一实施例中,所述处理器12还用于:在所述有效关键点的数量少于预设数量的情况下,将所述图像确定为关键帧,并将所述第一偏移量置为0。In one embodiment, the processor 12 is further configured to: determine the image as a key frame when the number of valid key points is less than a preset number, and determine the first offset Set to 0.
在一实施例中,所述处理器12还用于:在所述第一偏移量大于预设偏移阈值的情况下,将所述图像确定为关键帧,并将所述第一偏移量置为0。In an embodiment, the processor 12 is further configured to: when the first offset is greater than a preset offset threshold, determine the image as a key frame, and convert the first offset into a key frame. The amount is set to 0.
在一实施例中,在所述确定所述图像相对于所述关键帧的第一偏移量之后,所述处理器12还用于:使用在所述关键帧之前采集的至少一个已增稳图像的第一偏移量对 所述图像的第一偏移量进行平滑处理,获取所述图像处理后的第一偏移量。In one embodiment, after the determining of the first offset of the image relative to the key frame, the processor 12 is further configured to: use at least one stabilized image acquired before the key frame The first offset of the image is processed by smoothing the first offset of the image to obtain the processed first offset of the image.
在一实施例中,在所述图像是在采集所述关键帧之后的预设时长内采集得到的情况下,对所述图像的第一偏移量进行平滑处理。In an embodiment, when the image is acquired within a preset time period after acquiring the key frame, smoothing is performed on the first offset of the image.
在一实施例中,所述已增稳图像与所述关键帧的采集时间间隔小于指定时长。In one embodiment, the acquisition time interval between the stabilized image and the key frame is less than a specified time period.
在一实施例中,所述已增稳图像包括所述关键帧的前一帧图像。In one embodiment, the stabilized image includes a previous frame image of the key frame.
在一实施例中,所述平滑处理包括线性滤波处理。In one embodiment, the smoothing process includes a linear filtering process.
在一实施例中,所述图像处理后的第一偏移量为所述已增稳图像的第一偏移量和所述图像的第一偏移量加权平均后的结果。In one embodiment, the first offset after image processing is a weighted average result of the first offset of the stabilized image and the first offset of the image.
在一实施例中,随着所述图像与所述关键帧之间的采集时间间隔逐渐增大,所述已增稳图像的第一偏移量对应的权重系数逐渐减小,所述图像的第一偏移量对应的权重系数逐渐增大。In an embodiment, as the acquisition time interval between the image and the key frame gradually increases, the weight coefficient corresponding to the first offset of the stabilized image gradually decreases, and the The weight coefficient corresponding to the first offset gradually increases.
在一实施例中,所述处理器12还用于:根据所述第一偏移量和所述第二偏移量获取所述图像的总偏移量,并根据所述总偏移量对所述图像进行增稳处理。In an embodiment, the processor 12 is further configured to: acquire a total offset of the image according to the first offset and the second offset, and pair the image according to the total offset. The image is stabilized.
在一实施例中,所述处理器12还用于:使用所述第一偏移量对所述图像进行第一增稳处理,再使用所述第二偏移量对经第一增稳处理后的图像进行第二增稳处理。In one embodiment, the processor 12 is further configured to: use the first offset to perform a first stabilization process on the image, and then use the second offset to perform a first stabilization process on the image. The resulting image is subjected to a second stabilization process.
在一实施例中,所述图像和所述关键帧为经过下采样后获得的。In one embodiment, the image and the key frame are obtained after down-sampling.
对于装置实施例而言,由于其基本对应于方法实施例,所以相关之处参见方法实施例的部分说明即可。这里描述的各种实施方式可以使用例如计算机软件、硬件或其任何组合的计算机可读介质来实施。对于硬件实施,这里描述的实施方式可以通过使用特定用途集成电路(ASIC)、数字信号处理器(DSP)、数字信号处理装置(DSPD)、可编程逻辑装置(PLD)、现场可编程门阵列(FPGA)、处理器、控制器、微控制器、微处理器、被设计为执行这里描述的功能的电子单元中的至少一种来实施。对于软件实施,诸如过程或功能的实施方式可以与允许执行至少一种功能或操作的单独的软件模块来实施。软件代码可以由以任何适当的编程语言编写的软件应用程序(或程序)来实施,软件代码可以存储在存储器中并且由控制器执行。For the apparatus embodiments, since they basically correspond to the method embodiments, reference may be made to the partial descriptions of the method embodiments for related parts. The various embodiments described herein can be implemented using computer readable media such as computer software, hardware, or any combination thereof. For hardware implementation, the embodiments described herein can be implemented using application specific integrated circuits (ASICs), digital signal processors (DSPs), digital signal processing devices (DSPDs), programmable logic devices (PLDs), field programmable gate arrays ( FPGA), processors, controllers, microcontrollers, microprocessors, electronic units designed to perform the functions described herein are implemented. For software implementation, embodiments such as procedures or functions may be implemented with separate software modules that allow the performance of at least one function or operation. The software codes may be implemented by a software application (or program) written in any suitable programming language, which may be stored in memory and executed by a controller.
相应的,请参阅图6,本申请实施例还提供了一种手持云台20,包括姿态传感器21和上述的成像装置10;其中,所述姿态传感器21用于采集所述成像装置的姿态数据。Correspondingly, referring to FIG. 6 , an embodiment of the present application further provides a handheld pan/tilt head 20 , including an attitude sensor 21 and the above-mentioned imaging device 10 ; wherein, the attitude sensor 21 is used to collect attitude data of the imaging device .
在一实施例中,所述手持云台包括有云台轴,所述云台轴用于改变所述成像装置的姿态。In one embodiment, the hand-held pan/tilt head includes a pan/tilt head shaft, and the pan/tilt head shaft is used to change the posture of the imaging device.
相应的,请参阅图7,本申请实施例还提供了一种可移动平台30,包括:Correspondingly, referring to FIG. 7 , an embodiment of the present application further provides a movable platform 30, including:
机体31; body 31;
动力系统32,安装于所述机体31,用于驱动所述可移动平台30移动;a power system 32, mounted on the body 31, for driving the movable platform 30 to move;
上述的安装于所述机体31的成像装置10;The above-mentioned imaging device 10 installed on the body 31;
以及姿态传感器33,安装于所述机体31,用于采集所述成像装置10的姿态数据。and an attitude sensor 33 , which is installed on the body 31 and used to collect attitude data of the imaging device 10 .
在一实施例中,所述可移动平台包括但不限于无人机、无人驾驶车辆、无人驾驶船只或者可移动机器人等。In one embodiment, the movable platform includes, but is not limited to, an unmanned aerial vehicle, an unmanned vehicle, an unmanned vessel, or a mobile robot.
在示例性实施例中,还提供了一种包括指令的非临时性计算机可读存储介质,例如包括指令的存储器,上述指令可由装置的处理器执行以完成上述方法。例如,非临 时性计算机可读存储介质可以是ROM、随机存取存储器(RAM)、CD-ROM、磁带、软盘和光数据存储设备等。In an exemplary embodiment, there is also provided a non-transitory computer-readable storage medium, such as a memory including instructions, executable by a processor of an apparatus to perform the above-described method. For example, the non-transitory computer-readable storage medium may be ROM, random access memory (RAM), CD-ROM, magnetic tape, floppy disk, optical data storage device, and the like.
一种非临时性计算机可读存储介质,当存储介质中的指令由终端的处理器执行时,使得终端能够执行上述方法。A non-transitory computer-readable storage medium, when the instructions in the storage medium are executed by the processor of the terminal, enable the terminal to execute the above method.
需要说明的是,在本文中,诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。It should be noted that, in this document, relational terms such as first and second are only used to distinguish one entity or operation from another entity or operation, and do not necessarily require or imply any relationship between these entities or operations. any such actual relationship or sequence exists. The terms "comprising", "comprising" or any other variation thereof are intended to encompass non-exclusive inclusion such that a process, method, article or device comprising a list of elements includes not only those elements, but also other not expressly listed elements, or also include elements inherent to such a process, method, article or apparatus. Without further limitation, an element qualified by the phrase "comprising a..." does not preclude the presence of additional identical elements in a process, method, article or apparatus that includes the element.
以上对本申请实施例所提供的方法和装置进行了详细介绍,本文中应用了具体个例对本申请的原理及实施方式进行了阐述,以上实施例的说明只是用于帮助理解本申请的方法及其核心思想;同时,对于本领域的一般技术人员,依据本申请的思想,在具体实施方式及应用范围上均会有改变之处,综上所述,本说明书内容不应理解为对本申请的限制。The methods and devices provided by the embodiments of the present application have been introduced in detail above, and specific examples are used to illustrate the principles and implementations of the present application. At the same time, for those of ordinary skill in the art, according to the idea of the application, there will be changes in the specific implementation and application scope. In summary, the content of this specification should not be construed as a limitation to the application. .

Claims (87)

  1. 一种视频增稳方法,其特征在于,包括:A video stabilization method, comprising:
    获取成像装置采集的图像序列;acquiring a sequence of images acquired by the imaging device;
    基于姿态传感器识别因所述成像装置高频抖动造成的所述图像序列中的画面抖动;Identifying, based on an attitude sensor, image shake in the image sequence caused by high-frequency shake of the imaging device;
    基于所述图像序列之间的相对运动识别因所述成像装置低频晃动造成的所述图像序列中的画面抖动;Identifying screen shake in the image sequence due to low-frequency shaking of the imaging device based on relative motion between the image sequences;
    根据所述高频抖动和所述低频晃动对所述图像序列进行图像增稳,以消除所述图像序列中因所述成像装置高频抖动和低频晃动造成的画面抖动。Image stabilization is performed on the image sequence according to the high-frequency jitter and the low-frequency jitter, so as to eliminate picture jitter in the image sequence caused by the high-frequency jitter and the low-frequency jitter of the imaging device.
  2. 根据权利要求1所述的方法,其特征在于,所述高频抖动包括高频旋转产生的抖动和/或高频平移产生的抖动;所述低频晃动包括低频平移产生的晃动。The method according to claim 1, wherein the high-frequency jitter includes jitter generated by high-frequency rotation and/or jitter generated by high-frequency translation; and the low-frequency jitter includes jitter generated by low-frequency translation.
  3. 根据权利要求1或2所述的方法,其特征在于,所述成像装置为手持设备,所述低频晃动因用户握持所述手持设备的部位晃动产生;The method according to claim 1 or 2, wherein the imaging device is a handheld device, and the low-frequency shaking is generated by the shaking of a part where a user holds the handheld device;
    或者,所述成像装置搭载于可移动平台,所述低频晃动因可移动平台在运动过程中晃动产生。Alternatively, the imaging device is mounted on a movable platform, and the low-frequency shaking is generated by the shaking of the movable platform during movement.
  4. 根据权利要求1所述的方法,其特征在于,所述成像装置的变焦倍数越高,所述图像序列的图像因所述成像装置低频晃动产生偏移的像素越多。The method according to claim 1, wherein the higher the zoom factor of the imaging device is, the more pixels in the image sequence are shifted due to low-frequency shaking of the imaging device.
  5. 根据权利要求4所述的方法,其特征在于,所述基于所述图像序列之间的相对运动识别因所述成像装置低频晃动造成的所述图像序列中的画面抖动,包括:The method according to claim 4, wherein the identifying the picture shake in the image sequence caused by the low-frequency shaking of the imaging device based on the relative motion between the image sequences comprises:
    在所述成像装置当前变焦倍数高于预设倍数的情况下,基于所述图像序列之间的相对运动识别因所述成像装置低频晃动造成的所述图像序列中的画面抖动。In the case that the current zoom factor of the imaging device is higher than a preset factor, the image shake in the image sequence caused by the low-frequency shaking of the imaging device is identified based on the relative motion between the image sequences.
  6. 根据权利要求1所述的方法,其特征在于,所述方法还包括:The method according to claim 1, wherein the method further comprises:
    在所述成像装置进行变焦的过程中、或者在所述成像装置被改变朝向的过程中,不识别因所述成像装置低频晃动造成的所述图像序列中的画面抖动。During the process of zooming by the imaging device, or during the process of changing the orientation of the imaging device, the picture shaking in the image sequence caused by the low-frequency shaking of the imaging device is not recognized.
  7. 根据权利要求1所述的方法,其特征在于,所述方法还包括:The method according to claim 1, wherein the method further comprises:
    在使用所述图像序列进行目标跟踪的情况下,不对所述图像序列进行增稳处理。In the case of using the image sequence for target tracking, no stabilization processing is performed on the image sequence.
  8. 根据权利要求1所述的方法,其特征在于,所述方法还包括:控制所述成像装置根据所述低频晃动修正自身的姿态。The method according to claim 1, wherein the method further comprises: controlling the imaging device to correct its own posture according to the low-frequency shaking.
  9. 根据权利要求8所述的方法,其特征在于,所述控制所述成像装置根据所述低频晃动修正自身的姿态,包括:The method according to claim 8, wherein the controlling the imaging device to correct its own posture according to the low-frequency shaking comprises:
    获取所述低频晃动对应的第一偏移量,并根据所述第一偏移量控制所述成像装置运动。A first offset corresponding to the low-frequency shaking is acquired, and the imaging device is controlled to move according to the first offset.
  10. 根据权利要求8所述的方法,其特征在于,所述成像装置安装于云台;The method according to claim 8, wherein the imaging device is installed on a pan/tilt;
    所述控制所述成像装置根据所述低频晃动修正自身的姿态,包括:The controlling the imaging device to correct its posture according to the low-frequency shaking includes:
    根据所述低频晃动获得第一偏移量,并根据所述第一偏移量控制所述云台运动。A first offset is obtained according to the low-frequency shaking, and the movement of the gimbal is controlled according to the first offset.
  11. 根据权利要求1所述的方法,其特征在于,所述图像序列中的图像因所述低频晃动对应有第一偏移量以及因所述高频抖动对应有第二偏移量;The method according to claim 1, wherein the images in the image sequence correspond to a first offset due to the low-frequency jitter and have a second offset due to the high-frequency jitter;
    所述根据所述高频抖动和所述低频晃动对所述图像序列进行图像增稳,包括:The performing image stabilization on the image sequence according to the high-frequency jitter and the low-frequency jitter includes:
    根据所述第一偏移量和所述第二偏移量,对所述图像序列中的图像进行增稳处理。Stabilization is performed on the images in the sequence of images based on the first offset and the second offset.
  12. 根据权利要求11所述的方法,其特征在于,所述第一偏移量包括沿指定方向的偏移量;和/或,所述第二偏移量包括沿指定方向的偏移量。The method according to claim 11, wherein the first offset includes an offset along a specified direction; and/or the second offset includes an offset along a specified direction.
  13. 根据权利要求11所述的方法,其特征在于,所述基于姿态传感器识别因所述成像 装置高频抖动造成的所述图像序列中的画面抖动,包括:The method according to claim 11, wherein the identifying the picture shake in the image sequence caused by the high-frequency shaking of the imaging device based on the gesture sensor comprises:
    对于图像序列中的图像,基于姿态传感器采集的所述成像装置的姿态数据对所述图像进行电子防抖处理,获得所述图像因所述成像装置高频抖动产生的第二偏移量。For the images in the image sequence, electronic anti-shake processing is performed on the images based on the attitude data of the imaging device collected by the attitude sensor, so as to obtain a second offset of the images caused by the high-frequency jitter of the imaging device.
  14. 根据权利要求11或13所述的方法,其特征在于,所述第二偏移量包括针对于所述图像中各像素的偏移量;The method according to claim 11 or 13, wherein the second offset comprises an offset for each pixel in the image;
    所述方法还包括:The method also includes:
    根据所述成像装置当前变焦倍数调整所述第二偏移量;调整后的第二偏移量适应于在所述当前变焦倍数下所述图像的像素数量。The second offset is adjusted according to the current zoom factor of the imaging device; the adjusted second offset is adapted to the number of pixels of the image at the current zoom factor.
  15. 根据权利要求11所述的方法,其特征在于,所述基于所述图像序列之间的相对运动识别因所述成像装置低频晃动造成的所述图像序列中的画面抖动,包括:The method according to claim 11, wherein the identifying the picture shake in the image sequence caused by the low-frequency shaking of the imaging device based on the relative motion between the image sequences comprises:
    对于图像序列中的图像,根据预置的关键帧对所述图像进行运动估计,确定所述图像因所述成像装置低频晃动产生的相对于所述关键帧的第一偏移量。For the images in the image sequence, motion estimation is performed on the images according to preset key frames, and a first offset of the images relative to the key frames caused by low-frequency shaking of the imaging device is determined.
  16. 根据权利要求15所述的方法,其特征在于,在没有关键帧的情况下,所述图像作为关键帧使用,且所述第一偏移量被置为0。16. The method according to claim 15, wherein in the absence of a key frame, the image is used as a key frame, and the first offset is set to 0.
  17. 根据权利要求15所述的方法,其特征在于,所述根据预置的关键帧对所述图像进行运动估计,确定所述图像因所述成像装置低频晃动产生的相对于所述关键帧的第一偏移量,包括:The method according to claim 15, wherein the motion estimation is performed on the image according to a preset key frame, and the first motion of the image relative to the key frame is determined due to low-frequency shaking of the imaging device. an offset, including:
    获取所述关键帧中至少一个关键点,所述关键点位于所述关键帧中的第一位置;acquiring at least one key point in the key frame, the key point is located at a first position in the key frame;
    确定所述至少一个关键点在所述图像中的第二位置;determining a second position of the at least one keypoint in the image;
    根据所述第一位置与所述第二位置之间的差异,确定所述图像相对于所述关键帧的第一偏移量。A first offset of the image relative to the key frame is determined based on the difference between the first position and the second position.
  18. 根据权利要求17所述的方法,其特征在于,所述获取所述关键帧中至少一个关键点,包括:The method according to claim 17, wherein the acquiring at least one key point in the key frame comprises:
    将所述关键帧分割成若干图像块;dividing the key frame into several image blocks;
    对于每个所述图像块,根据所述图像块的中心点和/或所述图像块中梯度最大的至少一个特征点确定所述关键点。For each of the image blocks, the key point is determined according to the center point of the image block and/or at least one feature point with the largest gradient in the image block.
  19. 根据权利要求17所述的方法,其特征在于,所述确定所述至少一个关键点在所述图像中的第二位置包括:The method of claim 17, wherein the determining the second position of the at least one key point in the image comprises:
    获取所述至少一个关键点分别在所述关键帧中的第一特征信息;acquiring first feature information of the at least one key point in the key frame respectively;
    使用预设的图像转换关系估计所述关键帧中至少一个关键点在所述图像中的第二位置;estimating a second position in the image of at least one key point in the key frame using a preset image conversion relationship;
    在所述图像中,获取所述至少一个关键点分别在不同偏移量下的第二特征信息;In the image, acquiring second feature information of the at least one key point under different offsets;
    对于每个所述关键点,将所述不同偏移量下的第二特征信息分别与所述第一特征信息进行匹配,并根据与所述第一特征信息匹配的第二特征信息对应的偏移量调整所述第二位置。For each of the key points, the second feature information under the different offsets is respectively matched with the first feature information, and according to the offset corresponding to the second feature information matched with the first feature information The shift amount adjusts the second position.
  20. 根据权利要求19所述的方法,其特征在于,所述不同偏移量是在预设偏移范围内沿指定方向偏移的不同距离;The method according to claim 19, wherein the different offsets are different distances offset along a specified direction within a preset offset range;
    所述匹配的第二特征信息与所述第一特征信息的差异最小。The matched second feature information has the smallest difference from the first feature information.
  21. 根据权利要求19所述的方法,其特征在于,所述第一特征信息包括:对所述关键点进行投影得到的特征向量;The method according to claim 19, wherein the first feature information comprises: a feature vector obtained by projecting the key point;
    和/或,所述第二特征信息包括:对所述关键点进行投影得到的特征向量。And/or, the second feature information includes: a feature vector obtained by projecting the key point.
  22. 根据权利要求19或21所述的方法,其特征在于,所述第一特征信息包括:对包含所述关键点的预设区域沿指定方向进行投影得到的特征向量;The method according to claim 19 or 21, wherein the first feature information comprises: a feature vector obtained by projecting a preset area including the key point along a specified direction;
    和/或,所述第二特征信息包括:对包含所述关键点的预设区域沿指定方向进行投影得到的特征向量。And/or, the second feature information includes: a feature vector obtained by projecting a preset area including the key point along a specified direction.
  23. 根据权利要求12、20或22任意一项所述的方法,其特征在于,所述指定方向至少包括在图像坐标中的水平方向和/或垂直方向。The method according to any one of claims 12, 20 or 22, wherein the specified direction includes at least a horizontal direction and/or a vertical direction in image coordinates.
  24. 根据权利要求21或22所述的方法,其特征在于,所述特征向量为映射到预设范围内的特征向量,所述预设范围用于使各个所述关键点的亮度变化范围相同。The method according to claim 21 or 22, wherein the feature vector is a feature vector mapped to a preset range, and the preset range is used to make the luminance variation range of each of the key points the same.
  25. 根据权利要求24所述的方法,其特征在于,所述映射到预设范围内的特征向量包括:所述投影得到的特征向量与特征向量均值之差、或者所述投影得到的特征向量归一化的结果。The method according to claim 24, wherein the feature vector mapped to the preset range comprises: the difference between the feature vector obtained by the projection and the mean value of the feature vector, or the normalization of the feature vector obtained by the projection result of .
  26. 根据权利要求19所述的方法,其特征在于,所述预设的图像转换关系根据所述关键点在所述关键帧中的第一位置与所述关键点在上一帧图像中的位置之间的位置关系确定。The method according to claim 19, wherein the preset image conversion relationship is based on a first position of the key point in the key frame and a position of the key point in a previous frame of image The positional relationship between them is determined.
  27. 根据权利要求19所述的方法,其特征在于,在所述图像作为关键帧的情况下,所述预设的图像转换关系为单位矩阵。The method according to claim 19, wherein when the image is used as a key frame, the preset image conversion relationship is a unit matrix.
  28. 根据权利要求19所述的方法,其特征在于,还包括:The method of claim 19, further comprising:
    根据所述至少一个关键点的第一位置与调整后的第二位置之间的位置关系,确定用于下一帧图像的图像转换关系。An image conversion relationship for the next frame of image is determined according to the positional relationship between the first position of the at least one key point and the adjusted second position.
  29. 根据权利要求28所述的方法,其特征在于,所述确定用于下一帧图像的图像转换关系,包括:The method according to claim 28, wherein the determining the image conversion relationship for the next frame of image comprises:
    从所述至少一个关键点中筛选有效关键点;screening valid keypoints from the at least one keypoint;
    根据所述有效关键点的第一位置与调整后的第二位置之间的位置关系,确定用于下一帧图像的图像转换关系。According to the positional relationship between the first position of the effective key point and the adjusted second position, an image conversion relationship for the next frame of image is determined.
  30. 根据权利要求29所述方法,其特征在于,所述从所述至少一个关键点中筛选有效关键点,包括:The method according to claim 29, wherein the screening of valid key points from the at least one key point comprises:
    使用所述预设的图像转换关系确定位于调整后的第二位置的关键点在所述关键帧中的第三位置;Using the preset image conversion relationship to determine the third position of the key point located at the adjusted second position in the key frame;
    对于每一个关键点,确定所述关键点的所述第三位置相对于所述第一位置的位移量;For each key point, determining the displacement amount of the third position of the key point relative to the first position;
    将其中一个关键点的位移量确定为目标位移量,并获取其他关键点的位移量相对于所述目标位移量的误差值;Determine the displacement of one of the key points as the target displacement, and obtain the error value of the displacement of other key points relative to the target displacement;
    从所述至少一个关键点中选取误差值小于预设值的有效关键点。Valid key points whose error value is less than a preset value are selected from the at least one key point.
  31. 根据权利要求29所述的方法,其特征在于,还包括:The method of claim 29, further comprising:
    在所述有效关键点的数量少于预设数量的情况下,将所述图像确定为关键帧,并将所述第一偏移量置为0。When the number of valid key points is less than a preset number, the image is determined as a key frame, and the first offset is set to 0.
  32. 根据权利要求15所述的方法,其特征在于,还包括:The method of claim 15, further comprising:
    在所述第一偏移量大于预设偏移阈值的情况下,将所述图像确定为关键帧,并将所述第一偏移量置为0。When the first offset is greater than a preset offset threshold, the image is determined as a key frame, and the first offset is set to 0.
  33. 根据权利要求15所述的方法,其特征在于,在所述确定所述图像相对于所述关 键帧的第一偏移量之后,还包括:The method of claim 15, wherein after the determining the first offset of the image relative to the key frame, further comprising:
    使用在所述关键帧之前采集的至少一个已增稳图像的第一偏移量对所述图像的第一偏移量进行平滑处理,获取所述图像处理后的第一偏移量。Smoothing the first offset of the image by using the first offset of the at least one stabilized image collected before the key frame, to obtain the processed first offset of the image.
  34. 根据权利要求33所述的方法,其特征在于,在所述图像是在采集所述关键帧之后的预设时长内采集得到的情况下,对所述图像的第一偏移量进行平滑处理。The method according to claim 33, wherein when the image is acquired within a preset time period after acquiring the key frame, smoothing is performed on the first offset of the image.
  35. 根据权利要求33所述的方法,其特征在于,所述已增稳图像与所述关键帧的采集时间间隔小于指定时长。The method according to claim 33, wherein the acquisition time interval between the stabilized image and the key frame is less than a specified duration.
  36. 根据权利要求33所述的方法,其特征在于,所述已增稳图像包括所述关键帧的前一帧图像。34. The method of claim 33, wherein the stabilized image comprises an image of a previous frame of the key frame.
  37. 根据权利要求33所述的方法,其特征在于,所述平滑处理包括线性滤波处理。34. The method of claim 33, wherein the smoothing process comprises a linear filtering process.
  38. 根据权利要求33或37所述的方法,其特征在于,所述图像处理后的第一偏移量为所述已增稳图像的第一偏移量和所述图像的第一偏移量加权平均后的结果。The method according to claim 33 or 37, wherein the first offset after image processing is weighted by the first offset of the stabilized image and the first offset of the image Averaged results.
  39. 根据权利要求38所述的方法,其特征在于,随着所述图像与所述关键帧之间的采集时间间隔逐渐增大,所述已增稳图像的第一偏移量对应的权重系数逐渐减小,所述图像的第一偏移量对应的权重系数逐渐增大。The method according to claim 38, wherein as the acquisition time interval between the image and the key frame gradually increases, the weight coefficient corresponding to the first offset of the stabilized image gradually increases decreases, the weight coefficient corresponding to the first offset of the image gradually increases.
  40. 根据权利要求11所述的方法,其特征在于,所述根据所述第一偏移量和所述第二偏移量,对所述图像序列中的图像进行增稳处理,包括:The method according to claim 11, wherein the performing stabilization processing on the images in the image sequence according to the first offset and the second offset comprises:
    根据所述第一偏移量和所述第二偏移量获取所述图像的总偏移量,并根据所述总偏移量对所述图像进行增稳处理。A total offset of the image is acquired according to the first offset and the second offset, and stabilization processing is performed on the image according to the total offset.
  41. 根据权利要求11所述的方法,其特征在于,所述根据所述第一偏移量和所述第二偏移量,对所述图像序列中的图像进行增稳处理,包括:The method according to claim 11, wherein the performing stabilization processing on the images in the image sequence according to the first offset and the second offset comprises:
    使用所述第一偏移量对所述图像进行第一增稳处理,再使用所述第二偏移量对经第一增稳处理后的图像进行第二增稳处理。A first stabilization process is performed on the image by using the first offset, and a second stabilization process is performed on the image after the first stabilization process by using the second offset.
  42. 根据权利要求15所述的方法,其特征在于,所述图像和所述关键帧为经过下采样后获得的。The method according to claim 15, wherein the image and the key frame are obtained after down-sampling.
  43. 一种成像装置,其特征在于,包括图像传感器以及一个或多个处理器;An imaging device, comprising an image sensor and one or more processors;
    所述图像传感器用于采集图像序列;the image sensor is used to acquire a sequence of images;
    所述一个或多个处理器被单独地或共同地配置成:The one or more processors are individually or collectively configured to:
    基于姿态传感器识别因所述成像装置高频抖动造成的所述图像序列中的画面抖动;Identifying, based on an attitude sensor, image shake in the image sequence caused by high-frequency shake of the imaging device;
    基于所述图像序列之间的相对运动识别因所述成像装置低频晃动造成的所述图像序列中的画面抖动;Identifying screen shake in the image sequence due to low-frequency shaking of the imaging device based on relative motion between the image sequences;
    根据所述高频抖动和所述低频晃动对所述图像序列进行图像增稳,以消除所述图像序列中因所述成像装置高频抖动和低频晃动造成的画面抖动。Image stabilization is performed on the image sequence according to the high-frequency jitter and the low-frequency jitter, so as to eliminate picture jitter in the image sequence caused by the high-frequency jitter and the low-frequency jitter of the imaging device.
  44. 根据权利要求43所述的装置,其特征在于,所述高频抖动包括高频旋转产生的抖动和/或高频平移产生的抖动;所述低频晃动包括低频平移产生的晃动。The device according to claim 43, wherein the high-frequency jitter includes jitter caused by high-frequency rotation and/or jitter caused by high-frequency translation; and the low-frequency jitter includes jitter caused by low-frequency translation.
  45. 根据权利要求43或44所述的装置,其特征在于,所述成像装置为手持设备,所述低频晃动因用户握持所述手持设备的部位晃动产生;The device according to claim 43 or 44, wherein the imaging device is a handheld device, and the low-frequency shaking is generated by the shaking of the part where the user holds the handheld device;
    或者,所述成像装置搭载于可移动平台,所述低频晃动因可移动平台在运动过程中晃动产生。Alternatively, the imaging device is mounted on a movable platform, and the low-frequency shaking is generated by the shaking of the movable platform during movement.
  46. 根据权利要求43所述的装置,其特征在于,所述成像装置的变焦倍数越高,所述图像序列的图像因所述成像装置低频晃动产生偏移的像素越多。The device according to claim 43, wherein the higher the zoom factor of the imaging device is, the more pixels are shifted in the image of the image sequence due to the low-frequency shaking of the imaging device.
  47. 根据权利要求46所述的装置,其特征在于,所述处理器还用于:在所述成像装置当前变焦倍数高于预设倍数的情况下,基于所述图像序列之间的相对运动识别因所述成像装置低频晃动造成的所述图像序列中的画面抖动。The device according to claim 46, wherein the processor is further configured to: in the case that the current zoom factor of the imaging device is higher than a preset factor, identify the factor based on the relative motion between the image sequences Picture shaking in the image sequence caused by low-frequency shaking of the imaging device.
  48. 根据权利要求43所述的装置,其特征在于,所述处理器还用于:在所述成像装置进行变焦的过程中、或者在所述成像装置被改变朝向的过程中,不识别因所述成像装置低频晃动造成的所述图像序列中的画面抖动。The device according to claim 43, wherein the processor is further configured to: in a process of zooming by the imaging device or in a process of changing the orientation of the imaging device, not to identify the cause of the Picture shaking in the image sequence caused by low-frequency shaking of the imaging device.
  49. 根据权利要求43所述的装置,其特征在于,所述处理器还用于:在使用所述图像序列进行目标跟踪的情况下,不对所述图像序列进行增稳处理。The apparatus according to claim 43, wherein the processor is further configured to not perform stabilization processing on the image sequence when the image sequence is used for target tracking.
  50. 根据权利要求43所述的装置,其特征在于,所述处理器还用于:控制所述成像装置根据所述低频晃动修正自身的姿态。The device according to claim 43, wherein the processor is further configured to: control the imaging device to correct its posture according to the low-frequency shaking.
  51. 根据权利要求50所述的装置,其特征在于,所述处理器还用于:获取所述低频晃动对应的第一偏移量,并根据所述第一偏移量控制所述成像装置运动。The device according to claim 50, wherein the processor is further configured to: acquire a first offset corresponding to the low-frequency shaking, and control the movement of the imaging device according to the first offset.
  52. 根据权利要求50所述的装置,其特征在于,所述成像装置安装于云台;The device according to claim 50, wherein the imaging device is mounted on a pan/tilt;
    所述处理器还用于:根据所述低频晃动获得第一偏移量,并根据所述第一偏移量控制所述云台运动。The processor is further configured to: obtain a first offset according to the low-frequency shaking, and control the movement of the gimbal according to the first offset.
  53. 根据权利要求43所述的装置,其特征在于,所述图像序列中的图像因所述低频晃动对应有第一偏移量以及因所述高频抖动对应有第二偏移量;The apparatus according to claim 43, wherein the images in the image sequence correspond to a first offset due to the low frequency jitter and have a second offset due to the high frequency jitter;
    所述处理器还用于:根据所述第一偏移量和所述第二偏移量,对所述图像序列中的图像进行增稳处理。The processor is further configured to: perform stabilization processing on the images in the image sequence according to the first offset and the second offset.
  54. 根据权利要求53所述的装置,其特征在于,所述第一偏移量包括沿指定方向的偏移量;和/或,所述第二偏移量包括沿指定方向的偏移量。The apparatus of claim 53, wherein the first offset includes an offset in a specified direction; and/or the second offset includes an offset in a specified direction.
  55. 根据权利要求53所述的装置,其特征在于,所述处理器还用于:对于图像序列中的图像,基于姿态传感器采集的所述成像装置的姿态数据对所述图像进行电子防抖处理,获得所述图像因所述成像装置高频抖动产生的第二偏移量。The device according to claim 53, wherein the processor is further configured to: for the images in the image sequence, perform electronic anti-shake processing on the images based on the attitude data of the imaging device collected by the attitude sensor, A second offset of the image due to high frequency jitter of the imaging device is obtained.
  56. 根据权利要求53或55所述的装置,其特征在于,所述第二偏移量包括针对于所述图像中各像素的偏移量;The apparatus of claim 53 or 55, wherein the second offset comprises an offset for each pixel in the image;
    所述处理器还用于:根据所述成像装置当前变焦倍数调整所述第二偏移量;调整后的第二偏移量适应于在所述当前变焦倍数下所述图像的像素数量。The processor is further configured to: adjust the second offset according to the current zoom factor of the imaging device; the adjusted second offset is adapted to the number of pixels of the image at the current zoom factor.
  57. 根据权利要求53所述的装置,其特征在于,所述处理器还用于:对于图像序列中的图像,根据预置的关键帧对所述图像进行运动估计,确定所述图像因所述成像装置低频晃动产生的相对于所述关键帧的第一偏移量。The apparatus according to claim 53, wherein the processor is further configured to: for the images in the image sequence, perform motion estimation on the images according to preset key frames, and determine that the images are caused by the imaging The first offset relative to the key frame generated by the low-frequency shaking of the device.
  58. 根据权利要求57所述的装置,其特征在于,在没有关键帧的情况下,所述图像作为关键帧使用,且所述第一偏移量被置为0。The device according to claim 57, wherein in the case of no key frame, the image is used as a key frame, and the first offset is set to 0.
  59. 根据权利要求57所述的装置,其特征在于,所述处理器还用于:The apparatus of claim 57, wherein the processor is further configured to:
    获取所述关键帧中至少一个关键点,所述关键点位于所述关键帧中的第一位置;acquiring at least one key point in the key frame, the key point is located at a first position in the key frame;
    确定所述至少一个关键点在所述图像中的第二位置;determining a second position of the at least one keypoint in the image;
    根据所述第一位置与所述第二位置之间的差异,确定所述图像相对于所述关键帧的第一偏移量。A first offset of the image relative to the key frame is determined based on the difference between the first position and the second position.
  60. 根据权利要求59所述的装置,其特征在于,所述处理器还用于:The apparatus of claim 59, wherein the processor is further configured to:
    将所述关键帧分割成若干图像块;dividing the key frame into several image blocks;
    对于每个所述图像块,根据所述图像块的中心点和/或所述图像块中梯度最大的至少一个特征点确定所述关键点。For each of the image blocks, the key point is determined according to the center point of the image block and/or at least one feature point with the largest gradient in the image block.
  61. 根据权利要求59所述的装置,其特征在于,所述处理器还用于:The apparatus of claim 59, wherein the processor is further configured to:
    获取所述至少一个关键点分别在所述关键帧中的第一特征信息;acquiring first feature information of the at least one key point in the key frame respectively;
    使用预设的图像转换关系估计所述关键帧中至少一个关键点在所述图像中的第二位置;estimating a second position in the image of at least one key point in the key frame using a preset image conversion relationship;
    在所述图像中,获取所述至少一个关键点分别在不同偏移量下的第二特征信息;In the image, acquiring second feature information of the at least one key point under different offsets;
    对于每个所述关键点,将所述不同偏移量下的第二特征信息分别与所述第一特征信息进行匹配,并根据与所述第一特征信息匹配的第二特征信息对应的偏移量调整所述第二位置。For each of the key points, the second feature information under the different offsets is respectively matched with the first feature information, and according to the offset corresponding to the second feature information matched with the first feature information The shift amount adjusts the second position.
  62. 根据权利要求61所述的装置,其特征在于,所述不同偏移量是在预设偏移范围内沿指定方向偏移的不同距离;The device according to claim 61, wherein the different offsets are different distances offset along a specified direction within a preset offset range;
    所述匹配的第二特征信息与所述第一特征信息的差异最小。The matched second feature information has the smallest difference from the first feature information.
  63. 根据权利要求61所述的装置,其特征在于,所述第一特征信息包括:对所述关键点进行投影得到的特征向量;The device according to claim 61, wherein the first feature information comprises: a feature vector obtained by projecting the key point;
    和/或,所述第二特征信息包括:对所述关键点进行投影得到的特征向量。And/or, the second feature information includes: a feature vector obtained by projecting the key point.
  64. 根据权利要求61或63所述的装置,其特征在于,所述第一特征信息包括:对包含所述关键点的预设区域沿指定方向进行投影得到的特征向量;The device according to claim 61 or 63, wherein the first feature information comprises: a feature vector obtained by projecting a preset area including the key point along a specified direction;
    和/或,所述第二特征信息包括:对包含所述关键点的预设区域沿指定方向进行投影得到的特征向量。And/or, the second feature information includes: a feature vector obtained by projecting a preset area including the key point along a specified direction.
  65. 根据权利要求54、62或64任意一项所述的装置,其特征在于,所述指定方向至少包括在图像坐标中的水平方向和/或垂直方向。The apparatus according to any one of claims 54, 62 or 64, wherein the specified direction includes at least a horizontal direction and/or a vertical direction in image coordinates.
  66. 根据权利要求63或64所述的装置,其特征在于,所述特征向量为映射到预设范围内的特征向量,所述预设范围用于使各个所述关键点的亮度变化范围相同。The apparatus according to claim 63 or 64, wherein the feature vector is a feature vector mapped to a preset range, and the preset range is used to make the luminance variation range of each of the key points the same.
  67. 根据权利要求66所述的装置,其特征在于,所述映射到预设范围内的特征向量包括:所述投影得到的特征向量与特征向量均值之差、或者所述投影得到的特征向量归一化的结果。The device according to claim 66, wherein the feature vector mapped to the preset range comprises: the difference between the feature vector obtained by the projection and the mean value of the feature vector, or the normalization of the feature vector obtained by the projection result of .
  68. 根据权利要求61所述的装置,其特征在于,所述预设的图像转换关系根据所述关键点在所述关键帧中的第一位置与所述关键点在上一帧图像中的位置之间的位置关系确定。The device according to claim 61, wherein the preset image conversion relationship is based on a difference between a first position of the key point in the key frame and a position of the key point in a previous frame of image The positional relationship between them is determined.
  69. 根据权利要求61所述的装置,其特征在于,在所述图像作为关键帧的情况下,所述预设的图像转换关系为单位矩阵。The device according to claim 61, wherein, when the image is used as a key frame, the preset image conversion relationship is a unit matrix.
  70. 根据权利要求61所述的装置,其特征在于,所述处理器还用于:根据所述至少一个关键点的第一位置与调整后的第二位置之间的位置关系,确定用于下一帧图像的图像转换关系。The device according to claim 61, wherein the processor is further configured to: according to the positional relationship between the first position of the at least one key point and the adjusted second position, determine whether to use the next key point The image conversion relationship of the frame image.
  71. 根据权利要求70所述的装置,其特征在于,所述处理器还用于:The apparatus of claim 70, wherein the processor is further configured to:
    从所述至少一个关键点中筛选有效关键点;screening valid keypoints from the at least one keypoint;
    根据所述有效关键点的第一位置与调整后的第二位置之间的位置关系,确定用于 下一帧图像的图像转换关系。According to the positional relationship between the first position of the effective key point and the adjusted second position, an image conversion relationship for the next frame of image is determined.
  72. 根据权利要求71所述装置,其特征在于,所述处理器还用于:The apparatus of claim 71, wherein the processor is further configured to:
    使用所述预设的图像转换关系确定位于调整后的第二位置的关键点在所述关键帧中的第三位置;Using the preset image conversion relationship to determine the third position of the key point located at the adjusted second position in the key frame;
    对于每一个关键点,确定所述关键点的所述第三位置相对于所述第一位置的位移量;For each key point, determining the displacement amount of the third position of the key point relative to the first position;
    将其中一个关键点的位移量确定为目标位移量,并获取其他关键点的位移量相对于所述目标位移量的误差值;Determine the displacement of one of the key points as the target displacement, and obtain the error value of the displacement of other key points relative to the target displacement;
    从所述至少一个关键点中选取误差值小于预设值的有效关键点。Valid key points whose error value is less than a preset value are selected from the at least one key point.
  73. 根据权利要求71所述的装置,其特征在于,所述处理器还用于:在所述有效关键点的数量少于预设数量的情况下,将所述图像确定为关键帧,并将所述第一偏移量置为0。The apparatus according to claim 71, wherein the processor is further configured to: determine the image as a key frame when the number of valid key points is less than a preset number, and set the The first offset is set to 0.
  74. 根据权利要求57所述的装置,其特征在于,所述处理器还用于:在所述第一偏移量大于预设偏移阈值的情况下,将所述图像确定为关键帧,并将所述第一偏移量置为0。The apparatus according to claim 57, wherein the processor is further configured to: in the case that the first offset is greater than a preset offset threshold, determine the image as a key frame, and convert the image to a key frame. The first offset is set to 0.
  75. 根据权利要求57所述的装置,其特征在于,在所述确定所述图像相对于所述关键帧的第一偏移量之后,所述处理器还用于:使用在所述关键帧之前采集的至少一个已增稳图像的第一偏移量对所述图像的第一偏移量进行平滑处理,获取所述图像处理后的第一偏移量。The apparatus according to claim 57, wherein, after the determining the first offset of the image relative to the key frame, the processor is further configured to: use acquisition before the key frame The first offset of the at least one stabilized image is smoothed on the first offset of the image to obtain the processed first offset of the image.
  76. 根据权利要求75所述的装置,其特征在于,在所述图像是在采集所述关键帧之后的预设时长内采集得到的情况下,对所述图像的第一偏移量进行平滑处理。The apparatus according to claim 75, characterized in that in the case that the image is acquired within a preset time period after acquiring the key frame, smoothing is performed on the first offset of the image.
  77. 根据权利要求75所述的装置,其特征在于,所述已增稳图像与所述关键帧的采集时间间隔小于指定时长。The device according to claim 75, wherein the acquisition time interval between the stabilized image and the key frame is less than a specified duration.
  78. 根据权利要求75所述的装置,其特征在于,所述已增稳图像包括所述关键帧的前一帧图像。76. The apparatus of claim 75, wherein the stabilized image comprises a previous frame of the key frame.
  79. 根据权利要求75所述的装置,其特征在于,所述平滑处理包括线性滤波处理。76. The apparatus of claim 75, wherein the smoothing process comprises a linear filtering process.
  80. 根据权利要求75或79所述的装置,其特征在于,所述图像处理后的第一偏移量为所述已增稳图像的第一偏移量和所述图像的第一偏移量加权平均后的结果。The apparatus according to claim 75 or 79, wherein the first offset after image processing is weighted by the first offset of the stabilized image and the first offset of the image Averaged results.
  81. 根据权利要求80所述的装置,其特征在于,随着所述图像与所述关键帧之间的采集时间间隔逐渐增大,所述已增稳图像的第一偏移量对应的权重系数逐渐减小,所述图像的第一偏移量对应的权重系数逐渐增大。The apparatus according to claim 80, wherein as the acquisition time interval between the image and the key frame gradually increases, the weight coefficient corresponding to the first offset of the stabilized image gradually increases decreases, the weight coefficient corresponding to the first offset of the image gradually increases.
  82. 根据权利要求53所述的装置,其特征在于,所述处理器还用于:根据所述第一偏移量和所述第二偏移量获取所述图像的总偏移量,并根据所述总偏移量对所述图像进行增稳处理。The apparatus according to claim 53, wherein the processor is further configured to: obtain the total offset of the image according to the first offset and the second offset, and obtain the total offset of the image according to the first offset and the second offset. The image is stabilized by the total offset.
  83. 根据权利要求53所述的装置,其特征在于,所述处理器还用于:使用所述第一偏移量对所述图像进行第一增稳处理,再使用所述第二偏移量对经第一增稳处理后的图像进行第二增稳处理。The apparatus according to claim 53, wherein the processor is further configured to: use the first offset to perform a first stabilization process on the image, and then use the second offset to perform a first stabilization process on the image. The image after the first stabilization process is subjected to the second stabilization process.
  84. 根据权利要求57所述的装置,其特征在于,所述图像和所述关键帧为经过下采样后获得的。The apparatus of claim 57, wherein the image and the key frame are obtained after down-sampling.
  85. 一种手持云台,其特征在于,包括姿态传感器和如权利要求43至84任意一项的成像装置;其中,所述姿态传感器用于采集所述成像装置的姿态数据。A handheld pan/tilt head, characterized by comprising an attitude sensor and the imaging device according to any one of claims 43 to 84; wherein the attitude sensor is used to collect attitude data of the imaging device.
  86. 一种可移动平台,其特征在于,包括:A movable platform, characterized in that, comprising:
    机体;body;
    动力系统,安装于所述机体,用于驱动所述可移动平台移动;a power system, mounted on the body, for driving the movable platform to move;
    如权利要求43至84任意一项的成像装置;The imaging device of any one of claims 43 to 84;
    以及姿态传感器,安装于所述机体,用于采集所述成像装置的姿态数据。and an attitude sensor, which is installed on the body and used to collect attitude data of the imaging device.
  87. 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质存储有可执行指令,所述可执行指令被处理器执行时实现如权利要求1至42任一项所述的方法。A computer-readable storage medium, characterized in that, the computer-readable storage medium stores executable instructions, and when the executable instructions are executed by a processor, the method according to any one of claims 1 to 42 is implemented.
PCT/CN2021/091620 2021-04-30 2021-04-30 Video stability augmentation method, imaging apparatus, handheld gimbal, movable platform and storage medium WO2022227040A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2021/091620 WO2022227040A1 (en) 2021-04-30 2021-04-30 Video stability augmentation method, imaging apparatus, handheld gimbal, movable platform and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2021/091620 WO2022227040A1 (en) 2021-04-30 2021-04-30 Video stability augmentation method, imaging apparatus, handheld gimbal, movable platform and storage medium

Publications (1)

Publication Number Publication Date
WO2022227040A1 true WO2022227040A1 (en) 2022-11-03

Family

ID=83847520

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/091620 WO2022227040A1 (en) 2021-04-30 2021-04-30 Video stability augmentation method, imaging apparatus, handheld gimbal, movable platform and storage medium

Country Status (1)

Country Link
WO (1) WO2022227040A1 (en)

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102780846A (en) * 2012-07-11 2012-11-14 清华大学 Electronic image stabilization method based on inertial navigation information
CN105681674A (en) * 2016-03-21 2016-06-15 南通大学 Image stabilizing method and compound image stabilizing system based on mechanical image stabilizing and electronic image stabilizing
CN105721776A (en) * 2016-03-02 2016-06-29 湖南优象科技有限公司 Sports camera device with digital image stabilization function and digital image stabilization method
CN106534692A (en) * 2016-11-24 2017-03-22 腾讯科技(深圳)有限公司 Video image stabilization method and device
CN107241544A (en) * 2016-03-28 2017-10-10 展讯通信(天津)有限公司 Video image stabilization method, device and camera shooting terminal
CN111405187A (en) * 2020-04-16 2020-07-10 青岛浩海网络科技股份有限公司 Image anti-shake method, system, device and storage medium for monitoring equipment

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN102780846A (en) * 2012-07-11 2012-11-14 清华大学 Electronic image stabilization method based on inertial navigation information
CN105721776A (en) * 2016-03-02 2016-06-29 湖南优象科技有限公司 Sports camera device with digital image stabilization function and digital image stabilization method
CN105681674A (en) * 2016-03-21 2016-06-15 南通大学 Image stabilizing method and compound image stabilizing system based on mechanical image stabilizing and electronic image stabilizing
CN107241544A (en) * 2016-03-28 2017-10-10 展讯通信(天津)有限公司 Video image stabilization method, device and camera shooting terminal
CN106534692A (en) * 2016-11-24 2017-03-22 腾讯科技(深圳)有限公司 Video image stabilization method and device
CN111405187A (en) * 2020-04-16 2020-07-10 青岛浩海网络科技股份有限公司 Image anti-shake method, system, device and storage medium for monitoring equipment

Similar Documents

Publication Publication Date Title
US8395694B2 (en) Apparatus and method for blurring image background in digital image processing device
JP5136669B2 (en) Image processing apparatus, image processing method, and program
US10277809B2 (en) Imaging device and imaging method
US9648229B2 (en) Image processing device and associated methodology for determining a main subject in an image
WO2017088678A1 (en) Long-exposure panoramic image shooting apparatus and method
KR101594300B1 (en) A apparatus and method for estimating PSF
WO2017045558A1 (en) Depth-of-field adjustment method and apparatus, and terminal
WO2014023231A1 (en) Wide-view-field ultrahigh-resolution optical imaging system and method
KR20180101466A (en) Depth information acquisition method and apparatus, and image acquisition device
US20150116517A1 (en) Image processing device, image processing method, and program
US8295609B2 (en) Image processing apparatus, image processing method and computer readable-medium
KR20150078275A (en) Digital Photographing Apparatus And Method For Capturing a Moving Subject
JP2010199694A (en) Image capturing apparatus, angle-of-view adjusting method, and program
JP2020053774A (en) Imaging apparatus and image recording method
CN113875219A (en) Image processing method and device, electronic equipment and computer readable storage medium
JP5370555B2 (en) Imaging apparatus, imaging method, and program
JP2015012493A (en) Image processing apparatus, imaging apparatus and program
JP6833483B2 (en) Subject tracking device, its control method, control program, and imaging device
KR101630307B1 (en) A digital photographing apparatus, a method for controlling the same, and a computer-readable storage medium
JP2018007272A (en) Image processing apparatus, imaging apparatus, and program
JP2011217334A (en) Imaging apparatus and method of controlling the same
JP2008236015A (en) Image processor, imaging apparatus and its program
KR20100046544A (en) Image distortion compensation method and apparatus
JP5515492B2 (en) Image processing apparatus and method
WO2022227040A1 (en) Video stability augmentation method, imaging apparatus, handheld gimbal, movable platform and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21938514

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE