WO2015018337A1 - 在图像中线段吸附的方法及装置,构造多边形的方法及装置 - Google Patents

在图像中线段吸附的方法及装置,构造多边形的方法及装置 Download PDF

Info

Publication number
WO2015018337A1
WO2015018337A1 PCT/CN2014/083783 CN2014083783W WO2015018337A1 WO 2015018337 A1 WO2015018337 A1 WO 2015018337A1 CN 2014083783 W CN2014083783 W CN 2014083783W WO 2015018337 A1 WO2015018337 A1 WO 2015018337A1
Authority
WO
WIPO (PCT)
Prior art keywords
line segment
image
polygon
preset
adsorbing
Prior art date
Application number
PCT/CN2014/083783
Other languages
English (en)
French (fr)
Inventor
龙腾
王跃
镇立新
Original Assignee
上海合合信息科技发展有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 上海合合信息科技发展有限公司 filed Critical 上海合合信息科技发展有限公司
Publication of WO2015018337A1 publication Critical patent/WO2015018337A1/zh

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T11/002D [Two Dimensional] image generation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0484Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range
    • G06F3/04845Interaction techniques based on graphical user interfaces [GUI] for the control of specific functions or operations, e.g. selecting or manipulating an object, an image or a displayed text element, setting a parameter value or selecting a range for image manipulation, e.g. dragging, rotation, expansion or change of colour
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F3/00Input arrangements for transferring data to be processed into a form capable of being handled by the computer; Output arrangements for transferring data from processing unit to output unit, e.g. interface arrangements
    • G06F3/01Input arrangements or combined input and output arrangements for interaction between user and computer
    • G06F3/048Interaction techniques based on graphical user interfaces [GUI]
    • G06F3/0487Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser
    • G06F3/0488Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser using a touch-screen or digitiser, e.g. input of commands through traced gestures
    • G06F3/04883Interaction techniques based on graphical user interfaces [GUI] using specific features provided by the input device, e.g. functions controlled by the rotation of a mouse with dual sensing arrangements, or of the nature of the input device, e.g. tap gestures based on pressure sensed by a digitiser using a touch-screen or digitiser, e.g. input of commands through traced gestures for inputting data by handwriting, e.g. gesture or text
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T7/00Image analysis
    • G06T7/10Segmentation; Edge detection
    • G06T7/12Edge-based segmentation
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T2207/00Indexing scheme for image analysis or image enhancement
    • G06T2207/20Special algorithmic details
    • G06T2207/20092Interactive image processing based on input by user
    • G06T2207/20104Interactive definition of region of interest [ROI]

Definitions

  • the present invention relates to an image recognition technique, and more particularly to a method of adsorbing line segments in an image.
  • the present invention also relates to an image recognition apparatus, and more particularly to an apparatus for adsorbing line segments in an image.
  • the invention further relates to a method of image processing, and more particularly to a method of constructing a polygon in an image.
  • the invention further relates to an image processing apparatus, and more particularly to an apparatus for constructing a polygon in an image. Background technique
  • the whiteboard image is an image taken from a conference whiteboard, bulletin board, and other text-like rectangular targets.
  • the information of the user's interest in the whiteboard image is generally in the rectangular area, and the captured picture first has an unnecessary edge area outside the target rectangular area.
  • the target rectangular area may be irregular.
  • Quadrilateral as in Figure 1
  • missing corners Figure 2.
  • the part outside the target area is removed, and the irregular quadrilateral is deformed into a regular rectangle (as shown in Fig. 3), or the notched portion is filled in (Fig. 4), and the color image is converted into Black and white images, and image enhancement operations to make the text clearer.
  • Text images are images taken on standard text, such as A4 format paper text, pages in various books, movie posters, etc.
  • Text recognition is the processing of text image data by a text recognition engine that recognizes the text in the image.
  • the text image text content is generally displayed in a rectangular area, and the captured image has an unnecessary edge area outside the target rectangular area.
  • the target rectangular area may have an irregular quadrangle (as shown in the figure). 1), or the case of missing corners ( Figure 2).
  • the quadrilateral frame detection operation refers to the shot The captured image is detected, and the target quadrilateral frame is extracted, so that in the subsequent operation, the area other than the quadrilateral frame is removed, and the rectangular frame is corrected or complemented.
  • the range of the quadrilateral frame obtained by automatic detection may not be the desired result.
  • the original target frame is the part within the dotted line range, but the actual range is actually detected; or as shown in Fig. 6, the source There are multiple boxes in the figure for the grab, and the target box is the part of the dotted line on the left side, but the actual range of the solid line on the right side is actually detected. Therefore, when the image is initially detected, an interactive interface is needed for the user to make adjustments to the target frame.
  • the technical problem to be solved by the present invention is to provide a method for adsorbing line segments in an image, and a device for line segment adsorption using the method, and a method for constructing a polygon in an image, and a polygonal structure using the method
  • the device can easily identify the position of the quadrilateral frame in the image, thereby constructing a quadrilateral for subsequent image processing.
  • the technical solution of the method for adsorbing line segments in an image of the present invention comprises: a preset line segment, dragging the line segment by selecting the first portion of the line segment, and dragging the second portion of the line segment to the image
  • the preset line segment is set to coincide with the certain line segment, and the nearby portion means that the distance between the two lines conforms to a preset rule.
  • Another technical solution of the method for adsorbing line segments in an image of the present invention comprises: a preset line segment, by dragging the first portion of the line segment to drag the line segment, when the dragging action ends, the second portion of the line segment is located in the image In the vicinity of the corresponding second portion of a certain line segment, the preset line segment is set to coincide with the certain line segment, and the nearby portion means that the distance between the two lines conforms to a preset rule.
  • the invention also discloses a device for adsorbing line segments in an image, and the technical proposal is to adopt the above method for adsorbing line segments in an image based on a mobile communication data terminal platform.
  • the invention further discloses a method for constructing a polygon in an image, the technical proposal is to preset a polygon frame; the line segment of at least one side of the polygon frame adopts the above-mentioned line segment in the image
  • the attached method determines its length and position to construct a polygon in the image.
  • the present invention further discloses an apparatus for constructing a polygon in an image, the technical solution of which is to construct a polygon in the image based on the mobile communication data terminal platform.
  • the present invention can conveniently assist the user to define a polygon frame according to the line segments already existing in the image, so as to perform subsequent image processing.
  • Figure 1 - Figure 4 is a schematic diagram of the whiteboard and text captured and processed images
  • FIG. 5 and Figure 6 are schematic diagrams of the target frame that needs to be adjusted after the frame is captured. Detailed description of the invention
  • the invention discloses a method for adsorbing line segments in an image, comprising a preset line segment, dragging the line segment by selecting a first portion of the line segment, and dragging a second portion of the line segment to a corresponding line segment in the image
  • the preset line segment is set to coincide with the certain line segment
  • the nearby portion means that the distance between the two meets a preset rule in the process of being close to each other.
  • the preset line segment is adsorbed to the certain line. On the segment, it coincides with the certain line segment.
  • a method for adsorbing a line segment in an image includes a preset line segment, and dragging the line segment by selecting a first portion of the line segment, when the dragging action ends, the line segment
  • the second portion is located in the vicinity of the corresponding second portion of a certain line segment in the image, and the preset line segment is set to coincide with the certain line segment, and the nearby portion refers to a process in which the distance between the two is close to each other. Meet the pre-set rules.
  • the second part of the line segment is in a certain line segment of the image.
  • the preset line segment is adsorbed onto the certain line segment, that is, coincides with the certain line segment.
  • the method of detecting a certain line segment in an image is a relatively mature technology at present, and there may be various types, for example, all the line segments in the image can be obtained by Hough transform.
  • a line segment in the image described in the present invention may be a straight line segment or a curved segment that approximates a straight line segment.
  • the photographed photo may be curved or fluctuating in parts of the straight line, which is not very serious for bending and fluctuation.
  • the curve segment can also be identified as a straight line segment in the present invention, and the specific recognition mode is also very mature in the prior art.
  • the location clicked by the user may not be very accurate, so the first part selected when the preset line segment is dragged may be on the preset line segment, or the preset may be Near the line segment.
  • the first portion selected when the predetermined line segment is dragged and the second portion on the line segment are the same portion or different portions. If it is the same part, it is convenient for the user to drag the preset line segment directly to the position that you want to drag. However, if the content in the image is relatively complicated, the first part and the second part may be different parts, so that when the user drags, the user's finger does not block the second part, which is convenient for the user to drag and drop. Make a judgment on the position of the preset line segment.
  • the second portion may be the end point of the preset line segment and a certain line segment in the image, or may be the preset line segment and a midpoint of a line segment in the image. Similarly, it can be the preset line segment and other positions of a certain line segment in the image, which can be set at the time of use. For the selection of the second part, the position on the line segment that intersects with other line segments can be avoided, making the adsorption more accurate. These second parts can be marked with special markings for the user's attention.
  • the predetermined rule may be that the distance is less than a preset value.
  • the preset rule may also be that the distance conforms to a preset formula. For example, a value is obtained by multiplying a length of a line segment in the image by a scale factor. When the distance is less than the value, the preset line segment is adsorbed. That is, it coincides with the certain line segment.
  • a preset formula For example, a value is obtained by multiplying a length of a line segment in the image by a scale factor. When the distance is less than the value, the preset line segment is adsorbed. That is, it coincides with the certain line segment.
  • the present invention also discloses an apparatus for adsorbing line segments in an image. Based on the mobile communication data terminal platform, the above method of adsorbing line segments in the image is employed.
  • the invention further discloses a method for constructing a polygon in an image, which presets a polygon frame; a line segment of at least one side of the polygon frame uses the above method of adsorbing a line segment in the image to determine its length and position, thereby Construct a polygon in .
  • a polygon is first preset on the image, and then the line segment of at least one side of the polygon frame is determined by the method of adsorbing the line segment in the image to determine its length and position, thereby constructing a polygon in the image.
  • At least one edge of the preset polygon is set as a certain line segment in the image.
  • the other vertices of the polygon except the two endpoints of the border remain motionless. This way, the user can make adjustments with as few edges as possible, and then get the desired quadrilateral border.
  • the quadrilateral in general, the user can obtain a quadrilateral frame by adjusting the two opposite sides.
  • the present invention further discloses an apparatus for constructing a polygon in an image. Based on the mobile communication data terminal platform, the above method of constructing a polygon in an image is employed.
  • each line segment needs to be set at least twice.
  • the present invention adopts the above technical solution, and can only perform according to the line segment already existing in the image.
  • the setting of the line segment can be completed in one setting, which conveniently assists the user in defining the polygon border for subsequent image processing.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • General Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Human Computer Interaction (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Image Analysis (AREA)
  • Processing Or Creating Images (AREA)

Abstract

一种在图像中线段吸附的方法,包括预设线段,通过选择所述线段的第一部位拖拽该线段,当该线段的第二部位拖拽至图像中某一线段的相应的第二部位的附近时,将所述预设线段设置为与该某一线段重合,所述附近是指二者之间的距离符合预先设定的规则。通过采用上述技术方案,能够根据图像中已经存在的线段,便捷的协助用户定义多边形边框,以便进行后续的图像处理。

Description

在图像中线段吸附的方法及装置, 构造多边形的方法及装置 发明领域
本发明涉及一种图像识别技术, 尤其是一种在图像中线段吸附的方法。 本 发明还涉及一种图像识别装置, 尤其是一种在图像中线段吸附的装置。 本发明 又涉及一种图像处理的方法, 尤其是一种在图像中构造多边形的方法。 本发明 又涉及一种图像处理装置, 尤其是一种在图像中构造多边形的装置。 背景技术
目前, 在商务设备如扫描仪, 商务手机上, 开始越来越多的出现对所拍摄 图片的信息进行加工和提取的功能。 这些功能一般可归结为两类:
1 ) 白板图像的校正和增强
白板图像是对会议白板, 公告牌和其他文本类矩形目标拍摄所得的图像。 白板图像中用户感兴趣的信息一般在矩形区域内, 而拍摄所得的图片, 首先在 目标矩形区域之外会有不需要的边缘区域, 其次因为拍摄角度的问题, 目标矩 形区域可能会出现不规则四边形 (如图 1 ) , 或者缺角 (如图 2 ) 的情况。 通 过对图像的处理, 将目标区域之外的部分去除, 并将不规则四边形变形为规则 的矩形 (如图 3 ) , 或者将缺角部分填平 (如图 4 ) , 并将彩色图像转换为黑 白图像, 并做图像增强操作, 使文字更加清晰。
2 ) 对文本图像中文本的识别
文本图像是对标准文本, 如 A4格式纸质文本, 各种书籍中的页面, 电影 海报等进行拍摄所得的图像。 文本识别是通过文本识别引擎, 对文本图像数据 进行处理, 识别出图像中的文本。 文本图像文本内容一般显示在矩形区域内, 而拍摄所得的图片, 在目标矩形区域之外会有不需要的边缘区域, 其次因为拍 摄角度的问题, 目标矩形区域可能会出现不规则四边形 (如图 1 ) , 或者缺角 (如图 2 ) 的情况。 为了使文本识别引擎对文本图像的识别更加准确迅速, 需 要通过对图像的处理, 将目标区域之外的部分去除, 并将不规则四边形变形为 规则的矩形 (如图 3 ) , 或者将缺角部分填平 (如图 4 ) , 并将彩色文本图像 转换为黑白图像,并做图像增强操作,使图像中的文字和背景的对比更加明显。
上述两类操作都需要做四边形框检测操作。 四边形框检测操作是指对所拍 摄的图像进行检测, 提取目标四边形框, 从而在之后的操作中, 去除四边形框 以外的区域,并对该四边形框进行矩形框校正或补角操作。 然而在实际操作中, 自动检测所得的四边形框范围可能不是期望结果, 如图 5 , 原本目标框的是虚 线范围内的部分, 但实际检测到的是实线范围部分; 或者如图 6, 源图中有多 个框可供抓取, 目标框是左侧虚线范围部分, 但实际检测到的是右侧实线范围 部分。 因此, 在对图像进行初次检测时候, 需要提供一个交互界面供用户做目 标框的调整。
公开号为 CN101807293A的中国发明专利公开文本中, 记载了一种采用顶 点吸附构造四边形框的方法。 但是, 在实际操作中, 可能由于拍照背景比较杂 乱, 会使得顶点位置难以识别, 或者待识别的四边形, 其顶点是圓角。 这都造 成顶点吸附功能无法使用, 使用者无法通过顶点构造四边形边框。 发明概述
本发明所要解决的技术问题是提供一种在图像中线段吸附的方法, 和采用 这种方法的线段吸附的设备, 以及一种在图像中构造多边形的方法, 和采用这 种方法的构造多边形的设备, 能够便捷的识别图像中的四边形边框位置, 从而 构造出四边形以便后续图像处理使用。
为解决上述技术问题, 本发明在图像中线段吸附的方法的技术方案是, 包 括预设线段, 通过选择所述线段的第一部位拖拽该线段, 当该线段的第二部位 拖拽至图像中某一线段的相应的第二部位的附近时, 将所述预设线段设置为与 该某一线段重合, 所述附近是指二者之间的距离符合预先设定的规则。
本发明在图像中线段吸附的方法的另一技术方案是, 包括预设线段, 通过 选择所述线段的第一部位拖拽该线段, 当拖拽动作结束时, 该线段的第二部位 位于图像中某一线段的相应的第二部位的附近, 则将所述预设线段设置为与该 某一线段重合, 所述附近是指二者之间的距离符合预先设定的规则。
本发明还公开了一种在图像中线段吸附的装置, 其技术方案是, 基于移动 通信数据终端平台, 采用了上述在图像中线段吸附的方法。
本发明又公开了一种在图像中构造多边形的方法, 其技术方案是, 预设一 个多边形边框; 所述多边形边框中至少一条边的线段采用上述在图像中线段吸 附的方法来确定其长度和位置, 从而在图像中构造一个多边形。
本发明再公开了一种在图像中构造多边形的装置, 其技术方案是, 基于移 动通信数据终端平台, 采用了上述在图像中构造多边形的方法。
本发明通过采用上述技术方案, 能够根据图像中已经存在的线段, 便捷的 协助用户定义多边形边框, 以便进行后续的图像处理。 附图的简要描述
下面结合附图和实施例对本发明作进一步详细的说明:
图 1-图 4是白板和文本拍摄和处理后图像的示意图;
图 5和图 6是抓框后目的框需要再调整的示意图。 发明的详细说明
本发明公开了一种在图像中线段吸附的方法, 包括预设线段, 通过选择所 述线段的第一部位拖拽该线段, 当该线段的第二部位拖拽至图像中某一线段的 相应的第二部位的附近时, 将所述预设线段设置为与该某一线段重合, 所述附 近是指二者之间的距离在相互靠近过程中符合预先设定的规则。 该实施例中, 在拖拽预设线段的过程中, 一旦该线段的第二部位拖拽至图像中某一线段的相 应的第二部位的附近时, 预设线段就被吸附至该某一线段上, 即与该某一线段 重合。
在本发明的另一个公开的实施例中, 一种在图像中线段吸附的方法, 包括 预设线段, 通过选择所述线段的第一部位拖拽该线段, 当拖拽动作结束时, 该 线段的第二部位位于图像中某一线段的相应的第二部位的附近, 则将所述预设 线段设置为与该某一线段重合, 所述附近是指二者之间的距离在相互靠近过程 中符合预先设定的规则。 该实施例中, 只有在拖拽预设线段过程结束时, 例如 采用触摸屏的智能手机上, 用户拖拽预设线段后手指离开触摸屏时, 该线段的 第二部位如果在图像中某一线段的相应的第二部位的附近时, 预设线段就被吸 附至该某一线段上, 即与该某一线段重合。
对图像中某一线段进行检测的方法是目前比较成熟的技术, 具体可以有多 种, 比如可以通过霍夫变换得到图像中所有的线段。 本发明中所述的图像中的某一线段, 可以是直线段, 也可以是近似直线段 的曲线段。 在一些图像, 尤其是照片图像中, 由于拍摄镜头失真, 或者被拍摄 物体边缘不平等原因, 拍摄后的照片, 本该是直线段的部分可能会出现弯曲或 波动, 对于弯曲和波动不是很严重的曲线段, 在本发明中也可作为直线段进行 识别, 具体的识别方式, 在现有技术中也非常成熟。
在采用触摸屏的智能手机上, 用户点击的位置可能不会非常的精确, 因此 拖拽所述预设线段时所选择的第一部位可以在所述预设线段上, 也可以在所述 预设线段附近。
拖拽所述预设线段时所选择的第一部位与所述该线段上的第二部位为同 一部位或不同部位。 如果是同一部位, 可便于用户更直接的将预设线段拖拽至 想要拖拽的位置。 但是如果图像中内容比较复杂, 所述的第一部位和第二部位 就可以是不同的部位, 这样用户在拖拽时, 用户的手指不会将第二部位遮挡, 便于用户在拖拽过程中对于预设线段的位置做出判断。
所述第二部位可以是该预设线段及图像中某一线段的端点, 也可以是该预 设线段及图像中某一线段的中点。 同样, 也可以是该预设线段及图像中某一线 段的其它位置, 这可以在使用时进行设定。 对于第二部位的选择, 可以避开线 段上与其它线段相交的位置, 使得吸附时更加准确。 这些第二部位可以被以特 殊的标记标注, 以供使用者注意。
所述预先设定的规则可以是距离小于预先设定的数值。
所述预先设定的规则也可以是距离符合预先设定的公式, 例如根据图像中 某一线段的长度乘以一个比例因子得到一个数值, 当距离小于该数值时, 预设 线段就吸附上去, 即与该某一线段重合。 类似的规则还有很多, 本领域技术人 员对该规则可以灵活设置, 在此不多做赘述。
本发明还公开了一种在图像中线段吸附的装置, 基于移动通信数据终端平 台, 采用了上述在图像中线段吸附的方法。
本发明又公开了一种在图像中构造多边形的方法, 预设一个多边形边框; 所述多边形边框中至少一条边的线段采用上述在图像中线段吸附的方法来确 定其长度和位置, 从而在图像中构造一个多边形。
在实际应用中, 所述多边形为四边形的情况最为常见, 可方便用户进行各 种图像处理。
在一个优选实施例中, 首先在图像上预设一个多边形, 之后所述多边形边 框中至少一条边的线段采用上述在图像中线段吸附的方法来确定其长度和位 置, 从而在图像中构造一个多边形。
更进一步的, 所述预设的多边形的至少一条边设置为图像中的某一线段。 在拖拽某一条边框时, 除该边框的两个端点之外的所述多边形的其它顶点 保持不动。 这样, 用户可以尽可能少的边线进行调整, 之后得到需要的四边形 边框。 以四边形为例, 一般情况下, 用户只要对两条对边进行调整, 就可以得 到一个四边形边框。
在构造多边形时, 如果相邻的两条边在采用上述在图像中线段吸附的方法 来确定其位置之后仍旧没有交点, 则将该相邻的两条边在能够相交的方向延长 并相交, 该交点和延长后的相邻的两条边都作为所构造多边形的组成部分。 对 于图像中的多边形其顶点如果不是直线相交的角而是倒圓角, 或者其顶点位置 由于其他原因在图像中无法被清晰识别时, 采用上述方法可以有效的找到该顶 点的位置从而构造出多边形。
本发明再公开了一种在图像中构造多边形的装置, 基于移动通信数据终端 平台, 采用了上述在图像中构造多边形的方法。
现有技术, 对于在图像中识别多边形, 需要用户确定各条边线的端点, 因 此每条线段至少要进行两次设置, 本发明通过采用上述技术方案, 能够根据图 像中已经存在的线段, 只进行一次设置就可以完成线段位置的确定, 便捷的协 助用户定义多边形边框, 以便进行后续的图像处理。
以上所述仅为本发明的较佳是实力而已, 并非用以限定本发明的实质技术 内容范围, 本发明的实质技术内容是广义的定义于申请的权利要求范围中, 任 何他人完成的技术实体或方法, 若是与申请的权利要求范围所定义的完全相 同, 也或是一种等效的变更, 均将被视为涵盖于该权利要求范围之中。

Claims

权 利 要 求 书
1.一种在图像中线段吸附的方法, 其特征在于, 包括预设线段, 通过选择 所述线段的第一部位拖拽该线段, 当该线段的第二部位拖拽至图像中某一线段 的相应的第二部位的附近时, 将所述预设线段设置为与该某一线段重合, 所述 附近是指二者之间的距离符合预先设定的规则。
2.根据权利要求 1所述的在图像中线段吸附的方法, 其特征在于, 拖拽所 述预设线段时所选择的第一部位或者在所述预设线段上, 或者在所述预设线段 附近。
3.根据权利要求 1所述的在图像中线段吸附的方法, 其特征在于, 拖拽所 述预设线段时所选择的第一部位与所述该线段上的第二部位为同一部位或不 同部位。
4.根据权利要求 1、 2或 3所述的在图像中线段吸附的方法, 其特征在于, 所述第二部位为该预设线段及图像中某一线段的端点或中点或其它任意特定 位置的点。
5.根据权利要求 1所述的在图像中线段吸附的方法, 其特征在于, 所述预 先设定的规则是距离小于预先设定的数值, 或者是距离符合预先设定的公式。
6.—种在图像中线段吸附的方法, 其特征在于, 包括预设线段, 通过选择 所述线段的第一部位拖拽该线段, 当拖拽动作结束时, 该线段的第二部位位于 图像中某一线段的相应的第二部位的附近, 则将所述预设线段设置为与该某一 线段重合, 所述附近是指二者之间的距离符合预先设定的规则。
7.根据权利要求 6所述的在图像中线段吸附的方法, 其特征在于, 拖拽所 述预设线段时所选择的第一部位或者在所述预设线段上, 或者在所述预设线段 附近。
8.根据权利要求 6所述的在图像中线段吸附的方法, 其特征在于, 拖拽所 述预设线段时所选择的第一部位与所述该线段上的第二部位为同一部位。
9.根据权利要求 6、 7或 8所述的在图像中线段吸附的方法, 其特征在于, 所述第二部位为该预设线段及图像中某一线段的端点或中点或其它任意特定 位置的点。
10. 根据权利要求 6所述的在图像中线段吸附的方法, 其特征在于, 所 述预先设定的规则是距离小于预先设定的数值, 或者是距离符合预先设定的公 式。
11. 一种在图像中线段吸附的装置, 其特征在于, 基于移动通信数据终 端平台, 采用了如权利要求 1-10中任意一项所述的在图像中线段吸附的方法。
12. 一种在图像中构造多边形的方法, 其特征在于, 预设一个多边形边 框;所述多边形边框中至少一条边的线段采用如权利要求 1-10中任意一项所述 的在图像中线段吸附的方法来确定其位置, 从而在图像中构造一个多边形。
13. 根据权利要求 12所述的在图像中构造多边形的方法, 其特征在于, 所述多边形为四边形。
14. 根据权利要求 12所述的在图像中构造多边形的方法, 其特征在于, 首先在图像上预设一个多边形, 之后所述多边形边框中至少一条边的线段采用 如权利要求 1-10 中任意一项所述的在图像中线段吸附的方法来确定其长度和 位置, 从而在图像中构造一个多边形。
15. 根据权利要求 14所述的在图像中构造多边形的方法, 其特征在于, 所述预设的多边形的至少一条边设置为图像中的某一线段。
16. 根据权利要求 12所述的在图像中构造多边形的方法, 其特征在于, 在拖拽某一条边框时, 除该边框的两个端点之外的所述多边形的其它顶点保持 不动。
17. 根据权利要求 12所述的在图像中构造多边形的方法, 其特征在于, 在构造多边形时,如果相邻的两条边在采用如权利要求 1-10中任意一项所述的 在图像中线段吸附的方法来确定其位置之后仍旧没有交点, 则将该相邻的两条 边在能够相交的方向延长并相交, 该交点和延长后的相邻的两条边都作为所构 造多边形的组成部分。
18. 一种在图像中构造多边形的装置, 其特征在于, 基于移动通信数据 终端平台, 采用了如权利要求 12-17中任意一项所述的在图像中构造多边形的 方法。
PCT/CN2014/083783 2013-08-06 2014-08-06 在图像中线段吸附的方法及装置,构造多边形的方法及装置 WO2015018337A1 (zh)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201310338246.5 2013-08-06
CN201310338246.5A CN103400387B (zh) 2013-08-06 2013-08-06 在图像中线段吸附的方法及装置,构造多边形的方法及装置

Publications (1)

Publication Number Publication Date
WO2015018337A1 true WO2015018337A1 (zh) 2015-02-12

Family

ID=49563999

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2014/083783 WO2015018337A1 (zh) 2013-08-06 2014-08-06 在图像中线段吸附的方法及装置,构造多边形的方法及装置

Country Status (2)

Country Link
CN (1) CN103400387B (zh)
WO (1) WO2015018337A1 (zh)

Families Citing this family (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN103400387B (zh) * 2013-08-06 2018-10-09 上海合合信息科技发展有限公司 在图像中线段吸附的方法及装置,构造多边形的方法及装置
CN103399696B (zh) * 2013-08-16 2018-10-09 上海合合信息科技发展有限公司 直线/线段吸附的方法及装置,构造多边形的方法及装置
CN104835184B (zh) * 2014-02-10 2018-03-20 成都理想境界科技有限公司 图像中四边形区域的提取方法
CN109979020B (zh) * 2019-04-04 2020-12-22 广州视源电子科技股份有限公司 一种线段吸附方法、装置、设备及存储介质
CN111352545A (zh) * 2019-05-07 2020-06-30 鸿合科技股份有限公司 一种交互式服务的自动吸附方法及相关设备

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101137011A (zh) * 2006-08-29 2008-03-05 索尼株式会社 图像处理装置、图像处理方法和计算机程序
CN101634795A (zh) * 2009-08-20 2010-01-27 上海合合信息科技发展有限公司 相机自动聚焦方法及系统
CN101807293A (zh) * 2010-03-19 2010-08-18 上海合合信息科技发展有限公司 调整图像四边形框检测结果的方法
CN102714692A (zh) * 2009-09-23 2012-10-03 微软公司 基于照相机的扫描
CN103176687A (zh) * 2011-12-26 2013-06-26 腾讯科技(深圳)有限公司 一种桌面图标的分区管理方法和系统
CN103399696A (zh) * 2013-08-16 2013-11-20 上海合合信息科技发展有限公司 直线/线段吸附的方法及装置,构造多边形的方法及装置
CN103400387A (zh) * 2013-08-06 2013-11-20 上海合合信息科技发展有限公司 在图像中线段吸附的方法及装置,构造多边形的方法及装置

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP2005078032A (ja) * 2003-09-03 2005-03-24 Olympus Corp 画像表示プログラム、画像表示装置、画像表示方法
CN101853519A (zh) * 2009-04-01 2010-10-06 英业达股份有限公司 向量图形的形变控制系统及其方法
CN101859224B (zh) * 2010-04-30 2012-04-18 陈铸 一种从数字图片图像中抠取目标对象的方法和系统
CN102646258B (zh) * 2011-02-21 2016-01-06 腾讯科技(深圳)有限公司 图像裁剪方法及系统
CN102360395B (zh) * 2011-09-29 2014-03-26 沈阳远大铝业工程有限公司 一种用于幕墙不规则四边形装饰板批量展开图的绘制方法
CN102938064B (zh) * 2012-11-23 2015-06-17 南京大学 基于LiDAR数据与正射影像的停车场结构提取方法

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101137011A (zh) * 2006-08-29 2008-03-05 索尼株式会社 图像处理装置、图像处理方法和计算机程序
CN101634795A (zh) * 2009-08-20 2010-01-27 上海合合信息科技发展有限公司 相机自动聚焦方法及系统
CN102714692A (zh) * 2009-09-23 2012-10-03 微软公司 基于照相机的扫描
CN101807293A (zh) * 2010-03-19 2010-08-18 上海合合信息科技发展有限公司 调整图像四边形框检测结果的方法
CN103176687A (zh) * 2011-12-26 2013-06-26 腾讯科技(深圳)有限公司 一种桌面图标的分区管理方法和系统
CN103400387A (zh) * 2013-08-06 2013-11-20 上海合合信息科技发展有限公司 在图像中线段吸附的方法及装置,构造多边形的方法及装置
CN103399696A (zh) * 2013-08-16 2013-11-20 上海合合信息科技发展有限公司 直线/线段吸附的方法及装置,构造多边形的方法及装置

Also Published As

Publication number Publication date
CN103400387B (zh) 2018-10-09
CN103400387A (zh) 2013-11-20

Similar Documents

Publication Publication Date Title
WO2015021877A1 (zh) 直线/线段吸附的方法及装置,构造多边形的方法及装置
US11087407B2 (en) Systems and methods for mobile image capture and processing
WO2015018337A1 (zh) 在图像中线段吸附的方法及装置,构造多边形的方法及装置
EP2715579A2 (en) Document unbending systems and methods
US20180300861A1 (en) Method of correcting a captured image, method of selecting a drawing sketched on a page or on two adjacent pages of a notebook, a relative app for smartphone, a hardback notebook and a hardback agenda
WO2014086272A1 (zh) 方便电子化的专业笔记本及其添加到电子日历中的方法
JP2017120503A (ja) 情報処理装置、情報処理装置の制御方法、及びプログラム
US10275888B2 (en) Algorithmic method for detection of documents in images
US20210281742A1 (en) Document detections from video images
US20160224854A1 (en) Information processing apparatus, information processing method, and storage medium
JP6319974B2 (ja) 帳票読取装置およびプログラム
KR20130118704A (ko) 디지털 카메라로 촬영된 문서의 화상으로부터 전기적 문서 보관 및 전송을 위한 화상 처리 방법 및 장치

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 14833819

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 14833819

Country of ref document: EP

Kind code of ref document: A1