CN102456139B - Image processing equipment and image processing method - Google Patents
Image processing equipment and image processing method Download PDFInfo
- Publication number
- CN102456139B CN102456139B CN201110165801.XA CN201110165801A CN102456139B CN 102456139 B CN102456139 B CN 102456139B CN 201110165801 A CN201110165801 A CN 201110165801A CN 102456139 B CN102456139 B CN 102456139B
- Authority
- CN
- China
- Prior art keywords
- line
- image
- module
- information
- image processing
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Active
Links
- 238000012545 processing Methods 0.000 title claims abstract description 72
- 238000003672 processing method Methods 0.000 title claims abstract description 7
- 238000004364 calculation method Methods 0.000 claims abstract description 36
- 238000000605 extraction Methods 0.000 claims description 41
- 239000000284 extract Substances 0.000 claims description 18
- 230000013011 mating Effects 0.000 claims 1
- 238000004458 analytical method Methods 0.000 description 34
- 238000010586 diagram Methods 0.000 description 17
- 238000012937 correction Methods 0.000 description 14
- 238000000034 method Methods 0.000 description 14
- 238000010276 construction Methods 0.000 description 10
- 238000004891 communication Methods 0.000 description 9
- 230000015654 memory Effects 0.000 description 9
- 238000004590 computer program Methods 0.000 description 8
- 230000006870 function Effects 0.000 description 5
- 230000005540 biological transmission Effects 0.000 description 3
- 238000007792 addition Methods 0.000 description 2
- 238000001514 detection method Methods 0.000 description 2
- 238000012986 modification Methods 0.000 description 2
- 230000004048 modification Effects 0.000 description 2
- 230000035508 accumulation Effects 0.000 description 1
- 238000009825 accumulation Methods 0.000 description 1
- 230000006835 compression Effects 0.000 description 1
- 238000007906 compression Methods 0.000 description 1
- 238000013075 data extraction Methods 0.000 description 1
- 230000002950 deficient Effects 0.000 description 1
- 238000005516 engineering process Methods 0.000 description 1
- 230000010365 information processing Effects 0.000 description 1
- 238000009434 installation Methods 0.000 description 1
- 239000004973 liquid crystal related substance Substances 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 230000002093 peripheral effect Effects 0.000 description 1
- 239000004065 semiconductor Substances 0.000 description 1
- 230000001502 supplementing effect Effects 0.000 description 1
Abstract
本发明提供一种图像处理设备和一种图像处理方法。图像处理设备包括线信息接收单元、预测确定单元、特征量计算单元和线确定单元。线信息接收单元接收指示以下项目的一组信息:(i)可能为线的图像的有关信息;以及(ii)作为构成线的矩形像素块的线元素。预测确定单元基于所接收到的信息来确定目标线元素与预测值是否匹配。预测值指示目标线元素构成线时预测的线元素。特征量计算单元在预测确定单元确定了目标线元素与预测值不匹配时计算图像的特征量。线确定单元基于所计算的特征量来确定该图像是否为线。
The present invention provides an image processing device and an image processing method. An image processing device includes a line information receiving unit, a prediction determining unit, a feature amount calculating unit, and a line determining unit. The line information receiving unit receives a set of information indicating (i) information about an image which may be a line; and (ii) line elements which are rectangular blocks of pixels constituting the line. The prediction determination unit determines whether the target line element matches the predicted value based on the received information. The predicted value indicates the predicted line element when the target line element constitutes the line. The feature quantity calculation unit calculates the feature quantity of the image when the prediction determination unit determines that the target line element does not match the predicted value. The line determining unit determines whether the image is a line based on the calculated feature amount.
Description
技术领域technical field
本发明涉及图像处理设备和图像处理方法。The present invention relates to an image processing device and an image processing method.
背景技术Background technique
本领域中已知从图像中提取线段的技术。Techniques for extracting line segments from images are known in the art.
作为这些技术中的一种,JP-A-08-016782公开了一种线段提取系统,甚至在线段倾斜时、中间稍微弯曲时、或者中间断开时,其也可以从二进制绘图图像中提取线段。走向(run)数据提取单元通过纵向和横向扫描二进制绘图图像来提取走向数据组。指示点检测单元检测开始指示点和结束指示点的位置。走向数据选择单元根据开始指示点和结束指示点之间的倾斜度来选择纵向和横向走向数据组中的一个。开始走向检测单元从所选的走向数据组中检测其走向长度比指定值短并且距离开始指示点的距离最小的走向数据,并且将所检测到的走向数据定义为开始走向。走向跟踪单元跟踪从开始走向链接至结束指示点的走向数据,直到结束指示点的位置为止,并且将所获得的走向数据的链接定义为线段。As one of these techniques, JP-A-08-016782 discloses a line segment extraction system that can extract a line segment from a binary drawing image even when the line segment is inclined, slightly curved in the middle, or disconnected in the middle . A run data extraction unit extracts a run data group by scanning the binary drawing image longitudinally and laterally. The indication point detection unit detects the positions of the start indication point and the end indication point. The direction data selection unit selects one of the longitudinal and lateral direction data sets according to the inclination between the start indication point and the end indication point. The starting trend detection unit detects trend data whose trend length is shorter than a specified value and whose distance from the start indication point is the smallest from the selected trend data group, and defines the detected trend data as the starting trend. The trend tracking unit tracks the trend data linked from the start trend to the end indication point until the position of the end indication point, and defines the link of the obtained trend data as a line segment.
发明内容Contents of the invention
本发明的目的是提供一种图像处理设备和一种图像处理方法,其可以防止不是线的图像被错误地判定为线。An object of the present invention is to provide an image processing apparatus and an image processing method that can prevent an image that is not a line from being erroneously determined as a line.
[1]根据本发明的一方面,图像处理设备包括线信息接收单元、预测确定单元、特征量计算单元和线确定单元。线信息接收单元接收指示以下项目的一组信息:(i)可能为线的图像的有关信息;以及(ii)作为构成线的矩形像素块的线元素。预测确定单元基于线信息接收单元所接收到的指示线元素的信息来确定目标线元素与目标线元素的预测值是否匹配。预测值指示目标线元素在目标线元素的位置处构成线时所预测的线元素。特征量计算单元在预测确定单元确定了目标线元素与预测值不匹配时计算该图像的特征量。线确定单元基于特征量计算单元所计算的特征量来确定该图像是否为线。[1] According to an aspect of the present invention, an image processing apparatus includes a line information receiving unit, a prediction determining unit, a feature amount calculating unit, and a line determining unit. The line information receiving unit receives a set of information indicating (i) information about an image which may be a line; and (ii) line elements which are rectangular blocks of pixels constituting the line. The prediction determining unit determines whether the target line element matches the predicted value of the target line element based on the information indicating the line element received by the line information receiving unit. The prediction value indicates a line element predicted when the target line element constitutes a line at the position of the target line element. The feature amount calculation unit calculates the feature amount of the image when the prediction determination unit determines that the target line element does not match the predicted value. The line determination unit determines whether the image is a line based on the feature quantity calculated by the feature quantity calculation unit.
[2]在根据[1]的图像处理设备中,还包括图像接收单元和线提取单元。图像接收单元接收图像。线提取单元从图像接收单元所接收到的图像中提取可能为线的图像,并且提取指示线的线元素的一组信息。线信息接收单元接收线提取单元所提取的指示线元素的一组信息。[2] In the image processing device according to [1], an image receiving unit and a line extracting unit are further included. The image receiving unit receives an image. The line extracting unit extracts images that may be lines from the images received by the image receiving unit, and extracts a set of information indicating line elements of the lines. The line information receiving unit receives a set of information indicating line elements extracted by the line extracting unit.
[3]在根据[1]或[2]的图像处理设备中,预测确定单元具有多个条件来确定目标线元素与预测值是否匹配。特征量计算单元基于预测确定单元确定目标线元素与预测值不匹配时的条件类型来提取特征量。[3] In the image processing device according to [1] or [2], the prediction determination unit has a plurality of conditions to determine whether the target line element matches the predicted value. The feature quantity calculation unit extracts the feature quantity based on a condition type when the prediction determination unit determines that the target line element does not match the predicted value.
[4]在根据[2]或[3]的图像处理设备中,图像接收单元接收包括字符串的图像。线提取单元基于字符串的方向从图像接收单元所接收到的图像中提取可能为线的图像。[4] In the image processing device according to [2] or [3], the image receiving unit receives an image including a character string. The line extracting unit extracts images that may be lines from the images received by the image receiving unit based on the direction of the character string.
[5]在根据[2]或[3]的图像处理设备中,图像接收单元接收包括框线的图像。线提取单元基于框线的方向从图像接收单元所接收到的图像中提取可能为线的图像。[5] In the image processing apparatus according to [2] or [3], the image receiving unit receives the image including the frame line. The line extracting unit extracts an image that may be a line from the image received by the image receiving unit based on the direction of the frame line.
[6]根据本发明的另一方面,图像处理方法包括:接收指示以下项目的一组信息,(i)可能为线的图像的有关信息,(ii)作为构成线的矩形像素块的线元素;基于所接收到的指示线元素的信息来确定目标线元素与目标线元素的预测值是否匹配,预测值指示目标线元素在目标线元素的位置处构成线时所预测的线元素;当确定目标线元素与预测值不匹配时,计算该图像的特征量;以及基于计算出的特征量来确定该图像是否为线。[6] According to another aspect of the present invention, an image processing method includes receiving a set of information indicating (i) information about an image that may be a line, (ii) line elements that are rectangular blocks of pixels constituting the line ; Determine whether the target line element matches the predicted value of the target line element based on the received information indicating the line element, and the predicted value indicates the predicted line element when the target line element forms a line at the position of the target line element; when determining When the target line element does not match the predicted value, calculating a feature quantity of the image; and determining whether the image is a line based on the calculated feature quantity.
根据[1]的图像处理设备,可以防止不是线的图像被错误地判定为线。According to the image processing apparatus of [1], it is possible to prevent an image that is not a line from being erroneously determined as a line.
根据[2]的图像处理设备,可以防止被提取为可能为线的图像中不是线的图像被错误地判定为线。According to the image processing apparatus of [2], it is possible to prevent an image that is not a line from among images extracted as possible lines from being erroneously determined as a line.
根据[3]的图像处理设备,可以根据基于确定了与预测值不匹配时的条件类型的特征来确定图像是否相当于线。According to the image processing apparatus of [3], whether an image corresponds to a line can be determined according to a feature based on a condition type when a mismatch with a predicted value is determined.
根据[4]的图像处理设备,可以提取与字符串一起使用的线。According to the image processing device of [4], a line used with a character string can be extracted.
根据[5]的图像处理设备,可以从包含框线的图像中提取框线。According to the image processing apparatus of [5], a frame line can be extracted from an image containing the frame line.
根据[6]的图像处理方法,可以防止不是线的图像被错误地判定为线。According to the image processing method of [6], it is possible to prevent an image that is not a line from being erroneously determined as a line.
附图说明Description of drawings
以下将基于附图来详细描述本发明的示例实施例,附图中:Exemplary embodiments of the present invention will be described in detail below based on the accompanying drawings, in which:
图1是根据第一实施例的示例构造的概念模块构造视图;Fig. 1 is a conceptual module construction view according to an example construction of the first embodiment;
图2是示出示例目标图像的说明图;FIG. 2 is an explanatory diagram showing an example target image;
图3是示出从扫描仪等读取的示例图像的说明图;FIG. 3 is an explanatory diagram showing an example image read from a scanner or the like;
图4是示出线提取的示例的说明图;FIG. 4 is an explanatory diagram showing an example of line extraction;
图5是示出示例接收图像的说明图;FIG. 5 is an explanatory diagram showing an example received image;
图6是示出从所接收到的图像中提取的示例线元素的说明图;FIG. 6 is an explanatory diagram showing example line elements extracted from a received image;
图7是示出示例线元素的说明图;FIG. 7 is an explanatory diagram showing an example line element;
图8是示出示例校正和补充后的线元素的说明图;FIG. 8 is an explanatory diagram showing example corrected and supplemented line elements;
图9是示出跨越多个像素的示例线元素的说明图;FIG. 9 is an explanatory diagram showing an example line element spanning a plurality of pixels;
图10是示出确定和观察失败类型的示例的说明图;FIG. 10 is an explanatory diagram showing an example of determining and observing failure types;
图11是示出由采用确定模块执行的示例处理的流程图;11 is a flowchart illustrating example processing performed by an employment determination module;
图12是根据第二实施例的示例构造的概念模块构造视图;Fig. 12 is a conceptual module construction view according to an example construction of the second embodiment;
图13是根据第三实施例的示例构造的概念模块构造视图;Fig. 13 is a conceptual module construction view according to an example construction of the third embodiment;
图14是根据第四实施例的示例构造的概念模块构造视图;Fig. 14 is a conceptual module construction view according to an example construction of the fourth embodiment;
图15是根据第五实施例的示例构造的概念模块构造视图;Fig. 15 is a conceptual module construction view according to an example construction of the fifth embodiment;
图16是示出计算实现该实施例的示例硬件构造的方框图。FIG. 16 is a block diagram illustrating an example hardware configuration for computationally implementing this embodiment.
具体实施方式detailed description
以下将参考附图来描述适用于实现本发明的各种示例实施例。Various exemplary embodiments suitable for realizing the present invention will be described below with reference to the accompanying drawings.
图1是根据第一实施例的示例构造的概念模块构造视图。FIG. 1 is a conceptual block configuration view of an example configuration according to the first embodiment.
本文中所使用的“模块”通常指诸如逻辑可分离的软件(计算机程序)、硬件等之类的部件。因此,该实施例中的模块不仅包括计算机程序中的模块还包括硬件构造中的模块。因此,该实施例描述了使该实施例作为模块、系统、和方法的所有计算机程序(包括使计算机执行步骤的程序、使计算机作为装置的程序、以及使计算机实现功能的程序)。为了方便描述,当实施例涉及计算机程序时,本文中所使用的“存储”、“被存储”或其等价表述是指计算机程序存储在存储装置中或计算机程序被控制为存储在存储装置中。尽管模块与功能是一一对应关系,但是安装时,一个模块可以被构造为一个程序,多个模块可以被构造为一个程序,或者一个模块可以被构造为多个程序。多个模块可以由一个计算机执行,或者在分布式或并行环境中一个模块可以由多个计算机执行。一个模块可以包含其他模块。如本文中所使用的,术语“连接”除了表示物理连接之外还包括逻辑连接(数据传输、指令、数据间的引用关系等)。如本文中所使用的,术语“预定”是指在目标处理之前的确定,不仅包括实施例开始处理之前的确定,还包括根据当此确定为实施例开始处理之后到目标处理之前的确定时该时刻的情况和条件,或者直到该时刻的情况和条件。A "module" as used herein generally refers to a component such as logically separable software (computer program), hardware, and the like. Therefore, the modules in this embodiment include not only modules in computer programs but also modules in hardware configurations. Therefore, the embodiments describe all computer programs (including programs that cause a computer to perform steps, programs that make a computer an apparatus, and programs that cause a computer to realize functions) that make the embodiments a module, system, and method. For the convenience of description, when the embodiment refers to a computer program, "store", "stored" or its equivalent expression used herein means that the computer program is stored in the storage device or the computer program is controlled to be stored in the storage device . Although there is a one-to-one correspondence between modules and functions, when installed, one module can be constructed as one program, multiple modules can be constructed as one program, or one module can be constructed as multiple programs. Multiple modules can be executed by one computer, or one module can be executed by multiple computers in a distributed or parallel environment. A module can contain other modules. As used herein, the term "connection" includes not only a physical connection but also a logical connection (data transmission, instruction, reference relationship between data, etc.). As used herein, the term "predetermined" refers to a determination before the target process, including not only the determination before the embodiment starts the process, but also the The circumstances and conditions at the moment, or up to that moment.
如本文中所使用的,术语“系统”或“设备”除了表示通过通信装置(诸如网络)互连(包括一一对应的通信连接)的多个计算机、硬件、设备等,还包括一个计算机、硬件、设备等。在该说明书中,“设备”与“系统”同义。当然,“系统”不包括仅作为人为判定的社会“结构”(社会体制)的任何方面。As used herein, the term "system" or "equipment" not only refers to a plurality of computers, hardware, equipment, etc. interconnected (including one-to-one communication connections) through communication means (such as a network), but also includes a computer, hardware, equipment, etc. In this specification, "apparatus" is synonymous with "system". Of course, "system" does not include any aspect of a society's "structure" (social institution) that is merely a human judgment.
当不同模块执行不同处理或者一个模块执行不同处理时,从存储单元读取用于处理的信息,并且将处理结果写入存储单元中。因此,关于处理之前从存储单元读出信息以及处理之后将信息写入存储单元的说明可能被省略。本文中所使用的存储单元可以包括硬盘、随机存取存储器(RAM)、外部存储介质、经由通信线路的存储单元、中央处理单元(CPU)内的寄存器等。When different modules perform different processing or one module performs different processing, information for the processing is read from the storage unit, and the processing result is written in the storage unit. Therefore, descriptions about reading information from memory cells before processing and writing information into memory cells after processing may be omitted. The storage unit used herein may include a hard disk, a random access memory (RAM), an external storage medium, a storage unit via a communication line, a register within a central processing unit (CPU), and the like.
第一实施例的图像处理设备确定目标图像是否对应于线,并且其包括线信息接收模块110、观察失败分析模块120、特征量计算模块130和采用确定模块140,如图1的示例所示。The image processing apparatus of the first embodiment determines whether a target image corresponds to a line, and includes a line information receiving module 110, an observation failure analysis module 120, a feature quantity calculation module 130, and an adoption determination module 140, as shown in the example of FIG.
首先,将描述第一实施例中的目标图像。First, the target image in the first embodiment will be described.
图2是示出示例目标图像的说明图。所示图像包括线图像(图案)和非线图像。FIG. 2 is an explanatory diagram showing an example target image. The shown images include line images (patterns) and non-line images.
图3是示出扫描仪等读取图2中的图像时所获得的示例图像(像素数据)的说明图。在该示例中,目标图像中要被提取的线不一定被表示为被看作直线的单区域,这是由于噪声的影响,也可能被扫描仪等读取为点划线、短划线、非直线图案等。更具体地,可能出现起皱的(crushed)或不完整的(deficient)线等。FIG. 3 is an explanatory diagram showing an example image (pixel data) obtained when a scanner or the like reads the image in FIG. 2 . In this example, the line to be extracted in the target image is not necessarily represented as a single area that is regarded as a straight line, which is due to the influence of noise, and may also be read by a scanner, etc. as a dotted line, dashed line, Non-linear patterns, etc. More specifically, crushed or deficient threads and the like may occur.
图4是示出线提取的示例的说明图。FIG. 4 is an explanatory diagram showing an example of line extraction.
在该示例中,线被识别为与沿与线方向相交(例如,垂直)的方向延伸的像素串(pixel runs)相对应的一组区域。下文中,将这些区域称为“线元素”。In this example, a line is identified as a set of regions corresponding to pixel runs extending in a direction intersecting (eg, perpendicular to) the line direction. Hereinafter, these areas are referred to as "line elements".
第一实施例涉及检测并合并对应于要提取的线的像素串(即,所观察到的线元素),如图4的示例所示。像素串的合并结果对应于线图像。A first embodiment involves detecting and merging pixel strings corresponding to lines to be extracted (ie observed line elements), as shown in the example of FIG. 4 . The combined result of the pixel strings corresponds to a line image.
另外,可以基于这些像素串产生实线元素(即,校正线元素)。例如,在图4中,由点划线所指示的观察线元素411至428对应于在图像内观察到的线元素,而由灰色矩形所指示的校正线元素451至468对应于基于观察线元素411等校正的线元素。例如,当观察线元素411等于校正线元素451时,观察线元素412向下移动以与校正线元素452相对应,并且观察线元素414向上移动以与校正线元素454对应。另外,当在观察线元素422与观察线元素425之间没有观察到线元素时,添加校正线元素463和校正线元素464。In addition, solid line elements (ie, correction line elements) may be generated based on these pixel strings. For example, in FIG. 4 , observation line elements 411 to 428 indicated by dotted lines correspond to line elements observed within the image, while correction line elements 451 to 468 indicated by gray rectangles correspond to line elements based on observation. 411 and other corrected line elements. For example, when the observation line element 411 is equal to the correction line element 451 , the observation line element 412 moves down to correspond to the correction line element 452 , and the observation line element 414 moves up to correspond to the correction line element 454 . Also, when no line element is observed between the observed line element 422 and the observed line element 425 , a correction line element 463 and a correction line element 464 are added.
接下来,将描述线元素。Next, line elements will be described.
图5是示出示例接收图像的说明图。所接收的图像500被分成背景像素510和图案像素520,图案像素520由像素串530等构成。以下将描述从该图像500中提取水平线的示例。FIG. 5 is an explanatory diagram showing an example received image. The received image 500 is divided into background pixels 510 and pattern pixels 520 composed of pixel strings 530 and the like. An example of extracting a horizontal line from this image 500 will be described below.
图6是示出从所接收到的图像中提取的示例线元素的说明图。图案像素520由一组线元素640等构成。在该示例中,线元素是指对应于可以构成线的矩形像素块或上述像素串的区域。“指示线元素的信息”的示例可以包括用来绘制线元素的信息,具体地,指线元素绘制位置、线元素尺寸等。FIG. 6 is an explanatory diagram showing example line elements extracted from a received image. The pattern pixel 520 is made up of a set of line elements 640 and the like. In this example, a line element refers to an area corresponding to a rectangular pixel block that can constitute a line or the aforementioned pixel string. Examples of "information indicating line elements" may include information used to draw line elements, specifically, line element drawing positions, line element sizes, and the like.
图7是示出示例线元素710的说明图。在该示例中,线元素是指对应于沿与线方向(图7中的行进方向)相交的方向(图7中的位置方向)延伸的像素串的一个区域。FIG. 7 is an explanatory diagram showing an example line element 710 . In this example, a line element refers to an area corresponding to a pixel string extending in a direction (position direction in FIG. 7 ) intersecting the line direction (travel direction in FIG. 7 ).
假设线由多个线元素构成,例如,第k个线元素具有以下信息:Suppose a line is composed of multiple line elements, for example, the kth line element has the following information:
tk:线元素厚度t k : Line element thickness
Pk:线元素位置P k : line element position
tk表示关于线元素形状的信息,并且其对应于与线方向相交方向上的长度。即,tk是对应于线厚度的信息。t k represents information on the shape of the line element, and it corresponds to the length in the direction intersecting with the line direction. That is, t k is information corresponding to the thickness of the line.
Pk表示位置信息。例如,对于水平线,Pk是对应于图像高度的信息。P k represents position information. For example, for a horizontal line, P k is information corresponding to the image height.
如图7中的示例所示,实际从图像中观察到的线元素具有为一个像素的最小单位。但是,线元素的最小信息单位不一定是一个像素,还可能是小于一个像素的单位。例如,可以基于所获得的线元素以小于一个像素的单位获得实线元素。可替换地,可以从没有观察到线元素的状态产生线元素,诸如如图8所示的包括用于在中途切断部分中补充线元素的校正处理的线提取。As shown in the example in FIG. 7 , a line element actually observed from an image has a minimum unit of one pixel. However, the smallest information unit of a line element is not necessarily a pixel, and may be a unit smaller than a pixel. For example, solid line elements may be obtained in units smaller than one pixel based on the obtained line elements. Alternatively, a line element may be generated from a state where no line element is observed, such as line extraction including correction processing for supplementing a line element in a halfway cut portion as shown in FIG. 8 .
图8是示出示例校正和补充后的线元素的说明图。在该图中,由阴影矩形所指示的观察线元素811至828对应于在图像内所观察到的线元素,而由灰色矩形所指示的校正线元素851至868对应于基于观察线元素811等校正的线元素。例如,当在观察线元素820与观察线元素822之间没有观察到线元素时,添加校正线元素861。FIG. 8 is an explanatory diagram showing an example corrected and supplemented line element. In this figure, the observation line elements 811 to 828 indicated by the hatched rectangles correspond to the observed line elements within the image, while the correction line elements 851 to 868 indicated by the gray rectangles correspond to the line elements based on the observation line element 811 etc. Corrected line elements. For example, when no line element is observed between observed line element 820 and observed line element 822 , correction line element 861 is added.
另外,不一定使用所有上述信息。例如,如果线提取(稍后描述)使用了线位置,则关于各线元素的位置信息是不必要的。Additionally, not all of the above information is necessarily used. For example, if the line position is used for line extraction (described later), position information on each line element is unnecessary.
另外,如果对多值图像进行处理,则可以提供像素浓度信息。例如,其颜色信息突然改变的线元素可能被确定为观察失败。Additionally, if processing a multivalued image, pixel density information can be provided. For example, a line element whose color information changes suddenly may be determined as an observation failure.
另外,线元素可以连接至多个像素,如图9所示。例如,观察线元素可以对应于一个像素,而校正线元素可以对应于多个像素。在图9中,由阴影矩形所指示的观察线元素811至828对应于在图像内所观察到的线元素,而由灰色矩形所指示的校正线元素951至959对应于基于观察线元素811等校正的线元素。例如,由观察线元素811和观察线元素812组成的观察线元素对向上移动以与校正线元素951相对应,而在观察线元素820与观察线元素822之间没有观察到线元素时,额外产生校正线元素956。Additionally, a line element can be connected to multiple pixels, as shown in Figure 9. For example, an observation line element may correspond to one pixel, while a correction line element may correspond to multiple pixels. In FIG. 9 , observation line elements 811 to 828 indicated by hatched rectangles correspond to observed line elements within the image, while correction line elements 951 to 959 indicated by gray rectangles correspond to lines based on observation line element 811 etc. Corrected line elements. For example, the observation line element pair consisting of observation line element 811 and observation line element 812 is moved up to correspond to correction line element 951, and when there is no observation line element between observation line element 820 and observation line element 822, an additional A correction line element 956 is generated.
线信息接收模块110连接到观察失败分析模块120和特征量计算模块130。线信息接收模块110接收线信息108并将线信息112传输至观察失败分析模块120和特征量计算模块130。线信息108是指可能为线的图像的相关信息,并且对应于指示线元素的一组信息。第一实施例涉及确定线信息108是否构成线。The line information receiving module 110 is connected to the observation failure analysis module 120 and the feature quantity calculation module 130 . The line information receiving module 110 receives the line information 108 and transmits the line information 112 to the observation failure analysis module 120 and the feature quantity calculation module 130 . The line information 108 refers to information related to an image that may be a line, and corresponds to a set of information indicating line elements. The first embodiment involves determining whether the line information 108 constitutes a line.
观察失败分析模块120(其为预测确定单元)连接到线信息接收模块110和特征量计算模块130。观察失败分析模块120确定线信息112被成功观察到或未被成功观察到,并且将观察失败信息122传输至特征量计算模块130。如果线元素被预测在线元素的位置处构成线,则观察失败分析模块120确定目标线元素与线元素的预测值是否匹配。这种如果线元素被预测在线元素的位置处构成线而对目标线元素与线元素的预测值是否匹配进行的确定也被称为“观察”。“观察成功”是指确定为与预测值匹配的确定,而“观察失败”是指确定为与预测值不匹配的确定。The observation failure analysis module 120 , which is a prediction determination unit, is connected to the line information reception module 110 and the feature quantity calculation module 130 . The observation failure analysis module 120 determines whether the line information 112 is successfully observed or not, and transmits the observation failure information 122 to the feature amount calculation module 130 . If the line element is predicted to constitute a line at the line element's location, the observation failure analysis module 120 determines whether the target line element matches the predicted value of the line element. This determination of whether the target line element matches the predicted value of the line element if the line element is predicted to constitute a line at the location of the line element is also referred to as an "observation". "Observation success" refers to a determination that matches the predicted value, and "observation failure" refers to a determination that does not match the predicted value.
另外,对于观察失败,观察失败分析模块120还可以输出失败的类型作为观察失败信息122。In addition, for the observation failure, the observation failure analysis module 120 may also output the failure type as the observation failure information 122 .
以下将详细描述观察失败分析模块120执行的处理。The processing performed by the observation failure analysis module 120 will be described in detail below.
观察失败分析模块120接收线信息112,并且输出观察失败信息122(其为具有类型的观察失败的信息)。The observation failure analysis module 120 receives the line information 112, and outputs observation failure information 122 (which is information having a type of observation failure).
如图10所示,考虑由在行进方向上延伸的线元素构成的线。在该示例中,从线信息108获得以下信息:As shown in FIG. 10 , consider a line composed of line elements extending in the direction of travel. In this example, the following information is obtained from line information 108:
ft(k):位置k处的线厚度f t (k): line thickness at position k
fp(k):位置k处的线位置f p (k): line position at position k
ft(k)和fp(k)对应于位置k处的线元素的理想值。f t (k) and f p (k) correspond to ideal values for the line element at position k.
在该情况下,位置k处的线元素厚度tk不必等于ft(k)。In this case, the line element thickness t k at position k does not have to be equal to f t (k).
另外,位置k处的线元素位置pk不必等于fp(k)。Additionally, line element position p k at position k does not have to be equal to f p (k).
另外,在某些情况下,位置k处没有线元素。Also, in some cases there is no line element at position k.
因此,位置k处的线元素可以具有以下类型的观察失败:Thus, a line element at position k can have observation failures of the following types:
A)不存在线元素;A) There is no line element;
B)pk与fp(k)之间的差大于预设阈值;B) The difference between p k and f p (k) is greater than a preset threshold;
C)tk与ft(k)之间的差大于预设阈值。C) The difference between t k and f t (k) is greater than a preset threshold.
以下将使用图10的示例来详细描述这些类型的观察失败。在图10中,灰色矩形表示由线信息112所指示的线元素(观察到的线元素)。由顶部和底部的点线所包围的区域(线)表示基于线信息112计算出的理想线,而一点划线表示理想线的中心。These types of observation failures will be described in detail below using the example of FIG. 10 . In FIG. 10 , gray rectangles represent line elements indicated by line information 112 (observed line elements). A region (line) surrounded by dotted lines at the top and bottom represents an ideal line calculated based on the line information 112, and a dotted line represents the center of the ideal line.
在图10的示例中,在从最左边开始的第三个线元素的下一位置(由A指示的位置)处没有线元素。这是观察失败(A)的示例。In the example of FIG. 10 , there is no line element at the position next to the third line element from the leftmost (the position indicated by A). Here is an example of observation failure (A).
在图10的示例中,从最左边开始的第七个线元素(由B指示的位置)向上突出,这意味着pk与fp(k)之间的差大于预设阈值。这是观察失败(B)的示例。In the example of FIG. 10 , the seventh line element from the far left (position indicated by B) protrudes upwards, which means that the difference between p k and f p (k) is greater than a preset threshold. This is an example of observation failure (B).
观察失败的类型根据tk与ft(k)之间的差为正或为负而不同。在图10的示例中,从最左边开始的第十一个线元素(由C-1指示的位置)与理想线的线元素相比太厚。相反,在图10的示例中,从最左边开始的第十五个线元素(由C-2指示的位置)与理想线的线元素相比太薄。The type of observation failure differs depending on whether the difference between t k and f t (k) is positive or negative. In the example of FIG. 10 , the eleventh line element from the leftmost (position indicated by C-1 ) is too thick compared to that of the ideal line. In contrast, in the example of FIG. 10 , the fifteenth line element from the leftmost (position indicated by C-2 ) is too thin compared to that of the ideal line.
为了妥善处理这些类型的观察失败,观察失败分析模块120具有多个条件来确定与预测值是否匹配。In order to gracefully handle these types of observation failures, the observation failure analysis module 120 has a number of conditions to determine if there is a match with the predicted value.
第一个条件是存在紧邻目标线元素的线元素。这对应于以上类型A)。该条件的不满足导致第一类型的观察失败。The first condition is that there is a line element immediately adjacent to the target line element. This corresponds to type A) above. Failure of this condition causes the first type of observation to fail.
第二个条件是观察线元素的位置与预测线元素的位置之间的差小于阈值。这对应于以上类型B)。如果该差等于或大于阈值,则导致第二类型的观察失败。The second condition is that the difference between the position of the observed line element and the position of the predicted line element is less than a threshold. This corresponds to type B) above. If the difference is equal to or greater than the threshold, it causes the second type of observation to fail.
第三个条件是观察线元素的宽度与预测线元素的宽度之间的差小于阈值。这对应于以上类型C)。如果该差等于或大于阈值,则导致第三类型的观察失败。如果该差小于阈值,则导致第四类型的观察失败。The third condition is that the difference between the width of the observed line element and the width of the predicted line element is less than a threshold. This corresponds to type C) above. If the difference is equal to or greater than the threshold, it causes the third type of observation to fail. If the difference is less than the threshold, it results in a failure of the fourth type of observation.
上述阈值可以设置为常量、fp(k)的整数倍、ft(k)的整数倍、或线元素的变化(variance)的整数倍。The threshold above can be set as a constant, an integer multiple of f p (k), an integer multiple of f t (k), or an integer multiple of the variation (variance) of line elements.
观察失败分析模块120可以输出具有类型加成的观察,诸如A)、B)、C)-1和C)-2。The observation failure analysis module 120 may output observations with type additions, such as A), B), C)-1, and C)-2.
可以以现有方法来获得ft(k)和fp(k)。例如,可以将ft(k)获得为观察到的线信息112的平均值、中间值或最大频率。可以根据观察到的线信息112的位置通过最小平方误差函数来获得fp(k)。f t (k) and f p (k) can be obtained in existing methods. For example, f t (k) may be obtained as the mean, median or maximum frequency of the observed line information 112 . f p (k) can be obtained from the position of the observed line information 112 by a least square error function.
另外,不必使用所有的线元素来获得ft(k)和fp(k)。例如,要确定ft(k)和fp(k)针对的线元素可以指邻近的线元素。本文中所使用的邻近是指距离目标线元素预定距离内存在的邻近线元素。另外,也可以指除了导致明显的观察失败的线元素之外的其余线元素。Also, it is not necessary to use all line elements to obtain ft( k ) and fp( k ). For example, the line elements for which f t (k) and f p (k) are to be determined may refer to neighboring line elements. Adjacent as used herein refers to adjacent line elements existing within a predetermined distance from the target line element. In addition, it may also refer to the remaining line elements other than the line element causing obvious observation failure.
此外,除了上述类型的观察失败之外,可以提供关于线元素的浓度信息的其他类型的观察失败。Furthermore, other types of observation failures may provide information on the concentration of line elements in addition to the above-mentioned types of observation failures.
观察失败分析模块120可以被分成两个模块。The observation failure analysis module 120 can be divided into two modules.
第一模块为预测值计算模块,如果线元素从目标线元素位置开始构成线,其基于线信息接收模块110所接收到的线元素的相关信息来计算目标位置处线元素的有关信息的预测值。The first module is the prediction value calculation module. If the line elements form a line from the position of the target line element, it calculates the prediction value of the relevant information of the line elements at the target position based on the relevant information of the line elements received by the line information receiving module 110 .
第二模块为预测确定处理模块,其确定预测值计算模块计算出的预测值与线信息接收模块110所接收到的线元素的相关信息是否匹配。The second module is a prediction determination processing module, which determines whether the predicted value calculated by the predicted value calculation module matches the related information of the line elements received by the line information receiving module 110 .
另外,观察失败分析模块120还根据类型对与预测值不匹配的情况进行分类。In addition, the observed failure analysis module 120 also classifies cases that do not match the predicted values according to type.
特征量计算模块130连接到线信息接收模块110、观察失败分析模块120和采用确定模块140。特征量计算模块130计算对应于观察失败信息122的线信息112的特征量,并且将特征量132传输至采用确定模块140。特征量计算模块130计算关于观察失败分析模块120确定与预测值不匹配的情况出现时的图像特征。另外,特征量计算模块130可以基于观察失败分析模块120确定的与预测值不匹配的条件类型来提取特征。具体地,所述特征可以是与预测值不匹配(观察失败)的发生率。The feature quantity calculation module 130 is connected to the line information reception module 110 , the observation failure analysis module 120 and the adoption determination module 140 . The feature amount calculation module 130 calculates the feature amount of the line information 112 corresponding to the observation failure information 122 , and transmits the feature amount 132 to the adoption determination module 140 . The feature amount calculation module 130 calculates the image feature when the failure-to-observation analysis module 120 determines that a mismatch with the predicted value occurs. In addition, the feature amount calculation module 130 may extract features based on the type of condition determined by the observation failure analysis module 120 that does not match the predicted value. Specifically, the feature may be an occurrence rate that does not match the predicted value (observation failure).
特征量计算模块130参照线信息112和观察失败信息122来获得线特征量。针对每种类型的观察失败,或者通过安排几种类型的观察失败,获得观察失败的发生率作为特征量。The feature amount calculation module 130 refers to the line information 112 and the observation failure information 122 to obtain line feature amounts. For each type of observation failure, or by arranging several types of observation failures, the occurrence rate of observation failure is obtained as a feature quantity.
具体地,获得如下:Specifically, it is obtained as follows:
首先,根据以下等式(1)和(2)来安排观察失败的类型。在等式中,n{X}表示观察失败的类型X的发生次数。安排观察失败类型的方法不限于等式(1)和(2)。例如,可以将n{C2}添加到thick_skip。另外,可以不必对观察失败的类型进行安排。First, the types of observation failures are arranged according to the following equations (1) and (2). In the equation, n{X} represents the number of occurrences of type X of observation failures. The method of arranging observation failure types is not limited to equations (1) and (2). For example, n{C 2 } can be added to thick_skip. In addition, it is not necessary to schedule the type of observation failure.
[等式1][equation 1]
blank_skip=n{A}+n{B}+n{C2} (1)blank_skip=n{A}+n{B}+n{C 2 } (1)
A:没有线元素A: no line element
B:线元素的厚度太薄B: The thickness of the line element is too thin
C2:线元素的位置距离预测线元素的位置太远C 2 : The position of the line element is too far from the predicted position of the line element
[等式2][equation 2]
thick_skip=n{C1} (2)thick_skip=n{C 1 } (2)
C1:线元素的厚度太厚C 1 : The thickness of the line element is too thick
假设根据以下等式(3)和(4)来获得构成线的线元素的总数、观察失败类型的发生率,即特征量。It is assumed that the total number of line elements constituting a line, the occurrence rate of observation failure types, that is, feature quantities are obtained according to the following equations (3) and (4).
[等式3][equation 3]
ratio_blank_skip=blank_skip/N (3)ratio_blank_skip=blank_skip/N (3)
[等式4][equation 4]
ratio_thick_skip=thick_skip/N (4)ratio_thick_skip=thick_skip/N (4)
此处,可以不使用观察失败的类型,来获得不同于等式(3)和(4)的特征。例如,使用长度、厚度、位置、梯度、失真、ft(k)与tk(即,厚度)的误差和fp(k)与pk(即,位置)的误差的累计。该情况下,除了导致观察失败的线元素之外,还可以获得上述特征量。Here, the type of observation failure may not be used to obtain characteristics different from equations (3) and (4). For example, accumulations of length, thickness, position, gradient, distortion, error of ft( k ) from tk (ie, thickness) and error of fp( k ) from pk (ie, position) are used. In this case, it is possible to obtain the above-mentioned feature amount in addition to the line element causing observation failure.
采用确定模块140(其为线确定单元)连接到特征量计算模块130。采用确定模块140基于特征量132确定是否在用线元素作为线,并且输出采用/不采用的信息142。此处,输出的示例可以包括存储在诸如存储卡之类的存储介质上、传送至另一信息处理设备等。The adoption determination module 140 (which is a line determination unit) is connected to the feature quantity calculation module 130 . The adoption determination module 140 determines whether or not a line element is being used as a line based on the feature quantity 132 , and outputs adoption/non-adoption information 142 . Here, examples of output may include storage on a storage medium such as a memory card, transmission to another information processing device, and the like.
采用确定模块140基于特征量计算模块130计算出的特征量132来确定由线信息接收模块110所接收到的线信息108构成的图像是否为线。The employing determining module 140 determines whether the image constituted by the line information 108 received by the line information receiving module 110 is a line based on the feature quantity 132 calculated by the feature quantity calculating module 130 .
采用确定模块140基于线特征量132输出线采用/不采用的信息142。该情况下,可以确定每个特征是否满足预定条件。例如,执行图11的示例中所示的确定处理。在图11中,θratio_blank_skip和θratio_thick_skip为预定值。The adoption determination module 140 outputs line adoption/non-adoption information 142 based on the line feature quantity 132 . In this case, it may be determined whether each feature satisfies a predetermined condition. For example, the determination processing shown in the example of FIG. 11 is performed. In FIG. 11 , θ ratio_blank_skip and θ ratio_thick_skip are predetermined values.
另外,可以通过从特征量学习来产生决定因素。该情况下,可以使用不同线的特征量。即,通过至少将关于观察失败类型的线特征量作为输入的二级分类器来配置该处理。In addition, determinants can be generated by learning from feature quantities. In this case, feature quantities of different lines can be used. That is, the processing is configured by a secondary classifier that takes at least the line feature quantity on the observation failure type as input.
图11是示出采用确定模块140所执行的示例处理的流程图。FIG. 11 is a flowchart illustrating example processing performed by employing the determination module 140 .
在步骤S1102,确定是否建立了以下等式(5)或(6)。如果是,则处理继续进行步骤S1104,否则,处理继续前进到步骤S1106。In step S1102, it is determined whether the following equation (5) or (6) is established. If yes, the process proceeds to step S1104, otherwise, the process proceeds to step S1106.
[等式5][equation 5]
ratio_blank_skip<θratio_blank_skip (5)ratio_blank_skip<θ ratio_blank_skip (5)
[等式][equation]
ratio_thick_skip<θratio_thick_skip (6)ratio_thick_skip<θ ratio_thick_skip (6)
在步骤S1104,输出“采用”。In step S1104, "adopt" is output.
在步骤S1106,输出“不采用”。In step S1106, "not adopted" is output.
对于观察失败的类型,其特征在于实线具有较少的blank_skip和较多的thick_skip,而虚线具有较少的thick_skip和较多的blank_skip。另外,其特征在于,非线图案具有很多thick_skip和blank_skip。因此,通过使用观察失败的类型,当从其中混合有实线、不完整线、起皱线、线交叉、非线图案等的图像中提取线时,可以提取线而不会错误地提取非线图案。The type of observed failure is characterized by solid lines having less blank_skip and more thick_skip, while dashed lines have less thick_skip and more blank_skip. In addition, it is characterized in that the nonlinear pattern has many thick_skips and blank_skips. Therefore, when extracting lines from an image in which solid lines, incomplete lines, wrinkled lines, line intersections, non-line patterns, etc. pattern.
另外,如果观察失败的发生率等于或小于预定值,则可以不考虑随后确定来确定线是否被采用,或者更新用于随后确定的阈值。如果成功观察到提取线,则这可以导致高概率的线采用。In addition, if the occurrence rate of observation failure is equal to or less than a predetermined value, it may be determined whether or not a line is taken regardless of subsequent determination, or a threshold value for subsequent determination may be updated. This can result in a high probability of line adoption if the extraction line is successfully observed.
图12是根据第二实施例的示例构造的概念模块构造视图。Fig. 12 is a conceptual block configuration view according to an example configuration of the second embodiment.
第二实施例的构造包括图像接收模块1210、线提取模块1220、观察失败分析模块120、特征量计算模块130和采用确定模块1240。与第一实施例中相同的模块由相同的参考标号标示,并且不重复对其进行说明。对于由相同参考标号标示的模块,以下的描述涉及对对应于第一实施例的模块的性能或功能进行附加或替换。这同样适用于第三和随后的实施例。The configuration of the second embodiment includes an image reception module 1210 , a line extraction module 1220 , an observation failure analysis module 120 , a feature amount calculation module 130 , and an adoption determination module 1240 . The same blocks as those in the first embodiment are denoted by the same reference numerals, and description thereof will not be repeated. For blocks denoted by the same reference numerals, the following description involves addition or replacement of the performance or function of the blocks corresponding to the first embodiment. The same applies to the third and subsequent embodiments.
图像接收模块1210连接到线提取模块1220。图像接收模块1210接收图像1208并且将图像1212传输到线提取模块1220。The image receiving module 1210 is connected to the line extracting module 1220 . Image receiving module 1210 receives image 1208 and transmits image 1212 to line extraction module 1220 .
线提取模块1220连接到图像接收模块1210、观察失败分析模块120、特征量计算模块130和采用确定模块1240。线提取模块1220通过对齐来自图像1212的线元素来提取线,将线信息1222传输至观察失败分析模块120、特征量计算模块130和采用确定模块1240,并且在图像1212中不存在线的情况下输出无效信号1224。线提取模块1220从图像接收模块1210所接收到的图像1212中提取可能为线的图像,并且提取线信息1222(其为表示线的线元素的一组信息)。该线提取处理可以使用JP-A-08-016782中所公开的现有技术等。The line extraction module 1220 is connected to the image reception module 1210 , the observation failure analysis module 120 , the feature quantity calculation module 130 , and the adoption determination module 1240 . The line extraction module 1220 extracts a line by aligning line elements from the image 1212, transmits the line information 1222 to the observation failure analysis module 120, the feature quantity calculation module 130, and the adoption determination module 1240, and in the case where there is no line in the image 1212 An invalid signal 1224 is output. The line extracting module 1220 extracts an image that may be a line from the image 1212 received by the image receiving module 1210, and extracts line information 1222 (which is a set of information representing line elements of a line). The prior art disclosed in JP-A-08-016782 and the like can be used for this line extraction processing.
观察失败分析模块120连接到线提取模块1220和特征量计算模块130。观察失败分析模块120接收线信息1222(其为表示线的线元素的一组信息),做出线信息1222的观察成功/失败的确定,并且将观察失败信息122传输至特征量计算模块130。该情况下,对于观察失败,可以一起输出观察失败的类型。The observation failure analysis module 120 is connected to the line extraction module 1220 and the feature amount calculation module 130 . The observation failure analysis module 120 receives line information 1222 (which is a set of information representing line elements of a line), makes observation success/failure determination of the line information 1222 , and transmits the observation failure information 122 to the feature quantity calculation module 130 . In this case, for observation failure, the type of observation failure may be output together.
特征量计算模块130连接到线提取模块1220、观察失败分析模块120和采用确定模块1240。特征量计算模块130线提取模块1220所提取的接收线信息1222(其为表示线元素的一组信息)和来自观察失败分析模块120的观察失败信息122,计算对应于观察失败信息122的线信息1222的特征量,并且将特征量132传输到采用确定模块1240。此处,还可以计算其他特征量。The feature quantity calculation module 130 is connected to the line extraction module 1220 , the observation failure analysis module 120 , and the adoption determination module 1240 . Feature calculation module 130 receives line information 1222 extracted by line extraction module 1220 (it is a set of information representing line elements) and observation failure information 122 from observation failure analysis module 120, calculates line information corresponding to observation failure information 122 1222, and transmit the feature 132 to the adoption determination module 1240. Here, other feature quantities may also be calculated.
采用确定模块1240连接到线提取模块1220和特征量计算模块130。采用确定模块1240基于特征量132来确定图像是否被采用为线,并且输出线/无效信号1242。即,输出表示线的信息来表示采用,而输出无效信号⊥来表示不采用。The adoption determination module 1240 is connected to the line extraction module 1220 and the feature quantity calculation module 130 . The adoption determination module 1240 determines whether an image is adopted as a line based on the feature amount 132 , and outputs a line/invalid signal 1242 . That is, the information representing the line is output to indicate adoption, and the invalid signal ⊥ is output to indicate not to adopt.
图13是根据第三实施例的示例构造的概念模块构造视图。Fig. 13 is a conceptual block configuration view according to an example configuration of the third embodiment.
第三实施例的构造包括图像接收模块1210、线提取模块1320、观察失败分析模块1330、特征量计算模块130和采用确定模块1240。观察失败分析模块1330可以与线提取模块1320集成。即,对于线提取,在合并线元素的过程中,每当获得线元素时,都可以按顺序执行观察失败分析。与第二实施例的图12的示例相比较,不需要重复的观察失败分析,其可以导致处理速度的增加。此外,由于不包括导致观察失败的不必要的线元素,因此可以提高存储器效率。The configuration of the third embodiment includes an image reception module 1210 , a line extraction module 1320 , an observation failure analysis module 1330 , a feature amount calculation module 130 , and an adoption determination module 1240 . The observation failure analysis module 1330 may be integrated with the line extraction module 1320 . That is, for line extraction, in the process of merging line elements, observation failure analysis can be performed sequentially whenever a line element is obtained. Compared with the example of FIG. 12 of the second embodiment, repeated observation failure analysis is not required, which can lead to an increase in processing speed. In addition, memory efficiency can be improved since unnecessary line elements that cause observation failures are not included.
图像接收模块1210连接到线提取模块1320。图像接收模块1210接收图像1208并且将图像1212传输到线提取模块1320。The image receiving module 1210 is connected to the line extracting module 1320 . Image receiving module 1210 receives image 1208 and transmits image 1212 to line extraction module 1320 .
线提取模块1320连接到图像接收模块1210、观察失败分析模块1330、特征量计算模块130和采用确定模块1240。观察失败分析模块1330连接到线提取模块1320。在线提取模块1320中合并线元素的过程中,每当从图像1212获得线元素1322时,观察失败分析模块1330按顺序执行上述观察失败分析处理。另外,线提取模块1320从观察失败分析模块1330接收观察失败信息1332。最后,如果图像1212中没有线,则线提取模块1320输出无效信号1326,并且如果存在可能为线的图像,则将线信息和观察失败信息1324传输至特征量计算模块130和采用确定模块1240。The line extraction module 1320 is connected to the image reception module 1210 , the observation failure analysis module 1330 , the feature amount calculation module 130 , and the adoption determination module 1240 . The observation failure analysis module 1330 is connected to the line extraction module 1320 . In the process of merging line elements in the line extraction module 1320 , the observation failure analysis module 1330 sequentially executes the above-described observation failure analysis processing every time a line element 1322 is obtained from the image 1212 . In addition, the line extraction module 1320 receives observation failure information 1332 from the observation failure analysis module 1330 . Finally, if there is no line in the image 1212, the line extraction module 1320 outputs an invalid signal 1326, and if there is an image that may be a line, the line information and observation failure information 1324 are transmitted to the feature calculation module 130 and the adoption determination module 1240.
特征量计算模块130连接到线提取模块1320和采用确定模块1240。特征量计算模块130从线提取模块1320接收线信息和观察失败信息1324,并且计算该图像的特征量。The feature amount calculation module 130 is connected to the line extraction module 1320 and the adoption determination module 1240 . The feature quantity calculation module 130 receives the line information and the observation failure information 1324 from the line extraction module 1320, and calculates the feature quantity of the image.
采用确定模块1240连接到线提取模块1320和特征量计算模块130。采用确定模块1240从线提取模块1320接收线信息和观察失败信息1324,以及从特征量计算模块130接收特征量132,并且输出线/无效信号1242。The adoption determination module 1240 is connected to the line extraction module 1320 and the feature quantity calculation module 130 . The adoption determination module 1240 receives line information and observation failure information 1324 from the line extraction module 1320 and the feature quantity 132 from the feature quantity calculation module 130 , and outputs a line/invalid signal 1242 .
图14是根据第四实施例的示例构造的概念模块构造视图。Fig. 14 is a conceptual block configuration view according to an example configuration of the fourth embodiment.
第四实施例的构造包括图像接收模块1410、字符串提取模块1420、图像处理模块1430、字符识别模块1440和输出模块1450。The configuration of the fourth embodiment includes an image reception module 1410 , a character string extraction module 1420 , an image processing module 1430 , a character recognition module 1440 and an output module 1450 .
图像接收模块1410连接到字符串提取模块1420。图像接收模块1410接收包括字符串的图像。该图像可以包括下划线、删除线等。The image receiving module 1410 is connected to a character string extracting module 1420 . The image receiving module 1410 receives an image including a character string. The image may include underlining, strikethrough, etc.
字符串提取模块1420连接到图像接收模块1410和图像处理模块1430。字符串提取模块1420从图像接收模块1410所接收到的图像中提取字符串,并且基于字符串的方向来提取可能为线的图像。字符串提取可以使用现有技术。例如,可以获得竖直和水平方向上黑色像素的直方图,并且可以从直方图的分布中提取竖写和横写的方向。将字符串的方向作为线方向来提取可能为线的图像。The character string extraction module 1420 is connected to the image receiving module 1410 and the image processing module 1430 . The character string extracting module 1420 extracts a character string from the image received by the image receiving module 1410, and extracts an image that may be a line based on the direction of the character string. String extraction can use existing techniques. For example, a histogram of black pixels in vertical and horizontal directions can be obtained, and the directions of vertical and horizontal writing can be extracted from the distribution of the histograms. Extract possible line images with the orientation of the string as the line direction.
图像处理模块1430连接到字符串提取模块1420和字符识别模块1440,并且对应于第一实施例的图像处理设备、第二实施例的图像处理设备、和第三实施例的图像处理设备中的一个。即,确定字符串提取模块1420所提取的图像(可能为线)是否为线。The image processing module 1430 is connected to the character string extraction module 1420 and the character recognition module 1440, and corresponds to one of the image processing device of the first embodiment, the image processing device of the second embodiment, and the image processing device of the third embodiment . That is, it is determined whether the image (possibly a line) extracted by the character string extraction module 1420 is a line.
字符识别模块1440连接到图像处理模块1430和输出模块1450。字符识别模块1440擦除图像中被图像处理模块1430确定为线的图像,并且仅识别字符图像。The character recognition module 1440 is connected to the image processing module 1430 and the output module 1450 . The character recognition module 1440 erases images determined to be lines by the image processing module 1430 in the image, and recognizes only character images.
输出模块1450连接到字符识别模块1440,并且输出字符识别模块1440的字符识别结果。The output module 1450 is connected to the character recognition module 1440 and outputs a character recognition result of the character recognition module 1440 .
图15是根据第五实施例的示例构造的概念模块构造视图。Fig. 15 is a conceptual block configuration view according to an example configuration of the fifth embodiment.
第五实施例的构造包括图像接收模块1510、框线方向指定模块1520、图像处理模块1530、业务表格处理模块1540和输出模块1550。The configuration of the fifth embodiment includes an image receiving module 1510 , a frame line direction specifying module 1520 , an image processing module 1530 , a business form processing module 1540 and an output module 1550 .
图像接收模块1510连接到框线方向指定模块1520,并且接收包括框线的图像,例如,业务表格图像。The image receiving module 1510 is connected to the frame line direction specifying module 1520, and receives an image including a frame line, for example, a business form image.
框线方向指定模块1520连接到图像接收模块1510和图像处理模块1530,并且基于框线的方向从图像接收模块1510所接收到的图像中提取可能为线的图像。例如,框线方向指定模块1520指定并提取竖直和水平方向的线(主要用于业务表格中),并且提取可能为线的图像。The frame line direction specifying module 1520 is connected to the image receiving module 1510 and the image processing module 1530, and extracts images that may be lines from images received by the image receiving module 1510 based on the direction of the frame line. For example, the frame line direction specifying module 1520 specifies and extracts vertical and horizontal lines (mainly used in business tables), and extracts images that may be lines.
图像处理模块1530连接到框线方向指定模块1520和业务表格处理模块1540,并且对应于第一实施例的图像处理设备、第二实施例的图像处理设备和第三实施例的图像处理设备中的一个。即,确定框线方向指定模块1520所提取的可能为线的图像是否为线。The image processing module 1530 is connected to the frame line direction specifying module 1520 and the business form processing module 1540, and corresponds to the image processing device of the first embodiment, the image processing device of the second embodiment, and the image processing device of the third embodiment One. That is, it is determined whether the image that may be a line extracted by the frame line direction specifying module 1520 is a line.
业务表格处理模块1540连接到图像处理模块1530和输出模块1550。业务表格处理模块1540根据图像中被图像处理模块1530确定为线的线图像的位置来确定业务表格中的列,并且识别列中的字符图像,从而使所确定的列与字符识别结果相关联。The business form processing module 1540 is connected to the image processing module 1530 and the output module 1550 . The business form processing module 1540 determines the columns in the business form according to the positions of the line images determined as lines in the image by the image processing module 1530 , and recognizes the character images in the columns, thereby associating the determined columns with the character recognition results.
输出模块1550连接到业务表格处理模块1540,并且输出业务表格处理模块1540的业务表格处理结果。The output module 1550 is connected to the business form processing module 1540 and outputs the business form processing result of the business form processing module 1540 .
现在将参考图16来描述该实施例的图像处理设备的硬件构造的示例。图16中所示的硬件构造由例如个人计算机(PC)等构造,包括诸如扫描仪等的数据读取单元1617、诸如打印机等的数据输出单元1618。An example of the hardware configuration of the image processing apparatus of this embodiment will now be described with reference to FIG. 16 . The hardware configuration shown in FIG. 16 is configured by, for example, a personal computer (PC) or the like, and includes a data reading unit 1617 such as a scanner or the like, a data output unit 1618 such as a printer or the like.
中央处理单元(CPU)1601是根据计算机程序执行处理的控制单元,计算机程序描述上述实施例中所描述的各种模块(例如,线信息接收模块110、观察失败分析模块120、特征量计算模块130、采用确定模块140、图像接收模块1210、线提取模块1220、采用确定模块1240、线提取模块1320、观察失败分析模块1330、字符串提取模块1420、图像处理模块1430、字符识别模块1440、框线方向指定模块1520、图像处理模块1530、业务表格处理模块1540等)的执行顺序。A central processing unit (CPU) 1601 is a control unit that executes processing according to a computer program that describes various modules (for example, the line information receiving module 110, the observation failure analysis module 120, the feature amount calculation module 130) described in the above embodiments. , use determination module 140, image receiving module 1210, line extraction module 1220, use determination module 1240, line extraction module 1320, observation failure analysis module 1330, character string extraction module 1420, image processing module 1430, character recognition module 1440, frame line Direction designation module 1520, image processing module 1530, business form processing module 1540, etc.) execution sequence.
只读存储器(ROM)1602存储CPU 1601所使用的程序、操作参数等。随机存取存储器(RAM)1603存储用于由CPU 1601执行的程序、为了执行而适当改变的参数等。这些存储器通过主机总线(诸如CPU总线等)1604互联。A read only memory (ROM) 1602 stores programs used by the CPU 1601 , operating parameters, and the like. A random access memory (RAM) 1603 stores programs for execution by the CPU 1601 , parameters appropriately changed for execution, and the like. These memories are interconnected through a host bus (such as a CPU bus, etc.) 1604 .
主机总线1604通过桥接器1605连接到诸如外部设备互联/接口(PCI)总线等的外部总线1606。The host bus 1604 is connected to an external bus 1606 such as a Peripheral Component Interconnect/Interface (PCI) bus or the like through a bridge 1605 .
诸如键盘1608、鼠标等之类的指示(point)装置1609为由操作员操纵的输入装置。诸如液晶显示设备、阴极射线管(CRT)等的显示器1610将各种信息显示为文本或图像信息。A pointing (point) device 1609 such as a keyboard 1608, a mouse, etc. is an input device manipulated by an operator. A display 1610 such as a liquid crystal display device, a cathode ray tube (CRT), or the like displays various information as text or image information.
硬盘驱动器(HDD)1611包括硬盘,并且驱动硬盘以记录或再现由CPU 1601执行的程序或信息。硬盘存储目标图像、指示线元素的信息、确定结果等。另外,硬盘存储诸如数据处理程序之类的各种计算机程序。A hard disk drive (HDD) 1611 includes a hard disk, and drives the hard disk to record or reproduce programs or information executed by the CPU 1601 . The hard disk stores target images, information indicating line elements, determination results, and the like. In addition, the hard disk stores various computer programs such as data processing programs.
驱动器1612读取安装在其上的可移动记录介质(诸如磁盘、光盘、磁光盘、半导体存储器等)1613中所记录的数据或程序,并且通过接口1607、外部总线1606、桥接器1605和主机总线1604将所读取的数据或程序提供给RAM 1603。可移动记录介质1613还可以像硬盘一样被用作数据记录区域。The drive 1612 reads data or programs recorded in a removable recording medium (such as a magnetic disk, an optical disk, a magneto-optical disk, a semiconductor memory, etc.) 1613 mounted thereon, and transmits data through an interface 1607, an external bus 1606, a bridge 1605, and a host bus. 1604 provides the read data or program to RAM 1603 . The removable recording medium 1613 can also be used as a data recording area like a hard disk.
连接端口1614是连接到外部连接装置1615的端口,并且包括诸如USB、IEEE 1394等的连接单元。连接端口1614还通过接口1607、外部总线1606、桥接器1605、主机总线1604等连接到CPU 1601等。通信单元1616连接到网络,用于与外部进行数据通信。数据读取单元1617例如为扫描仪,用于读取文档。数据输出单元1618例如为打印机,用于输出文档数据。The connection port 1614 is a port connected to an external connection device 1615, and includes connection units such as USB, IEEE 1394, and the like. The connection port 1614 is also connected to the CPU 1601 and the like through the interface 1607, the external bus 1606, the bridge 1605, the host bus 1604, and the like. The communication unit 1616 is connected to a network for data communication with the outside. The data reading unit 1617 is, for example, a scanner for reading documents. The data output unit 1618 is, for example, a printer for outputting document data.
图16中所示的图像处理设备的硬件构造为构造的示例,并且该实施例不限于图16中所示的硬件构造,还可以具有任意其他构造,只要其可以执行本实施例中所描述的模块。例如,一些模块可以被构造为专用硬件(例如,ASIC(专用集成电路)等),一些模块可以在外部系统中,并且通过通信链路连接,此外,图16中所示的多个系统可以通过通信链路互联以进行合作。另外,所述硬件结构可以装配在复印机、传真机、扫描仪、打印机、多功能复印机(具有扫描仪、打印机、复印机和传真机等功能中的两种或更多种的图像处理设备)等中。The hardware configuration of the image processing apparatus shown in FIG. 16 is an example of configuration, and this embodiment is not limited to the hardware configuration shown in FIG. 16 , and may have any other configuration as long as it can execute the method described in this embodiment. module. For example, some modules can be constructed as dedicated hardware (for example, ASIC (Application Specific Integrated Circuit) etc.), some modules can be in external systems, and connected by communication links, in addition, a plurality of systems shown in Figure 16 can be connected by Communication links are interconnected for cooperation. In addition, the hardware configuration may be incorporated in a copier, a facsimile, a scanner, a printer, a multifunction copier (an image processing apparatus having two or more functions of a scanner, a printer, a copier, and a facsimile), etc. .
上述各种实施例可以组合(例如,包括将一个实施例中的模块添加到另一个实施例或者代替另一个实施例中的模块),或者可以将[背景技术]中所描述的技术用作各模块的处理内容。The various embodiments described above may be combined (for example, including adding a module in one embodiment to another embodiment or replacing a module in another embodiment), or the technology described in [Background Art] may be used as each The processing content of the module.
上述程序可以存储在记录介质中,或者可以通过通信方式来提供。该情况下,例如,上述程序可以被理解为“其上记录有程序的计算机可读记录介质”的发明。The above-mentioned program may be stored in a recording medium, or may be provided by means of communication. In this case, for example, the above-mentioned program can be understood as an invention of "the computer-readable recording medium on which the program is recorded".
“其上记录有程序的计算机可读记录介质”是指用于程序的安装、执行、分配等的其上记录有程序的计算机可读记录介质。The "computer-readable recording medium on which the program is recorded" refers to a computer-readable recording medium on which the program is recorded for installation, execution, distribution, and the like of the program.
例如,记录介质可以包括诸如“DVR-R、DVD-RW、DVD-RAM等”(其为DVD论坛规定的标准)以及“DVD+R、DVD+RW等”(其为DVD+RW规定的标准)之类的数字多功能光盘(DVD)、诸如只读存储器(CD-ROM)、可记录CD(CD-R)、可重写CD(CD-RW)等之类的压缩盘(CD)、蓝光光盘(注册商标)、磁光盘(MO)、软磁盘(FD)、磁带、硬盘、只读存储器(ROM)、电可擦除可编程只读存储器(EEPROM(注册商标))、闪存、随机访问存储器(RAM)等。For example, recording media may include media such as "DVR-R, DVD-RW, DVD-RAM, etc." (which are standards specified by the DVD Forum) and "DVD+R, DVD+RW, etc." ) such as Digital Versatile Disc (DVD), Compact Disc (CD) such as Read Only Memory (CD-ROM), Recordable CD (CD-R), Rewritable CD (CD-RW), etc., Blu-ray disc (registered trademark), magneto-optical disk (MO), floppy disk (FD), magnetic tape, hard disk, read-only memory (ROM), electrically erasable programmable read-only memory (EEPROM (registered trademark)), flash memory, random access Memory (RAM), etc.
程序或程序的一部分可以记录在用于存储和分配的记录介质中。另外,程序或程序的一部分可以通过通信的方式来传输,例如,通过诸如用于局域网(LAN)、城域网(MAN)、广域网(WAN)、因特网、内网、外网等或者其组合的有线网或无线网之类的传输介质,或者可以使用载波来承载。The program or a part of the program can be recorded in a recording medium for storage and distribution. In addition, the program or a part of the program may be transmitted by means of communication, for example, through a network such as a network for a local area network (LAN), a metropolitan area network (MAN), a wide area network (WAN), the Internet, an intranet, an extranet, etc., or a combination thereof. A transmission medium such as a wired network or a wireless network, or a carrier may be used to carry the data.
所述程序可以是另一程序的一部分,或者可以与不同的程序一起记录在记录介质中。另外,所述程序可以被分开并记录在多个记录介质中。另外,所述程序可以以任意形式(包括压缩、加密等)记录,只要其可以被再现。The program may be a part of another program, or may be recorded in a recording medium together with a different program. In addition, the program may be divided and recorded in a plurality of recording media. In addition, the program may be recorded in any form (including compression, encryption, etc.) as long as it can be reproduced.
以上本发明的示例实施例的描述仅用于说明和描述的目的,而不是穷尽或将本发明限制为所公开的精确形式。许多改变和变形对于本领域技术人员来说是显而易见的。实施例的选择和描述是为了能够更好地说明本发明的原理及其实际应用,从而使得本领域技术人员可以理解各种实施例的本发明以及适用于特定用途的各种变形。本发明的范围由所附权利要求及其等价物限定。The foregoing description of example embodiments of the present invention has been presented for purposes of illustration and description only, and is not intended to be exhaustive or to limit the invention to the precise forms disclosed. Many changes and modifications will be apparent to those skilled in the art. The selection and description of the embodiments are to better illustrate the principle of the present invention and its practical application, so that those skilled in the art can understand the present invention of various embodiments and various modifications suitable for specific purposes. The scope of the invention is defined by the appended claims and their equivalents.
Claims (6)
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP2010239389A JP5640645B2 (en) | 2010-10-26 | 2010-10-26 | Image processing apparatus and image processing program |
JP2010-239389 | 2010-10-26 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN102456139A CN102456139A (en) | 2012-05-16 |
CN102456139B true CN102456139B (en) | 2016-12-14 |
Family
ID=
Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101299240A (en) * | 2007-05-01 | 2008-11-05 | 夏普株式会社 | Image processing apparatus, image forming apparatus, image processing system, and image processing method |
Patent Citations (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN101299240A (en) * | 2007-05-01 | 2008-11-05 | 夏普株式会社 | Image processing apparatus, image forming apparatus, image processing system, and image processing method |
Non-Patent Citations (2)
Title |
---|
Kalman Filter Contributions Towards Document Segmentation;Ivan Leplumey等;《IEEE Proceedings of the Third International Conference on Document Analysis and Recognition》;19950816;第2卷;第765-769页 * |
Text Line Extraction in Handwritten Document with Kalman Filter Applied on Low Resolution Image;Aurélie Lemaitre等;《IEEE Proceedings of the Second International Conference on Document Image Analysis for Libraries》;20060428;第38-45页 * |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN102479332B (en) | Image processing apparatus and image processing method | |
US20100008585A1 (en) | Image processing apparatus, image processing method, computer-readable medium and computer data signal | |
US8391607B2 (en) | Image processor and computer readable medium | |
US20150213332A1 (en) | Image processing apparatus, non-transitory computer readable medium, and image processing method | |
US20100007912A1 (en) | Image processing apparatus, image processing method, computer-readable medium and computer data signal | |
KR101248449B1 (en) | Information processor, information processing method, and computer readable medium | |
JP5365440B2 (en) | Image processing apparatus and image processing program | |
US8805076B2 (en) | Image processing apparatus, image processing method and computer readable medium | |
US8391606B2 (en) | Image processing device, image processing method, and computer readable medium | |
US8977044B2 (en) | Image processing apparatus for area separation of images, image processing method, and computer readable medium | |
CN102456139B (en) | Image processing equipment and image processing method | |
JP4780184B2 (en) | Image processing apparatus and image processing program | |
JP2010079507A (en) | Image processor and image processing program | |
US8620090B2 (en) | Line determination apparatus, line determination method, and computer-readable medium | |
CN102486828B (en) | Image processing equipment and image processing method | |
JP4793429B2 (en) | Image processing apparatus and image processing program | |
JP5489894B2 (en) | Image processing apparatus and image processing program | |
JP5262778B2 (en) | Image processing apparatus and image processing program | |
CN103198291B (en) | Information processor | |
US20150043832A1 (en) | Information processing apparatus, information processing method, and computer readable medium | |
JP5821648B2 (en) | Information processing apparatus and information processing program | |
US8923635B2 (en) | Image processing apparatus and image processing program | |
JP2016009235A (en) | Information processing apparatus and information processing program | |
JP6676955B2 (en) | Image processing apparatus and image processing program | |
JP6281309B2 (en) | Image processing apparatus and image processing program |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
CP01 | Change in the name or title of a patent holder |
Address after: Tokyo Patentee after: Fuji film business innovation Co.,Ltd. Address before: Tokyo Patentee before: Fuji Xerox Co.,Ltd. |