JP2022009474A

JP2022009474A - System and method for detecting lines in vision system

Info

Publication number: JP2022009474A
Application number: JP2021174197A
Authority: JP
Inventors: ワン，レイ; Lei Wang; アナンド，ヴィヴェク; Anand Vivek; ディー．ヤコブソン，ロウェル; D Jacobson Lowell
Original assignee: Cognex Corp
Current assignee: Cognex Corp
Priority date: 2018-06-06
Filing date: 2021-10-25
Publication date: 2022-01-14
Anticipated expiration: 2039-06-03
Also published as: CN110570439A; JP2024016287A; KR20190138755A; JP7393106B2; KR20210032374A; CN118521604A; CN110570439B; DE102019115224A1; JP2019220163A; KR20240058827A

Abstract

PROBLEM TO BE SOLVED: To provide a vision system tool for detecting line features in an acquired image, which efficiently and accurately identifies multiple lines.

SOLUTION: Provided are a system and a method for extracting edge points from an acquired image and detecting lines. Once lines are identified, it is trained in such a manner that a user associates predetermined (e.g., text) labels with the lines. These labels are used to define neural net classifiers. The neural net operates at runtime to identify and score lines in a runtime image detected by using a line detection process. The detected lines are displayed to the user with labels and associated probability score maps based on the neural net results. Unlabeled lines are generally regarded as having low scores, and are identified by an interface as un-flagged or irrelevant.

SELECTED DRAWING: Figure 2

Description

関連出願
本出願は２０１５年２月１１日に出願された同時係属米国特許仮出願第６２／２４９９１８号「ビジョンシステムでラインを検出するためのシステム及び方法」の利益を主張するものであり、その教示内容は参照により本明細書に組み込まれる。 Related Applications This application claims the benefits of Simultaneously Pending U.S. Patent Application No. 62/24918, "Systems and Methods for Detecting Lines in Vision Systems," filed February 11, 2015. The teachings are incorporated herein by reference.

記述分野
本発明はマシンビジョンシステム、より具体的には取得した画像内のライン特徴を検出するビジョンシステムツールに関する。 Description Field The present invention relates to a machine vision system, more specifically a vision system tool for detecting line features in an acquired image.

マシンビジョンシステム（本明細書中では単に「ビジョンシステム」とも呼ぶ）は、製造、物流及び産業における多様なタスクに使用されている。そのようなタスクは表面及び部品の検査、組立工程における対象物のアライメント、パターン及びＩＤコードの読取り、及び視覚データが取得されてその後のプロセスに使用するために解釈されるその他の操作を含んでいてもよい。ビジョンシステムは典型的には１台以上のカメラを用いて興味のあるオブジェクト又は対象を含むシーンの画像を取得する。オブジェクト／対象は静止し、又は相対運動をしてよい。運動はロボットで部品を操作する場合のように、ビジョンシステムによって得られる情報で制御することもできる。 Machine vision systems (also referred to herein simply as "vision systems") are used for a variety of tasks in manufacturing, logistics and industry. Such tasks include inspection of surfaces and parts, alignment of objects in the assembly process, reading of patterns and ID codes, and other operations in which visual data is acquired and interpreted for use in subsequent processes. You may. A vision system typically uses one or more cameras to capture an image of a scene containing an object or object of interest. The object / object may be stationary or in relative motion. Motion can also be controlled by the information obtained by the vision system, as in the case of manipulating parts with a robot.

ビジョンシステムの共通のタスクは、画像内のライン特徴を検出して特徴付けることである。そのようなライン特徴を特定して解析するために多様なツールが用いられる。典型的には、これらのツールは画像の部分で生じる明確なコントラスト差に依拠している。このコントラスト差、例えばキャリパツールを用いて解析して、コントラスト差のある画像内の個々のポイントがライン状の特徴に集合できるか決定する。できると決定した場合は、画像内でラインが特定される。特に、エッジポイントを検出するツールと、ラインをポイントにフィッティングするツールは互いに独立に動作する。これにより処理オーバーヘッドが増えて信頼性が低下する。画像が複数のラインを含む場合、そのようなツールはそれらを正確に特定する能力が制限されよう。更に、画像内のシングルラインを検出するために設計されたツールは、画像内で類似の向きと極性をもつ複数のラインが密集している場合は使用するのに問題があろう。 A common task of vision systems is to detect and characterize line features in images. Various tools are used to identify and analyze such line features. Typically, these tools rely on the clear contrast differences that occur in the parts of the image. This contrast difference, eg, a caliper tool, is analyzed to determine if individual points in the image with the contrast difference can be aggregated into line-like features. If it is determined that it can be done, the line is identified in the image. In particular, the tool for detecting edge points and the tool for fitting lines to points operate independently of each other. This increases the processing overhead and reduces reliability. If the image contains multiple lines, such tools will have limited ability to pinpoint them. In addition, tools designed to detect single lines in an image may be problematic to use if the image is densely populated with multiple lines of similar orientation and polarity.

別の難点は、取得した画像内でオブジェクトのラインが時々隠れているか不明瞭であり得ることである。ユーザは検出されたラインの特定に関して確信が持てず、そのようなラインを個別に識別できるメカニズムは洗練されたルールとスクリプトの作成を含む可能性があり、それはビジョンシステムのセットアップとトレーニングタスクの時間とコストを増す。 Another difficulty is that the lines of the object can sometimes be hidden or obscured in the captured image. Users are uncertain about identifying the detected lines, and mechanisms that can identify such lines individually may include sophisticated rules and scripting, which is the time for vision system setup and training tasks. And increase the cost.

本発明は先行技術の短所を、複数のラインを効率的且つ正確に特定して特徴付けることを可能にする、画像内のライン特徴を検出するためのシステム及び方法を提供することによって克服する。ラインが識別されると、ユーザは、そのようなラインに所定の（例えばテキスト）ラベルを関連付けるようにシステムをトレーニングすることができる。これらのラベル（本明細書では「タグ」とも呼ばれる）は、ニューラルネット分類子を定義するために使用できる。ニューラルネットはランタイムに動作して、ライン検出プロセスを使用して検出されたランタイム画像内のラインを識別してスコア付けする。検出されたラインは、ラベルと、ニューラルネット結果に基づく関連する確率スコアマップを付けてユーザに表示できる。ラベルが付いていないラインは、一般的に低スコアと見なされ、インタフェースによってフラグが立てられていないか又は関連性がないと識別される。 The present invention overcomes the shortcomings of the prior art by providing systems and methods for detecting line features in images that allow multiple lines to be efficiently and accurately identified and characterized. Once the lines are identified, the user can train the system to associate a given (eg, text) label with such lines. These labels (also referred to herein as "tags") can be used to define neural net classifiers. The neural network operates at runtime to identify and score lines in the detected runtime image using the line detection process. The detected lines can be displayed to the user with labels and associated probability score maps based on the neural net results. Unlabeled lines are generally considered low scores and are identified by the interface as unflagged or irrelevant.

例示的な実施形態では、取得された画像内でラインを検出するためのシステム及び方法が提供される。システム及び方法は、ビジョンシステムプロセッサと、このビジョンシステムプロセッサに関連するインタフェースとを含んでおり、ビジョンシステムプロセッサはオブジェクトのトレーニング画像内でライン検出プロセスによって特定された関連性のあるラインに関する個別ラベルの作成を可能にする。ランタイムライン検出プロセスは、取得した画像内のラインを特定し、ニューラルネットプロセスは、ラベルに基づいて分類子を使用して、ラベルに対するライン特徴の確率マップを決定する。ランタイム結果生成プロセスは、少なくとも１本の関連性のあるラインに対してラベルと確率スコアを提供する。例示的に、ランタイム結果生成プロセスは関連性のないラインに対して確率スコアを提供し、及び／又はラインを強調表示するインタフェースを含み、強調表示されたラインに関連付けられた確率スコアを提供する。確率スコアマップは、サイズが取得した画像と類似している可能性がある。ライン検出プロセスは、ライン特徴を含むシーンの画像データを受け取るプロセッサを有し、エッジポイント抽出装置を備えており、このエッジポイント抽出装置は、（ａ）画像データから勾配ベクトル場を計算し、（ｂ）勾配ベクトル場を複数の勾配投影サブ領域に投影し、及び（ｃ）投影された勾配データに基づき勾配投影サブ領域のそれぞれにおいて複数のエッジポイントを検出する。プロセッサは、画像から抽出されたエッジポイントと一致する複数のラインを生成するラインファインダも含む。例示的に、ラインファインダはＲＡＮＳＡＣに基づくプロセスを操作してインライアエッジポイントを新しいラインにフィッティングし、事前に定義されたラインを基準にしてアウトライアエッジポイントから反復的にラインを定義することを含む。勾配場の投影は、１以上のライン特徴の予想される向きに応じて設定された方向に沿って方向付けることができ、及び／又はガウスカーネルに基づいて粒度を定義できる。エッジポイント抽出装置は、勾配投影サブ領域の各々で複数の勾配強度最大値を検出するように構成できる。勾配強度最大値はそれぞれ複数のエッジポイントの幾つかとして識別され、位置ベクトルと勾配ベクトルによって記述される。ラインファインダは、少なくとも１つのエッジポイントの少なくとも１本の候補ラインからの距離、及び少なくとも１つのエッジポイントの勾配方向と少なくとも１本の候補ラインの法線方向との間の角度差に基づくメトリックを計算することによって、複数の抽出されたエッジポイントの少なくとも１つのエッジポイントと、検出された複数のラインの少なくとも１本の候補ラインとの間の一致を決定するように構成されてもよい。例示的に、画像データは複数のカメラから取得された複数の画像からのデータを含む。これにより画像は共通の座標空間に変換される。 In an exemplary embodiment, a system and method for detecting a line in an acquired image is provided. The system and method includes a vision system processor and an interface associated with this vision system processor, which is a separate label for the relevant line identified by the line detection process in the training image of the object. Allows creation. The run-time line detection process identifies the lines in the acquired image, and the neural net process uses a classifier based on the label to determine a probability map of the line features for the label. The run-time result generation process provides labels and probability scores for at least one relevant line. Illustratively, the runtime result generation process provides a probability score for irrelevant lines and / or includes an interface for highlighting the line, providing the probability score associated with the highlighted line. The probability score map may be similar in size to the acquired image. The line detection process has a processor that receives image data of the scene including line features and is equipped with an edge point extractor, which (a) calculates a gradient vector field from the image data and (a) b) the gradient vector field is projected onto a plurality of gradient projection subregions, and (c) multiple edge points are detected in each of the gradient projection subregions based on the projected gradient data. The processor also includes a line finder that produces multiple lines that match the edge points extracted from the image. Illustratively, the linefinder operates a RANSAC-based process to fit an inlier edge point to a new line and iteratively define a line from the outlier edge point relative to a predefined line. include. The projection of the gradient field can be oriented along the direction set according to the expected orientation of one or more line features and / or the grain size can be defined based on the Gauss kernel. The edge point extractor can be configured to detect multiple gradient intensity maximums in each of the gradient projection subregions. The maximum gradient intensity is identified as some of a plurality of edge points, respectively, and is described by a position vector and a gradient vector. The line finder provides a metric based on the distance of at least one edge point from at least one candidate line and the angle difference between the gradient direction of at least one edge point and the normal direction of at least one candidate line. By calculation, it may be configured to determine a match between at least one edge point of the plurality of extracted edge points and at least one candidate line of the plurality of detected lines. Illustratively, image data includes data from multiple images acquired from multiple cameras. This transforms the image into a common coordinate space.

例示的な実施態様によれば、１以上のカメラに基づいて取得された画像内のライン特徴を検出するシステムが提供される。該システム及び方法はビジョンシステムプロセッサ及び該ビジョンシステムプロセッサに関連付けられたインターフェイスを備え、それによって、オブジェクトのトレーニング画像内でライン検出プロセスによって特定された関連性のあるラインに関する個別ラベルの作成を可能になる。実行中ライン検出プロセスによって取得された画像内のラインを位置付け、該ライン検出プロセスによって位置付けられたラインに基づくインターフェイスへのラベルを生成する統計的分類子やＫ－ＮＮ分類子の位置を特定する。 According to an exemplary embodiment, there is provided a system for detecting line features in an image acquired based on one or more cameras. The system and method comprises a vision system processor and an interface associated with the vision system processor, which allows the creation of individual labels for the relevant lines identified by the line detection process in the training image of the object. Become. Positions the lines in the image acquired by the running line detection process and locates the statistical or K-NN classifiers that generate labels for the interface based on the lines positioned by the line detection process.

以下に本発明を図面を参照して説明する。 The present invention will be described below with reference to the drawings.

例示的な実施形態による、複数のエッジ特徴を含むオブジェクトの画像を取得する模範的なビジョンシステム構成及びエッジ検出ツール／モジュール及び関連するラベルインタフェースプロセッサ／モジュールを含むビジョンシステムプロセッサの図である。FIG. 6 is a diagram of an exemplary vision system configuration and vision system processor including an edge detection tool / module and associated label interface processor / module to acquire an image of an object containing a plurality of edge features, according to an exemplary embodiment.

例示的な実施形態による、取得した画像からエッジポイントを抽出してラインを検出するためのシステム及び方法の全体を示す図である。It is a figure which shows the whole system and method for extracting an edge point from the acquired image and detecting a line by an exemplary embodiment.

図２のシステム及び方法による、エッジポイント抽出手順のフローチャートである。It is a flowchart of the edge point extraction procedure by the system and method of FIG.

図３のエッジポイント抽出手順の一部である、エッジ特徴を含む画像の区域に場を投影する図である。FIG. 3 is a diagram for projecting a field onto an area of an image including edge features, which is part of the edge point extraction procedure of FIG.

図３のエッジポイント抽出手順で使用する画像を平滑化するために、ガウスカーネルを画像に適用することを示す図である。It is a figure which shows that the Gauss kernel is applied to the image in order to smooth the image used in the edge point extraction procedure of FIG.

図３のエッジポイント抽出手順で使用する投影を平滑化するためのガウスカーネルの適用を含む場の投影の図である。FIG. 3 is a diagram of a field projection that includes the application of a Gaussian kernel to smooth the projection used in the edge point extraction procedure of FIG.

ガウスカーネルの適用とエッジポイントに対する絶対及び正規化したコントラスト閾値の計算を含む、図３のエッジポイント抽出手順の概観を示す図である。FIG. 3 is an overview of the edge point extraction procedure of FIG. 3, including application of the Gaussian kernel and calculation of absolute and normalized contrast thresholds for edge points.

十分な絶対コントラスト閾値と正規化したコントラスト閾値を有するエッジポイントに対して適格化されたコントラストの区域を示す図である。It is a figure which shows the area of the contrast qualified for the edge point which has a sufficient absolute contrast threshold and the normalized contrast threshold.

例示的な実施形態による、模範的なＲＡＮＳＡＣを用いる、図３で検出されたエッジポイントに基づくライン検出手順のフローチャートである。FIG. 3 is a flow chart of a line detection procedure based on the edge points detected in FIG. 3 using an exemplary RANSAC according to an exemplary embodiment.

密集した平行線特徴に対するエッジポイントの不正確なアライメントと正確なアライメントを示す図である。It is a figure which shows the inaccurate alignment and the accurate alignment of an edge point with respect to a dense parallel line feature. 密集した平行線特徴に対するエッジポイントの不正確なアライメントと正確なアライメントを示す図である。It is a figure which shows the inaccurate alignment and the accurate alignment of an edge point with respect to a dense parallel line feature.

それぞれ例示的なシステム及び方法のラインファインダによって解像され得る、交差線特徴に対するエッジポイントの正確なアライメントと不正確なアライメントエッジポイントを示す図である。It is a figure which shows the accurate alignment and the inaccurate alignment edge point of the edge point with respect to the crossing line feature which can be resolved by the line finder of an exemplary system and method respectively. それぞれ例示的なシステム及び方法のラインファインダによって解像され得る、交差線特徴に対するエッジポイントの正確なアライメントと不正確なアライメントエッジポイントを示す図である。It is a figure which shows the accurate alignment and the inaccurate alignment edge point of the edge point with respect to the crossing line feature which can be resolved by the line finder of an exemplary system and method respectively.

例示的なシステム及び方法のラインファインダによって解像され得る、明から暗極性、暗から明極性、明から暗若しくは暗から明極性、又は混合極性を示す図である。FIG. 6 shows light to dark polarities, dark to bright polarities, bright to dark or dark to bright polarities, or mixed polarities that can be resolved by linefinders of exemplary systems and methods. 例示的なシステム及び方法のラインファインダによって解像され得る、明から暗極性、暗から明極性、明から暗若しくは暗から明極性、又は混合極性を示す図である。FIG. 6 shows light to dark polarities, dark to bright polarities, bright to dark or dark to bright polarities, or mixed polarities that can be resolved by linefinders of exemplary systems and methods. 例示的なシステム及び方法のラインファインダによって解像され得る、明から暗極性、暗から明極性、明から暗若しくは暗から明極性、又は混合極性を示す図である。FIG. 6 shows light to dark polarities, dark to bright polarities, bright to dark or dark to bright polarities, or mixed polarities that can be resolved by linefinders of exemplary systems and methods. 例示的なシステム及び方法のラインファインダによって解像され得る、明から暗極性、暗から明極性、明から暗若しくは暗から明極性、又は混合極性を示す図である。FIG. 6 shows light to dark polarities, dark to bright polarities, bright to dark or dark to bright polarities, or mixed polarities that can be resolved by linefinders of exemplary systems and methods.

ユーザ定義マスクを考慮して検出されたラインに対するカバレージスコアの修正を示す図である。It is a figure which shows the correction of the coverage score for the line detected in consideration of a user-defined mask.

オブジェクト画像内の関心のあるラインを参照するラベル／タグを含むインタフェースを使用してラインファインダをトレーニングするための手順を示すフローチャートである。It is a flowchart showing a procedure for training a line finder using an interface including a label / tag that refers to a line of interest in an object image.

図１９のトレーニング手順を実行するために使用されるダイアログボックスと、関心のあるラインを有するトレーニングオブジェクトの画像を含む、模範的ユーザインタフェースディスプレイを示す図である。FIG. 19 illustrates an exemplary user interface display including a dialog box used to perform the training procedure of FIG. 19 and an image of a training object having a line of interest.

ラベルの関心のあるラインへの割り当てと、検出されたラインに対するニューラルネット分類子ベースの確率スコアを含む、図１９に従ってトレーニングされたラインファインダを使用してラインを検出するためのランタイム手順のフローチャートである。In the flow chart of the run-time procedure for detecting a line using a line finder trained according to Figure 19, including assigning the label to the line of interest and a neural net classifier-based probability score for the detected line. be.

関連性のないラインに対するラベルと確率スコアを示す、図２１によるラインファインダランタイム手順に基づいて検出されたラインを有するオブジェクトの画像を含む、模範的ユーザインタフェースディスプレイを示す図である。FIG. 6 shows an exemplary user interface display containing an image of an object with lines detected based on the line finder runtime procedure according to FIG. 21, showing labels and probability scores for irrelevant lines.

関心のあるラインに対するラベルと確率スコアを示す、図２１によるラインファインダランタイム手順に基づいて検出されたラインを有するオブジェクトの画像を含む、模範的ユーザインタフェースディスプレイを示す図である。FIG. 6 shows an exemplary user interface display containing an image of an object with a line detected based on the line finder runtime procedure according to FIG. 21, showing labels and probability scores for the line of interest.

Ｉ．システムの概要 I. System overview

例示的な実施形態に従って使用できる模範的なビジョンシステム構成１００が、図１に示されている。システム１００は少なくとも１台のビジョンシステムカメラ１１０を含み、１台以上の追加の随意のカメラ１１２（仮想線で示す）。例示的なカメラ１１０、１１２は、画像センサ（又はイメージャ）Ｓと、画像フレームを取得してビジョンシステムプロセス（プロセッサ）１３０に伝送する関連エレクトロニクスを含み、ビジョンシステムプロセス（プロセッサ）１３０はスタンドアローンプロセッサ及び／又はコンピューティングデバイス１４０として例示され得る。カメラ１１０（及び１１２）は検査中のオブジェクト１５０を包含するシーンに焦点を当てた適当なレンズ／光学系１１６を含んでいてもよい。カメラ１１０（及び１１２）は、画像取得プロセスに従って作動する内部及び／又は外部照明器（図示せず）を含んでいてもよい。コンピューティングデバイス１４０は、例示的な実施形態に従って画像データを保存及び操作できる任意の容認可能なプロセッサベースのシステムであってよい。例えばコンピューティングデバイス１４０は、ＰＣ（図示の例）、サーバー、ラップトップ、タブレット、スマートフォン又は他の類似のデバイスを含んでいてもよい。コンピューティングデバイス１４０は、適当な端末装置、例えばカメラと相互接続するバスベースの画像キャプチャカードを含んでいてもよい。代替的な実施形態において、ビジョンプロセッサは部分的又は完全にカメラ本体それ自体の中に含まれることができ、画像データを共有及び処理する他のＰＣ、サーバー及び／又はカメラベースのプロセッサとネットワーキングできる。コンピューティングデバイス１４０は随意に適当なディスプレイ１４２を含み、これはビジョンシステムプロセス（プロセッサ）１３０内に設けられたビジョンシステムツール及びプロセッサ１３２に従って作動できる適当なグラフィカルユーザインタフェース（ＧＵＩ）をサポートすることができる。種々の実施形態でディスプレイは省くことができ及び／又はセットアップとサービス機能のためにのみ提供できることに留意されたい。ビジョンシステムツールは、オブジェクトの検査において使用することが認められる任意の容認可能なソフトウェア及び／又はハードウェアパッケージ、例えば米国マサチューセッツ州ネイティック市のコグネックス株式会社から市販されているものの一部であってよい。コンピューティングデバイスは、例えばキーボード１４４及びマウス１４６、並びにディスプレイ１４２内部のタッチスクリーンを含め関連するユーザインタフェース（ＵＩ）コンポーネントも含んでいてもよい。 An exemplary vision system configuration 100 that can be used according to an exemplary embodiment is shown in FIG. The system 100 includes at least one vision system camera 110 and one or more additional optional cameras 112 (indicated by virtual lines). Exemplary cameras 110, 112 include an image sensor (or imager) S and related electronics that acquire an image frame and transmit it to the vision system process (processor) 130, where the vision system process (processor) 130 is a stand-alone processor. And / or may be exemplified as a computing device 140. The camera 110 (and 112) may include a suitable lens / optical system 116 focused on the scene containing the object 150 under examination. Cameras 110 (and 112) may include internal and / or external illuminators (not shown) that operate according to the image acquisition process. The computing device 140 may be any acceptable processor-based system capable of storing and manipulating image data according to exemplary embodiments. For example, the computing device 140 may include a PC (illustrated example), a server, a laptop, a tablet, a smartphone or other similar device. The computing device 140 may include a suitable terminal device, such as a bus-based image capture card interconnected with a camera. In an alternative embodiment, the vision processor can be partially or completely contained within the camera body itself and can be networked with other PCs, servers and / or camera-based processors that share and process image data. .. The computing device 140 optionally includes a suitable display 142, which may support a suitable graphical user interface (GUI) that can operate according to the vision system tools and processor 132 provided within the vision system process (processor) 130. can. Note that in various embodiments the display can be omitted and / or provided only for setup and service functions. Vision system tools are part of any acceptable software and / or hardware package approved for use in the inspection of objects, such as those commercially available from Cognex Corporation, Natick, Mass., USA. good. The computing device may also include related user interface (UI) components, including, for example, a keyboard 144 and a mouse 146, as well as a touch screen inside the display 142.

カメラ１１０（及び１１２）は、シーン内部に位置するオブジェクト１５０の一部又は全部を撮像する。各カメラは光学軸ＯＡを定義し、それを中心に光学系１１６、焦点距離などに基づいて視野が確定される。オブジェクト１５０はそれぞれ異なる方向に配置された複数のエッジ１５２、１５４を含む。例えばオブジェクトのエッジはスマートフォン本体内に組み付けられたカバーガラスのエッジを含んでよい。例示的に、カメラはオブジェクト全体又は特定の箇所（例えばガラスが本体と交わるコーナ）を撮像できる。（共通）座標空間は、オブジェクト、１台のカメラ又は他の基準点（例えばオブジェクト１５０を載せている可動ステージ）を基準に確定できる。図示されているように、座標空間は軸１５８によって表される。これらの軸は例示的に直交するｘ軸、ｙ軸及びｚ軸、並びにｘ－ｙ面上のｚ軸を中心とする回Θｚを定義する。 The camera 110 (and 112) captures part or all of the object 150 located inside the scene. Each camera defines an optical axis OA, and the field of view is determined based on the optical system 116, the focal length, and the like around the optical axis OA. The object 150 includes a plurality of edges 152, 154, each arranged in a different direction. For example, the edge of the object may include the edge of the cover glass assembled inside the smartphone body. Illustratively, a camera can capture an entire object or a specific location (eg, a corner where glass intersects the body). The (common) coordinate space can be determined with reference to an object, one camera or another reference point (eg, a movable stage on which the object 150 is mounted). As shown, the coordinate space is represented by the axis 158. These axes exemplify the x-axis, y-axis and z-axis that are orthogonal to each other, and the times Θz centered on the z-axis on the xy plane.

例示的な実施形態に従い、ビジョンシステムプロセス１３０は集合的にビジョンシステムツール／プロセス１３２のセットを含む１以上のアプリケーション／プロセス（コンピューティングデバイス１４０上で実行）と相互作用する。これらのツールは、画像データを解像するために使用される多様な慣用的及び特殊なアプリケーション－例えば取得した画像データを所定の（例えば共通）座標系に変換するために使用できる多様な較正ツール及びアフィン変換ツールを含んでいてもよい。画像グレースケール強度データを所定の閾値に基づいて二値画像に転換するツールも含めることができる。同様に、隣接した画像ピクセル（及びサブピクセル）の間の強度の勾配（コントラスト）を解析するツールを設けることができる。 According to an exemplary embodiment, the vision system process 130 collectively interacts with one or more applications / processes (running on a computing device 140) that include a set of vision system tools / processes 132. These tools are a variety of idiomatic and specialized applications used to resolve image data-for example, a variety of calibration tools that can be used to transform acquired image data into a given (eg, common) coordinate system. And affine transformation tools may be included. A tool for converting image grayscale intensity data into a binary image based on a predetermined threshold can also be included. Similarly, tools can be provided to analyze the intensity gradient (contrast) between adjacent image pixels (and subpixels).

ビジョンシステムプロセス（プロセッサ）１３０は、例示的な実施形態に従い取得した画像内で複数のラインの位置を特定するライン検出プロセス、ツール又はモジュール１３を含む。ここで図２を参照する例示的な実施形態に従うライン検出手順２００の概観を図示する。手順２００は２つの主要部分からなる。入力画像２１０がプロセッサに提供される。図示されているように、画像は１対の交差するエッジ２１２及び２１４を含む。これらは上述したオブジェクト１５０のコーナ区域を表すことができる。エッジポイント抽出装置２２０は、入力画像２１０を処理して、それぞれエッジ２１２及び２１４に沿って存在するエッジポイント２３２及び２３４を含む候補エッジポイントのセット２３０を得る。エッジポイント２３２、２３４、及びそれらに関連するデータ（例えば以下に説明する強度勾配情報）が再帰的ラインファインダ２４０に提供され、これは選択されたエッジポイントで一連の反復プロセスを実行する。反復プロセスの目標は、他の検出されたエッジポイントを候補ライン特徴にフィッティングすることを試みることである。ライン検出プロセス２４０の結果、図示されているようにライン２５２及び２５４が検出される。これらの結果は情報を使用する他の下流のプロセス２６０－例えばアライメントプロセス、ロボット操作、検査、ＩＤ読取り、部品／表面検査などに提供され得る。 The vision system process (processor) 130 includes a line detection process, tool or module 13 that locates a plurality of lines in an image acquired according to an exemplary embodiment. Here, an overview of the line detection procedure 200 according to an exemplary embodiment with reference to FIG. 2 is illustrated. Procedure 200 consists of two main parts. The input image 210 is provided to the processor. As shown, the image contains a pair of intersecting edges 212 and 214. These can represent the corner areas of the object 150 described above. The edge point extractor 220 processes the input image 210 to obtain a set of candidate edge points 230 including edge points 232 and 234 existing along the edges 212 and 214, respectively. Edge points 232, 234 and related data (eg, intensity gradient information described below) are provided to the recursive line finder 240, which performs a series of iterative processes at the selected edge points. The goal of the iterative process is to try to fit other detected edge points to the candidate line features. As a result of the line detection process 240, lines 252 and 254 are detected as shown. These results may be provided for other downstream processes using information 260-eg alignment processes, robotic operations, inspections, ID reads, parts / surface inspections, etc.

ＩＩ．ライン検出プロセス（プロセッサ） II. Line detection process (processor)

一実施形態に従う、エッジポイントを抽出するための手順を説明した図３を参照する。検出されるべきエッジ特徴を有するオブジェクト又は表面を含んでいるシーンの１以上の画像が取得される（ステップ３１０）。画像は単一のカメラ又は複数のカメラによって抽出できる。いずれの場合も、ステップ３２０で画像ピクセルは、（随意に）適当な較正パラメータによって新しい座標空間及び／又は共通座標空間に変換され得る。このステップは以下に説明するように画像の平滑化も含んでいてもよい。複数のカメラがシーンの不連続区域を撮像する－例えばより大きいオブジェクトのコーナ区域に焦点を当てる－実施形態において、共通座標空間はカメラ視野の間の空の区域を占めることができる。以下に説明するように、そのような視野の間に延びるライン（例えば２つの検出されたコーナ区域を接続するオブジェクトエッジ）は、例示的な実施形態のシステム及び方法によって例示的な実施形態のシステム及び方法によって外挿できる。ステップ３３０で、ラインを検出するために必要とされるエッジポイントは、適当な座標空間においてエッジポイント抽出装置によって勾配場の投影を用いて画像から抽出される。勾配値は最初に各ピクセルについて計算されて、ｘ勾配成分とｙ勾配成分に対して画像を生成する。画像は更に勾配場を多数のキャリパ状区域に投影することによって処理される。強度値を投影する慣用的キャリパツールとは異なり、実施形態に従い勾配場を投影することによって、勾配の向きを保存でき、以下に説明するようにこれが後続のライン検出プロセスを容易にする。 Refer to FIG. 3, which describes a procedure for extracting edge points according to one embodiment. One or more images of the scene containing an object or surface with edge features to be detected are acquired (step 310). Images can be extracted by a single camera or multiple cameras. In either case, in step 320 the image pixels can be (optionally) transformed into a new coordinate space and / or a common coordinate space with the appropriate calibration parameters. This step may also include image smoothing as described below. In embodiments, a plurality of cameras capture a discontinuous area of the scene-for example focusing on a corner area of a larger object-in which a common coordinate space can occupy an empty area between the camera fields of view. As described below, a line extending between such fields of view (eg, an object edge connecting two detected corner areas) is an exemplary embodiment system and method. And can be extrapolated by method. In step 330, the edge points required to detect the line are extracted from the image using gradient field projection by an edge point extractor in a suitable coordinate space. The gradient value is first calculated for each pixel to produce an image for the x-gradient component and the y-gradient component. The image is further processed by projecting a gradient field onto a number of caliper-like areas. Unlike conventional caliper tools that project intensity values, by projecting a gradient field according to an embodiment, the orientation of the gradient can be preserved, which facilitates subsequent line detection processes as described below.

ステップ３４０で、図４の線図も参照して、候補エッジ特徴を含んでいる画像の部分（キャリパ状区域）４００は（複数の投影４１０、４２０、４３０によって表された）勾配場の投影を受け、サーチ方向（矢印ＳＤ）で（近似的に）予想されるエッジの向きにわたってサーチされ、投影は直交投影方向（矢印ＰＤ）で区域４００にわたって繰り返される。各投影（例えば投影４２０）に対してエッジは投影と関連した勾配場４４０における極大値として現れる。概して、エッジと関連した投影の内部の一連のエッジポイントは、エッジの延在方向に対して直角に強度勾配（ベクトル５５２、５５４）を示す。以下に説明するように、ユーザは予想されるラインの向きに基づいて投影方向を定義できる。代替として、これは既定のメカニズム又は他のメカニズム－例えば画像内の特徴の解析によって提供されてよい。 In step 340, also referring to the diagram of FIG. 4, the portion of the image (caliper-like area) 400 containing the candidate edge features is a projection of the gradient field (represented by multiple projections 410, 420, 430). It receives and is searched over the (approximately) expected edge orientation in the search direction (arrow SD), and the projection is repeated across the area 400 in the orthogonal projection direction (arrow PD). For each projection (eg, projection 420) the edges appear as maxima in the gradient field 440 associated with the projection. In general, the set of edge points inside the projection associated with the edge show an intensity gradient (vectors 552, 554) perpendicular to the extending direction of the edge. As described below, the user can define the projection direction based on the expected line orientation. Alternatively, this may be provided by a predetermined mechanism or other mechanism-eg, analysis of features in the image.

上述した勾配投影ステップには２つの粒度パラメータが包含されている。勾配場計算の前に、ユーザは異方性ガウスカーネルを用いて画像を平滑化することを選択できる。第１の粒度はこのガウス平滑化カーネルのサイズを決定する。図５の線図５００に示されているように、近似的なサイズのガウスカーネル（例えば大きい５１２、中位５１４、小さい５１６）を適用して画像２１０を平滑化する。それゆえ第１の粒度パラメータは、場計算の前に等方性ガウス平滑化カーネルのサイズを決定する。 The gradient projection step described above includes two particle size parameters. Prior to the gradient field calculation, the user can choose to smooth the image using an anisotropic Gaussian kernel. The first particle size determines the size of this Gaussian smoothing kernel. As shown in FIG. 500, a Gaussian kernel of approximately size (eg, large 512, medium 514, small 516) is applied to smooth the image 210. Therefore, the first particle size parameter determines the size of the isotropic Gaussian smoothing kernel prior to the field calculation.

これにより勾配場計算の後で、慣用的なキャリパツールにおける統一的な重み付けではなく、ガウス重み付けされた投影がプロセスによって実行される。従って第２の粒度パラメータは、場の投影中に使用される１次元（１Ｄ）ガウスカーネルのサイズを決定し、図６に示されているように区域６００はガウス平滑化カーネル６１０、６２０、６３０を受ける。典型的な操作の間、ユーザは（ＧＵＩを用いて）画像に重ね合わされたすべての抽出されたエッジを検証し、次いで検出されるべきラインに沿って抽出されたエッジの数が十分と思われるまで粒度とコントラスト閾値を調整する一方で、画像内のバックグラウンドノイズに起因するエッジの過度な数を回避する。換言すれば、このステップにより信号雑音比を画像特徴に対して最適化することが可能になる。この調整は、種々の実施形態でシステムにより既定値を用いて自動的に実行することもできる。ガウス重み付け関数は、（例えば）一様な重み付けを含む、投影を重み付けするための多様な方策の１つであることに留意されたい。 This causes the process to perform a Gauss-weighted projection after the gradient field calculation, rather than a uniform weighting in the conventional caliper tool. The second particle size parameter therefore determines the size of the one-dimensional (1D) Gaussian kernel used during field projection, where the area 600 is the Gaussian smoothing kernel 610, 620, 630 as shown in FIG. Receive. During a typical operation, the user validates all the extracted edges superimposed on the image (using the GUI), and then the number of extracted edges along the line to be detected seems to be sufficient. Adjust the grain size and contrast thresholds up to, while avoiding an excessive number of edges due to background noise in the image. In other words, this step makes it possible to optimize the signal-to-noise ratio for image features. This adjustment can also be performed automatically by the system using default values in various embodiments. Note that the Gauss weighting function is one of a variety of measures for weighting projections, including (eg) uniform weighting.

勾配場の抽出及び投影の全体の流れが、図７の線図７００に図示されている。２つの粒度パラメータ、等方性ガウスカーネル７１０及び１Ｄガウスカーネル７２０が、全体図７００の各半分に示されている。図示されているように、各々の取得された画像２１０は平滑化及びデシメーション７３０を受ける。次いで結果として生じる画像７４０は、上述したように勾配場の計算７５０を経て、２つの勾配画像７５２及び７５４を生成する。これらの勾配画像はｇ_ｘ及びｇ_ｙとも表現され、それぞれ共通座標空間における２本の直交軸を表す。２つの勾配画像に加えて、強度画像７５６は典型的には平滑化、デシメーション及びに投影プロセス７６０（１Ｄガウスカーネル７２０に基づいてガウス重み付けされた投影７７０を用いる）を受けることに留意されたい。なぜなら、処理された強度情報は、以下に説明する実施形態に従って正規化したコントラストを計算するためにも使用されるからである。その結果は、勾配画像７７２（ｇ_ｘ）、７７４（ｇ_ｙ）及び強度画像７７６の投影プロフィルである。 The overall flow of gradient field extraction and projection is illustrated in Figure 700 of FIG. Two particle size parameters, an isotropic Gaussian kernel 710 and a 1D Gaussian kernel 720, are shown in each half of Figure 700. As shown, each acquired image 210 undergoes smoothing and decimation 730. The resulting image 740 then produces two gradient images 752 and 754 through gradient field calculation 750 as described above. These gradient images are also represented as g _x and _gy , each representing two orthogonal axes in a common coordinate space. Note that in addition to the two gradient images, the intensity image 756 typically undergoes a smoothing, decimation and projection process 760 (using a Gauss-weighted projection 770 based on the 1D Gaussian kernel 720). This is because the processed intensity information is also used to calculate the normalized contrast according to the embodiments described below. The result is a projection profile of the gradient images 772 (g _x ), 774 ( _gy ) and the intensity image 776.

手順３００（図３）のステップ３５０も参照して、次にｘ勾配画像とｙ勾配画像の１Ｄ投影プロフィルを結合することによって適格化されたエッジポイントが抽出される。このことは生のコントラスト計算７８０と、強度画像に基づいて正規化したコントラスト計算７９０を用いて達成される。より具体的には、それぞれ閾値を超える生の投影された勾配の大きさと正規化した投影された勾配の大きさのいずれも有する局所的なピークは、次の例示的な式に従い後続のライン検出のための候補エッジポイントと見なされる。
（ｇ_ｘ ^２＋ｇ_ｙ ^２）^１／２＞Ｔ_ＡＢＳ
（ｇ_ｘ ^２＋ｇ_ｙ ^２）^１／２／Ｉ＞Ｔ_ＮＯＲＭ
ここで、ｇ_ｘ及びｇ_ｙはそれぞれピクセル箇所におけるｘ勾配投影とｙ勾配投影の値、Ｉは強度、Ｔ_ＡＢは生の投影された勾配の大きさに対する絶対コントラスト閾値、及びＴ_ＮＯＲＭは強度正規化した投影された勾配の大きさに対する正規化したコントラスト閾値である。 Also with reference to step 350 in step 300 (FIG. 3), qualified edge points are extracted by combining the 1D projection profiles of the x-gradient image and the y-gradient image. This is achieved using a raw contrast calculation 780 and a contrast calculation 790 normalized based on the intensity image. More specifically, local peaks, each with both a raw projected gradient magnitude above the threshold and a normalized projected gradient magnitude, are detected by subsequent lines according to the following exemplary equation: Is considered a candidate edge point for.
(G _x ² + _gy ² ) ^1/2 > T _ABS
(G _x ² + _gy ² ) ^1/2 / I> T _NORM
Here, g _x and gy are the values of the x-gradient projection and the _y -gradient projection at the pixel location, I is the intensity, _{TAB is the absolute contrast threshold for the magnitude of the raw projected gradient, and TNORM} _is the intensity normal. It is a normalized contrast threshold for the magnitude of the projected projected gradient.

特に、ポイントはその絶対コントラストと正規化したコントラストがいずれもそれぞれの閾値を超える場合のみ候補エッジポイントと見なされる。このことは正規化したコントラスト閾値Ｔ_ＮＯＲＭに対するコントラスト閾値Ｔ_ＡＢＳの模範的なグラフ８００において、右上の象限８１０によって示されている。二重の閾値（絶対及び正規化）を用いることは、典型的に絶対コントラスト閾値を使用する既存の方策とは全体として異なる。二重コントラスト閾値の利点は、例を挙げるとある画像がいずれも興味のあるエッジを含む暗い強度区域と明るい強度区域の両方を含む場合に明確である。画像の暗い区域でエッジを検出するためには、低いコントラスト閾値を設定することが望ましい。しかしながらそのような低いコントラストを設定すると、その結果として画像の明るい部分で誤ったエッジを検出する可能性がある。反対に、画像の明るい区域で誤ったエッジを検出するのを避けるためには、高いコントラスト閾値を設定することが望ましい。しかしながら高いコントラストを設定すると、システムは画像の暗い区域でエッジを適切に検出できないかもしれない。伝統的な絶対コントラスト閾値に加えて、第２の正規化したコントラスト閾値を用いることによって、システムは暗い区域と明るい区域の両方で適切にエッジを検出することができ、画像の明るい区域で誤ったエッジを検出するのを避けることができる。それゆえ該当するエッジの検出を可能にする一方で誤ったエッジを回避することにより、避けながら、二重コントラスト閾値の使用は全体プロセスの後続のライン検出段階の速度と堅牢性を最大化するのに役立つ。 In particular, a point is considered a candidate edge point only if its absolute contrast and normalized contrast both exceed their respective thresholds. This is shown by the upper right quadrant 810 in the exemplary graph 800 of the contrast threshold T _ABS for the normalized contrast threshold T _NORM . Using double thresholds (absolute and normalized) is generally different from existing strategies that typically use absolute contrast thresholds. The advantage of the dual contrast threshold is evident when, for example, an image contains both dark and light intensity areas, all of which contain edges of interest. It is desirable to set a low contrast threshold to detect edges in dark areas of the image. However, setting such a low contrast can result in the detection of false edges in bright areas of the image. Conversely, it is desirable to set a high contrast threshold to avoid detecting false edges in bright areas of the image. However, with high contrast settings, the system may not be able to properly detect edges in dark areas of the image. By using a second normalized contrast threshold in addition to the traditional absolute contrast threshold, the system can properly detect edges in both dark and bright areas, which is incorrect in the bright areas of the image. You can avoid detecting edges. Therefore, while avoiding false edges while allowing detection of applicable edges, the use of dual contrast thresholds maximizes the speed and robustness of subsequent line detection stages of the overall process. Useful for.

更に手順ステップ３５０（図３）を参照すると、すべてのエッジポイントが抽出されたら、後続のラインファインダが作動するために都合の良いデータ構造で再現され保存される。例えば次のタプルに留意されたい。
ｐ＝（ｘ，ｙ，ｇｘ，ｇｙ，ｇｍ，ｇｏ，Ｉ，ｇｍ／Ｉ，ｍ，ｎ）
ここで、（ｘ，ｙ）はエッジポイントの箇所、（ｇ_ｘ，ｇ_ｙ）はそれぞれｘ勾配投影及びｙ勾配投影の値、（ｇ_ｍ，ｇ_ｏ）は（ｇ_ｘ，ｇ_ｙ）から計算された勾配の大きさと向き、Ｉはエッジポイント箇所における強度、ｇ_ｍ／Ｉは勾配の大きさｇ_ｍを強度Ｉで除算することによって得られる強度正規化したコントラスト、ｍは画像インデックス、及びｎは投影区域インデックスである。標準キャリパツールにおけるようなエッジポイントの箇所は、精度を向上させるために補間できる。 Further referring to step 350 (FIG. 3), once all the edge points have been extracted, they are reproduced and stored in a convenient data structure for subsequent linefinders to operate. Note, for example, the following tuple.
p = (x, y, gx, gy, gm, go, I, gm / I, m, n)
Here, (x, _y ) is the location of the edge point, (g _x , gy) is the value of the _x -gradient projection and the y-gradient projection, respectively, and (g _m , go) is calculated from (g _x , g _y ). The magnitude and orientation of the gradient, I is the intensity at the edge point, g _m / I is the intensity normalized contrast obtained by dividing the gradient magnitude g _m by the intensity I, m is the image index, and n. Is the projected area index. Edge point locations, such as those in standard caliper tools, can be interpolated to improve accuracy.

エッジポイント抽出プロセスは一般的に場の投影を、予想されるライン角度と実質的に一致する単一方向で行うように作動することに留意されたい。それゆえツールはこの角度にあるエッジに対して最も敏感であり、その感度は他の角度にあるエッジに対して徐々に減少し、減少率は場の投影の長さを間接的に決定する粒度の設定に依存する。結果として角度範囲はユーザによって指定されることを条件に、プロセスは角度が予想されるライン角度「に近い」ラインを検出することに制限される。プロセスは直交していないラインを検出するように適合されているが、種々の実施形態において直交方向を含む複数の方向（全方向ライン検出）で投影を実行することによって３６０度全体の任意の角度のラインを検出できるように一般化されることが想定されている。 It should be noted that the edge point extraction process generally operates to project the field in a single direction that substantially matches the expected line angle. Therefore, the tool is most sensitive to edges at this angle, its sensitivity gradually decreases with respect to edges at other angles, and the rate of decrease indirectly determines the length of the field projection. Depends on the settings of. As a result, the process is limited to detecting lines that are "close" to the expected line angle, provided that the angle range is specified by the user. The process is adapted to detect non-orthogonal lines, but in various embodiments any angle across 360 degrees by performing projections in multiple directions, including orthogonal directions (omnidirectional line detection). It is supposed to be generalized so that the line of can be detected.

手順３００（図３）のステップ３６０を参照すると、例示的な実施形態に従い閾値を超えるエッジポイント候補がラインファインダに提供される。例を挙げると、ラインファインダは再帰的に作動し、（例えば）ランダムサンプルコンセンサス（ＲＡＮＳＡＣ）ベースの技術を採用する。図９におけるライン検出手順９００も参照する。ステップ９１０において、ユーザは１つの画像内で予想されるラインの最大数を、予想される角度、角度誤差、距離誤差及び（例示的に）最小カバレージスコア（以下に一般に定義）と共に（例えば）ＧＵＩによって指定する。これらのパラメータは、次のプロセスを操作するためにラインファインダによって使用される。画像の各サブ領域に対するラインはＲＡＮＳＡＣラインファインダを再帰的に実行して、ある段階からのエッジポイントアウトライアが次の段階に対する入力ポイントとなることによって検出される。こうしてステップ９２０において、手順９００は、エッジ検出プロセスにおいて極値として特定されたエッジポイントのグループ部分である１対のエッジポイントを選択する。手順９００は、モデルラインと（選択された誤差範囲内で）一致している勾配値のマッチングに基づいて、モデルラインを選択されたエッジポイントにフィッティングすることを試みる。ステップ９２４ではステップ９２２からの１以上のライン候補が戻される。各ライン検出段階は候補ライン、そのインライア及びアウトライアを戻す。戻されたラインは、ライン候補と一致する位置と勾配を有するインライアエッジポイントが計算される（ステップ９２６）。ステップ９２８では、最大のインライアカウントを有する候補ラインが特定される。上述したライン検出段階（ステップ９２０－９２８）は、ＲＡＮＳＡＣ反復の許容された最大数に達すると終了する（決定ステップ９３０）。各ライン検出段階内部の反復の最大数は、内部計算されたアウトライアの最悪比率とユーザによって指定された保証レベルを用いて自動的に計算される。各ライン検出段階は、ユーザ指定のフィッティング誤差、幾何学的制限及び極性を条件に、キャンプチャーされたエッジポイントの最大数を有するラインをそのすべての反復から戻す。各エッジポイントは１本のラインのインライアリストのみに割り当てることができ、各ラインは各投影区域からたかだか１つのエッジポイントのみ含むことが許される。エッジポイントの勾配の向きはその位置と共に、候補ラインのインライアリストに含まれるべきか決定するために使用される。具体的には、エッジポイントは候補ラインの角度と一致する勾配の向きを有するべきである。 Referring to step 360 of step 300 (FIG. 3), edge point candidates exceeding the threshold are provided to the line finder according to an exemplary embodiment. For example, the linefinder works recursively and employs (eg) random sample consensus (RANSAC) based technology. The line detection procedure 900 in FIG. 9 is also referred to. In step 910, the user (eg) GUI with the expected maximum number of lines in one image, along with the expected angle, angle error, distance error and (exemplarily) minimum coverage score (generally defined below). Specified by. These parameters are used by the line finder to operate on the next process. Lines for each subregion of the image are detected by recursively running the RANSAC Line Finder, where the edge point outline from one stage becomes the input point for the next stage. Thus, in step 920, step 900 selects a pair of edge points that are a group portion of the edge points identified as extrema in the edge detection process. Step 900 attempts to fit the model line to the selected edge points based on the matching of the gradient values that match the model line (within the selected error range). In step 924, one or more line candidates from step 922 are returned. Each line detection stage returns candidate lines, their inliers and outliers. For the returned line, an inlier edge point with a position and slope that matches the line candidate is calculated (step 926). In step 928, the candidate line with the largest in-line account is identified. The line detection step described above (steps 920-928) ends when the maximum number of RANSAC iterations allowed is reached (decision step 930). The maximum number of iterations inside each line detection stage is automatically calculated using the internally calculated worst outlier ratio and the guarantee level specified by the user. Each line detection step returns the line with the maximum number of camped edge points from all its iterations, subject to user-specified fitting errors, geometric limits and polarities. Each edge point can be assigned to only one line inlier list, and each line is allowed to contain at most one edge point from each projection area. The direction of the slope of the edge point, along with its position, is used to determine if it should be included in the candidate line's inliarist. Specifically, the edge points should have a gradient orientation that matches the angle of the candidate line.

決定ステップ９３０では、より多くの反復が許可されて、最良インライア候補からのアウトライア（ステップ９４０）を、ＲＡＮＳＡＣプロセス（ステップ９２０）がライン候補の検出に使用するために戻すことを決定する。 In the determination step 930, more iterations are allowed to determine that the outliers from the best inlier candidates (step 940) are returned for use by the RANSAC process (step 920) in the detection of the line candidates.

各ＲＡＮＳＡＣ反復において、異なる投影区域に属する２つのエッジポイントがランダムに選択され、それらの２つのポイントにラインがフィッティングされる。結果として生じる候補ラインは更に、その角度がポイント対の両エッジの勾配角度と一致するか、及びラインの角度がユーザによって指定された不確かさの範囲と一致するかについてのみ検討される。概して、エッジポイント勾配方向は通常直角であるが、ユーザ設定の角度誤差だけ異なることが許容される。候補ラインがこれらの初期テストに合格したら、次いでインライアエッジポイントの数が評価され、さもなければ新しいＲＡＮＳＡＣ反復が開始される。エッジポイントは、ユーザによって指定された勾配角度と距離誤差に基づいて、その勾配の方向と位置がラインと一致する場合のみ候補ラインのインライアと見なされる。 At each RANSAC iteration, two edge points belonging to different projection areas are randomly selected and a line is fitted to those two points. The resulting candidate line is further examined only to see if its angle matches the slope angle of both edges of the point pair and if the angle of the line matches the range of uncertainty specified by the user. In general, the edge point gradient direction is usually a right angle, but it is permissible to differ by a user-configured angular error. If the candidate line passes these initial tests, then the number of aligner edge points is evaluated and otherwise a new RANSAC iteration is started. An edge point is considered an aligner of candidate lines only if its slope direction and position match the line, based on the slope angle and distance error specified by the user.

ＲＡＮＳＡＣ反復が最大数に達したら（決定ステップ９３０）、検出された最良のライン候補のインライアは、（例えば）最小二乗回帰又は他の受け入れ可能な近似法を用いて改善されたラインのフィッティングが行われ、インライアエッジポイントのセットが再評価され、これらのステップを最大Ｎ回（例えば３回以上）、インライアの数がそれ以上増加又は減少しなくなるまで繰り返す（ステップ９６０）。これはステップ９７０で検出されたラインとして出力されるラインである。 Once the maximum number of RANSAC iterations has been reached (decision step 930), the best line candidate inliar detected is (eg) improved line fitting using least squares regression or other acceptable approximation method. The set of inliar edge points is re-evaluated and these steps are repeated up to N times (eg, 3 or more) until the number of inliar does not increase or decrease any further (step 960). This is a line output as the line detected in step 970.

決定ステップ９８０は、（（例えば）更にサブ領域又は他の基準をサーチすることによって）より多くのラインが検出されるべきか決定し、そうであればプロセスはステップ９２０にループバックして新しいセットのエッジポイントで作動する（ステップ９８２）。ポイントが使い尽くされるか、又は最大数反復カウントに達したら、手順９００はステップ９９０で（複数の画像内で）検出されたラインのセットを戻す。 Decision step 980 determines if more lines should be detected (by searching further (eg) further subregions or other criteria), if so the process loops back to step 920 and sets a new set. It operates at the edge point of (step 982). When the points are exhausted or the maximum number of iteration counts is reached, step 900 returns the set of lines detected in step 990 (in multiple images).

マルチラインファインダは、２本のラインが検査区域内で互いに交差する場合に、既存の結果の最終調整を実行するように適合されている。図１０及び図１１に一般的に示されているように、密集した平行線１０１０及び１０２０に対しては、ＲＡＮＳＡＣ手順の統計的正確に起因して時々誤ったライン結果（即ち図１０）が得られることがある。しかしながらそのようなエラーが発生したら、インライアポイントグループの交換（図１１のグループ１１１０における矢印１１２０）は時々カバレージスコアが増加してフィッティング残余が減少した正しいラインの位置を特定することができる。ポイント交換は、図示されているように画像が密集した平行線を含んでいる場合は極めて効果的である。反対に、画像が図１２及び図１３に図示されているように実際に互いに交差するライン１２１０及び１２２０を包含する場合、カバレージスコアはポイント交換の後で減らされ（図１２のグループ１２４０における矢印１２３０）、交換の前に得られた最初の結果は、交差線を首尾よく検出するプロセスにより保存される。 The multi-line finder is adapted to perform final adjustments to existing results when two lines intersect each other within the inspection area. As is commonly shown in FIGS. 10 and 11, for dense parallel lines 1010 and 1020, sometimes incorrect line results (ie, FIG. 10) are obtained due to the statistical accuracy of the RANSAC procedure. May be However, if such an error occurs, the exchange of align point groups (arrow 1120 in group 1110 in FIG. 11) can sometimes locate the correct line with increased coverage score and reduced fitting residue. Point exchange is extremely effective when the image contains dense parallel lines as shown. Conversely, if the images include lines 1210 and 1220 that actually intersect each other as illustrated in FIGS. 12 and 13, the coverage score is reduced after the point exchange (arrow 1230 in group 1240 of FIG. 12). ), The first results obtained prior to the exchange are preserved by the process of successfully detecting the intersection.

ＲＡＮＳＡＣ手順は、ラインファインダがポイントをラインにフィッティングできる多様な技術の１つであることに留意されたい。代替的な実施形態において、候補ポイントはそれらの間の変位セットに従って選択でき、或いは画像は（例えば）網羅的サーチ技術を用いて処理できる。従って、本明細書中で用いられているようにＲＡＮＳＡＣ技術への参照は、広く多様な類似のポイントフィッティング技術を含むものと解するべきである。 It should be noted that the RANSAC procedure is one of the various techniques that allows the line finder to fit points to the line. In an alternative embodiment, candidate points can be selected according to the set of displacements between them, or images can be processed using (eg) exhaustive search techniques. Therefore, references to RANSAC techniques as used herein should be construed to include a wide variety of similar point fitting techniques.

このシステム及び方法の追加の機能を設けることができる。これらは混合極性に対するサポート、投影区域幅の自動計算、多視点ライン検出のサポート、及び歪みのない入力画像により光学歪みを除去できるようにすることを含む。これらの機能について以下に説明する。 Additional functionality of this system and method can be provided. These include support for mixed polarities, automatic calculation of projected area widths, support for multi-view line detection, and the ability to remove optical distortion with a distortion-free input image. These functions will be described below.

更に図１４－図１６の例を参照すると、例示的な実施形態のライン検出システム及び方法は、一般に検出されたエッジ間のコントラストに対して（それぞれ）標準的な明から暗、暗から明及び択一極性設定をサポートする。加えてこのシステム及び方法は明から暗と暗から明の両特徴が同一のラインに現れる混合極性設定（図１７）もサポートできる。全４種類の設定のライン検出結果が次の図に示されている。例示的な実施形態において、システム及び方法は、反対極性のエッジポイントを包含する１本のラインの検出を可能にする混合極性設定を含んでいてもよい。これは、１本のラインのすべてのエッジポイントがいずれかの極性であるが１つの極性に限られている慣用的な「択一」極性設定とは異なる。混合極性設定は、幾つかある応用の中でも（例えば）較正プレートの明暗格子縞に用いられると有利であり得る。 Further referring to the example of FIGS. 14-16, the line detection systems and methods of the exemplary embodiments are generally (respectively) standard light to dark, dark to light and for contrast between detected edges. Supports alternative polarity settings. In addition, the system and method can also support a mixed polarity setting (FIG. 17) in which both light-to-dark and dark-to-light features appear on the same line. The line detection results for all four settings are shown in the following figure. In an exemplary embodiment, the system and method may include a mixed polarity setting that allows the detection of a single line containing edge points of opposite polarity. This is different from the conventional "alternative" polarity setting where all edge points in a line are of either polarity but are limited to one polarity. The mixed polarity setting may be advantageous when used for light-dark grid fringes of (eg) calibration plates, among other applications.

ユーザは、ライン検出の改善されたシフト不変を選択できる。そのような場合に、エッジポイント抽出装置は結果の安定性を改善するために実質的にオーバーラップした投影区域を使用する。これらの区域がオーバーラップしていないと、対象としているピクセルは画像がシフトしたとき投影区域の外に出る可能性があり、その結果としてライン検出結果におけるシフト不変が乏しいものとなる。オーバーラップした投影区域は、対象としているピクセルが投影区域によって連続的にカバーされることを保証する。オーバーラップした投影区域が使用された場合、増分計算を可能な低レベル最適化と共に実行できる。 The user can select an improved shift immutability for line detection. In such cases, the edge point extractor uses substantially overlapping projection areas to improve the stability of the results. If these areas do not overlap, the pixel of interest may move out of the projected area when the image shifts, resulting in poor shift invariance in the line detection results. The overlapping projection areas ensure that the pixels of interest are continuously covered by the projection area. If overlapping projection areas are used, incremental calculations can be performed with possible low-level optimizations.

ユーザは、取得した画像及び／又は撮像された表面の特定部分をライン特徴の解析から除外するマスクを設けることができる。これは表面が興味のない既知のライン特徴（例えば他のメカニズムによって解析されるバーコード、テキスト、及びラインを検出しようとするタスクと密接な関係がないその他の構造）を含む場合に望ましいことがある。従ってエッジポイント抽出装置は、画像内の「無関係」区域はマスクアウトでき「関係」区域はマスクインされる画像マスキングをサポートできる。そのようなマスキングが起きたら、例示的に検出されたラインのカバレージスコアはマスク内に入るエッジポイントの数に従って再重み付される。 The user can provide a mask that excludes the acquired image and / or a specific portion of the captured surface from the analysis of line features. This is desirable if the surface contains known line features that are of no interest (eg, barcodes, texts analyzed by other mechanisms, and other structures that are not closely related to the task of trying to detect the line). be. Therefore, the edge point extractor can support image masking in which "irrelevant" areas in the image can be masked out and "relational" areas are masked in. When such masking occurs, the coverage score of the exemplary detected line is reweighted according to the number of edge points that fall within the mask.

図１８の模範的な画像区域１８００を参照すると、画像マスクが存在する場合のカバレージスコアと、そのようなカバレージスコアに及ぼす画像マスキングの効果を示す。エッジポイント抽出装置は、画像内の「無関係」区域をマスクアウトできる画像マスキングをサポートする。図示されているように、検出されたライン１８１０は（「関係」マスク区域１８２０に基づき）関係エッジポイントによって特徴付けられる。そのような関係エッジポイントは、ライン１８１０に対する関係エッジポイントインライア１８３０と、ライン１８１０に対する無関係エッジポイントアウトライア１８４０からなる。ライン１８１０上の無関係エッジポイント１８５０は、この例で示されているようにマスクの関係区域１８２０の間にあり、たとえインライアとしてライン上にあってもカバレージスコア計算に含まれていない。ライン１８１０に沿ったエッジポイントに対する潜在的箇所１８６０も、図示のように決定されている。これらの潜在的箇所は、検出されたポイントの間隔に基づいて予測できる間隔で既知のポイントの間に位置決めされている。例示的に、検出されたラインのカバレージスコアは、マスク内に入るエッジポイントの数に従って再重み付される。これによりカバレージスコアは、次の通り修正される。
カバレージスコア＝ラインに対する関係エッジポイントインライアの数／（ラインに対する関係エッジポイントインライアの数＋ラインに対する関係エッジポイントアウトライア＋関係エッジポイントの潜在的箇所の数）。
Referring to the exemplary image area 1800 of FIG. 18, coverage scores in the presence of image masks and the effect of image masking on such coverage scores are shown. The edge point extractor supports image masking that can mask out "irrelevant" areas in an image. As shown, the detected lines 1810 are characterized by relational edge points (based on the "relationship" mask area 1820). Such a relational edge point consists of a relational edgepoint inlier 1830 for line 1810 and an irrelevant edgepoint outlier 1840 for line 1810. The irrelevant edge points 1850 on line 1810 are between the mask's relevant areas 1820 as shown in this example and are not included in the coverage score calculation even if they are on the line as inliers. Potential points 1860 for edge points along line 1810 have also been determined as shown. These potential locations are positioned between known points at predictable intervals based on the intervals between the detected points. Illustratively, the coverage score of the detected lines is reweighted according to the number of edge points that fall into the mask. As a result, the coverage score will be revised as follows.
Coverage score = number of relational edgepoint inliers for the line / (number of relational edgepoint inliers for the line + relational edgepoint outliers for the line + number of potential points of relational edgepoint).

本明細書に記載したシステム及び方法に従うライン検出プロセスを実行した後で、検出されたラインはユーザが（（例えば）ＧＵＩによって）指定する分類基準に基づいて種々の方法で分類できる。ユーザは、内部分類手段、例えばインライアカバレージスコア、強度又はコントラストから選択できる。ユーザは内部分類手段、例えば符号付き距離又は相対角度からも選択できる。外部分類手段を用いる場合、ユーザは検出されたラインの外部分類手段を計算するための参照ラインセグメントを指定できる。 After performing the line detection process according to the systems and methods described herein, the detected lines can be classified in various ways based on the classification criteria specified by the user (eg, by the GUI). The user can choose from internal classification means, such as in-line coverage score, intensity or contrast. The user can also choose from internal classification means, such as signed distances or relative angles. When using the external classification means, the user can specify a reference line segment for calculating the external classification means of the detected line.

一般的に上述したように、このシステム及び方法は多視野角（ＭＦＯＶ）オーバーロードを含むことができ、異なる視野からの画像のベクトルをプロセスに入れることができる。画像はすべて較正に基づいて共通クライアント座標空間にあるべきである。上記のように、この機能性は、複数のカメラを使って単一の部品の部分領域をキャプチャーする応用シナリオにおいて極めて有益であり得る。エッジポイントは勾配情報を保持するので、視野におけるギャップの間に投影されるライン特徴は、（両ＦＯＶにおける勾配が各ＦＯＶで与えられたラインの向き及びアライメントに匹敵するとき）依然として解像され得る。ｓｄｒｔ As generally mentioned above, this system and method can include multi-viewing angle (MFOV) overloads and can incorporate vectors of images from different fields of view into the process. All images should be in a common client coordinate space based on calibration. As mentioned above, this functionality can be extremely useful in application scenarios where multiple cameras are used to capture a subregion of a single component. Since the edge points retain the gradient information, the line features projected between the gaps in the field of view can still be resolved (when the gradient in both FOVs is comparable to the orientation and alignment of the lines given in each FOV). .. sdrt

特に、システム及び方法は、非直線ひずみを取り除いてひずみが重大でないことを保証するために、歪みの除去を要求しない（即ち画像が歪んでいないことを要求しない）（歪みのない画像を可能にする）。画像が歪んでいなければ、システム及び方法は依然として候補エッジポイントを検出し、非線形変換を通してポイント位置と勾配ベクトルをマッピングできる。 In particular, systems and methods do not require distortion removal (ie, do not require the image to be undistorted) (enable distortion-free images, in order to remove non-linear distortion and ensure that the distortion is not significant. do). If the image is not distorted, the system and method can still detect candidate edge points and map point positions and gradient vectors through non-linear transformations.

ＩＩＩ．ラインラベル付けトレーニングインタフェースとランタイムプロセス III. Line labeling training interface and run-time process

再び図１を参照すると、ビジョンシステムプロセス（プロセッサ）１３０は更に、以下に説明するトレーニング時とランタイムに使用されるラベルインタフェース及び関連するプロセス（プロセッサ）１３６を含む。加えて、ビジョンシステムプロセス（プロセッサ）は、ニューラルネットプロセス（プロセッサ）１３８を含み、又はこれと界接しており、ニューラルネットプロセス（プロセッサ）１３８は以下に説明するようにラベルプロセス１３６と界接するプロセス（プロセッサ）１３７から画像データ及び関連する分類子を受け取る。ラベルインタフェースプロセス（プロセッサ）１３６は、トレーニング時に動作してユーザに特定の（通常はテキスト／英数字による）記述（本明細書では「ラベル」又は「タグ」と呼ぶ）を、オブジェクトの画像内の関心のあるラインと関連付ける。これらのプロセス（プロセッサ）は、ツール結果で検出されたラインに自動的にラベル付けする追加の能力を提供することによって上述のライン検出ツールの機能を強化する。 Referring again to FIG. 1, the vision system process (processor) 130 further includes a label interface and related process (processor) 136 used during training and runtime as described below. In addition, the vision system process (processor) includes or borders the neural net process (processor) 138, which is the process that borders the label process 136 as described below. (Processor) Receives image data and related classifiers from 137. The label interface process (processor) 136 operates during training to provide a user-specific (usually text / alphanumerical) description (referred to herein as a "label" or "tag") in an image of an object. Associate with the line of interest. These processes (processors) enhance the capabilities of the line detection tool described above by providing the additional ability to automatically label the lines detected in the tool results.

図１９を参照すると、上記のようなライン検出プロセスをトレーニングするための手順１９００を示している。トレーニング手順１９００は、関心のあるラインに対するユーザ提供のラベル（通常はテキスト形式及び／又は英数字形式）を含んでいる。ステップ１９１０で、ユーザはオブジェクトのトレーニング画像を検査する。トレーニング画像は通常はモデルオブジェクトの実際の画像であるが、部分的又は全体的にＣＡＤ又は他の合成方法によって生成されてもよい。ユーザは関心のある画像内のライン、例えばタブレットやスマートフォンのカバーガラスのエッジなどを識別する。関心のあるラインを記述するために「内側ハウジング内側エッジ」、「外側ハウジング内側エッジ」などのラベルを作成することができる。ユーザは（可能なラベルのリストから）アクセスするか、又はラインを定義する一連の用語を作成し、これらは画像のトレーニングに使用するためのデータベースに保存される（ステップ１９２０）。 Referring to FIG. 19, a procedure 1900 for training a line detection process as described above is shown. Training procedure 1900 includes a user-provided label (usually in text and / or alphanumerical format) for the line of interest. At step 1910, the user inspects the training image of the object. The training image is usually an actual image of the model object, but may be partially or wholly generated by CAD or other compositing methods. The user identifies a line in the image of interest, such as the edge of the cover glass of a tablet or smartphone. Labels such as "inner housing inner edge", "outer housing inner edge" can be created to describe the line of interest. The user either accesses (from a list of possible labels) or creates a set of terms that define the line, which are stored in a database for use in image training (step 1920).

ユーザは、ビジョンシステムによる検査中にオブジェクトの１以上のトレーニング画像を取得するかアクセスして、それらの上でトレーニングプロセス（プロセッサ）を実行する（ステップ１９３０）。トレーニングプロセスは、上述したライン検出ツールの操作を含む。ツールはユーザ設定パラメータを使用して、各トレーニング画像内に自動的に複数のラインを検出する。 The user acquires or accesses one or more training images of the object during inspection by the vision system and performs a training process (processor) on them (step 1930). The training process involves the operation of the line detection tool described above. The tool uses custom parameters to automatically detect multiple lines within each training image.

図２０も参照すると、ユーザインタフェースディスプレイ画面２０００が示されている。ディスプレイ画面２０００は、ビジョンシステムによる検査中のオブジェクトのトレーニング画像を示すウィンドウ２０１０を含む。画像は一般に強調表示された印２０２０、２０２２、２０２４、２０２６、２０２８及び２０２９によって示される一連の検出された線を含む。ディスプレイは、特定のライン、例えば以下に記すようなラベルを付けるためにユーザによってクリックされたかフラグが立てられた「内側ハウジング内側エッジ」を（多くの場合異なる色で）強調表示する。印２０２２－２０２６は、ビジョンシステムタスクにとって特に関心／関連性のない及びクリックされなかったラインに関係する。一般に本明細書で定義される「ライン」は本質的に画像内の線形ステップエッジであり、これにラインファインダは数学的ラインフィットを結果として返す。要素２０３０（これもタスクに対する関連性のあるエッジあり、トレーニング中にユーザによってラベル付される）の場合は、２つの関連するステップエッジ（２０２８と２０２９参照）が要素２０３０の両側に示されている。 Also with reference to FIG. 20, the user interface display screen 2000 is shown. The display screen 2000 includes a window 2010 showing a training image of the object being inspected by the vision system. The image contains a series of detected lines generally indicated by the highlighted marks 2020, 2022, 2024, 2026, 2028 and 2029. The display highlights a particular line, eg, the "inner housing inner edge" that was clicked or flagged by the user to label it as described below (often in a different color). Marks 2022-2026 relate to lines that are not of particular interest / relevance to the vision system task and are not clicked. Generally, a "line" as defined herein is essentially a linear step edge in an image, to which the line finder returns a mathematical line fit as a result. For element 2030, which also has relevant edges for the task and is labeled by the user during training, two related step edges (see 2028 and 2029) are shown on both sides of element 2030. ..

ユーザは、定義されたラベル２０４２、２０４４、２０４６及び２０４８を含むメニュー２０４０にアクセスする。ユーザは、カーソル２０５０を使用して所望のラベル又は他のインタフェース構成要素をクリックすることができ、次に検出された関心のあるライン（ライン２０３０）をクリックして特定のライン上にラベルを設定する（ステップ１９４０）。検出されたすべてのラインにラベルを付ける必要はなく、ユーザが所望する関連性のあるラインのみに付ければよいことに留意されたい。１以上の関連性のあるラインが画像から欠落している場合は、そのラインに関連付けられているラベルは割り当てられないままである。 The user accesses menu 2040, which includes the defined labels 2042, 2044, 2048 and 2048. The user can use the cursor 2050 to click on the desired label or other interface component and then click on the detected line of interest (line 2030) to set the label on a particular line. (Step 1940). Note that not all detected lines need to be labeled, only the relevant lines desired by the user. If one or more related lines are missing from the image, the label associated with that line remains unassigned.

トレーニング画像のセットのライン結果をラベル付けした後、ツールがトレーニングされ、当該データはオブジェクト／ビジョンシステムタスクに関して適切なデータベースに保存される（ステップ１９５０）。それに続いてトレーニングされたツールが画像上で実行されると、ツールは複数のラインを検出するだけでなく、検出された各ラインに自動的にユニークなラベルを割り当てる（或いは検出されたラインがユーザのアプリケーションと関連性がない場合はラベルを付けない）。これにより、ユーザは検出された各ラインの識別情報を決定するためにライン結果を（例えばスクリプトコードで）後処理する必要がなくなる。 After labeling the line results for the set of training images, the tool is trained and the data is stored in the appropriate database for the object / vision system task (step 1950). When the subsequently trained tool is run on the image, the tool not only detects multiple lines, but also automatically assigns a unique label to each detected line (or the detected line is the user). Do not label if it is not relevant to your application). This eliminates the need for the user to post-process the line results (eg in script code) to determine the identification information for each detected line.

（任意選択の）ステップ１９６０によると、識別された（ラベル付けされた）ラインは、ランタイムに画像のライン特徴の処理及びスコアリングのためにニューラルネットワークツールに提供される。パラメータは、ユーザに提供されたトレーニングインタフェースの外部のニューラルネットワークに提供され、例えば画像内のライン特徴を検索するための最適化されたパラメータとして事前にプログラムすることができる。従ってユーザはトレーニング時にラインファインダ結果にテキストラベルを割り当てる責任のみある。ニューラルネットワークは、上記のライン検出ツールによって返された確率スコア候補ライン特徴に使用される。より具体的には、トレーニング時にラインがトレーニングウィンドウ２０１０で検出され、ユーザがラベル付けしたいラインの名前を選択したら、これらのラベル付きラインの各々に対してニューラルネットワーク分類子も作成される。分類子は、ユーザがトレーニングインタフェース２０００で定義して適用したラベルと同じ（又は類似の）名前を採用できる。例えばユーザがラインを選択してラベル「内側ハウジング内側エッジ」を選択すると、プロセスは同じ名前の分類子を作成して、現在の画像をライン特徴ベクトルと共に分類子に追加する。 According to (optional) step 1960, the identified (labeled) lines are provided to the neural network tool at runtime for processing and scoring line features of the image. The parameters are provided to a neural network outside the training interface provided to the user and can be pre-programmed, for example, as optimized parameters for retrieving line features in the image. Therefore, the user is only responsible for assigning text labels to the linefinder results during training. The neural network is used for the probability score candidate line features returned by the line detection tool described above. More specifically, once the lines are detected in the training window 2010 during training and the user selects the name of the line they want to label, a neural network classifier is also created for each of these labeled lines. The classifier can adopt the same (or similar) name as the label defined and applied by the user in Training Interface 2000. For example, if the user selects a line and selects the label "Inner Housing Inner Edge", the process creates a classifier with the same name and adds the current image to the classifier along with the line feature vector.

入力画像内からライン特徴候補を抽出するために、カスタマイズされた又は在来技術に従う適切なプログラミングで、様々な市販のニューラルネットワークツールを採用できる。上述のライン検出プロセスが、様々なライン検出ツール及び画像から検出されたラインを供給する技術の例であることは明らかであろう。 Various off-the-shelf neural network tools can be employed to extract line feature candidates from the input image, with appropriate programming according to customized or conventional techniques. It will be clear that the line detection process described above is an example of various line detection tools and techniques for supplying lines detected from images.

ここで図２１のランタイム手順２１００を参照すると、オブジェクトのランタイム画像が先行の取得プロセスから取得及び／又は提供される（ステップ２１１０）。この画像はビジョンシステム及び関連するプロセッサ１３０に渡され、プロセッサ１３０は関連するラベルを含むトレーニングされたモデルに基づいて上述したライン検出ツール１３４を操作する（ステップ２１２０）。ライン検出ツールは、ラベルの有無にかかわらずすべての検出されたラインを返す。次に、ステップ２１３０で、トレーニング時に作成されたニューラルネットワークが実行され、確率スコアマップが取得される。確率スコアマップは、あるピクセルが、ニューラルネットワークがトレーニングされた特徴ベクトルに対応するかどうかのマップである。この確率スコアマップは画像と同じサイズである。次に検出された各ラインは定期的にサンプリングされ、スコアマップからツールごとに確率スコアが統合される。次にステップ２１４０において、各ラベルに対してどのラインが最大確率スコアを持つかに基づいてラベルがラインに割り当てられる。ステップ２１５０において、ライン検出ステップの結果は関連するラベルと確率スコアと共に保存されて、ユーザに表示され、及び／又は部品検査（合否）、ロボット制御など他の下流の利用タスクのために使用される。 With reference to the run-time procedure 2100 of FIG. 21, a run-time image of the object is obtained and / or provided from the preceding acquisition process (step 2110). This image is passed to the vision system and the associated processor 130, which operates the line detection tool 134 described above based on the trained model including the associated label (step 2120). The line detection tool returns all detected lines with or without labels. Next, in step 2130, the neural network created during training is executed and the probability score map is acquired. A probability score map is a map of whether a pixel corresponds to a trained feature vector in a neural network. This probability score map is the same size as the image. Each line detected next is sampled periodically and the probability score is integrated for each tool from the score map. Then, in step 2140, labels are assigned to the lines based on which line has the maximum probability score for each label. In step 2150, the result of the line detection step is stored with the relevant label and probability score and displayed to the user and / or used for other downstream utilization tasks such as part inspection (pass / fail), robot control, etc. ..

図２２は、（図２０のトレーニングオブジェクトに基づく）模範的ランタイムオブジェクト上のランタイム結果のディスプレイを示しており、関連性のあるライン（ステップエッジ）が強調表示されている。追加の関連性のあるライン（例えば２２２８）は、トレーニングステップでトレーニングされた他の関心のあるラインに対して表示されており、ライン特徴２３１０を包囲している。幾つかの関連性のないラインも強調表示されている（２２２０、２２２２、２２２４）。ユーザは関連性のないラインの１つをクリックした（強調表示２２２０）が、これはそれぞれの情報ウィンドウ２２３０に「タグなし」を表示し、対応する確率スコアは０である。逆に、図２３では、図２２のディスプレイ２２００と同じ結果を示すディスプレイ２３００は、「内側ハウジング内側エッジ」のラベルを付けた関連性のあるライン２２１０に対する情報ボックス２３２０を提供する。これはトレーニング時からのユーザラベル付きライン特徴を表し、そのようなラインの確率スコアは０．６３５であり、これは検出されたラインがどちらかと言えば正しいラベル付きラインであることを意味する。 FIG. 22 shows a display of run-time results on an exemplary run-time object (based on the training object of FIG. 20), with relevant lines (step edges) highlighted. Additional relevant lines (eg, 2228) are displayed relative to other lines of interest trained in the training step and surround the line feature 2310. Some irrelevant lines are also highlighted (2220, 2222, 2224). The user clicked on one of the irrelevant lines (highlight 2220), which displays "untagged" in each information window 2230 with a corresponding probability score of 0. Conversely, in FIG. 23, the display 2300, which shows the same results as the display 2200 of FIG. 22, provides an information box 2320 for the relevant line 2210 labeled "inner housing inner edge". This represents a user-labeled line feature from the time of training, with a probability score of 0.635 for such lines, which means that the detected lines are rather correct labeled lines.

例示的に、上記のニューラルネット分類子は、ラインセグメントを定義する特徴と共に入力として画像（ピクセルデータ）を受け取る。ニューラルネット分類子の出力は画像セットであり、そこでは単一の画像内の各ピクセルが、対応する入力ピクセルがトレーニングされたラインセグメントと一致することの確実性である。出力画像の数は、分類子がトレーニングされたラインセグメントの数と同じである。ネットワークが再現するようにトレーニングされた所望の出力画像は、空間確率分布のバイナリ又はグレースケール表現であることができ、高確率の狭い稜線はライン又は他のトレーニングされたパターンの高勾配エッジに対応している。ランタイムに、分類子は．入力画像を受け取り、ニューラルネットがトレーニングされたラインセグメント現在のラベル／タグと関連している可能性があると結論する領域を強調する出力画像セットを生成する。 Illustratively, the neural net classifier described above receives an image (pixel data) as input along with features that define a line segment. The output of the neural net classifier is an image set, where each pixel in a single image is certain that the corresponding input pixel matches the trained line segment. The number of output images is the same as the number of line segments trained by the classifier. The desired output image trained to reproduce the network can be a binary or grayscale representation of the spatial probability distribution, with high probability narrow ridges corresponding to high gradient edges of lines or other trained patterns. is doing. At runtime, the classifier is. Takes the input image and produces an output image set that highlights the area where the neural net concludes that it may be associated with the trained line segment current label / tag.

代替的には、分類子は統計的に訓練されていてもよい。統計的に訓練された分類子への入力は、カレントラインセグメントとその隣のラインセグメントとの間の関係を描写する測定された特性（例；隣接するラインの距離、相対角度、等）又はラインセグメントの近傍における画像の算出された特性（ラインセグメントに接する１Ｄ強度画像プロジェクション、強度ヒストグラム統計値、等）を伴った、カレントラインセグメントの測定された諸特性（例；極性、位置、角度、等）を有する特徴ベクトルとして供給されたものであってもよい。したがって、本明細書中、「分類子」という用語は、ラベルを生成するニューラルネット分類子又は統計的に訓練された分類子を参照することができる。この用語は、Ｋ－近傍（Ｋ－ＮＮ）分類子及び／又はプロセス（プロセッサ）を参照することもできる。統計的分類子及び／又はＫ－ＮＮ分類子が動作する場合、確率スコア又は確率マップの出力は手順１９００及び２１００からオミットされることもあるし、また、インターフェイスのラベル／タグディスプレイの一部として提供されない可能性もある。しかし、そういった分類子であってもラベルプロセスを公的に改良することができる。 Alternatively, the classifier may be statistically trained. The input to the statistically trained classifier is a measured characteristic (eg, distance between adjacent lines, relative angle, etc.) or line that describes the relationship between the current line segment and the adjacent line segment. Measured characteristics of the current line segment (eg, polarity, position, angle, etc.) with calculated characteristics of the image in the vicinity of the segment (1D intensity image projection in contact with the line segment, intensity histogram statistics, etc.). ) May be supplied as a feature vector. Accordingly, in the present specification, the term "classifier" can refer to a neural net classifier or a statistically trained classifier that produces a label. The term can also refer to K-nearest neighbor (K-NN) classifiers and / or processes (processors). If the statistical classifier and / or the K-NN classifier works, the output of the probability score or probability map may be omitted from steps 1900 and 2100 and as part of the interface label / tag display. It may not be provided. However, even such classifiers can publicly improve the labeling process.

ＩＶ．結論 IV. Conclusion

システム、及び方法及び種々の代替実施形態／改良に従って提供されるラインファインダは、多様な条件の下で多数のライン特徴を決定するための効果的且つ堅牢なツールであることは明らかであろう。概して、ライン特徴を検出するために使用する場合にシステム及び方法は画像内に検出されるべきラインの最大数に特別制限はない。検出されたラインは、推定された正しさを決定でき、それによってライン検出プロセスの汎用性と堅牢性が増加するようにラベルを付けて分類することができる。 It will be clear that the line finder provided according to the system and method and various alternative embodiments / improvements is an effective and robust tool for determining a large number of line features under a variety of conditions. In general, systems and methods when used to detect line features have no particular limitation on the maximum number of lines that should be detected in an image. The detected lines can be labeled and classified so that the estimated correctness can be determined, thereby increasing the versatility and robustness of the line detection process.

以上、本発明の例示的な実施形態を詳細に説明した。本発明の精神と範囲を逸脱することなく種々の改変及び追加を行うことができる。上述した種々の実施形態の各々の特徴は、関連する新しい実施形態において多数の特徴の組み合わせを提供するのに適する限り、別の記載された実施形態の特徴と組み合わされてよい。更に、上に本発明の装置と方法の多数の別個の実施形態を記したが、ここに記載されたものは本発明の原理の応用を例示したものに過ぎない。例えば本明細書中で使用される「プロセス」及び／又は「プロセッサ」という言葉は広く電子ハードウェア及び／又はソフトウェアをベースとする多様な機能及びコンポーネント（代替として機能的「モジュール」又は「エレメント」と呼ぶことがある）を含むものと解釈されるべきである。更に、図示されたプロセス又はプロセッサは他のプロセス及び／又はプロセッサと組み合わせ、又は種々のサブプロセス又はサブプロセッサに分割されてよい。そのようなサブプロセス及び／又はサブプロセッサは、本明細書に記載された実施形態に従って多様に組み合わせることができる。同様に、本明細書中の何らかの機能、プロセス及び／又はプロセッサは、プログラム命令の非一時的コンピュータ可読媒体からなる電子ハードウェア、ソフトウェア、或いはハードウェアとソフトウェアの組み合わせを用いて実施できることが明確に想定されている。更に、本明細書で使用される様々な方向及び／又は向きを表わす用語、例えば、「垂直」、「水平」、「上」、「下」、「底部」、「頂部」、「側部」、「前部」、「後部」、「左」、「右」及びこれに類するものは、相対的な表現法として用いられているに過ぎず、重力の作用方向など固定した座標系を基準とした絶対的な向きを表わすものではない。従ってこの記述は例示としてのみ受け取られるべきであり、本発明の範囲を別途制限することを意味するものではない。加えて、与えられた測定、値又は特徴に関して「実質的に」又は「近似的に」という言葉が用いられている場合、それは所期の結果を達成するための通常の操作範囲内にある量を指しているが、システムに許容された誤差の範囲内の固有の不正確さや誤りに起因するある程度のばらつきを含む（例えば１－５パーセント）。従ってこの説明は例示の方法によるものであり、本発明の範囲を別途制限することを意味するものではない。 The exemplary embodiments of the present invention have been described in detail above. Various modifications and additions can be made without departing from the spirit and scope of the present invention. Each feature of the various embodiments described above may be combined with the features of another described embodiment as long as it is suitable to provide a combination of multiple features in the relevant new embodiment. Further, although a number of distinct embodiments of the devices and methods of the invention have been described above, those described herein merely illustrate the application of the principles of the invention. For example, the terms "process" and / or "processor" as used herein are broadly based on electronic hardware and / or software and various functions and components (alternatively functional "modules" or "elements". Should be construed as including). Further, the illustrated process or processor may be combined with other processes and / or processors or divided into various subprocesses or subprocessors. Such subprocesses and / or subprocessors can be combined in various ways according to the embodiments described herein. Similarly, it is expressly stated that any function, process and / or processor herein can be performed using electronic hardware, software, or a combination of hardware and software consisting of non-temporary computer-readable media of program instructions. It is supposed. In addition, terms used herein to describe various directions and / or orientations, such as "vertical," "horizontal," "top," "bottom," "bottom," "top," and "side." , "Front", "rear", "left", "right" and the like are only used as relative expressions and are based on a fixed coordinate system such as the direction of gravity. It does not represent the absolute orientation. Therefore, this description should be taken as an example only and does not imply limiting the scope of the invention separately. In addition, when the word "substantially" or "approximately" is used with respect to a given measurement, value or feature, it is an amount within the normal operating range to achieve the desired result. However, it includes some inherent inaccuracies and errors within the margin of error allowed by the system (eg 1-5 percent). Therefore, this description is by way of example and does not imply limiting the scope of the invention separately.

特許請求の範囲 Claims

Claims

A system for detecting line features in images acquired based on one or more cameras.
With a vision system processor;
With an interface related to the vision system processor that allows the creation of individual labels for the relevant lines identified by the line detection process in the training image of the object;
With a run-time line detection process that identifies the lines in the captured image;
With a neural net process that uses a classifier based on the label to determine the probability map of the line features for the label;
With a run-time result generation process that provides labels and probability scores for at least one related line;
The above system.

The vision system of claim 1, wherein the run-time result generation process provides a probability score for irrelevant lines.

The vision system of claim 1, wherein the result generation process includes an interface that highlights a line and provides a probability score associated with the highlighted line.

The vision system according to claim 1, wherein the probability score map is similar in size to the acquired image.

The vision system of claim 1, wherein the classifier process uses at least one of a neural net classifier and a statistically trained classifier.

The line detection process has a processor that receives image data of the scene including line features, and includes an edge point extractor.
The edge point extraction device is
(A) Calculate the gradient vector field from the image data and
(B) The gradient vector field is projected onto a plurality of gradient projection subregions.
(C) A plurality of edge points are detected in each of the gradient projection subregions based on the projected gradient data.
The line finder will generate multiple lines that match the edge points extracted from the image.
The vision system according to claim 1.

The line finder includes manipulating a process based on RANSAC to fit inlier edge points to new lines and iteratively define lines from outlier edge points relative to predefined lines. The system according to claim 6.

The system of claim 6, wherein the projection of the gradient field is directed along a set direction in response to the expected orientation of one or more features or line features.

The system of claim 6, wherein the gradient field projection defines a grain size based on a Gaussian kernel.

The edge point extraction device is arranged so as to detect a plurality of gradient intensity maximum values in each of the gradient projection subregions, and the gradient intensity maximum values are each specified as some of the plurality of edge points, and the position vector is used. And the system of claim 6, described by a gradient vector.

The line finder is a metric based on the distance of at least one edge point from at least one candidate line and the angle difference between the gradient direction of at least one edge point and the normal direction of at least one candidate line. Arranged to determine a match between at least one edge point of a plurality of extracted edge points and at least one candidate line of a plurality of detected lines by calculating. Item 6. The system according to Item 6.

A system for detecting line features in images acquired based on one or more cameras.
With a vision system processor;
With an interface related to the vision system processor that allows the creation of individual labels for the relevant lines identified by the line detection process in the training image of the object;
With a run-time line detection process that identifies the lines in the captured image;
With a statistical classifier that produces labels for interfaces based on the lines identified by the line detection process;
The above system.

A system for detecting line features in images acquired based on one or more cameras.
With a vision system processor;
With an interface related to the vision system processor that allows the creation of individual labels for the relevant lines identified by the line detection process in the training image of the object;
With a run-time line detection process that identifies the lines in the captured image;
With a K-NN classifier that produces labels for interfaces based on the lines identified by the line detection process;
The above system.

In a method for detecting line features in an image acquired based on one or more cameras.
With steps to provide an interface related to the vision system processor that allows the creation of individual labels for the relevant lines identified by the line detection process in the training image of the object;
With the steps of identifying the detected lines in the captured image by the run-time line detection process;
With the step of generating a label for at least one related detected line with a classifier;
How to have.

The classifier comprises at least one neural net classifier, and the method further comprises the at least one neural net classifier based on the label to determine a probability map of the line features detected for the label. 14. The method of claim 14, comprising a step to use and a step to generate a probability score for irrelevant lines.

15. The method of claim 15, wherein the generated step highlights the detected lines in an interface and provides a probability score associated with the highlighted lines.

14. The method of claim 14, wherein the classifier is at least one of a neural net classifier, a statistically trained classifier, and a K-NN classifier.

The line detection process has a processor that receives image data of the scene including line features, and includes an edge point extractor.
The edge point extraction device is
(A) Calculate the gradient vector field from the image data.
(B) The gradient vector field is projected onto a plurality of gradient projection subregions.
(C) Detect multiple edge points in each gradient projection subregion based on the projected gradient data and calculate multiple lines matching the edge points extracted from the image.
The method according to claim 14.

The calculation step involves manipulating a RANSAC-based process to fit an inlier edge point to a new line, iteratively defining a line from an outlier edge point to a predefined line. 18. The method of claim 18.

18. The method of claim 18, wherein the projection of the gradient field is oriented along a direction set according to the expected orientation of one or more line features.

The edge point extractor detects a plurality of gradient intensity maximums in each of the gradient projection subregions, each of which is identified as some of the plurality of edge points and is described by a position vector and a gradient vector. The method according to claim 18.

The line detection process is based on the distance of at least one edge point from at least one candidate line and the angular difference between the gradient direction of at least one edge point and the normal direction of at least one candidate line. 18. Claim 18, wherein the calculation of the metric determines a match between at least one edge point of the plurality of extracted edge points and at least one candidate line of the plurality of detected lines. the method of.