JP2004509389A

JP2004509389A - Method and apparatus for ground inference with distributed embedded processing elements

Info

Publication number: JP2004509389A
Application number: JP2002508682A
Authority: JP
Inventors: ハワード，マイク; ペイトン，デービッド; ホフ，ブルース; リー，クレイグ; デイリー，マイク
Original assignee: HRL Laboratories LLC
Current assignee: HRL Laboratories LLC
Priority date: 2000-07-10
Filing date: 2001-07-10
Publication date: 2004-03-25
Also published as: WO2002005142A2; EP1299823A2; AU2002218791A1; WO2002005142A3

Abstract

環境内埋め込み分散型ネットワークを形成する複数のエージェントを用いて、前記物理的環境の特性を計算するための方法と装置が提供される。前記方法は、始動エージェント２００を確定するステップと、隣接エージェント２０２に対して蓄積原価価値を含む信号を送信するステップと、局所情報２０４により前記蓄積原価価値を増強するために各隣接エージェントにおいて前記信号をプロセスするステップとを含む。複数の信号が受信される場合は、新しい信号２０６を生成するための最良の蓄積原価価値をいずれが有しているのかを確定し、前記隣接エージェントをその後始動エージェント２０８として取り扱い、また、隣接エージェント２０８に前記新しい信号を送信し、また、メモリー２１０に最良の増強された原価価値を保持する。方法にはさらに、最短パス計算を使用してパスを確定するステップと、２つの基準エージェント間にある１つのパス上にエージェントを整列させるために二重勾配を使用するステップと、チョーク点上にエージェントを発見し、また収束させるステップとが含まれる。Methods and apparatus are provided for calculating characteristics of the physical environment using a plurality of agents forming a distributed network embedded in the environment. The method includes determining an initiating agent 200, transmitting a signal including an accumulated cost value to an adjacent agent 202, and transmitting the signal at each adjacent agent to enhance the accumulated cost value with local information 204. Processing. If more than one signal is received, determine which has the best accumulated cost value to generate a new signal 206, then treat the neighbor agent as the initiating agent 208, and The new signal is sent to 208 and the memory 210 retains the best enhanced cost value. The method further includes determining the path using a shortest path calculation, using a double gradient to align the agents on one path between the two reference agents, Discovering and converging the agent.

Description

【０００１】
【連邦に帰属するべき権利についての記載】
本発明は、ＤＡＲＰＡ　ＩＴＯ契約第Ｎ６６００１−９９−Ｃ−８５１４号、「フェモロン・ロボティックス（Ｐｈｅｒｏｍｏｎｅ　Ｒｏｂｏｔｉｃｓ）」について交わした米国政府との契約内で執り行った研究に関連しており、米国政府は本発明に一定の権利を有する。
【優先権主張】
本出願は、「大規模ロボット群に対する緊急運動制御（Ｅｍｅｒｇｅｎｔ　Ｍｏｖｅｍｅｎｔ　Ｃｏｎｔｒｏｌ　ｆｏｒ　Ｌａｒｇｅ　Ｇｒｏｕｐ　ｏｆ　Ｒｏｂｏｔｓ）」と題されて２０００年７月１０日に米国において仮出願された第６０／２１７，２３２号、および「分布埋込み式プロセシング素子による地面推論（Ｔｅｒｒａｉｎ　Ｒｅａｓｏｎｉｎｇ　ｗｉｔｈ　Ｄｉｓｔｒｉｂｕｔｅｄ　Ｅｍｂｅｄｄｅｄ　Ｐｒｏｃｅｓｓｉｎｇ　Ｅｌｅｍｅｎｔｓ）」と題されて２０００年７月１０日に米国において出願された第６０／２１７，２２６号に対する
優先権の恩恵を主張する。
【０００２】
【発明の属する技術分野】
本発明は、物理的環境の特性を計算するという分野に関する。さらに詳細には、この開示は環境内埋め込み分散型ネットワークを形成する複数のエージェントを使用して物理学的環境の特性を計算するための方法を提示する。
【０００３】
【従来の技術及び発明が解決しようとする課題】
パスプラニングと地面分析に対するアプローチの諸方法に関して言えば、過去数十年にわたり、地面の諸特徴を含む内部マップ上でオペレーションを行う単独または並列プロセッサソリューションが関心を集めてきた。伝統的には、地面分析は２つの形態のうちの１つの形態に関わっていた：１）上空飛行により地面情報を収集する単一の移動可能なエージェント（例えば、ロボット）または少数のエージェント、および／または事前構築したデジタルマップを使用してパスプラニングまたは他の地面関連意思決定を手助けする、２）幾何計算のために有用な事前構築デジタルマップとして、地面／環境データベース（以下にあるようなシミュレーションの統合部分）を参照しつつシミュレートされた地面埋め込み実体（例えば、シミュレートされた戦場における軍用車両またはコンピュータゲームにおけるシミュレートされた敵）、のうちの１つに関わっていた。
【０００４】
典型的な計算には、目的地への最短パスの確定、地面において高い視認性を有する位置の確定、および公知の敵の位置からの隠されているルートの確定が含まれる。ルートプラニングと地面論理へのこうした伝統的なアプローチでは、該当するデータは、機械により読取り可能なマップを作り上げるために、現実世界からとられる。局部的なセンシングに基づく地面／環境論理は、環境に関する情報に対しては非常に限定されたアクセスしか提供しない。
【０００５】
自然界からの解決策によって環境のマップを事前構築すればずっと多くの情報が提供され、また、ロボット自体のセンサからの情報により更新されうる。こうしたマップは通常は地面特性の配列によって表現され、それにはトラバース性とブロック化されたルートを含み、ある特定の地面パスを横切って移動する困難さと経費を表わす対応した配列に換算することができる。
【０００６】
その原価配列は然るのち解析され、そのマップ上の指定された地点間の最小原価パスを示す新たなマップを得ることができる。この伝統的なアプローチにおいては、前記エリアの環境上の特性をセンシングし、中央点にデータを送信し、マップを生成するという個別でまた明確な諸ステップが、全ての最小原価パス計算が実施される前に行われなければならない。その結果、前記環境が急速に変化していったり、センシングとマップ生成がタイミング良く実行することができない状況の中では、最新の情報を取り入れた最小原価パスを確定することは多くの場合困難である。
【０００７】
空間をマッピングするという現行の手段には、探索パターンで環境を移動していき、生データまたは部分的に処理されたデータを中央演算リソースに送り戻すいくつかの強力な移動可能なロボットの使用が含められる。前記中央リソースはマップを編集し、その後にそのマップ上でオペレーションを行う。環境に対する劇的な変化は、前記ロボットがその探索を反復するまではとり上げられることはない。
【０００８】
ある種の通信ネットワークでは、通信目的のための最低原価パスを確定するための計算を行う（例えば、送信エネルギーまたは信号再送信による遅滞を最小にする）。こうしたアプローチの方法は通信問題に特に関連しており、そうしたアプローチの方法は、移動性に影響を与えるのではないかと考えられる通信効率と環境特性との間の対応関係を引き出そうと試みることはしない。こうした諸アプローチの方法のいくつかに該当する情報は米国特許第５，５６１，７９０号と第５，２３３，６０４号に見出すことができる。
【０００９】
従って、隣接しているエージェントとの局所的な通信のみを必要とする数多くの小さくて、安価なエージェントを使用して、ある環境のエリア全体にわたってまたはそこを通る最適パスを確定するための手段を提供することが望ましいこととなる。１つのパスに沿って蓄積された非局所的な特性は各エージェントごとに局所的な表現を提供するやり方で、信号が複数のエージェントを通して一連のホップをすることによりパス全体にわたって送信されるであろう。エージェントにおける情報は、前記パスに沿って別の信号を送信することにより、直ちに更新することができよう。
【００１０】
【発明を解決するための手段】
本明細書の中で開示されている本発明は、物理的環境内に配置されている複数のエージェントを使用して物理的環境の特性を計算するための方法と装置を提供する。配置されている各エージェントは、通信能力と、前記通信能力と結合されているプロセッサと、前記プロセッサと接続されているメモリーと、を含む。前記複数のエージェントは、エージェント間通信の物理的特性を活用して、物理的環境の非局所特性の分散型表現を形成する。
【００１１】
本発明の前記方法は、少なくとも１つの始動エージェントのトリガーを引くステップ；前記始動エージェントから前記始動エージェントに隣接しているエージェントへ信号を送信するステップ；前記始動エージェントと各隣接しているエージェントとの間のそれぞれのパスに沿った局所特性に基づく原価価値によって、前記隣接しているエージェントのそれぞれにおいて信号をプロセスするステップ；受信された信号から送信するべき新しい信号を選択するステップ；前記隣接しているエージェントを前記プロセスされた信号の始動エージェントとして取り扱い、かつ前記送信およびプロセスするステップを反復するステップ；各エージェントにおいては、受信した信号に基づく物理的環境の非局所特性に関する情報を表わす少なくとも１つの蓄積原価価値を局所的に保持するステップを含む。
【００１２】
したがって、始動エージェントから前記複数のエージェントの中の各エージェントへ信号が伝搬して行き、そのエージェントから前記始動エージェントへの最良パスに関する質の尺度を提供するために蓄積原価価値が各エージェントに保存される。
【００１３】
前記原価価値は簡易ホップカウントとして表現され、各ホップごとに前記始動エージェントから複数のエージェントにわたって前記信号伝搬が反復され得る。前記蓄積原価価値は典型的には、受信した信号から送信するべき新しい信号を確定するのに使用される。
【００１４】
さらに、原価価値は過去の信号から保持され、現行の信号からの原価価値と比較して　信号をプロセスする原価価値を確定することが可能である。前記信号は典型的には視覚的パスのラインに沿って渡されていき、それが、１つのパスに沿った前記トラバース性に関する一般的な表現を提供する。
【００１５】
前記エージェントはまた、特定の信号が１つの通信パスを越えて受信されたその方向を保存するための手段を前記エージェントに提供することにより増強されうるものであり、また、２つのエージェント間の異なるパスについての原価価値は、最良原価を有するパスに対応する方向とともに保存することが可能である。
【００１６】
さらに、前記原価価値は前記エージェントのメモリに保持されるとその時刻が記録されように出来、一番最近になって保持された原価価値は前記信号をプロセスする際に使用されるように指定され、また前記原価価値は所定の時間が経過した後にメモリーから取り除かれる。前記原価価値はまた時間経過にわたって最良から最悪まで順位付けが可能であり前記最良原価価値は前記信号をプロセスする際に使用される。
【００１７】
この場合、前記信号をプロセスする際に使用されるように指定されている原価価値のその時刻記録の期限が切れて、前記原価価値がメモリーから取り除かれると、その次に最良となっている原価価値が前記信号をプロセスする際に使用されるように指定される。
【００１８】
本発明のもう１つの実施態様では、前記エージェントは整列を行うように構成することが可能であり、そこでは、２つのエージェントが、基準エージェントとして行動するように、前記複数のエージェントの中から選択され；前記複数のエージェントの中のエージェント間の最適原価パスを表現したベクトルから成る原価勾配と、キャッシュ基準エージェントが生成され；前記ベクトルは、前記エージェントに対する運動ベクトルを作り出すために各エージェントにおいて合計され；そして、前記エージェントは前記基準エージェント間にある１つのパスに沿って前記エージェントを整列させるために、前記運動ベクトルに沿って移動される。
【００１９】
前記整列ステップでは、例えば、侵入者の検出などのトリガーを引く出来事の発生に際してトリガーを引くことができる。一旦前記エージェントが１つのパスに沿って整列すると、それらエージェントは、収束エージェントを選択することにより、また、他のエージェントを前記収束エージェントに向かうように促すことにより、１つのチョーク点に収束されることが可能であり、結果として前記収束エージェントの近くでエージェントの局所クラスタリングができる。
【００２０】
エージェントにはまた、前記原価価値の生成のためのプロセッサに付加的な情報を提供するためにそれらプロセッサと接続されるセンサを組み込むことが可能である。例えば煙検出器は、１つのパスに沿って前記原価価値の中に、あるエリアの煙の程度を因数化するのに使用することができる。
【００２１】
本発明とともに使用される用途で設計されているエージェントは、ソフトウェアの形態で本発明の実行に使用される論理を組み込むことが可能であり、または、前記論理はハードウェアにエンコードすることができる。
本発明は、物理的環境の諸特性を計算する分野に関する。
以下の説明は、通常の技術を有する同業者であれば、本発明を作り、使用し、またそれを特定のアプリケーションのコンテクストに組み込むことを可能にするよう呈示される。異なるアプリケーションにおける多様な使用法と同様にさまざまな変更も熟練した同業者には直ちに明快であり、また本明細書に定義されている一般的な原則は、実施態様の広範な範囲に対して適用しうるものである。
【００２２】
したがって、本発明は呈示されている前記実施態様に対して限定されることが意図されているわけではなく、本明細書で開示されている原則と新規な特徴と合致しているもっとも広範な範囲に対して認容されるべきものである。
【００２３】
基準となる作業フレームを提供するために、最初に説明と請求項において使用される用語の解説が読者向けの中心となる情報源として提供される。次にその具体的な詳細を展開するのに先立って、概念的な理解を付与するために、簡単な紹介が本発明の説明的な記述の形態で提供される。
【００２４】
【用語解説】
本発明の具体的な詳細を説明する前に、本明細書ならびに請求項において使用されているさまざまな用語の集中一覧個所を提供することは有益である。
【００２５】
エージェント−本明細書の中で説明されているように、エージェントという用語は、そのもっとも一般的な意味においては、プロセッサと、通信能力と、メモリとを含むユニットを意味している。エージェントにはまた、運動のための１つまたはそれ以上の手段と、前記エージェント配置システムにおける他の物体に対するその方向を検出するための手段（コンパス、指向性送信機／受信機、その他など）と、距離測定装置（レーザー、赤外線、超音波、またはレーダーレンジファインダーなど）と、１つまたはそれ以上のセンサを含めることもできる。
【００２６】
さらにもっと分かり易い用語で言えば、エージェントは、例えば、赤外線送信／受信機と一組のホイールを備えたロボットでありうるものであり、そこでは、前記赤外線送信／受信機は前記ロボットと他のロボットとの間の距離と方向を計測するのに使用される。さらに複雑な状況では、同じロボットにはまた、無線通信を送信し、また受信するためのアンテナと、環境のさまざまな状況（温度、視認性、他など）を感知するためのセンサ群と、飛行用の翼やプロペラなどの代替的な運動手段とが含まれうる。
【００２７】
チョーク点−ある種の状況では、１つの点に向かって一群のエージェントを動かすことが望ましい場合がある。例えば、侵入者が１つのエリア内をあちこち動いている場合、複数のエージェントを前記侵入者の近傍に送り込むことが可能である。一方、狭い回廊があるエリアでは、侵入者の通過を制限するのが望ましい場合がある。
【００２８】
したがって、エージェントは前記侵入者の動きを制限するために、前記侵入者のパスに沿った回廊の近傍に送られ得る。このように、チョーク点は一般的には１つの選択されたエージェントの周りにある一群のエージェントの収束点であり、通路を塞ぐ、または、１つの選択された局所において密に一群のエージェントを配置するのに使用することが出来る。
【００２９】
通信能力−本明細書の中で使用されている通信能力という用語は一般的に、エージェントとその局所隣接しているものとの間の通信を容易にする局所通信メカニズムを言う。特定の環境に対して適用可能ないずれの通信メカニズムを使用しても良い。
【００３０】
例えば、清浄な大気中では、指向性赤外線通信メカニズムは、その指向性のため、特に有用である場合がある。他方、水または液体環境では、音波通信メカニズムがより有用である場合がある。
【００３１】
原価価値−原価価値という用語は、エージェント間にあるパスに沿っての１つまたはそれ以上の物理的特性の尺度を意味する。例えば、非常に簡単なケースでは、原価価値は、１つのエージェントからもう１つのエージェントに信号を送ることができたかどうかの尺度となることが可能であり、その場合は、ホップカウントを前記原価価値として機能させることができるであろう。
【００３２】
一方、より複雑な原価価値は（例えば）いくつかの物理的特性の重み付けされた合計として展開されて良く、以下に限定される訳ではないがその例として、信号送信の質、温度、風速、エージェント間のパスの角度が含まれる。特定の実施態様の必要に応じて多くの原価価値スキームを展開するができる。
【００３３】
局所原価価値という用語は、２つのエージェント間にある前記通信パスに沿った局所環境条件の尺度を言うことが意図されている。蓄積原価価値という用語は、２つ以上のエージェントの間にある前記パスに沿った非局所環境条件の尺度を言うことが意図されている。局所原価価値と蓄積原価価値という用語がここで定義されているが、この用語の正確な適用はまたその用語の使われ方から推論することが可能である。
【００３４】
また、始動エージェントにより送信される信号に対して応答する複数のエージェントの中のそれぞれのエージェントで生成される原価価値は、前記複数のエージェントの全体にわたる原価勾配パターンを形成する。指向性情報と組み合わせて、前記原価勾配パターンは、方向付けされたグラフの多くの特徴を有し、また、方向付けされたグラフを使用して実施される多くの計算は前記複数のエージェントに適用することが可能である。
【００３５】
局所−直接（通常は見通し線）通信およびあるエージェントのセンシング範囲内にあるエージェントは局所にあると考えられ、そのエージェントの隣接物と呼ばれる。１つの任意のエージェントに対する局所（隣接）エージェントの数は、１つの任意エリアに存在するエージェントの数によって変化しうる。送信出力を保存するために局所エージェント間でのみ信号を使用することが望ましい。
【００３６】
環境の特性に関してそれが使用される場合については、１つのエージェントに対して局所的である諸特性は前記エージェントにより感知することができる諸特性である。１つのエージェントに対する非局所物理的／環境諸特性は一般的には、前記エージェントが直接感知することができないそうした物理的／環境諸特性を言う。
【００３７】
移動性を提供するための手段−環境の中で動き回ることが必要な移動性を提供するために（運動が望ましいものと仮定して）、エージェントは、移動性を容易にする何らかを必要とする。前記エージェントが使用される環境に適当なものであれば、どんな公知の制御可能な運動メカニズムでも使用することが可能である。
【００３８】
例えば、地上では、エージェントは典型的には脚またはホイールを有するものであり、一方空中ではエージェントはプロペラおよび翼を有することであろう。他方、液体環境では、前記エージェントにはプロペラとフィンが装備され、宇宙空間では、前記エージェントはジェットまたはロケットが装備されることであろう。移動性を提供するための手段は、前記プロセッサに結合され、また前記プロセッサにより制御される。
【００３９】
センサ−他のエージェントからの距離と方向を検出するための手段に加えて、エージェントはエージェントの状況に関するものとともに、その環境に関する情報を収集するためのそれらプロセッサに結合されるセンサを備えることが可能である。本発明のエージェントに使用することが可能なセンサの実施例には、以下の例に限定はされないが、煙探知器と、音声検出器、視覚センサ、地震検出器、熱／赤外線検出器、化学品感知装置、および衛星利用測位システム（ＧＰＳ）センサが含まれる。
【００４０】
センサ情報は、１つのパスに沿って前記原価価値の中に組み込むことが可能であり、または、チョーク点に対する収束などの前記複数のエージェントのさまざまな挙動のトリガーを引くのに使用することができる。
【００４１】
【序言】
本発明は、一般的には、エリアについての地形計算を行うため、１つのエリアに分布している複数のエージェントを使用するための手段を提供する。諸特性はもっとも近い隣接しているものの間の通信のパターンを作り出すことにより確定することが可能であり、そこでは供給元エージェントは、その隣接しているエージェントのそれぞれに、蓄積原価価値を含む信号を送信する。
【００４２】
各隣接しているエージェントは、前記供給元からの前記信号を受信し、また、前記供給元と前記それぞれの隣接しているエージェントとの間の前記物理的諸特性に基づいて蓄積原価価値を修正し、また、前記修正原価価値を組み込んでいるその隣接しているエージェントに新しい信号を送信する。
【００４３】
前記信号は複数のエージェントを経て伝搬して行くので、単独のエージェントは、その隣接している２つ以上のエージェントから蓄積原価価値を有する信号を受信することができる。特定の実施態様によっては、単独のエージェントは受信したものの中で最良の蓄積原価価値を確定し、さらに前記最良蓄積原価価値を修正して新たな信号の中に組み込む。
【００４４】
または、エージェントは前記蓄積原価価値を全て修正して、最良の蓄積原価価値を確定し、新たな信号の中に組み込むことが出来る。前記蓄積原価価値は、環境の非局所特性をあらわす（例えば、前記受信エージェントは直接感知することはできない物理的環境の特性）。修正後は、前記エージェントは、最良蓄積原価価値を保存し、また、その隣接しているエージェントへ送信するための新たな信号を生成する。
【００４５】
さらに明確に分かりやすい実施例として、複数のエージェントは、火災（または、地雷、侵入者などの他の物体）に対する探索を行うのに利用することができるであろう。ユーザが特定の場所で待機している間に、あるエージェントが火災の検出をすると、複数のエージェントの全体にわたり前記検出エージェントからの信号が伝搬され、前記ユーザへ伝えられる。
【００４６】
蓄積原価価値を有する複数パスは、前記複数のエージェントの中のいずれの任意の場所でも有用であるため、前記ユーザは前記火災に対する最適パスを提供される（例えば、もっとも迅速で、もっとも安全、など）。最良パスは、それに沿って前記信号が前記ユーザに行くルート途中で最も少数のホップを行ったパスとして簡単に指定することができる（前記距離の概算確定として）。一方で、最良パスは、前記パスに沿って前記エージェントに搭載したセンサから導き出されたものなどの場合は、より複雑な情報の機能であっても良い。
【００４７】
一般的には、一旦ある始動エージェントが複数のエージェントの全体にわたって信号を伝搬しはじめると、前記信号は、前記始動エージェントと各受信エージェントとの間の各信号ポップで修正される。その修正は、最新の伝達ホップにわたって物理的環境の特性に関する情報を組み込むという目的を有し、ホップカウントを更新するといった簡単なもの、またはセンサ情報または局所計算の結果を加えた複雑なものでありうる。
【００４８】
各パスに沿っての物理的環境に関する前記情報は、前記パスの品質の尺度を簡単に提供することが出来、また前記パスにおける個々のホップにおける固有の特徴を確定するための分離可能な部分を含めることが可能である。いずれの場合でも、前記蓄積信号は、最良パスの質を具体的に表現する。始動エージェントと受信エージエンとの間の信号の伝搬は、基本的には、各ステップで、前記パスに沿って非局所的特性の局所的概要を提供するために、前記始動エージェントと前記受信エージェントとの間の前記パス全体にわたって起こる分散型計算である。
【００４９】
あるパスに沿っての環境特性に関する詳細情報のレベルは、前記特定の実施態様により広範に変化しうる。例えば、前記環境特性を測定するもっとも簡単な方法の１つは、送信が成功裡に行われたパスに沿って前記始動エージェントと前記受信エージェントの間の前記ホップのカウントとともに、２つのエージェントの間の信号を伝達する能力の存在または不在を利用することである。
【００５０】
少しばかりより複雑な方法は、前記ホップに沿って前記バスの質の尺度として１つのホップにわたって受信した信号の強さを使用することである。これにより、距離を確定するさらに正確な方法が与えられる。その上さらに複雑な諸方法には、前記物理的環境特性を確定するために前記エージエントに搭載されているセンサを使用し、また、それら諸特性を前記信号の中に組み込むことである。
【００５１】
この意味では、前記信号の中に組み込まれた前記蓄積原価価値は、前記送信パス全体にわたって前記諸特性の重み付けされた合計として機能し得る。１つのエージェントは前記始動エージェントから前記受信エージェントへのルート途中で複数の信号を受信しても良い。この場合、前記エージェントはその隣接しているエージェントに対して再送信するためにもっとも好適なパスを表現する信号を選択することになる。
【００５２】
本質的には、１つのエージェントともう１つのエージェントとの間のホップに沿っている前記パスの質に関する尺度は、原価価値であるとして考えることが可能であり、そこでは、その原価が低ければ低いほど、そのパスはより良いということになる。前記ホップカウントからその点まで１つのパスの質が簡単に確定される場合は、一番最近のホップに沿った原価価値は、前記信号が前記パスを越えることができたかどうかを示すバイナリーオン／オフ関数である。複数パスの前記蓄積原価価値は、最良パスを確定するために比較される。
【００５３】
再び、簡単なホップカウント実施例では、前記信号が成功裡に送信される各ホップに対しては、もっとも低い蓄積原価価値を有するパスが最適と考えられるので（もっとも短い、または、もっとも低いホップ、パスがもっとも望ましいと仮定して）前記蓄積原価価値は１だけ増加される。原価価値の概念は、他のセンサとともに使用されている他の実施態様に敷衍されうるものであり、そこでは、前記蓄積原価価値は、環境の多くの態様が潜在的に組み込まれているさらに複雑な尺度である場合もある。変化が起こり易い環境では、原価価値は、ある一定の時間量の後に期限が切れるように指定され、おそらく次の最良のより古い（履歴の）原価価値と取り替えられる可能性がある。
【００５４】
２つのエージェントの間にあるパスの質を簡単に確定することに加えて、２つのエージェントが選択されて、前記複数のエージェントの全体をわたる信号の送信が同時に始められる場合がある。好ましくは、前記２つのエージェントのそれぞれがユニークで識別可能なやり方で信号を送信する（例えば、異なる周波数上で、異なる符号を付けてなど）。前記信号を受信する前記複数のエージェントはそれぞれ、前記２つのエージェントのそれぞれに対する最良パスの質と方向を表現するベクトルを展開し、前記２つのエージェントの間にあるパス上にラインアップさせるために、前記２つのベクトルに従って動く。
【００５５】
前記２つのエージェントの選択はユーザにより明示的に行われうるものであり、または、トリガーを引く出来事の発生時に、パスに沿っている１つのエージェントから第２の（所定の）エージェントまでメッセージを伝搬することにより行いうるものである。次に、他のエージェントに対する収束点として行動するように、もう１つのエージェントが前記ラインアップされたエージェントの中から選択され、また、前記パス上にラインアップされた前記エージェントは前記収束点に向かうよう促される。この挙動の有用性は火災が建物の１つのエリアで検出される場合では例証されている。
【００５６】
火災を検出するエージェントは、前記火災の近くにある狭い回廊を見つけ出すために他のエージェントに対して信号を送る。一旦狭い回廊が発見されると（おそらく最大の運動制限に直面しているものを確定するために近くにあるエージェントを問いただすことにより）、前記回廊の反対側にある別のエージェントへの回廊を通るパスが確定される。次に、前記パスに沿っているもう１つのエージェントが信号を送信するために選択されるが、他のエージェントは前記パスに沿ってラインアップするように動いていく。前記狭い回廊にある前記エージェントはその後、前記パスに沿っている火炎遮断エージェントに対して前記回廊に移動するよう信号を送り、そこでは、それらエージェントは前記建物の他の部分から前記火災を遮断するという試みの中で、限定量の火災遮断化学物質を適用することができる。
【００５７】
さて、本発明の前記一般的な性質がすでに説明されたので、本発明の特異性は図面を参照して説明する。
【００５８】
【発明を実施するための最良の形態】
最初に、本発明により使用される１つの典型的なエージェントの記述と説明図が提示される。次に、始動エージェントともう１つの任意に選択されたエージェントからの最良パス経路を測定するために、前記エージェントを使用し、また、チョーク点上に収束する際に他のエージェントを手助けするために基準エージェントとして２つのエージェントを使用する前記諸方法が、いくつかのフローチャートを参照して論じられる。最後に、エージェント配置のさまざまな態様を図式的に示すために説明図が提示される。
【００５９】
本発明により使用される典型的なエージェントが図１に図示されている。図示されているように、前記エージェント１００には、アンテナとして図示されている、信号を送信し、受信するための通信ケーパビリティ１０２が含まれる。内部には、前記エージェント１００には、１組のトラック１０４またはホイール１０６などの移動性を提供するための手段を制御するだけではなく、受信した信号と前記環境に関するデータをプロセスするためのメモリーとプロセッサが含まれる。
【００６０】
前記エージェント１００にはまた、金属検出器１０８と、ＧＰＳ受信機１１０と、赤外線検出器１１２と、ガスクロマトグラフィ１１４とを含む、図示されている非限定実施例である１つまたはそれ以上のセンサが含まれうる。エージェント１００は、特定の環境内での使用に適応され、または、環境の特定の特性の検出に適応することができる。前記エージェント１００は、物理的環境の特性を計算し、また、前記物理的環境の非局所的特性の分散型表現を展開するのに使用される。
【００６１】
エージェントは、各エージェント１００が少なくとも１つの他のエージェント１００の通信範囲内にあり、任意のエージェントの局所的に隣接しているエージェントとは前記任意のエージェントの直接（非中継）通信内にあるエージェント１００として定義されている。複数のエージェント１００は、全体として、通信プロセッサの分散型ネットワークを形成する。
【００６２】
好ましくは、前記エージェント１００は、所望されるエリアの全体を通してそれらエージェントがほぼ均等に間隔を置いて配置されるようにされ、その能力を最大化して前記エリアを効果的に網羅する。前記エージェントの配置は、特定の環境に望ましい手段ならどんな手段でも可能である。
【００６３】
例えば、前記エージェントが移動可能なものである場合には、前記エージェント１００をそれぞれ他のエージェント１００から等距離にある場所に確実に移動させるためにさまざまなアルゴリズムが使用され得る。前記エージェントが移動可能なものではない場合には、それらエージェントは半無作為なやり方で（例えば、航空機の外にそれらを落とす）分散されて良く、またはそれらエージェントはより注意深く位置決めされても良い。ハイブリッド移動状況マップもまた存在し、そこでは、エージェント１００の１群の一部のみが移動可能である。
【００６４】
本発明の目標は、１つの始動エージェントから前記複数のエージェントの中の他のいずれの１つのエージェントに対してでもその最適パスを確定することである。図２は、少なくとも１つの始動エージェント２００を確定するステップからはじめて、前記最適パスを確定する方法のフロー図であり、そこでは、あるエージェントが前記複数のエージェントを通して伝播される信号を送信し始める。前記始動エージェントは、前記エージェントのユーザからのコマンドにより明示的に決定することが可能である。
【００６５】
例えば、前記ユーザは、それから信号を伝搬しはじめる１つのエージェントを選択するようにレーザーポインターまたは無線通信装置などの遠隔制御を使用することができる。代替的には、前記始動エージェントは、１つの出来事により、暗黙的に決定することができる。例えば、前記エージェントには、煙検出器などのセンサが含まれ、また、エージェントによる煙の検出時には、前記検出エージェントは前記始動エージェントとして指定することができる。１つのエージェントが始動エージェントとして確定されるその正確なメカニズムは特別に重要なことではない。
【００６６】
前記始動エージェントが確定された後、前記始動エージェントは送信ステップ２０２において、隣接しているエージェントに対して信号を送信する。先に述べているように、隣接しているエージェントはいずれかの任意のエージェントの通信範囲内にあるエージェントであると考えられる。一般的には、前記複数のエージェントの１つ１つそれぞれは少なくとも１つの隣接しているエージェントを有することとなる。
【００６７】
前記信号はその後、前記隣接しているエージェントにおいて受信され、また、信号プロセシングステップ２０４において、前記始動エージェントと前記隣接しているエージェントとの間にある前記それぞれのパスと関連している原価価値によりプロセスされる。前記原価価値は、前記パスに沿っている前記ホップカウントなどの前記パスに沿っている送信関連局所特性、または、前記パスによりもたらされる前記送信の質に基づくものであって良い（簡単なケースでは、前記原価価値は、視覚的通信パスのラインに沿って、エージェントが通信することができるかどうかに基づいたものであって良い）。
【００６８】
または、前記原価価値は、温度、湿度、煙の程度などの前記エージェントの近くにある環境の局所的な特性に基づいたものである場合もある。実質的には、前記環境のいずれかの局所的な特性は、前記原価価値の定まったものの中に組み込まれる。さらに、複数の特性の重み付けされた組み合わせを用いることが可能である。
【００６９】
次に、信号を受信している各エージェントは、前記受信した信号に基づき、送信する新たな信号を決定する。前期決定ステップ２０６は特定の実施態様に依存して、前記信号プロセシングステップ２０４の前か、または後かのいずれかで、行うことができる。送信する前記新たな信号に関する決定は一般的には、前記エージェントの隣接しているものから受信した信号の原価価値の比較に基づいている。
【００７０】
例えば、目標が信号を伝搬する準備に当って始動エージェントからのもっとも低いホップカウントパスを決定することであった場合は、前記パスに沿ったエージェントは、もっとも小さなホップカウントを有する信号を決定するために、その隣接しているものから受信した信号を比較することになる。前記エージェントはその後に、前記ホップカウントを増加させ、また、前記増加されたホップカウントを示す新しい信号をその隣接しているものに対して送信することになる。
【００７１】
送信する新たな信号を決定した後、エージェントは本質的に、処理ステップ２０８にある新たな始動エージェント様なものになり、また、その隣接しているものに対して新たな信号を送信する。これらステップは、前記複数のエージェント全体にわたって１つの信号を伝搬するために反復される。
【００７２】
信号は伝搬されると、蓄積原価価値は保持ステップ２１０にある各エージェントにおけるメモリーの中に保持される。前記蓄積原価価値は、各エージェントにおいて、前記始動エージェントに向かって１つのパスに沿って非局所的特徴の局所的な概要を提供し、また、１つの信号の逆伝搬を防ぐためのメカニズムを提供する。例えば、多くの構成においては、エージェントは全方向に向けて一斉発信することが可能である。したがって、第１のエージェントがその現行の原価価値を表わす信号を送信した後、隣接しているエージェントは、前記信号を受信し、前記隣接しているエージェントにおける原価価値を表わす新たな信号を送信することになる。
【００７３】
前記隣接しているエージェントからの送信は全方向性であるので、前記新たな信号は、前記第１のエージェントにより受信されることとなる。前記隣接しているエージェントの原価価値（これは前記第１のエージェントのそれから導き出されたものは、前記第１のそれよりも望ましくはないものと仮定すると、前記第１のエージェントはその原価価値を保持し、前記隣接しているエージェントの前記原価価値を利用することはしない。したがって、始動エージェントからの信号の伝搬は、前記始動エージェントからは一般的には離れていくように方向付けられる。
【００７４】
いずれかの任意のエージェントにおける蓄積原価価値は、信号が始動エージェントからのルート途中で横切ったパスに関する品質の蓄積尺度を表現する。このように、前記蓄積原価価値は、前記始動エージェントといずれかの任意のエージェント間のパスに沿って非局所的な特性（前記エージェントに対する）の局所的な表現（前記エージェントにおける）として機能する。
【００７５】
いくつかのケースでは、受信した信号から送信する新たな信号を確定する際にいずれを使用するを決定するために、保持原価価値をある信号から受信した原価価値と比較することが望ましい場合がある。このようにして、各エージェントにおいて受信された最良蓄積原価価値は保持され、また、毎回信号は前記エージェントに通され、前記受信蓄積原価価値は、いずれが高いのかを決定するために、前記保持原価価値と比較することが可能である。このことが起こるためには、同じ始動エージェントが周期的に、複数のエージェント全体にわたって伝搬されるべき信号を再送信することが想定される。
【００７６】
これは例えば、前記複数のエージェントにより網羅されているエリアにおける変化を追跡するために、各エージェントにおいて前記蓄積原価価値を継続的に更新するために更新プロセスの一部であると望ましい場合かもしれない。
【００７７】
前記信号プロセシングステップ２０４にはまた、時間経過により前記原価価値の期限が切れるようなメカニズムを提供するために、時刻記録の使用が含まれても良い。原価価値が生成されメモリーの中に保持されると、それらは時刻を記録される場合がある。最も近い時点で時刻記録され保持されている原価価値は、信号をプロセスする際に使用することができる。
【００７８】
しかし、所定の時間が経過した後、前記保持されていた原価価値はメモリーから除去される。原価価値の期限が切れることを使用することにより、原価価値情報は一定の時間的な期間にわたって保持されるが、それはまた一定の時間的な量が経過した後には期限が切れて、そのシステムから使われなくなった情報をフラッシュすることを確実に行うことによる１つの長所がもたらされる。
【００７９】
メモリーにおける最終原価価値を単に保存することに加えて、一定の順序で順位付けされた（またはアクセスされた）原価価値とともに、複数の原価価値がスタック様構成でメモリーの中に保存される。前記原価価値の順序は、例えば、最高から最低へ、またはその反対にまたは、何らかの他の順位付け基準を使用することが可能である。
【００８０】
このシナリオでは、新たに受信した原価価値がその時刻も記録され、また、メモリーに保存される。現行の原価価値が期限切れになると、メモリーに保存されていた次の最良原価価値がそれが期限切れになるまで使用され、原価価値がメモリーの中に残っているかぎり、そのようにして続けられていく。このメカニズムは、メモリーに保存されている原価価値が、消去される前に準減衰挙動に追随することを可能にする。
【００８１】
前記エージェントは、信号が受信された方向を確定するための手段を装備することが可能である。これは、複数の指向性赤外線トランシーバーまたは複数の指向性無線受信機などの指向性通信メカニズムを使用することにより達成することが可能である。そのように装備されたエージェントは、隣接しているエージェントへの単一パスに関する方向と原価価値の両方を保存することが可能である。始動エージェントへの前記最適パスを確定するために、最良原価価値を有するものに対する隣接エージェントによる探索ではなくてむしろ各エージェントは、前記パスに沿って次のエージェントに対する方向を保存することができる。
【００８２】
信号が受信された方向はまた、前記原価価値の中に因子化して入れることもできる。例えば、特定の向かい先へ導入されるパスは、他の向かい先へ導入されるものよりもより望ましくない場合が有り得る。したがって、前記方向は、ある一定方向にあるパスの原価を増加させるかまたは減少させるかのいずれかのために、原価価値の中に因数化して入れることができる。前記エージェント１００はまた、位置と方向を確定するためのメカニズムを、衛星測位システム（ＧＰＳ）受信機と方向確定を容易にする手助けとなるコンパスなどをワールド座標系の中に組み込むことができる。
【００８３】
複数の始動エージェントを使用することも可能であり、また、それぞれに対するパスは、どれがより良好であるかを確定するために、比較することが可能である。例えば、エージェントがいくつかの地雷を有していることが知られている１つのエリア全体を通して分散されている場合は、１つまたはそれ以上のエージェントの近くにある地雷の検出に基づいて信号の伝搬開始のトリガーを引くことが望ましい場合がある。各エージェントはその後に、原価価値を有する信号の伝搬を開始することが可能である。
【００８４】
各エージェントにおいては、前記始動エージェントのそれぞれから開始される信号の原価価値は、それぞれへの最適パスだけではなく、総体的な最適パスもまた確定するために比較可能である。その結果として、複数の地雷がエージェントの中に存在している可能性があっても、いずれかの地雷への最適パスを確定することが可能である。この場合、複数のエージェントの中の各エージェントにおいて、各始動エージェントへのパスに沿った原価価値を示すために、ユニークな識別子を使用することが望ましい場合がある。
【００８５】
各始動エージェントへのパスの原価価値はその後、それらの中の最良パスを決定するために各エージェントにおいて比較することが可能である。前記原価価値は、各始動エージェントへの各パスに対して個別に受け渡しされていき、または、それら原価価値は比較され、最良のもののみが受け渡しされていく。言い換えれば、前記複数のエージェントの中の各エージェントは各始動エージェントの前記パスに関する原価価値を受信し、また保持もすることが可能であり、または、受信した最良の原価価値のみを保持することが可能である。
【００８６】
任意に選択されたエージェントからある始動エージエンへのパスを簡単に識別することに加えて、２つのエージェント間にあるパスを利用することが望ましい場合がありうる。例えば、２つの基準エージェント間にある１つのパスに沿ってエージェントをラインアップさせることが望ましい場合がある。１つのパスに沿ってエージェントをラインアップするために利用される諸ステップを示すフロー図が図３に図示されている。
【００８７】
最初に、基準エージェント確定ステップ３００において、基準エージェントとして行動するように、複数のエージェントの中から２つのエージェントを決定することが必要である。次に、１つの信号が前記基準エージェントのそれぞれから送信される。前期信号の送信とパスベクトルの生成は一般的には原価勾配生成ステップ３０２として示される。前記パスベクトルは基本的に複数のエージェントによってカバーされるエリア全体にわたって、かつ始動エージェントの方向に向かって原価価値勾配を生成する。
【００８８】
前記パスベクトルは、したがって運動ベクトル生成ステップ３０６において、ある運動ベクトルを作り出すために利用される。単に合計を作り出すこと以外のメカニズムが使用することができるとは言っても、一般的には、各運動ベクトルは前記パスベクトルの合計を表わす。エージェントは、それらがエージェント収束ステップ３０８において前記２つの基準エージェント間にある１つのパスに沿って位置付けされるまでは継続的に更新される前記運動ベクトルに沿って運動することが促される。
【００８９】
前記２つの基準エージェント間にある前記パスに前記エージェントが動いて近づけば近づくほど、前記運動ベクトルは小さくなり、前記パスベクトル構成成分はどんどん近づき、反対側に対向するものになる。始動エージェントの指定に関しては、基準エージェントの指定は、ユーザコマンドにより明示的に、またはエージェント検出出来事により暗黙的にかのいずれかにより行うことが可能である。
【００９０】
一旦前記エージェントが２つの基準エージェント間にあるパスに沿ってラインアップされると、他のエージェントの行動が可能となる。例えば、前記２つのエージェント間にあるパスに沿ってチョーク点に収束することが前記エージェントにとっては望ましい場合がある。具体的には前記エージェントが１つの建物内部の火災を検出し、また、前記火災を止めるのに最良の可能なやり方で火災抑制剤をデリバリーするように設計される場合、前記火災抑制剤の影響を最大限のものとするために、前記建物の物理的構造部材を使用することが望ましいかもしれない。
【００９１】
最大限の効果を達成するためには、通路が狭いことは、火災抑制剤をデリバリーする観点からは最適である。したがって、狭い通路のいずれかの側に存在する２つの基準エージェントを決めて、前記他のエージェントを、前記基準エージェントの間にある１つのパス上にラインアップさせることが可能である。
【００９２】
次に、動きに対する最小自由度を有するエージェントを決定するために、前記パスに沿ってラインアップさせたエージェントに探索させることが望ましいことになる。そのエージェントは、前記通路のもっとも狭い部分にあると仮定することができ、また、前記他のエージェントは、そのエージェントに向かうように促すことができ、したがって、結果的には、前記収束エージェントの近くにいるエージェントの局所クラスタリングを生じる。前記エージェントはその後、前記狭い通路の中にその火災抑制剤を最大の効果をもってデリバリーして入れることができる。
【００９３】
視覚的通信システムのラインを使用する場合は、前記パスのもっとも狭い部分の検出はまた、隣接しているエージェントからの少数の送信を受信する前記パスに沿って前記エージェントを決定することにより、または、もっとも小さい角度範囲にわたる送信を受信している前記エージェントを決定することにより達成することができるであろう。
【００９４】
【本発明のさらなる形態】
本発明のさまざまな態様を図示するいくつかの説明図が、その使用法に関するさらに完全な理解を提供するために提示される。最初に、地面推論に対する従来のマップをベースにした諸アプローチの方法と、本発明を対照させている図が図４に図示される。
【００９５】
次に、出発エージェントから標的へのパスを確定するために、パスに沿って原価価値を使用する説明が図５に図示される。次に、２つの異なる標的を図示するための２つの明確な信号型の使用法に関する１つの実施例が図６に示されている。２つの基準エージェント間にある１つのパスに沿っているエージェントの整列と、チョーク点への収束が図７と図８にそれぞれ図示されている。
【００９６】
さて、図４を参照すると、図４（ａ）は、中央演算リソース４０４に戻る生データまたは部分的にプロセスされたデータを送信する探索パターンにおける環境４０２を通って移動していくいくつかの強力な移動可能なロボットまたはセンサ４００を使用することにより、空間をマッピングするための従来からの手段を図示している。中央演算リソース４０４はその後、マップ４０６を編集し、また、それから、他のオペレーションのために前記マップを利用することができる。
【００９７】
前記環境に対する動的変化は、前記ロボット４００が探索を繰り返すまでは、取り上げられることはない。それとは対照的に、図４（ｂ）に図示されているように、本発明によりもたらされた技術は、前記マップを具体化し、簡単なエージェント１００の集合の全体にわたってそのマップを広げており、そのそれぞれがその局所性の地面の特徴を確定している。最短経路、ブロックされている経路、偶発性プランを含む大域特性が、エラーに強い分散型のやり方で計算することができ、前記複数のエージェント１００の各メンバーがその結果に貢献していている。
【００９８】
本発明を利用すると、エージェント１００の分散型群が、センシングと計算タスクの両方を同時に行うために環境内に埋め込まれる。先に述べたように、前記エージェント１００はさまざまな手段により、無作為にか、または特定配置により、前記環境の中に置くことが可能である。ある環境の全体にわたって広がったエージェント１００を使用する長所には、故障許容、前記環境における動的変化に対する応答性が含まれる。
【００９９】
また、本発明は、いくつかの非常に高価なロボットよりも、１つの環境の全体に拡がる数多くの比較的小さくて安価なエージェントの使用をもたらす。このように、数多くの簡単なエージェント１００をそこで動的変化を検出するために、非常に簡単な個別の計算能力を備えて、１つの環境の中で用いることが出来る。多くのエージェント１００が使用されるため、単一エージェント１００の損失は、総体的な計算能力についてはほとんど影響を与えず、したがって、故障許容の高い程度のものを保証することができる。
【０１００】
さらに、通信は、もっとも近くに隣接しているエージェント間のみで必要とされ、そのため、高出力通信装置の必要はない。情報はメッセージを中継することによりユーザ４０８に戻されるか、または前記ユーザが前記環境を移動していくと、前記ユーザは近くのロボットからデータを得ることができる。さらに、前記エージェント１００は、１つの環境の全体にわたって積極的に広がって環境を捜索することを可能とするために、運動を提供するための手段を装備することが可能である。
【０１０１】
複数の小さなプロセッサを前記環境の全体に広がらせることにより、計算が分離され、また平行にすることができる程度にまで計算速度における改善を可能にするということは留意すべき重要なことである。既存の地面データベースを使用するパスプラニングは、単一のコンピュータでＯ（ｎｌｏｇｎ）の時間かかるが、ネットワークでは最悪でもＯ（ｎ）の時間である。最良の地面点、そこでは点の質は局所的に決定することができるものであり、それを見出すには、単一のプロセッサではＯ（ｎ）時間かかるが、ネットワークではＯ（ｌｏｇｎ）時間であり、その場合は、その時間の大半はその結果を整理するためのものである。したがって、大量のデータでは、分散型ネットワークにある複数のプロセッサを使用することにより、計算時間を有意に減少させる。
【０１０２】
出発エージェントから標的へのパスを確定するために、パスに沿った原価価値の使用法に関する説明図がその次に図５に図示されている。本発明を使って１つの建物の内部の出発点５０２から標的５００への最良パスを確定することが可能である。図示されている実施例では、その適用は１つの建物内部の下検分であり、また、標的の位置を決めることである。
【０１０３】
図５（ａ）は、前記標的５００を見つけ出しているいくつかのメンバーとともに、建物５０４の中に散開した一群のエージェント１００を図示している一種の地形図である。前記エージェント１００は、前記建物５０４の中のほぼ均一な空間取りを達成していたことを特記しておく。この場合、前記エージェント１００には、前記標的を検出するために適当なセンサが含められる。
【０１０４】
例えば、前記標的５００が侵入した人間である場合は、センサは体温、動き、雑音を検出するために設けることが可能である。標的までの距離をコード化しているその原価価値を有するメッセージ伝搬様勾配を図示している説明図が図５（ｂ）に示されている。前記標的Ｔ５００を検出する際には、前記標的Ｔ５００にもっとも近いエージェントである前記始動エージェント５０６は、１００としいうゼロ原価価値を有する信号を送信する。この実施例は、１００でスタートし、前記始動エージェント５０６から離れると、各ホップ毎に減少していく原価価値を図示している。
【０１０５】
前記原価価値は、前記始動エージェント５０６から、各ホップ毎に非常に容易に増加させることができ、または、先に述べたように、局所センサ測定を表わす因数により変化させることができるであろう。これは、送信エージェントまでの距離でありうるものであり、受信されたメッセージの信号の強さから推定することができるであろう。
【０１０６】
本実施例では、壁やドアなどの内部構造がその構造表示の中に組み込まれているため、エージェント間の視覚的通信ラインの使用が望ましい。始動エージェント５０６の視覚ライン内にある隣接しているエージェントは、前記始動エージェント５０６からの信号を受信し、原価価値を減少させ、前記減少させた原価価値を組み込んでいる新しい信号を生成し、それらがあたかも前記始動エージェントであるかのように、前記新しい信号を送信する。
【０１０７】
前記ラインに沿って全てのエージェントが同じ原価価値を示している等強度原価価値ラインが、１００でスタートし、また、前記始動エージェント５０６から前記出発点５０２の一番近くにある前記エージェントまで９２に減少している前記ラインにより示されている。これは、複数のエージェントの全体にわたる原価価値がいかにして勾配様マップを形成するのかを示している。この場合、ラインは、標的から伝搬して離れていく信号の勾配を示しており、一方、原価価値の伝搬はＤｉｊｋｓｔｒａの最短パスアルゴリズムにおける距離情報の伝搬と同一である。前記視覚的通信ラインは建物内のトラバース性に固有にリンクされていて、原価価値は、標的までのほぼ同様のトラバース距離を反映している。等強度ライン上にある全てのエージェントは検出された標的Ｔ５００からはほぼ等距離にある。前期原価価値勾配を上昇するエージェントを辿っていくことにより、標的へのもっとも効率的なパスを発見することが期待できる。
【０１０８】
受信したメッセージが発せられた方向を検出するための手段が前記エージェント１００に設けられることは望ましい場合もある。新しいメッセージが受信されるときにはいつでも、原価価値は更新され、また、メッセージの供給元への対応する方向が保存される。この情報を使用すると、各エージェントにとっては最良のものが標的に向かっていく局所方向を示すことが可能となる。
【０１０９】
原価価値メッセージに基づく発信物に対する方向の信号を送信し、また、前記標的への方向付けられたパスを作り出すエージェントを図示する図が図５（ｃ）に示されている。前記出発点５０２に始まり、前記標的Ｔ５００まで前記ユーザを導いていく１つのパスに繋がる前記情報がユーザに提供され、一方、前記標的位置と前記建物５０４内の通路についての情報が組み込まれる。
【０１１０】
２つの異なる標的を検出するための２つの明確な信号タイプの使用法に関する１つの実施例が図６に図示されている。図６には２つの明確な信号のタイプが使われているが、各信号タイプが明確な原価価値勾配を形成するものであれば、信号タイプのいずれの数のものも使用することが可能である。前記原価価値勾配には、各エージェントと標的または複数の標的との間の距離と方向が含まれる。２つのメッセージタイプを使用することにより、各エージェント１００には、どの標的が一番近いのかを確定することが可能になることが図６（ａ）に図示されている。エージェント６００、６０２、６０４、６０８はそれぞれ、第１標的６１０または第２標的６１２のどれが一番近いのかを検出する。
【０１１１】
このことにより、ユーザはどの標的が一番近いのかを確定することが可能になる。この場合、エージェント６００は第１標的６１０を検出する。その結果生じる第１標的６１０までの距離は、前記他のエージェント６０２、６０４、６０６、６０８のそれぞれにより計算される。同時に、エージェント６０８は第２標的６１２を検出する。その結果生じる第２標的６１２までの距離もまた前記他のエージェント６０２、６０４、６０６、６０８のそれぞれにより計算される。
【０１１２】
明確なメッセージタイプは各標的用に使用することが可能である。ホップに関しては、前記距離は各エージェント毎に示され、スラッシュで区切られていて、第１距離は第１標的６１０からの距離を示しており、また、第２距離は第２標的６１２からの距離を示している。各標的に対する明確なメッセージタイプを使用すると、どの標的がそのエージェントに一番近いかを測定するために各エージェントにおける前記原価価値を容易に比較することができる。
【０１１３】
代替的なものとして、各エージェントが一番近くの標的までの距離を示している一つのメッセージタイプを使用して図示している図が図６（ｂ）に示されている。この場合、同じメッセージタイプが両方の標的に対して使用することが可能であり、また、各エージェントは前記一番近いエージェントまでの距離のみを保存することになる。図６（ｂ）に図示されているように、一番近い標的までの方向もまた見つけることができるように、上に論じられているように、指向性情報を保持しているメッセージスキームを使用することは助けとなる。
【０１１４】
２つの基準エージェント間にあるパスに沿っているエージェントの整列と、チョーク点への収束が図７と図８にそれぞれ図示されている。図７に図示されているように、出発エージェント７００と標的エージェント７０２は、基準エージェントとして指定されている。前記基準エージェントはそれぞれ、他のエージェントに対して明確なメッセージタイプを発している。前記他のエージェント７０４は前記基準エージェントのそれぞれに向かうパスベクトルを生成する。
【０１１５】
前記パスベクトルはその後、前記他のエージェント７０４のそれぞれに対する運動ベクトル、これは前記出発エージェント７００と前記標的エージェント７０２との間にある１つのパス７０６上にラインアップさせるために、これに沿って前記他のエージェント７０４が移動することができるものであるが、を生成するのに使用される。矢印は前記エージェント７０４と前記基準エージェントとの間の前記パスベクトルを表わすことを記しておく。
【０１１６】
前記エージェント７０４が前記パス７０６に向かって移動していくと、前記エージェント７０４が前記パスに到達し、また、前記出発エージェント７００と前記標的エージェント７０２に向かうのとは反対の方向に、前記パス７０６に平行に前記ベクトルは整列されるまでには前記パスベクトルは更新される。
【０１１７】
図８に提示されている説明図は、チョーク点に沿って収束させるようエージェントを準備するために、基準エージェントを選択することと、それらの間にある１つのパスに沿っている他のエージェントを整列させることを利用している。チョーク点に沿う収束は前記基準エージェント間にある１つのパスに沿って前記エージェントを最初に整列させることをせずに行うことが可能であるが、前記パスに沿った整列により前記パスに沿って非常に高い密度でエージェントを供給し、適切なチョーク点の選択のために、前記パスに沿う諸特性のより精密な検出を可能にしている。
【０１１８】
図８（ａ）に図示されているように、出発点８００にあるエージェントと標的点８０２にあるエージェントは基準エージェントとして指定され、一方、それらの間にあるパス８０４は、点線により示されている。この場合、前記エージェント１００は、前記エージェント１００上で矢印により示されているように、それらに一番近い前記バスに沿う１つの点に向かって移動する。
【０１１９】
図８（ｂ）では、前記エージェントは前記パス８０４に沿ってほとんど全部が整列している。この場合、前記出発点と前記標的点との間にある一番狭いエリアでのブロッキングは、ブロッキングエリアが前記標的点に一番近いので望ましいものであると考えられる。
【０１２０】
したがって、２つのチョーク点８０６と８０８が指定され、また前記エージェントはそれぞれに対して収束する。この場合の前記チョーク点を決定するのに使用される基準は、（１）回廊（エリア）の狭さと、（２）標的点に対してブロックする前記回廊の近さであることは留意すべき点である。
【０１２１】
したがって、エージェントが１つも前記エリアには収まらなくなるまでチョーク点８０６上に最初にエージェントは収束し、その後チョーク点８０８上に収束する。もう１つの狭いパスエリア８１０は前記出発点８００と前記標的点８０２との間にある前記パスに沿ってあるわけではなく、そこにはエージェントは収束しない。収束のためのエリアは多くの異なる基準に基づいて多くの異なるやり方で選択することが可能である。例えば、狭さは、エリアをブロッキングするための基準でありうるが、急勾配はエリアをブロッキングすること逆らった基準となりうる。したがって、一旦１つのエリアが十分に傾斜している場合は、それが狭いという事実は問題ではなくなり、またブロックされるべき必要もなくなる場合もある。
【０１２２】
本発明の組込みに関連する全ての論理は典型的には、エージェントのメモリーに保存されているソフトウェアの形態で具体化され、またプロセッサにより運用される。しかし、本発明の前記論理はソフトウェアを使用するというよりもハードウェアにも固定されうる。前記ソフトウェアのさらに完全な説明と、その好ましい実施態様を提供するために、コンパクトディスクが、本明細書には、添付書類として提供されており、また、本明細書に述べられている機能のいくつかを実施するソフトウェアが含まれている。
【図面の簡単な説明】
本発明のこうした特徴ならびに他の特徴と態様および長所は、以下の説明と、付随の請求項と、添付されている図面とに注目すればより良く理解されることであろう。すなわち、
【図１】図１は、本発明に関連して使用される典型的なエージェントの１つの実施例を図解したものである；
【図２】図２は、本発明の前記方法における前記ステップを図示するフロー図であり、少なくとも１つの始動エージェントに対する最良原価価値パスを確定するための方法を示す；
【図３】図３は、２つの基準エージェントの間の１つのパスに沿ってエージェントを整列させるのに利用される前記方法の付加的なステップを図示するフロー図である；
【図４】図４（ａ）は、地面推論の通常のマップをベースにしたアプローチの方法を図示している説明図である；
図４（ｂ）は、図４（ａ）に図示されている通常のアプローチの方法と対照させて、本発明の前記地面推論アプローチの方法を図示している説明図である；
【図５】図５（ａ）は、いくつかのメンバーがその標的を捜して、１つの建物の中に散在しているエージェントの一群を図示している地形図である；
図５（ｂ）は、標的に対する距離をコード化してその原価価値を伴う勾配として、伝搬されているメッセージを図示している説明図である；
図５（ｃ）は、原価価値メッセージに基づく前記標的に対する方向を通信し、標的に対するパスの方向を創生するエージェントを示した図である；
【図６】図６（ａ）は、各エージェントがどの標的がもっとも近いのかを確定するのを可能にする２つのメッセージのタイプを使用することを示している図である；
図６（ｂ）は、各エージェントがもっとも近くにある標的に対する方向を示すのに１つのメッセージタイプを使用することを示した図である；
【図７】図７は、２つの基準エージェント間にある１つのパスに沿ってエージェントを整列させるためにパスベクトルを使用することを示す説明図である；
【図８】図８（ａ）は、２つの基準エージェント間にある１つのパスを作り出すこと、ならびに、前記パスに沿ってチョーク点に他のエージェントを収束させることを準備するために、前記パスに他のエージェントを促すようなパスベクトルから運動ベクトルを生成することを示す説明図である；
図８（ｂ）は、一旦前記エージェントが前記パスに沿って整列した場合の基準エージェント間のパスを示す説明図である；
図８（ｃ）は、前記パスに沿ってチョーク点にエージェントが収束した後の前記エージェントの説明図である；
【符号の説明】
１００：エージェント、１０２：通信能力、１０４：１組のトラック
１０６：１組のホィール、１０８：金属検出器、１１０：ＧＰＳ受信機
１１２：赤外線検出器、１１４：ガスクロマトグラフィ、
４００：移動可能なロボットまたはセンサ、４０２：環境、
４０４：中央演算リソース、４０６：マップ、４０８：ユーザ
５００：標的、５０２：出発点、５０４：ビルディング、
５０６：始動エージェント、
６００、６０２，６０４，６０６，６０８：エージェント、
６１０：第１標的、６１２：第２標的、
７００：出発エージェント、７０２：出発エージェント、
７０４：他のエージェント、７０６：パス、
８００：出発点、９０２：標的点、８０４：パス、[0001]
[Statement of rights to be attributed to the federal]
The present invention relates to research conducted within a contract with the United States Government for a DARPA ITO Agreement No. N66001-99-C-8514, "Pharmone Robotics", The invention has certain rights.
[Priority claim]
No. 60 / 217,232, filed in the United States on July 10, 2000, entitled "Emergent Movement Control for Large Group of Robots", and No. 60 / 217,226, filed in the United States on July 10, 2000, entitled "Terrain Reasoning with Distributed Embedded Processing Elements", entitled "Terrain Reasoning with Distributed Embedded Processing Elements."
Claim the benefits of priority.
[0002]
TECHNICAL FIELD OF THE INVENTION
The invention relates to the field of calculating properties of a physical environment. More specifically, this disclosure presents a method for calculating characteristics of a physical environment using multiple agents forming a distributed network embedded within the environment.
[0003]
Problems to be solved by the prior art and the invention
When it comes to path planning and approaches to ground analysis, over the past few decades, there has been interest in single or parallel processor solutions that operate on internal maps containing ground features. Traditionally, ground analysis has involved one of two forms: 1) a single mobile agent (eg, a robot) or a small number of agents that collect ground information by flying over; And / or use pre-built digital maps to assist in path planning or other ground related decisions. 2) As pre-built digital maps useful for geometric calculations, ground / environment databases (simulations as below) (For example, a simulated ground vehicle or a simulated enemy in a computer game).
[0004]
Typical calculations include determining the shortest path to a destination, determining a location with high visibility on the ground, and determining a hidden route from a known enemy location. In such traditional approaches to route planning and ground logic, relevant data is taken from the real world to create a machine-readable map. Ground / environment logic based on local sensing provides very limited access to information about the environment.
[0005]
Pre-building a map of the environment with solutions from the natural world provides much more information and can be updated with information from the robot's own sensors. Such maps are usually represented by an array of ground characteristics, including traversability and blocked routes, which can be translated into a corresponding array representing the difficulty and cost of traveling across a particular ground path. .
[0006]
The cost array can then be analyzed to obtain a new map showing the minimum cost path between designated points on the map. In this traditional approach, the separate and distinct steps of sensing the environmental characteristics of the area, sending data to a central point, and generating a map, are performed when all minimum cost path calculations are performed. Must be done before As a result, in situations where the environment is changing rapidly or sensing and map generation cannot be performed in a timely manner, it is often difficult to determine the minimum cost path incorporating the latest information. is there.
[0007]
Current means of mapping space include the use of several powerful mobile robots that navigate the environment in a search pattern and send raw or partially processed data back to a central processing resource. Included. The central resource edits the map and then performs operations on the map. Dramatic changes to the environment are not captured until the robot repeats its search.
[0008]
In some communication networks, calculations are performed to determine the lowest cost path for communication purposes (eg, to minimize delay due to transmission energy or signal retransmission). These approaches are particularly relevant to telecommunications issues, and they do not attempt to derive a correspondence between communication efficiency and environmental characteristics that might affect mobility. . Information relevant to some of these approaches can be found in U.S. Patent Nos. 5,561,790 and 5,233,604.
[0009]
Thus, a number of small, inexpensive agents that require only local communication with adjacent agents are used to provide a means for determining the optimal path over or through an area of an environment. It would be desirable to provide. The non-local characteristics accumulated along one path are signals transmitted across the path by making a series of hops through multiple agents in a manner that provides a local representation for each agent. Would. Information at the agent could be updated immediately by sending another signal along the path.
[0010]
[Means for Solving the Invention]
The invention disclosed herein provides a method and apparatus for calculating characteristics of a physical environment using a plurality of agents located within the physical environment. Each deployed agent includes a communication capability, a processor coupled to the communication capability, and a memory connected to the processor. The plurality of agents form a distributed representation of the non-local characteristics of the physical environment by utilizing the physical characteristics of the inter-agent communication.
[0011]
Triggering at least one initiating agent; sending a signal from the initiating agent to an agent adjacent to the initiating agent; communicating between the initiating agent and each adjacent agent Processing signals at each of said neighboring agents according to a cost value based on local characteristics along respective paths between; selecting a new signal to transmit from the received signals; Treating an agent as a triggering agent of the processed signal and repeating the transmitting and processing steps; at least one agent representing information about a non-local property of a physical environment based on the received signal. The storage cost value includes the step of locally retained.
[0012]
Thus, a signal propagates from the initiating agent to each of the plurality of agents, and the accumulated cost value is stored at each agent to provide a measure of quality for the best path from the agent to the initiating agent. You.
[0013]
The cost value is expressed as a simple hop count, and the signal propagation may be repeated from the initiating agent to multiple agents at each hop. The accumulated cost value is typically used to determine a new signal to be transmitted from the received signal.
[0014]
In addition, the cost value is retained from past signals and can be compared to the cost value from current signals to determine the cost value of processing the signal. The signal is typically passed along the lines of a visual path, which provides a general expression of the traversability along one path.
[0015]
The agent may also be augmented by providing the agent with a means for preserving the direction in which a particular signal has been received over one communication path, and the difference between the two agents The cost value for a path can be stored with the direction corresponding to the path with the best cost.
[0016]
Further, when the cost value is stored in the agent's memory, the time can be recorded, and the most recently stored cost value is specified to be used in processing the signal. And the cost value is removed from memory after a predetermined time has elapsed. The cost value can also be ranked from best to worst over time, and the best cost value is used in processing the signal.
[0017]
In this case, when the time value of the cost value specified to be used in processing the signal expires and the cost value is removed from memory, the next best cost A value is specified to be used in processing the signal.
[0018]
In another embodiment of the invention, the agents can be configured to perform an alignment, wherein two agents select from among the plurality of agents to act as reference agents. Generating a cost gradient comprising a vector representing an optimal cost path between agents among the plurality of agents and a cache-based agent; the vectors are summed at each agent to create a motion vector for the agent. And the agent is moved along the motion vector to align the agent along one path between the reference agents.
[0019]
In the aligning step, for example, a trigger can be pulled upon occurrence of a triggering event such as detection of an intruder. Once the agents are aligned along one path, they are converged to one choke point by selecting a convergence agent and prompting other agents to head towards the convergence agent. Is possible, resulting in local clustering of agents near the convergence agent.
[0020]
The agents can also incorporate sensors connected to the processors to provide additional information to the processors for generation of the cost value. For example, a smoke detector can be used to factor the extent of smoke in an area into the cost value along one path.
[0021]
Agents designed for use with the present invention can incorporate the logic used to implement the present invention in the form of software, or the logic can be encoded in hardware.
The invention relates to the field of calculating properties of a physical environment.
The following description is presented to enable one of ordinary skill in the art to make, use, and incorporate the invention in the context of a particular application. Various modifications as well as various uses in different applications will be readily apparent to those skilled in the art, and the general principles defined herein will apply to a wide range of embodiments. It is possible.
[0022]
Accordingly, the present invention is not intended to be limited to the embodiments shown, but is to be accorded the widest scope consistent with the principles and novel features disclosed herein. Should be tolerated.
[0023]
To provide a reference work frame, a description of terms used in the description and claims is first provided as a central source for the reader. A brief introduction is then provided in the form of an illustrative description of the invention to provide a conceptual understanding prior to elaborating its specific details.
[0024]
[Glossary]
Before describing the specific details of the present invention, it is instructive to provide a centralized listing of various terms used in the specification and claims.
[0025]
Agent-As described herein, the term agent, in its most general sense, refers to a unit that includes a processor, communication capabilities, and memory. The agent also includes one or more means for movement and means for detecting its orientation relative to other objects in the agent placement system (compass, directional transmitter / receiver, etc.). , A distance measuring device (such as a laser, infrared, ultrasonic, or radar range finder) and one or more sensors.
[0026]
In even more straightforward terms, an agent can be, for example, a robot with an infrared transmitter / receiver and a set of wheels, where the infrared transmitter / receiver is connected to the robot and other robots. Used to measure the distance and direction to the robot. In more complex situations, the same robot may also have antennas to transmit and receive wireless communications, sensors to sense various conditions in the environment (temperature, visibility, etc.), flight, Alternative means of movement, such as wings and propellers for the vehicle, may be included.
[0027]
Choke point-In certain situations, it may be desirable to move a group of agents toward a point. For example, if an intruder is moving around in one area, multiple agents can be sent in the vicinity of the intruder. On the other hand, in areas with narrow corridors, it may be desirable to restrict the passage of intruders.
[0028]
Thus, an agent may be sent near a corridor along the intruder's path to limit the intruder's movement. Thus, a choke point is generally a convergence point of a group of agents around one selected agent, blocking a passage or placing a group of agents densely at one selected locality. Can be used to
[0029]
Communication Capability—The term communication capability as used herein generally refers to a local communication mechanism that facilitates communication between an agent and its local neighbors. Any communication mechanism applicable to a particular environment may be used.
[0030]
For example, in a clean atmosphere, a directional infrared communication mechanism may be particularly useful because of its directivity. On the other hand, in a water or liquid environment, sonic communication mechanisms may be more useful.
[0031]
Cost value-The term cost value refers to a measure of one or more physical characteristics along a path between agents. For example, in a very simple case, the cost value can be a measure of whether one agent was able to signal another agent, in which case the hop count would be the cost value. Could function as
[0032]
On the other hand, more complex cost values may be developed (for example) as a weighted sum of several physical properties, including, but not limited to, signal transmission quality, temperature, wind speed, Includes the angle of the path between the agents. Many cost value schemes can be deployed as needed for a particular implementation.
[0033]
The term local cost value is intended to refer to a measure of local environmental conditions along the communication path between two agents. The term accumulated cost value is intended to refer to a measure of non-local environmental conditions along the path between two or more agents. Although the terms local cost value and accumulated cost value are defined herein, the exact application of this term can also be inferred from its use.
[0034]
Also, the cost value generated at each of the plurality of agents responsive to the signal transmitted by the initiating agent forms a cost gradient pattern across the plurality of agents. Combined with the directional information, the cost gradient pattern has many features of the oriented graph, and many calculations performed using the oriented graph apply to the plurality of agents. It is possible to do.
[0035]
Local-Agents that are within direct (usually line-of-sight) communication and the sensing range of an agent are considered to be local and are called its neighbors. The number of local (adjacent) agents for one arbitrary agent may vary depending on the number of agents present in one arbitrary area. It is desirable to use signals only between local agents to conserve transmission power.
[0036]
For the case where it is used with respect to the properties of the environment, the properties that are local to one agent are the properties that can be sensed by said agent. Non-local physical / environmental properties for an agent generally refer to those physical / environmental properties that the agent cannot directly sense.
[0037]
Means for Providing Mobility-To provide mobility that needs to move around in the environment (assuming movement is desirable), agents need something to facilitate mobility. I do. Any known controllable movement mechanism can be used, as appropriate for the environment in which the agent is to be used.
[0038]
For example, on the ground, agents would typically have legs or wheels, while in the air they would have propellers and wings. On the other hand, in a liquid environment, the agent would be equipped with propellers and fins, and in space, the agent would be equipped with a jet or rocket. Means for providing mobility are coupled to and controlled by the processor.
[0039]
Sensors-In addition to means for detecting distance and direction from other agents, agents can include sensors associated with their status as well as sensors coupled to their processors to collect information about their environment. It is. Examples of sensors that can be used in the agents of the present invention include, but are not limited to, smoke detectors, audio detectors, visual sensors, seismic detectors, heat / infrared detectors, An article sensing device and a satellite based positioning system (GPS) sensor are included.
[0040]
Sensor information can be incorporated into the cost value along one path, or can be used to trigger various behaviors of the plurality of agents, such as convergence to a choke point .
[0041]
[Introduction]
The present invention generally provides a means for using a plurality of agents distributed in an area to perform terrain calculations on the area. Properties can be determined by creating a pattern of communication between the nearest neighbors, where the source agent sends a signal to each of its neighbors that includes the accumulated cost value. Send
[0042]
Each adjacent agent receives the signal from the source and modifies the accumulated cost value based on the physical characteristics between the source and the respective adjacent agent. And sends a new signal to its neighboring agent incorporating the modified cost value.
[0043]
As the signal propagates through multiple agents, a single agent can receive a signal having an accumulated cost value from two or more of its neighbors. In some particular embodiments, a single agent determines the best accumulated cost value among those received, and further modifies the best accumulated cost value into a new signal.
[0044]
Alternatively, the agent can modify all of the accumulated cost values to determine the best accumulated cost value and incorporate it into the new signal. The accumulated cost value represents a non-local characteristic of the environment (eg, a characteristic of the physical environment that cannot be directly sensed by the receiving agent). After modification, the agent saves the best accumulated cost value and generates a new signal to send to its neighbors.
[0045]
As a clearer example, multiple agents could be used to conduct a search for fires (or other objects such as mines, intruders, etc.). If an agent detects a fire while the user is waiting at a specific location, a signal from the detection agent is propagated throughout the plurality of agents and transmitted to the user.
[0046]
The user is provided with an optimal path to the fire (e.g., the fastest, most secure, etc.) since multiple paths with accumulated cost value are useful anywhere in the multiple agents. ). The best path can easily be designated as the path along which the signal has taken the fewest hops along the route to the user (as an estimate of the distance). On the other hand, if the best path is derived from a sensor mounted on the agent along the path, it may be a function of more complicated information.
[0047]
Generally, once a triggering agent begins to propagate a signal across multiple agents, the signal is modified at each signal pop between the triggering agent and each receiving agent. The modifications have the purpose of incorporating information about the characteristics of the physical environment across the latest conveyed hops, and can be as simple as updating the hop count, or as complex as adding sensor information or local computation results. sell.
[0048]
The information about the physical environment along each path can easily provide a measure of the quality of the path and provide a separable part to determine the unique characteristics at each hop in the path. It is possible to include. In any case, the stored signal specifically represents the quality of the best path. Propagation of the signal between the initiating agent and the receiving agent is basically performed by the initiating agent and the receiving agent at each step to provide a local overview of the non-local characteristics along the path. Is a distributed computation that occurs over the entire path between
[0049]
The level of detail information about environmental characteristics along a path may vary widely depending on the particular implementation. For example, one of the easiest ways to measure the environmental properties is to count the number of hops between the initiating agent and the receiving agent along the path of a successful transmission between two agents. Is to take advantage of the presence or absence of the ability to transmit signals.
[0050]
A slightly more complicated method is to use the strength of the signal received over one hop as a measure of the quality of the bus along the hop. This provides a more accurate way of determining the distance. Still more complex methods are to use sensors mounted on the agent to determine the physical environmental properties and to incorporate those properties into the signal.
[0051]
In this sense, the stored cost value embedded in the signal may serve as a weighted sum of the properties across the transmission path. One agent may receive a plurality of signals on the route from the starting agent to the receiving agent. In this case, the agent will select the signal representing the most suitable path to retransmit to its neighbors.
[0052]
In essence, the measure of the quality of the path along the hop between one agent and another agent can be considered as cost value, where the cost is low The lower, the better the path. If the quality of a path is easily determined from the hop count to that point, the cost value along the most recent hop is a binary on / off indicating whether the signal was able to cross the path. Off function. The accumulated cost values of multiple passes are compared to determine the best pass.
[0053]
Again, in a simple hop count embodiment, for each hop for which the signal is transmitted successfully, the path with the lowest accumulated cost value is considered optimal (the shortest or lowest hop, The accumulated cost value is increased by one (assuming the path is most desirable). The concept of cost value can be extended to other embodiments that are used with other sensors, where the accumulated cost value is more complex than many aspects of the environment are potentially embedded. It may be a good measure. In a volatile environment, the cost value may be designated to expire after a certain amount of time, possibly replacing the next best older (historical) cost value.
[0054]
In addition to simply determining the quality of the path between two agents, two agents may be selected and transmission of signals across the plurality of agents may be initiated simultaneously. Preferably, each of the two agents transmits signals in a unique and identifiable manner (eg, on different frequencies, with different signs, etc.). Each of the plurality of agents receiving the signal develops a vector representing the quality and direction of the best path for each of the two agents, and to line up on the path between the two agents, Move according to the two vectors.
[0055]
The selection of the two agents can be made explicitly by the user, or when a triggering event occurs, propagate the message from one agent along the path to a second (predetermined) agent. It can be done by doing. Next, another agent is selected from the lined-up agents to act as a convergence point for another agent, and the agents lined up on the path go to the convergence point You will be prompted to: The usefulness of this behavior is illustrated in the case where a fire is detected in one area of a building.
[0056]
The agent detecting the fire signals other agents to find a narrow corridor near the fire. Once a narrow corridor is discovered (perhaps by asking a nearby agent to determine what is facing the greatest motion limitation), pass through the corridor to another agent on the other side of the corridor The path is determined. Next, another agent along the path is selected to send a signal, while the other agents move to line up along the path. The agents in the narrow corridor then signal a flame shut-off agent along the path to move to the corridor, where they shut off the fire from the rest of the building In that attempt, a limited amount of fire-blocking chemicals can be applied.
[0057]
Now that the general nature of the invention has been described, the specificity of the invention will be described with reference to the drawings.
[0058]
BEST MODE FOR CARRYING OUT THE INVENTION
First, a description and illustration of one exemplary agent used by the present invention is presented. Then, use the agent to measure the best path path from the initiating agent and another arbitrarily selected agent, and to help other agents in converging on the choke point. The above methods of using two agents as reference agents will be discussed with reference to some flowcharts. Finally, an illustration is presented to graphically illustrate various aspects of agent placement.
[0059]
A typical agent used in accordance with the present invention is illustrated in FIG. As shown, the agent 100 includes a communication capability 102 for transmitting and receiving signals, shown as an antenna. Internally, the agent 100 has a memory for processing received signals and data about the environment, as well as controlling the means for providing mobility, such as a set of trucks 104 or wheels 106. Processor included.
[0060]
The agent 100 also includes one or more sensors, which are non-limiting examples shown, including a metal detector 108, a GPS receiver 110, an infrared detector 112, and a gas chromatography 114. May be included. The agent 100 may be adapted for use within a particular environment or may be adapted to detect particular characteristics of the environment. The agent 100 is used to calculate properties of the physical environment and develop a distributed representation of the non-local properties of the physical environment.
[0061]
An agent may be such that each agent 100 is within communication range of at least one other agent 100 and an agent that is within direct (non-relaying) communication of any agent with any locally adjacent agent of that agent. It is defined as 100. The plurality of agents 100 form a distributed network of communication processors as a whole.
[0062]
Preferably, the agents 100 are arranged such that they are substantially evenly spaced throughout the desired area, maximizing its ability to effectively cover the area. The placement of the agent can be by any means desired for the particular environment.
[0063]
For example, if the agent is mobile, various algorithms may be used to ensure that each agent 100 is moved to a location equidistant from the other agents 100. If the agents are not mobile, they may be dispersed in a semi-random manner (eg, drop them out of the aircraft), or they may be positioned more carefully. There is also a hybrid travel situation map, where only a portion of a group of agents 100 can travel.
[0064]
The goal of the present invention is to determine the optimal path from one initiating agent to any one of the agents. FIG. 2 is a flow diagram of a method for determining the optimal path, beginning with the step of determining at least one initiating agent 200, wherein an agent begins transmitting signals propagated through the plurality of agents. The initiating agent can be explicitly determined by a command from the user of the agent.
[0065]
For example, the user can use a remote control such as a laser pointer or wireless communication device to select one agent from which to begin propagating the signal. Alternatively, the initiator can be determined implicitly by one event. For example, the agent includes a sensor such as a smoke detector, and when detecting smoke by the agent, the detection agent may be designated as the starting agent. The exact mechanism by which one agent is established as the initiating agent is not particularly important.
[0066]
After the initiating agent has been determined, the initiating agent sends a signal to neighboring agents in a transmitting step 202. As mentioned earlier, an adjacent agent is considered to be an agent within the communication range of any given agent. Generally, each one of the plurality of agents will have at least one adjacent agent.
[0067]
The signal is then received at the neighboring agent, and in a signal processing step 204 according to the cost value associated with the respective path between the initiating agent and the neighboring agent. Be processed. The cost value may be based on transmission-related local characteristics along the path, such as the hop count along the path, or the quality of the transmission provided by the path (in a simple case, , The cost value may be based on whether the agent can communicate along the line of the visual communication path).
[0068]
Alternatively, the cost value may be based on local characteristics of the environment near the agent, such as temperature, humidity, degree of smoke, and the like. In essence, any local characteristics of the environment are incorporated into the fixed cost value. Further, it is possible to use a weighted combination of multiple properties.
[0069]
Next, each agent receiving the signal determines a new signal to be transmitted based on the received signal. The determining step 206 can be performed either before or after the signal processing step 204, depending on the particular implementation. The decision on the new signal to send is generally based on a comparison of the cost value of the signal received from the agent's neighbors.
[0070]
For example, if the goal was to determine the lowest hop count path from the initiating agent in preparation for propagating the signal, then the agent along the path would determine the signal with the lowest hop count. Then, the signals received from the adjacent ones are compared. The agent will then increase the hop count and send a new signal to the neighbor indicating the increased hop count.
[0071]
After determining the new signal to send, the agent essentially becomes like a new initiating agent in process step 208, and sends a new signal to its neighbors. These steps are repeated to propagate one signal across the plurality of agents.
[0072]
As the signal is propagated, the accumulated cost value is held in memory at each agent in a holding step 210. The accumulated cost value provides, at each agent, a local overview of non-local features along one path towards the initiating agent and also provides a mechanism to prevent back propagation of one signal I do. For example, in many configurations, agents can broadcast in all directions. Thus, after the first agent sends a signal representing its current cost value, the neighboring agent receives the signal and sends a new signal representing the cost value at the neighboring agent. Will be.
[0073]
Since the transmission from the adjacent agent is omni-directional, the new signal will be received by the first agent. Assuming that the cost value of the adjacent agent (which is derived from that of the first agent is less desirable than that of the first agent, the first agent It does not hold and does not make use of the cost value of the neighboring agent, so the propagation of signals from the initiating agent is generally directed away from the initiating agent.
[0074]
The accumulated cost value at any given agent represents an accumulated measure of quality for the path that the signal traversed along the route from the initiating agent. In this way, the accumulated cost value acts as a local representation (at the agent) of a non-local characteristic (to the agent) along the path between the initiating agent and any agent.
[0075]
In some cases, it may be desirable to compare the retained cost value with the received cost value from one signal to determine which to use in determining a new signal to transmit from the received signal . In this way, the best accumulated cost value received at each agent is retained, and each time a signal is passed to the agent, the received accumulated cost value is determined by the retained cost value to determine which is higher. It is possible to compare with value. For this to happen, it is assumed that the same initiating agent periodically retransmits the signal to be propagated across multiple agents.
[0076]
This may be desirable, for example, as part of an update process to continually update the accumulated cost value at each agent to track changes in the area covered by the plurality of agents. .
[0077]
The signal processing step 204 may also include the use of a time stamp to provide a mechanism for the cost value to expire over time. As cost values are generated and held in memory, they may be time stamped. The most recently timed and kept cost value can be used in processing the signal.
[0078]
However, after a predetermined period of time, the stored cost value is removed from memory. By using cost value expiration, cost value information is retained over a period of time, but it also expires after a certain amount of time has passed and the system One advantage is provided by ensuring that information that is no longer used is flushed.
[0079]
In addition to simply storing the final cost value in memory, multiple cost values are stored in memory in a stack-like configuration, with the cost values ranked (or accessed) in a certain order. The order of the cost values can be used, for example, from highest to lowest or vice versa, or some other ranking criterion.
[0080]
In this scenario, the newly received cost value is also recorded at that time and stored in memory. When the current cost value expires, the next best cost value stored in memory is used until it expires, and so on as long as the cost value remains in memory. . This mechanism allows the cost value stored in memory to follow the quasi-decay behavior before being erased.
[0081]
The agent can be equipped with means for determining the direction in which the signal was received. This can be achieved by using a directional communication mechanism such as directional infrared transceivers or directional radio receivers. An agent so equipped can store both direction and cost value for a single pass to an adjacent agent. To determine the optimal path to the initiating agent, rather than searching by neighbors for the one with the best cost value, each agent can save the direction to the next agent along the path.
[0082]
The direction in which the signal was received can also be factored into the cost value. For example, a path introduced to a particular destination may be less desirable than one introduced to another destination. Thus, the direction can be factored into the cost value to either increase or decrease the cost of a path in a certain direction. The agent 100 can also incorporate a mechanism for determining position and direction into the world coordinate system, such as a satellite positioning system (GPS) receiver and a compass to help facilitate direction determination.
[0083]
It is also possible to use multiple initiators, and the paths for each can be compared to determine which is better. For example, if agents are dispersed throughout an area known to have several mines, the signal may be detected based on the detection of mines near one or more agents. It may be desirable to trigger the start of propagation. Each agent can then initiate the propagation of a signal having a cost value.
[0084]
At each agent, the cost value of the signal initiated from each of the initiating agents can be compared to determine not only the optimal path to each, but also the overall optimal path. As a result, even if a plurality of mines may exist in the agent, it is possible to determine the optimum path to any one of the mines. In this case, it may be desirable for each agent in the plurality of agents to use a unique identifier to indicate a cost value along a path to each initiating agent.
[0085]
The cost value of the paths to each initiating agent can then be compared at each agent to determine the best path among them. The cost values are passed individually to each path to each initiating agent, or the cost values are compared and only the best is passed. In other words, each agent in the plurality of agents may receive and also retain a cost value for the path of each initiating agent, or may retain only the best cost value received. It is possible.
[0086]
In addition to easily identifying the path from an arbitrarily selected agent to an initiating agent, it may be desirable to utilize a path between two agents. For example, it may be desirable to have agents line up along one path between two reference agents. A flow diagram illustrating the steps used to line up agents along one path is illustrated in FIG.
[0087]
First, in the reference agent determination step 300, it is necessary to determine two agents from among a plurality of agents to act as reference agents. Next, one signal is transmitted from each of the reference agents. The transmission of the signal and the generation of the path vector are generally shown as a cost gradient generation step 302. The path vector essentially creates a cost value gradient over the area covered by the agents and towards the initiating agent.
[0088]
The path vector is therefore used in a motion vector generation step 306 to create a motion vector. In general, each motion vector represents the sum of the path vectors, although mechanisms other than simply producing a sum can be used. Agents are prompted to move along the continuously updated motion vector until they are positioned along one path between the two reference agents in an agent convergence step 308.
[0089]
The closer the agent moves to the path between the two reference agents, the smaller the motion vector is and the closer the path vector component is to the opposite side. Regarding the designation of the initiating agent, the designation of the reference agent can be done either explicitly by a user command or implicitly by an agent detection event.
[0090]
Once the agent is lined up along the path between the two reference agents, other agents can take action. For example, it may be desirable for the agents to converge to a choke point along a path between the two agents. Specifically, if the agent is designed to detect a fire inside one building and deliver the fire suppressant in the best possible way to stop the fire, the effects of the fire suppressant It may be desirable to use the physical structural components of the building to maximize
[0091]
In order to achieve maximum effectiveness, narrow passages are optimal from a fire suppression agent delivery standpoint. Thus, it is possible to determine two reference agents that are present on either side of the narrow passage and to line up the other agent on one path between the reference agents.
[0092]
Next, it would be desirable to have agents lined up along the path search to determine the agent with the least degree of freedom for movement. The agent can be assumed to be in the narrowest part of the passage, and the other agent can be encouraged to head for the agent, and consequently close to the convergence agent Causes local clustering of agents in the cluster. The agent can then deliver the fire suppressant into the narrow passage with maximum effect.
[0093]
When using the lines of a visual communication system, detecting the narrowest part of the path may also be by determining the agent along the path that receives a small number of transmissions from neighboring agents, or Could be achieved by determining the agent receiving the transmission over the smallest angular range.
[0094]
[Further form of the present invention]
Several illustrations illustrating various aspects of the invention are presented to provide a more thorough understanding of how to use them. First, a diagram contrasting the present invention with methods of conventional map-based approaches to ground inference is illustrated in FIG.
[0095]
Next, an illustration of using cost values along a path to determine a path from a departure agent to a target is illustrated in FIG. Next, one example of the use of two distinct signal types to illustrate two different targets is shown in FIG. The alignment of the agents along one path between the two reference agents and the convergence to the choke point are illustrated in FIGS. 7 and 8, respectively.
[0096]
Referring now to FIG. 4, FIG. 4 (a) illustrates some powerful movements through the environment 402 in a search pattern that sends raw or partially processed data back to the central processing resource 404. A conventional means for mapping a space by using a mobile robot or sensor 400 is illustrated. Central processing resource 404 can then edit map 406 and then use the map for other operations.
[0097]
Dynamic changes to the environment will not be picked up until the robot 400 repeats the search. In contrast, as illustrated in FIG. 4 (b), the technique provided by the present invention embodies the map and spreads the map over the entire set of simple agents 100. , Each of which defines its local ground feature. Global properties, including shortest paths, blocked paths, and contingency plans, can be calculated in an error-tolerant and distributed manner, with each member of the plurality of agents 100 contributing to the result.
[0098]
Utilizing the present invention, a distributed group of agents 100 is embedded in the environment to perform both sensing and computational tasks simultaneously. As mentioned above, the agent 100 can be placed in the environment by various means, either randomly or in a specific arrangement. Advantages of using the agent 100 spread throughout an environment include fault tolerance and responsiveness to dynamic changes in the environment.
[0099]
The present invention also provides for the use of a number of relatively small and inexpensive agents that spread throughout an environment, rather than some very expensive robots. Thus, a number of simple agents 100 can be used in one environment with very simple individual computing power to detect dynamic changes there. Since many agents 100 are used, the loss of a single agent 100 has little effect on the overall computing power and can therefore guarantee a high degree of fault tolerance.
[0100]
In addition, communication is only required between the closest neighbors, so there is no need for a high power communication device. Information can be returned to the user 408 by relaying a message, or as the user moves through the environment, the user can obtain data from nearby robots. In addition, the agent 100 can be equipped with means for providing exercise to enable it to actively spread throughout one environment and search for the environment.
[0101]
It is important to note that having multiple small processors spread throughout the environment allows for an improvement in computational speed to the extent that computations can be separated and parallelized. Path planning using existing ground databases takes O (nlogn) time on a single computer, but at worst O (n) time on a network. The best ground point, where the quality of the point can be determined locally, takes O (n) time on a single processor to find it, but O (logn) time on the network. Yes, and in that case most of the time is dedicated to organizing the results. Thus, for large amounts of data, using multiple processors in a distributed network significantly reduces computation time.
[0102]
An illustration of the use of cost value along the path to determine the path from the departure agent to the target is then shown in FIG. With the present invention, it is possible to determine the best path from a starting point 502 inside a building to a target 500. In the illustrated embodiment, the application is a preview inside one building and to locate the target.
[0103]
FIG. 5 (a) is a type of topographic map illustrating a group of agents 100 scattered throughout a building 504, with some members finding the target 500. FIG. It should be noted that the agent 100 has achieved a substantially uniform spacing within the building 504. In this case, the agent 100 includes a suitable sensor for detecting the target.
[0104]
For example, if the target 500 is an invading human, sensors can be provided to detect body temperature, movement, and noise. An illustration depicting a message propagation-like gradient having its cost value encoding the distance to the target is shown in FIG. 5 (b). Upon detecting the target T500, the initiating agent 506, the agent closest to the target T500, sends a signal with a zero cost value of 100. This embodiment illustrates the cost value starting at 100 and decreasing with each hop away from the initiating agent 506.
[0105]
The cost value could be increased very easily from the initiating agent 506 at each hop or, as mentioned earlier, could be varied by a factor representing the local sensor measurement. This could be the distance to the sending agent and could be estimated from the signal strength of the received message.
[0106]
In this embodiment, it is desirable to use a visual communication line between agents, since the internal structure, such as a wall or door, is incorporated into the structural display. Adjacent agents within the visual line of the starting agent 506 receive the signal from the starting agent 506, reduce the cost value, and generate new signals incorporating the reduced cost value. Send the new signal as if it were the initiator.
[0107]
An equal-strength cost value line along which the agent shows all the same cost value, starting at 100 and from 92 to the agent closest to the starting point 502 from the starting agent 506 This is indicated by the line decreasing. This illustrates how the overall cost value of multiple agents forms a gradient-like map. In this case, the line shows the slope of the signal propagating away from the target, while the cost value propagation is the same as the distance information propagation in Dijkstra's shortest path algorithm. The visual communication line is uniquely linked to traversability within the building, and the cost value reflects a similar traverse distance to the target. All agents on the isointensity line are approximately equidistant from the detected target T500. By following the agent whose cost value gradient rises in the previous term, we can expect to find the most efficient path to the target.
[0108]
It may be desirable for the agent 100 to be provided with means for detecting the direction in which the received message was emitted. Whenever a new message is received, the cost value is updated and the corresponding direction to the message source is saved. Using this information, it is possible for each agent to indicate the local direction in which the best is towards the target.
[0109]
A diagram illustrating an agent transmitting a direction signal to an originator based on a cost value message and creating a directed path to the target is shown in FIG. 5 (c). Starting at the starting point 502, the information is provided to the user leading to a single path leading the user to the target T500, while incorporating information about the target location and a passage in the building 504.
[0110]
One example of the use of two distinct signal types to detect two different targets is illustrated in FIG. Although two distinct signal types are used in FIG. 6, any number of signal types can be used as long as each signal type forms a distinct cost value gradient. is there. The cost value gradient includes the distance and direction between each agent and the target or targets. FIG. 6A illustrates that using two message types allows each agent 100 to determine which target is closest. Each of the agents 600, 602, 604, 608 detects which of the first target 610 or the second target 612 is closest.
[0111]
This allows the user to determine which target is closest. In this case, the agent 600 detects the first target 610. The resulting distance to the first target 610 is calculated by each of the other agents 602, 604, 606, 608. At the same time, the agent 608 detects the second target 612. The resulting distance to the second target 612 is also calculated by each of the other agents 602, 604, 606, 608.
[0112]
Distinct message types can be used for each target. For hops, the distances are shown for each agent, separated by a slash, the first distance indicates the distance from the first target 610, and the second distance is the distance from the second target 612. Is shown. Using a distinct message type for each target, the cost value at each agent can be easily compared to determine which target is closest to that agent.
[0113]
Alternatively, a diagram is shown in FIG. 6 (b), where each agent is shown using one message type indicating the distance to the nearest target. In this case, the same message type can be used for both targets, and each agent will only store the distance to the closest agent. As shown in FIG. 6 (b), use a message scheme holding directional information, as discussed above, so that the direction to the nearest target can also be found. Doing it will help.
[0114]
The alignment of the agents along the path between the two reference agents and the convergence to the choke point are illustrated in FIGS. 7 and 8, respectively. As shown in FIG. 7, departure agent 700 and target agent 702 have been designated as reference agents. Each of the reference agents emits a distinct message type to other agents. The other agents 704 generate path vectors for each of the reference agents.
[0115]
The path vector is then the motion vector for each of the other agents 704, along which the line vector is to be lined up on one path 706 between the departure agent 700 and the target agent 702. The other agents 704 can move, but are used to create Note that the arrow represents the path vector between the agent 704 and the reference agent.
[0116]
As the agent 704 moves toward the path 706, the agent 704 arrives at the path and the path 706 moves in a direction opposite to the directions of the departure agent 700 and the target agent 702. The path vector is updated until the vectors are aligned in parallel to.
[0117]
The illustration presented in FIG. 8 illustrates the selection of reference agents to prepare agents to converge along a choke point, and the selection of other agents along one path between them. Utilize aligning. The convergence along the choke point can be performed without first aligning the agents along one path between the reference agents, but by aligning along the path along the path. It supplies the agent at a very high density, allowing more precise detection of the characteristics along the path for the selection of an appropriate choke point.
[0118]
As shown in FIG. 8 (a), the agent at the starting point 800 and the agent at the target point 802 are designated as reference agents, while the path 804 between them is indicated by a dotted line. . In this case, the agent 100 moves toward a point along the bus closest to them, as indicated by the arrow on the agent 100.
[0119]
In FIG. 8B, almost all of the agents are aligned along the path 804. In this case, blocking in the narrowest area between the starting point and the target point is considered desirable because the blocking area is closest to the target point.
[0120]
Thus, two choke points 806 and 808 are specified, and the agent converges on each. It should be noted that the criteria used to determine the choke point in this case are (1) the narrowness of the corridor (area) and (2) the proximity of the corridor to block against the target point. Is a point.
[0121]
Thus, the agent first converges on choke point 806 until no more agents fit in the area and then converges on choke point 808. Another narrow path area 810 is not along the path between the starting point 800 and the target point 802, where the agent does not converge. The area for convergence can be selected in many different ways based on many different criteria. For example, narrowness can be a criterion for blocking an area, while steep slopes can be a criterion against blocking an area. Thus, once an area is sufficiently sloping, the fact that it is narrow is not a problem, and may not need to be blocked.
[0122]
All logic associated with the incorporation of the present invention is typically embodied in the form of software stored in the agent's memory and operated by a processor. However, the logic of the present invention may be fixed in hardware rather than using software. To provide a more complete description of the software and a preferred embodiment thereof, a compact disc is provided herein as an appendix, and some of the features described herein are provided. Or software that performs this.
[Brief description of the drawings]
These and other features and aspects and advantages of the present invention will become better understood with regard to the following description, appended claims, and accompanying drawings where: FIG. That is,
FIG. 1 illustrates one embodiment of a typical agent used in connection with the present invention;
FIG. 2 is a flow diagram illustrating the steps in the method of the present invention, illustrating a method for establishing a best cost value path for at least one initiating agent;
FIG. 3 is a flow diagram illustrating additional steps of the method used to align agents along one path between two reference agents;
FIG. 4 (a) is an illustration illustrating a method of a conventional map-based approach to ground inference;
FIG. 4 (b) is an illustration illustrating the method of the ground inference approach of the present invention, in contrast to the conventional approach illustrated in FIG. 4 (a);
FIG. 5 (a) is a terrain map showing a group of agents scattered within a building with several members searching for their targets;
Figure 5 (b) is an illustration depicting the message being encoded as a gradient with its cost value, encoding the distance to the target;
FIG. 5 (c) illustrates an agent communicating directions to the target based on a cost value message and creating a path direction to the target;
FIG. 6 (a) shows that each agent uses two message types that allow it to determine which target is closest;
FIG. 6 (b) shows that each agent uses one message type to indicate the direction to the nearest target;
FIG. 7 is an illustration showing the use of a path vector to align agents along one path between two reference agents;
FIG. 8 (a) illustrates the creation of one path between two reference agents and the preparation of the other agents to converge to a choke point along the path, FIG. 5 is an explanatory diagram showing that a motion vector is generated from a path vector that prompts another agent to be executed;
FIG. 8B is an explanatory diagram showing a path between reference agents once the agents are arranged along the path;
FIG. 8 (c) is an illustration of the agent after the agent has converged to a choke point along the path;
[Explanation of symbols]
100: agent, 102: communication capability, 104: one set of trucks
106: 1 set of wheels, 108: metal detector, 110: GPS receiver
112: infrared detector, 114: gas chromatography,
400: movable robot or sensor, 402: environment,
404: central processing resource, 406: map, 408: user
500: target, 502: starting point, 504: building,
506: startup agent,
600, 602, 604, 606, 608: Agent,
610: first target, 612: second target,
700: departure agent, 702: departure agent,
704: another agent, 706: path,
800: starting point, 902: target point, 804: pass,

Claims

A method for calculating a characteristic of the physical environment using a plurality of agents located in a physical environment, each agent comprising: a communication capability; a processor coupled to the communication capability; A method comprising utilizing a physical property of inter-agent communication to form a distributed representation of a non-local property of the physical environment, comprising: a memory coupled to a processor;
a. Triggering at least one initiator agent;
b. Sending a signal from the initiating agent to an agent adjacent to the initiating agent;
c. Processing the signal at each of the neighboring agents by a cost value based on local characteristics along a respective path between the initiating agent and each neighboring agent;
d. Selecting a new signal to be transmitted from the received signals;
e. Treating the neighboring agent as a triggering agent of the processed signal and repeating the transmitting and processing steps; and f. At each agent locally retaining at least one accumulated cost value representing information about non-local properties of the physical environment based on the received signal;
The method comprising:

2. The method of calculating characteristics of a physical environment according to claim 1, wherein, in the processing step, the cost value is a hop count, wherein an accumulated hop count from the initiating agent is generated at each agent. 2. The method of claim 1, wherein in the processing step, the hop count is increased at each agent.

2. A method for calculating a characteristic of a physical environment as claimed in claim 1, comprising using the accumulated cost value in a determining step for determining the new signal to be transmitted from the received signal. Item 2. The method according to Item 1.

4. The method of calculating a property of a physical environment as set forth in claim 3, wherein a signal is periodically propagated from the initiating agent across the plurality of agents to determine the cost value for processing the signal. 4. The method of claim 3, including comparing the cost value retained from a past signal with the cost value from a current signal to do so.

4. The method of calculating characteristics of a physical environment as recited in claim 3, wherein the communication path between the agents is a visual path line, thereby transmitting signals across the visual path line. 4. The method of claim 3, wherein the method is used to provide a representation of a traversing presence along the path.

4. A method for calculating a characteristic of a physical environment as set forth in claim 3, wherein the agent includes means for storing the direction in which a particular signal was received over a communication path. 4. The method of claim 3, further comprising: comparing cost values for different paths between two agents; and storing a direction of the path with a best cost.

5. The method of calculating a property of a physical environment according to claim 4, wherein the step of processing the signal further comprises:
a. Sub-step of clocking the cost value when the cost value is held in memory;
b. Specifying a most recently retained time-recorded cost value to be used in processing the signal; and c. Removing the time-recorded cost value from memory after a predetermined time has elapsed;
5. The method according to claim 4, comprising the additional substep of:

8. A method for calculating a characteristic of a physical environment as claimed in claim 7, wherein a plurality of cost values are recorded and ranked from best to worst over time in retaining the cost values in memory. The time value of the time value of the cost value designated for use in processing the signal has expired; and the cost value of the cost value has been specified for use in processing the signal. 8. The method of claim 7, wherein the value is removed from memory, and the next best cost value is designated for use in processing the signal.

The method of calculating a property of a physical environment as recited in claim 1, wherein the method further comprises an agent alignment step, wherein the agent alignment step includes:
a. Sub-step of selecting two agents from the plurality of agents to act as reference agents;
b. Generating a cost gradient comprising a vector representation of an optimal cost path between each agent in the plurality of agents and each reference agent;
c. Summing the vector at each agent to create a motion vector for the agent; and d. Moving the agent along the motion vector to align the agent along a path between the reference agents;
The method of claim 1, comprising:

10. The method of calculating a property of a physical environment as recited in claim 9, wherein the agent alignment step includes triggering upon occurrence of a triggering event.

10. The method of calculating a property of a physical environment according to claim 9, wherein the agent alignment step comprises:
a. Selecting a convergence agent; and b. Sub-steps of prompting other agents towards the convergence agent, resulting in local clustering of agents near the convergence agent;
10. The method of claim 9, comprising:

2. The method of calculating a property of a physical environment according to claim 1, wherein the agent further comprises at least one sensor connected to the processor, wherein the sensor provides an output and is used in the signal processing step. The method of claim 1, wherein the performed cost value is developed using the output of the sensor.

13. The method of calculating a property of a physical environment as recited in claim 12, wherein the cost value is used in the determining step to determine a new signal to transmit from the received signal. Item 13. The method according to Item 12.

14. The method of calculating a property of a physical environment as recited in claim 13, wherein a signal including a current signal and at least one past signal is periodically propagated from the initiating agent across the plurality of agents. 14. The method of claim 13, further comprising comparing a cost value retained from a past signal with a cost value from the current signal to determine a cost value for processing the signal. Said method.

14. A method for calculating a property of a physical environment as set forth in claim 13, wherein the agent includes means for storing the direction in which a particular signal was received over a communication path. 14. The method of claim 13, further comprising: comparing cost values for different paths between the two agents; and storing a direction of the path with the best cost.

The method of calculating a property of a physical environment as recited in claim 14, wherein the step of processing the signal further comprises:
a. Sub-step of clocking the cost value when the cost value is held in memory;
b. Specifying a most recently retained time-recorded cost value to be used in processing the signal; and c. Removing the time-recorded cost value from memory after a predetermined time has elapsed;
15. The method of claim 14, comprising the additional substep of

17. The method of calculating a characteristic of a physical environment as recited in claim 16, wherein a plurality of cost values are recorded over time when retaining the cost values in memory; and wherein the time-recorded cost values are removed from memory. 17. The method of claim 16 wherein, when done, another historical cost value is selected to adjust the accumulated cost value message.

14. The method of calculating a characteristic of a physical environment as set forth in claim 13, wherein the communication path between the agents is a visual path line, whereby signals over the line of the visual path are transmitted. 14. The method of claim 13, wherein transmitting comprises using to provide a representation of a traversing presence along the path.

The method of calculating a property of a physical environment as recited in claim 12, wherein the method further comprises an agent alignment step, wherein the agent alignment step includes:
a. Sub-step of selecting two agents from the plurality of agents to act as reference agents;
b. Generating a cost gradient comprising a vector representation of an optimal cost path between each agent in the plurality of agents and each reference agent;
c. Summing the vector at each agent to create a motion vector for the agent; and d. Moving the agent along the motion vector to align the agent along a path between the reference agents;
13. The method according to claim 12, comprising:

20. The method of calculating characteristics of a physical environment as recited in claim 19, wherein the agent aligning step includes triggering upon the occurrence of a triggering event.

21. The method of calculating a property of a physical environment according to claim 20, wherein the agent alignment step comprises:
a. Selecting a convergence agent; and b. Sub-steps of prompting other agents towards the convergence agent, resulting in local clustering of agents near the convergence agent;
21. The method of claim 20, comprising:

An agent used among a plurality of agents located in the physical environment to calculate properties of the physical environment:
a. Communication capability for communicating with other locally spaced agents from among the plurality of agents;
b. Generating a local cost value from the physical characteristics along the path between the agent and another locally spaced agent; and generating the other local value to generate an accumulated cost value. Combining each non-local cost value and the local cost value received in communication with each of the regularly spaced agents to determine the best of the accumulated cost values; A processor coupled with communication capabilities to generate new signals incorporating the best accumulated cost value for transmission to locally spaced agents;
c. A memory connected to the processor for retaining a best accumulated cost value, whereby the plurality of agents are connected to each other to form a distributed representation of a non-local characteristic of the physical environment. Utilizing the physics of communication, at least one of the agents in the plurality of agents is designated as a triggering agent, and the signal is such that the signal is propagated across the plurality of agents and moves away from the triggering agent The agent that has been updated to incorporate the best accumulated cost value, whereby the best path to the initiating agent is saved at each agent in the plurality of agents.

23. An agent used between a plurality of agents located in the physical environment to calculate a property of the physical environment according to claim 22, wherein the non-local cost value is a hop from an initiating agent. A count, wherein the local cost value is an increment of the hop count, and the accumulated cost value is a sum of the local cost value and the best non-local cost value received at the agent to produce an incremental hop count. 23. The agent of claim 22, comprising summing

23. An agent used among a plurality of agents located in the physical environment to calculate a property of the physical environment according to claim 22, wherein the accumulated cost value is calculated from the received signal. 23. The agent of claim 22, including using the processor to determine the best accumulated cost value to send to other locally spaced agents.

25. An agent used between a plurality of agents located in the physical environment to calculate characteristics of the physical environment as set forth in claim 24, wherein the signal is from an initiating agent to the plurality of agents. To determine the best accumulated cost value to be propagated periodically and stored in the memory of the agent to be transmitted to other locally spaced agents. 25. The agent of claim 24, comprising comparing to a stored cost value generated from a newly received signal.

25. An agent used between a plurality of agents located in the physical environment to calculate properties of the physical environment as set forth in claim 24, wherein the communication bus between the agents is a visual agent. Transmission of a signal across the line of the visual path to generate a traversable accumulated cost value indication along the path from the initiating agent to the agent. 25. The agent of claim 24, including using in a processor.

25. An agent used between a plurality of agents located in the physical environment to calculate properties of the physical environment as set forth in claim 24, wherein the agent is further connected to the memory. Means for determining the direction in which a particular signal is being received throughout the communication path, wherein the direction in which the signal having the best cost value was received is stored in the memory. 25. The agent of claim 24, comprising:

26. An agent used among a plurality of agents located within the physical environment to calculate properties of the physical environment as set forth in claim 25, wherein the processor further comprises: Record the time of the accumulated cost value when retained, specify the accumulated cost value recorded the most recently retained time used in processing the signal, and after a predetermined time has elapsed 26. The agent of claim 25, operative to remove accumulated time value recording the time from memory.

29. An agent used between a plurality of agents located in said physical environment to calculate properties of the physical environment as set forth in claim 28, wherein the plurality of accumulated cost values over time. The processor generates another accumulated cost value from memory to be selected to adjust the accumulated cost value message when removing the accumulated cost value recorded in memory and the time recorded from memory. 29. The agent of claim 28, comprising causing the agent to:

23. An agent used between a plurality of agents located in the physical environment to calculate properties of the physical environment according to claim 22, further comprising: an agent connected to the processor. 23. The system of claim 22, wherein at least one sensor is included and the sensor provides an output, and wherein the local cost value is developed by the processor using the output of the sensor. Said agent.

31. An agent used among a plurality of agents located in the physical environment to calculate properties of the physical environment according to claim 30, wherein the accumulated cost value is the received signal. 31. The agent of claim 30, including using in the processor to determine the best accumulated cost value to send to other locally spaced agents from .

32. An agent used between a plurality of agents located in the physical environment to calculate properties of the physical environment as set forth in claim 31, wherein the signal is from a start agent to the plurality of agents. The accumulated cost value stored in the memory of the agent for transmission to other locally spaced agents. 32. The agent of claim 31, comprising comparing to a stored cost value generated from a newly received signal to determine a cost value.

32. An agent used between a plurality of agents located in the physical environment to calculate properties of the physical environment as set forth in claim 31, wherein the communication bus between the agents is visual. Transmitting a signal across the line of the visual path to generate a traversable accumulated cost value indication along one path from the initiating agent to the agent. 32. The agent of claim 31, including being used in the processor of the agent.

32. An agent used between a plurality of agents located in the physical environment to calculate properties of the physical environment as set forth in claim 31, wherein the agent is further connected to the memory. Means for determining the direction in which a particular signal is being received throughout the communication bus, wherein the direction in which the signal having the best cost value was received is stored in the memory. 32. The agent of claim 31, comprising:

33. An agent used between a plurality of agents located in the physical environment to calculate properties of the physical environment as set forth in claim 32, wherein the processor further stores the cost value in memory. Record the time of the accumulated cost value when retained, specify the accumulated cost value recorded the most recently retained time used in processing the signal, and after a predetermined time has elapsed 33. The agent of claim 32, operable to remove from the memory the accumulated cost value that recorded the time.

An agent used between a plurality of agents located in said physical environment to calculate properties of the physical environment as set forth in claim 35, wherein the plurality of accumulated cost values over time. The processor generates another accumulated cost value from memory to be selected to adjust the accumulated cost value message when removing the accumulated cost value recorded in memory and the time recorded from memory. 36. The agent of claim 35, comprising causing the agent to: