JP2012205242A

JP2012205242A - Electronic device and information transfer system

Info

Publication number: JP2012205242A
Application number: JP2011070358A
Authority: JP
Inventors: Masamitsu Yanagihara; 政光柳原; Tetsuya Yamamoto; 哲也山本; Masahiro Nei; 正洋根井; Satoru Hagiwara; 哲萩原; Isao Totsuka; 功戸塚; Tomoyuki Matsuyama; 知行松山; Masaichi Sekiguchi; 政一関口
Original assignee: Nikon Corp
Current assignee: Nikon Corp
Priority date: 2011-03-28
Filing date: 2011-03-28
Publication date: 2012-10-22

Abstract

PROBLEM TO BE SOLVED: To provide an electronic device capable of appropriately controlling a sound device.SOLUTION: The electronic device comprises: an acquisition device which acquires an imaging result from at least one imaging device capable of imaging an image including an object person; a controller which controls a sound device installed out of an imaging area of the imaging device, according to an imaging result obtained by the imaging device; and a detection device which detects movement information on the object person based on the imaging result. The controller controls the sound device based on a result obtained by the detection device.

Description

本発明は、電子機器及び情報伝達システムに関する。 The present invention relates to an electronic device and an information transmission system.

ユーザに対して音声を用いて案内をする音声案内装置が提案されている（例えば、特許文献１参照）。 A voice guidance device that provides guidance to a user using voice has been proposed (see, for example, Patent Document 1).

特開２００７-４５５６５号公報JP 2007-45565 A

しかしながら、従来の音声案内装置には、特定の場所からでないと音声を聞き取りにくいという課題があった。 However, the conventional voice guidance device has a problem that it is difficult to hear the voice unless it is from a specific place.

本発明は上記の課題に鑑みてなされたものであり、適切な音声装置の制御が可能な電子機器及び情報伝達システムを提供することを目的とする。 The present invention has been made in view of the above problems, and an object thereof is to provide an electronic device and an information transmission system capable of controlling an appropriate audio device.

本発明の電子機器は、対象者を含む画像を撮像可能な少なくとも１つの撮像装置（１１）から、撮像結果を取得する取得装置（２５）と、前記撮像装置の撮像結果に応じて、前記撮像装置の撮像範囲外に設けられた音声装置（１２，１３）を制御する制御装置（２５）と、を備えた電子機器である。 The electronic apparatus according to the present invention includes an acquisition device (25) that acquires an imaging result from at least one imaging device (11) that can capture an image including the subject, and the imaging according to the imaging result of the imaging device. And a control device (25) that controls the sound device (12, 13) provided outside the imaging range of the device.

この場合において、前記少なくとも１つの撮像装置の撮像結果に基づいて前記対象者の移動情報を検出する検出装置（２５）を備え、前記制御装置は、前記検出装置の検出結果に基づいて前記音声装置を制御することができる。また、この場合、前記制御装置は、前記検出装置が検出した前記移動情報に基づいて前記対象者が所定領域外に移動すると判断したとき、又は所定領域外に移動したと判断したときに、前記音声装置を制御して前記対象者に対する警告を行うことができる。 In this case, a detection device (25) that detects movement information of the subject based on an imaging result of the at least one imaging device is provided, and the control device is configured to detect the audio device based on the detection result of the detection device. Can be controlled. In this case, when the control device determines that the subject moves outside the predetermined region based on the movement information detected by the detection device, or when the control device determines that the subject has moved outside the predetermined region, The voice device can be controlled to give a warning to the subject.

本発明の電子機器では、前記制御装置は、前記少なくとも１つの撮像装置が前記対象者とは異なる人を撮像した際に、前記音声装置を制御することができる。また、前記音声装置は、指向性スピーカを有することができる。また、前記音声装置の位置及び／又は姿勢を調節する駆動制御装置（２５）を備えることができる。この場合、前記駆動制御装置は、前記対象者の移動に応じて前記音声装置の位置及び／又は姿勢を調節することとしてもよい。 In the electronic apparatus according to the aspect of the invention, the control device can control the audio device when the at least one imaging device images a person different from the subject. The audio device may have a directional speaker. Moreover, the drive control apparatus (25) which adjusts the position and / or attitude | position of the said audio | voice apparatus can be provided. In this case, the drive control device may adjust the position and / or posture of the audio device according to the movement of the subject.

本発明の電子機器では、前記少なくとも１つの撮像装置は、第１の撮像装置と第２の撮像装置とを含み、前記第１の撮像装置の撮像範囲の一部と、前記第２の撮像装置の撮像範囲の一部とが重複するように前記第１、第２撮像装置が配置されていることとしてもよい。 In the electronic apparatus according to the aspect of the invention, the at least one imaging device includes a first imaging device and a second imaging device, a part of an imaging range of the first imaging device, and the second imaging device. The first and second imaging devices may be arranged so as to overlap a part of the imaging range.

また、前記音声装置は、前記第１の撮像装置の撮像範囲に設けられた第１音声装置と、前記第２の撮像装置の撮像範囲に設けられた第２音声装置と、を含み、前記制御装置は、前記第１音声装置が前記対象者の後ろ側に位置した場合に、前記第２音声装置を制御することとしてもよい。この場合、前記音声装置は、前記第１の撮像装置の撮像範囲に設けられた第１マイクと第１スピーカとを有する第１音声装置と、前記第２の撮像装置の撮像範囲に設けられた第２スピーカを備えた第２音声装置と、を含み、前記制御装置は、前記第１の撮像装置が前記対象者と、当該対象者とは異なる人を撮像した際に、前記第２スピーカを制御することとしてもよい。また、前記制御装置は、前記第１の撮像装置が前記対象者を撮像した際に、前記第１マイクを制御して前記対象者の音声を集音することとしてもよい。 The audio device includes a first audio device provided in an imaging range of the first imaging device and a second audio device provided in an imaging range of the second imaging device, and the control The device may control the second audio device when the first audio device is located behind the subject. In this case, the audio device is provided in a first audio device having a first microphone and a first speaker provided in an imaging range of the first imaging device, and in an imaging range of the second imaging device. A second audio device including a second speaker, wherein the control device controls the second speaker when the first imaging device images the target person and a person different from the target person. It is good also as controlling. The control device may control the first microphone to collect the subject's voice when the first imaging device images the subject.

本発明の電子機器では、前記撮像装置の撮像結果を用いて前記対象者を追尾する追尾装置（２５）を備え、前記追尾装置は、前記撮像装置を用いて前記対象者の特定部分の画像を取得して当該特定部分の画像をテンプレートとし、前記対象者を追尾する場合には、前記テンプレートを用いて前記対象者の特定部分を特定するとともに、当該特定された前記対象者の特定部分の新たな画像で、前記テンプレートを更新することができる。 The electronic apparatus of the present invention includes a tracking device (25) that tracks the target person using the imaging result of the imaging device, and the tracking device uses the imaging device to capture an image of a specific portion of the target person. When acquiring and tracking the target person using the image of the specific part as a template, the specific part of the target person is specified using the template, and a new specific part of the specified target person is specified. The template can be updated with a simple image.

この場合において、前記撮像装置は、第１の撮像装置と、当該第１の撮像装置の撮像範囲の一部と重複する撮像範囲を有する第２の撮像装置と、を含み、前記追尾装置は、前記第１の撮像装置と前記第２の撮像装置が前記対象者を同時に撮像できるときに、一方の撮像装置により撮像される前記対象者の特定部分の位置情報を取得するとともに、他方の撮像装置により撮像される画像のうち、前記特定部分の位置情報に対応する領域を特定し、当該特定された領域の画像を他方の撮像装置の前記テンプレートとすることとしてもよい。また、前記追尾装置は、前記特定部分の大きさ情報が所定量以上変動した場合に、前記対象者の異常を判定することとしてもよい。 In this case, the imaging device includes a first imaging device and a second imaging device having an imaging range that overlaps a part of the imaging range of the first imaging device, and the tracking device includes: When the first imaging device and the second imaging device can simultaneously image the subject, the position information of the specific portion of the subject imaged by one imaging device is acquired and the other imaging device It is also possible to identify an area corresponding to the position information of the specific part from the image captured by, and use the image of the identified area as the template of the other imaging apparatus. Further, the tracking device may determine the abnormality of the target person when the size information of the specific portion fluctuates by a predetermined amount or more.

本発明の情報伝達システムは、対象者を含む画像を撮像可能な少なくとも１つの撮像装置（１１）と、前記撮像装置の撮像範囲外に設けられた音声装置（１２，１３）と、本発明の電子機器（２０）と、を備える情報伝達システム（１００）である。 The information transmission system of the present invention includes at least one imaging device (11) capable of capturing an image including a subject, an audio device (12, 13) provided outside the imaging range of the imaging device, An information transmission system (100) comprising an electronic device (20).

なお、本発明をわかりやすく説明するために、上記においては一実施形態を表す図面の符号に対応つけて説明したが、本発明は、これに限定されるものではなく、後述の実施形態の構成を適宜改良しても良く、また、少なくとも一部を他の構成物に代替させても良い。更に、その配置について特に限定のない構成要件は、実施形態で開示した配置に限らず、その機能を達成できる位置に配置することができる。 In addition, in order to explain the present invention in an easy-to-understand manner, the above description has been made in association with the reference numerals of the drawings representing one embodiment. However, the present invention is not limited to this, and the configuration of an embodiment described later is provided. May be modified as appropriate, or at least a part thereof may be replaced with another component. Further, the configuration requirements that are not particularly limited with respect to the arrangement are not limited to the arrangement disclosed in the embodiment, and can be arranged at a position where the function can be achieved.

本発明の電子機器及び情報伝達システムは、適切な音声装置の制御ができるという効果を奏する。 The electronic device and the information transmission system of the present invention have an effect that an appropriate audio device can be controlled.

一実施形態に係る案内システムの構成を示すブロック図である。It is a block diagram which shows the structure of the guidance system which concerns on one Embodiment. 撮像装置の具体的な構成を示す図である。It is a figure which shows the specific structure of an imaging device. 音声ユニットを示す斜視図である。It is a perspective view which shows an audio | voice unit. 本体部のハードウェア構成図である。It is a hardware block diagram of a main-body part. 本体部の機能ブロック図である。It is a functional block diagram of a main-body part. 図６(ａ)は、広角レンズ系の前側焦点から撮像した人（対象者）の頭までの距離と、像（頭部分）の大きさとの関係を示すグラフであり、図６（ｂ）は、図６（ａ）のグラフを床からの高さに変換したグラフである。FIG. 6A is a graph showing the relationship between the distance from the front focal point of the wide-angle lens system to the head of the person (subject) and the size of the image (head portion), and FIG. FIG. 7 is a graph obtained by converting the graph of FIG. 6A to a height from the floor. 像の大きさの変化率を示すグラフである。It is a graph which shows the change rate of the magnitude | size of an image. 図８（ａ）、図８（ｂ）は、対象者の姿勢に応じた頭の大きさの変化を模式的に示す図である。FIGS. 8A and 8B are diagrams schematically showing changes in the size of the head according to the posture of the subject. 対象者の位置に応じた、撮像素子に撮像される対象者の頭の像の大きさの変化を示す図である。It is a figure which shows the change of the magnitude | size of the image of the subject's head imaged with an image pick-up element according to a subject's position. オフィス内の１つの区画と、当該区画内に設けられた撮像装置の撮像領域と、の関係を模式的に示す図である。It is a figure which shows typically the relationship between one division in an office, and the imaging area of the imaging device provided in the said division. 対象者の追跡処理を説明するための図（その１）である。It is FIG. (1) for demonstrating the tracking process of a subject. 対象者の追跡処理を説明するための図（その２）である。It is FIG. (2) for demonstrating a subject's tracking process. 対象者の追跡処理を説明するための図（その３）である。It is FIG. (3) for demonstrating a subject's tracking process. 図１４（ａ）、図１４（ｂ）は、図１０の１つの区画内において４人の対象者（対象者Ａ，Ｂ，Ｃ，Ｄ）が移動する場合の追跡処理について説明するための図（その１）である。FIGS. 14A and 14B are diagrams for explaining the tracking process when four subjects (subjects A, B, C, and D) move in one section of FIG. (Part 1). 図１５（ａ）〜図１５（ｃ）は、図１０の１つの区画内において４人の対象者（対象者Ａ，Ｂ，Ｃ，Ｄ）が移動する場合の追跡処理について説明するための図（その２）である。FIGS. 15A to 15C are diagrams for explaining the tracking process when four subjects (subjects A, B, C, and D) move in one section of FIG. (Part 2). 案内部が通路（廊下）に沿って配置された場合の指向性スピーカの制御方法を説明するための図である。It is a figure for demonstrating the control method of a directional speaker when a guide part is arrange | positioned along a channel | path (hallway). 案内システムにおける案内処理を示すフローチャートである。It is a flowchart which shows the guidance process in a guidance system.

以下、一実施形態に係る案内システムについて、図１〜図１７に基づいて、詳細に説明する。図１には、案内システム１００の構成がブロック図にて示されている。なお、案内システム１００は、オフィス、商業施設、空港、駅、病院、美術館などに設置可能なものであるが、本実施形態では、案内システム１００が、オフィスに設置される場合を例に採り説明する。 Hereinafter, a guidance system according to an embodiment will be described in detail with reference to FIGS. FIG. 1 is a block diagram showing the configuration of the guidance system 100. The guidance system 100 can be installed in an office, a commercial facility, an airport, a station, a hospital, a museum, etc., but in this embodiment, the guidance system 100 is described as an example in which it is installed in an office. To do.

案内システム１００は、図１に示すように、複数の案内部１０ａ、１０ｂ…と、カードリーダ８８と、本体部２０と、を備える。なお、図１では２つの案内部１０ａ、１０ｂを図示しているが、その数は設置場所に応じて設定することができる。例えば、図１６では、通路に４つの案内部１０ａ〜１０ｄが設置された状態を図示している。なお、各案内部１０ａ，１０ｂ…は同一の構成を有しているものとする。また、以下において、案内部１０ａ，１０ｂ…のうち任意の案内部を示す場合には、案内部１０と表記するものとする。 As shown in FIG. 1, the guide system 100 includes a plurality of guide units 10 a, 10 b, a card reader 88, and a main body unit 20. In addition, in FIG. 1, although the two guide parts 10a and 10b are shown in figure, the number can be set according to an installation place. For example, FIG. 16 illustrates a state where four guide portions 10a to 10d are installed in the passage. In addition, each guide part 10a, 10b ... shall have the same structure. In the following, when an arbitrary guide part is shown among the guide parts 10a, 10b,...

案内部１０は、撮像装置１１と、指向性マイク１２と、指向性スピーカ１３と、駆動装置１４と、を有する。 The guide unit 10 includes an imaging device 11, a directional microphone 12, a directional speaker 13, and a driving device 14.

撮像装置１１は、オフィスの天井に設けられ、主としてオフィス内にいる人の頭を撮像するものである。本実施の形態において、オフィスの天井の高さは２．６ｍとする。すなわち、撮像装置１１は、２．６ｍの高さから人の頭などを撮像する。 The imaging device 11 is provided on the ceiling of the office and mainly captures the head of a person in the office. In the present embodiment, the height of the ceiling of the office is 2.6 m. That is, the imaging device 11 images a human head or the like from a height of 2.6 m.

撮像装置１１は、図２に示すように、３群構成の広角レンズ系３２と、ローパスフィルタ３４と、ＣＣＤ又はＣＭＯＳなどからなる撮像素子３６と、撮像素子を駆動制御する回路基板３８と、を有する。なお、図２では不図示であるが、広角レンズ系３２とローパスフィルタ３４との間には、不図示のメカシャッターが設けられているものとする。 As shown in FIG. 2, the imaging apparatus 11 includes a wide-angle lens system 32 having a three-group configuration, a low-pass filter 34, an imaging element 36 such as a CCD or a CMOS, and a circuit board 38 that drives and controls the imaging element. Have. Although not shown in FIG. 2, it is assumed that a mechanical shutter (not shown) is provided between the wide-angle lens system 32 and the low-pass filter 34.

広角レンズ系３２は、２枚の負メニスカスレンズを有する第１群３２ａと、正レンズ、接合レンズ、及び赤外カットフィルタを有する第２群３２ｂと、２枚の接合レンズを有する第３群３２ｃと、を有しており、第２群３２ｂと第３群３２ｃとの間に絞り３３が配置されている。本実施形態の広角レンズ系３２は、系全体の焦点距離が６．１８８ｍｍ、最大画角が８０°となっている。なお、広角レンズ系３２は、３群構成に限定されるものでもない。すなわち、例えば、各群のレンズ枚数やレンズ構成、並びに焦点距離や画角は、適宜変更することが可能である。 The wide-angle lens system 32 includes a first group 32a having two negative meniscus lenses, a second group 32b having a positive lens, a cemented lens, and an infrared cut filter, and a third group 32c having two cemented lenses. The diaphragm 33 is disposed between the second group 32b and the third group 32c. The wide-angle lens system 32 of this embodiment has a focal length of 6.188 mm and a maximum field angle of 80 °. The wide-angle lens system 32 is not limited to the three-group configuration. That is, for example, the number of lenses in each group, the lens configuration, the focal length, and the angle of view can be changed as appropriate.

撮像素子３６は、一例として、２３．７ｍｍ×１５．９ｍｍの大きさで、画素数が４０００×３０００（１２００万画素）であるものとする。すなわち、１画素の大きさは、５．３μｍである。ただし、撮像素子３６としては、上記と異なるサイズ及び画素数の撮像素子を用いてもよい。 As an example, the image sensor 36 has a size of 23.7 mm × 15.9 mm and a pixel number of 4000 × 3000 (12 million pixels). That is, the size of one pixel is 5.3 μm. However, as the image sensor 36, an image sensor having a different size and the number of pixels from the above may be used.

上記のように構成される撮像装置１１では、広角レンズ系３２に入射した光束はローパスフィルタ３４を介して撮像素子３６に入射し、回路基板３８が撮像素子３６の出力をデジタル信号に変換する。そして、ＡＳＩＣ（Application Specific Integrated Circuit）を含む画像処理制御部（不図示）が、デジタル信号に変換された画像信号に対してホワイトバランス調整、シャープネス調整、ガンマ補正、階調調整などの画像処理を施すとともに、ＪＰＥＧなどの画像圧縮をする。また、画像処理制御部は、ＪＰＥＧ圧縮された静止画像を本体部２０の制御部２５（図５参照）に送信する。 In the imaging device 11 configured as described above, the light beam incident on the wide-angle lens system 32 enters the imaging device 36 via the low-pass filter 34, and the circuit board 38 converts the output of the imaging device 36 into a digital signal. An image processing control unit (not shown) including an ASIC (Application Specific Integrated Circuit) performs image processing such as white balance adjustment, sharpness adjustment, gamma correction, and gradation adjustment on the image signal converted into the digital signal. In addition, image compression such as JPEG is performed. Further, the image processing control unit transmits the JPEG-compressed still image to the control unit 25 (see FIG. 5) of the main body unit 20.

なお、撮像装置１１の撮像領域は、隣接する案内部１０に含まれる撮像装置１１の撮像領域と重複（オーバラップ）している（図１０の撮像領域Ｐ１〜Ｐ４参照）。なお、この点については、後に詳述する。 Note that the imaging region of the imaging device 11 overlaps with the imaging region of the imaging device 11 included in the adjacent guide unit 10 (see imaging regions P1 to P4 in FIG. 10). This point will be described in detail later.

指向性マイク１２は、特定の方向（例えば前面方向）から入射する音声を高感度に集音するものであり、超指向性ダイナミック型マイクロホンや超指向性コンデンサ型マイクロホン等を用いることができる。 The directional microphone 12 collects sound incident from a specific direction (for example, front direction) with high sensitivity, and a super-directional dynamic microphone, a super-directional condenser microphone, or the like can be used.

指向性スピーカ１３は、超音波トランスデューサを備えており、限られた方向のみに音声を伝達するスピーカである。 The directional speaker 13 includes an ultrasonic transducer and is a speaker that transmits sound only in a limited direction.

駆動装置１４は、指向性マイク１２と指向性スピーカ１３とを一体的に、又は別々に駆動する。 The driving device 14 drives the directional microphone 12 and the directional speaker 13 integrally or separately.

本実施形態では、図３に示すように、指向性マイク１２、指向性スピーカ１３、及び駆動装置１４は、一体型の音声ユニット５０に設けられるものとする。具体的には、音声ユニット５０は、指向性マイク１２及び指向性スピーカ１３を保持するユニット本体１６と、ユニット本体１６を保持する保持部１７と、を有する。保持部１７は、水平方向（図３ではＸ軸方向）に延びる回転軸１５ｂにて、ユニット本体１６を回転自在に保持する。保持部１７には、駆動装置１４を構成するモータ１４ｂが設けられており、ユニット本体１６（すなわち、指向性マイク１２及び指向性スピーカ１３）は、モータ１４ｂの回転力により、パン方向（水平方向の首振り）に駆動される。また、保持部１７には、鉛直方向（Ｚ軸方向）に延びる回転軸１５ａが設けられており、回転軸１５ａは、駆動装置１４を構成するモータ１４ａ（オフィスの天井部に固定される）により回転される。これにより、ユニット本体１６（すなわち、指向性マイク１２及び指向性スピーカ１３）は、チルト方向（垂直方向（Ｚ軸方向）の首振り）に駆動される。なお、モータ１４ａ、１４ｂとしては、ＤＣモータ、ボイスコイルモータ、リニアモータなどを用いることができる。 In the present embodiment, as shown in FIG. 3, the directional microphone 12, the directional speaker 13, and the driving device 14 are provided in an integrated audio unit 50. Specifically, the audio unit 50 includes a unit main body 16 that holds the directional microphone 12 and the directional speaker 13, and a holding unit 17 that holds the unit main body 16. The holding unit 17 rotatably holds the unit main body 16 with a rotation shaft 15b extending in the horizontal direction (X-axis direction in FIG. 3). The holding unit 17 is provided with a motor 14b that constitutes the driving device 14, and the unit body 16 (that is, the directional microphone 12 and the directional speaker 13) is panned (horizontal direction) by the rotational force of the motor 14b. Driven). The holding portion 17 is provided with a rotating shaft 15a extending in the vertical direction (Z-axis direction). The rotating shaft 15a is fixed by a motor 14a (fixed to the ceiling portion of the office) constituting the driving device 14. It is rotated. Thereby, the unit main body 16 (that is, the directional microphone 12 and the directional speaker 13) is driven in the tilt direction (swing in the vertical direction (Z-axis direction)). Note that a DC motor, a voice coil motor, a linear motor, or the like can be used as the motors 14a and 14b.

なお、モータ１４ａは、指向性マイク１２及び指向性スピーカ１３が真下を向いた状態（−９０°）から時計回り方向と反時計回り方向にそれぞれ６０°〜８０°程度の範囲内で、指向性マイク１２及び指向性スピーカ１３を駆動することができるものとする。駆動範囲をこのような範囲とするのは、音声ユニット５０をオフィスの天井部に設けた場合、人の頭が音声ユニット５０の真下に存在することはあっても、音声ユニット５０の真横に存在することは想定されないためである。 The motor 14a has a directivity within a range of about 60 ° to 80 ° in the clockwise direction and the counterclockwise direction from the state where the directional microphone 12 and the directional speaker 13 are directed downward (−90 °). It is assumed that the microphone 12 and the directional speaker 13 can be driven. The driving range is set to such a range when the audio unit 50 is provided on the ceiling of the office, even if the head of a person may be directly below the audio unit 50, it exists right next to the audio unit 50. This is because it is not expected to do.

なお、本実施形態では、音声ユニット５０と図１の撮像装置１１とを別体としているが、これに限らず、案内部１０の全てをユニット化して天井部に設けるようにしてもよい。 In the present embodiment, the audio unit 50 and the imaging device 11 of FIG. 1 are separated from each other. However, the present invention is not limited to this, and the entire guide unit 10 may be unitized and provided on the ceiling.

図１に戻り、カードリーダ８８は、例えばオフィス入り口に設けられ、オフィス内に入ることが許可されている人が保有するＩＤカードを読み取る装置である。 Returning to FIG. 1, the card reader 88 is a device that is provided at the entrance of an office, for example, and reads an ID card held by a person permitted to enter the office.

本体部２０は、案内部１０ａ，１０ｂ…やカードリーダ８８から入力される情報（データ）を処理するとともに、案内部１０ａ，１０ｂ…及びカードリーダ８８を統括的に制御するものである。図４には、本体部２０のハードウェア構成図が示されている。図４に示すように、本体部２０は、ＣＰＵ９０、ＲＯＭ９２、ＲＡＭ９４、記憶部（ここではＨＤＤ（Hard Disk Drive）９６ａやフラッシュメモリ９６ｂ）、インタフェース部９７等を備えている。本体部２０の構成各部は、バス９８に接続されている。インタフェース部９７は、案内部１０の撮像装置１１や駆動装置１４などと接続するためのインタフェースである。インタフェースとしては、無線／有線ＬＡＮ、ＵＳＢ、ＨＤＭＩ、Bluetooth（登録商標）などの様々な接続規格を採用することができる。 The main body 20 processes information (data) input from the guide units 10a, 10b,... And the card reader 88, and controls the guide units 10a, 10b,. FIG. 4 shows a hardware configuration diagram of the main unit 20. As shown in FIG. 4, the main unit 20 includes a CPU 90, a ROM 92, a RAM 94, a storage unit (here, an HDD (Hard Disk Drive) 96a and a flash memory 96b), an interface unit 97, and the like. Each component of the main body 20 is connected to a bus 98. The interface unit 97 is an interface for connecting to the imaging device 11 and the driving device 14 of the guide unit 10. As the interface, various connection standards such as a wireless / wired LAN, USB, HDMI, Bluetooth (registered trademark) can be adopted.

本体部２０では、ＲＯＭ９２あるいはＨＤＤ９６ａに格納されているプログラムをＣＰＵ９０が実行することにより、図５の各部の機能が実現される。すなわち、本体部２０では、ＣＰＵ９０がプログラムを実行することにより、図５に示す、音声認識部２２、音声合成部２３、制御部２５としての機能が実現される。なお、図５では、図４のフラッシュメモリ９６ｂにより実現される格納部２４についても図示している。 In the main unit 20, the functions of each unit shown in FIG. 5 are realized by the CPU 90 executing a program stored in the ROM 92 or the HDD 96 a. That is, in the main body unit 20, functions as the voice recognition unit 22, the voice synthesis unit 23, and the control unit 25 illustrated in FIG. 5 are realized by the CPU 90 executing the program. 5 also shows the storage unit 24 realized by the flash memory 96b of FIG.

音声認識部２２は、指向性マイク１２が集音した音声の特徴量に基づいて音声認識をするものである。音声認識部２２は、音響モデルおよび辞書機能を有し、この音響モデルと辞書機能とを用いて音声認識を行う。音響モデルは、音声認識する音声言語の音素や音節などの音響的な特徴を記憶するものである。また、辞書機能は、認識対象の各単語について、その発音に関する音韻情報を記憶している。なお、音声認識部２２は、市販の音声認識ソフト（プログラム）をＣＰＵ９０が実行することで、実現してもよい。なお、音声認識技術については、例えば、日本特許第４５８７０１５号（特開２００４−３２５５６０号公報）に記載されている。 The voice recognition unit 22 performs voice recognition based on the feature amount of the voice collected by the directional microphone 12. The voice recognition unit 22 has an acoustic model and a dictionary function, and performs voice recognition using the acoustic model and the dictionary function. The acoustic model stores acoustic features such as phonemes and syllables of a speech language for speech recognition. Further, the dictionary function stores phonological information related to pronunciation of each word to be recognized. The voice recognition unit 22 may be realized by the CPU 90 executing commercially available voice recognition software (program). The speech recognition technology is described in, for example, Japanese Patent No. 4587715 (Japanese Patent Laid-Open No. 2004-325560).

音声合成部２３は、指向性スピーカ１３により発する（出力する）音声を合成するものである。音声合成は、音韻の音声素片を生成して、この音声素片を接続することにより行うことができる。音声合成の原理は、子音をＣ（Consonant）、母音をＶ（Vowel）で表すとＣＶ、ＣＶＣ，ＶＣＶなどの基本となる小さな単位の特徴パラメータや音声素片を記憶し、ピッチや継続時間長を制御して接続して音声を合成するというものである。なお、音声合成技術については、例えば、日本特許第３７２７８８５号（特開２００３−２２３１８０号公報）に記載されている。 The voice synthesizer 23 synthesizes the voice emitted (output) from the directional speaker 13. Speech synthesis can be performed by generating phoneme speech segments and connecting the speech segments. The principle of speech synthesis is to store feature parameters and speech segments in small units such as CV, CVC, VCV, etc. when consonants are represented by C (Consonant) and vowels are represented by V (Vowel). Is controlled and connected to synthesize speech. The speech synthesis technique is described in, for example, Japanese Patent No. 3727885 (Japanese Patent Laid-Open No. 2003-223180).

制御部２５は、本体部２０の制御に加えて、案内システム１００全体を制御するものである。例えば、制御部２５は、撮像装置１１の画像処理制御部から送信されてきたＪＰＥＧ圧縮された静止画像を格納部２４に格納する。また、制御部２５は、格納部２４に格納された画像に基づいて、複数の指向性スピーカ１３のうちどの指向性スピーカ１３を用いてオフィス内の特定の人（対象者）への案内を行うかを制御する。 The control unit 25 controls the entire guidance system 100 in addition to the control of the main body unit 20. For example, the control unit 25 stores the JPEG-compressed still image transmitted from the image processing control unit of the imaging device 11 in the storage unit 24. Further, the control unit 25 performs guidance to a specific person (target person) in the office using which directional speaker 13 among the plurality of directional speakers 13 based on the image stored in the storage unit 24. To control.

また、制御部２５は、隣接する案内部１０との距離に応じて、少なくとも隣接する案内部１０と集音範囲、音声出力範囲がオーバラップするように指向性マイク１２と指向性スピーカ１３の駆動を制御する。また、制御部２５は、撮像装置１１の撮像範囲よりも広い範囲で音声案内ができるように指向性マイク１２と指向性スピーカ１３とを駆動するとともに、指向性マイク１２の感度と、指向性スピーカ１３の音量とを設定する。これは、対象者を撮像していない撮像装置を有する案内部１０の指向性マイク１２と指向性スピーカ１３とを用いて対象者を音声案内する場合があるからである。 Further, the control unit 25 drives the directional microphone 12 and the directional speaker 13 so that at least the adjacent guide unit 10 overlaps the sound collection range and the sound output range according to the distance from the adjacent guide unit 10. To control. In addition, the control unit 25 drives the directional microphone 12 and the directional speaker 13 so that voice guidance can be performed in a wider range than the imaging range of the imaging device 11, and also the sensitivity of the directional microphone 12 and the directional speaker. 13 volume is set. This is because there is a case where the target person is voice-guided using the directional microphone 12 and the directional speaker 13 of the guide unit 10 having an imaging device that does not capture the target person.

また、制御部２５は、カードリーダ８８で読み取られたＩＤカードのカード情報を取得し、格納部２４に格納されている従業員情報等に基づいて、カードリーダ８８にＩＤカードをかざした人物を特定する。 In addition, the control unit 25 acquires the card information of the ID card read by the card reader 88 and, based on the employee information stored in the storage unit 24, the person holding the ID card over the card reader 88 Identify.

格納部２４は、撮像装置１１の光学系のディストーションの影響による検出誤差を補正する補正テーブル（後述）や、従業員情報、撮像装置１１が撮像した画像などを記憶する。 The storage unit 24 stores a correction table (described later) for correcting a detection error due to the influence of distortion of the optical system of the imaging device 11, employee information, an image captured by the imaging device 11, and the like.

次に、撮像装置１１による対象者の頭部分の撮像について、詳細に説明する。図６（ａ）には、広角レンズ系３２の前側焦点から撮像した人（対象者）の頭までの距離と、像（頭部分）の大きさとの関係がグラフにて示され、図６（ｂ）には、図６（ａ）のグラフを床からの高さに変換したグラフが示されている。 Next, imaging of the head portion of the subject by the imaging device 11 will be described in detail. FIG. 6A is a graph showing the relationship between the distance from the front focal point of the wide-angle lens system 32 to the head of the person (subject) and the size of the image (head portion). FIG. 6B shows a graph obtained by converting the graph of FIG. 6A to the height from the floor.

ここで、前述のように広角レンズ系３２の焦点距離が６．１８８ｍｍであり、対象者の頭の直径が２００ｍｍであるとすると、広角レンズ系３２の前側焦点から対象者の頭の位置までの距離が１０００ｍｍの場合（すなわち、身長１ｍ６０ｃｍの人が直立している場合）には、撮像装置１１の撮像素子３６に結像する対象者の頭の直径は１．２３８ｍｍである。これに対し、対象者の頭の位置が３００ｍｍ下がって広角レンズ系３２の前側焦点から対象者の頭の位置までの距離が１３００ｍｍになった場合には、撮像装置１１の撮像素子に結像する対象者の頭の直径は０．９５２ｍｍとなる。すなわち、この場合には、頭の高さが３００ｍｍ変化することで、０．２８６ｍｍ（２３．１％）だけ像の大きさ（直径）が変化する。 Here, as described above, when the focal length of the wide-angle lens system 32 is 6.188 mm and the diameter of the subject's head is 200 mm, from the front focal point of the wide-angle lens system 32 to the position of the subject's head. When the distance is 1000 mm (that is, when a person with a height of 1 m60 cm stands upright), the diameter of the head of the subject imaged on the imaging device 36 of the imaging device 11 is 1.238 mm. On the other hand, when the position of the subject's head is lowered by 300 mm and the distance from the front focal point of the wide-angle lens system 32 to the position of the subject's head is 1300 mm, an image is formed on the imaging device of the imaging device 11. The diameter of the subject's head is 0.952 mm. That is, in this case, when the head height changes by 300 mm, the size (diameter) of the image changes by 0.286 mm (23.1%).

同様に、広角レンズ系３２の前側焦点から対象者の頭の位置までの距離が２０００ｍｍの場合には（対象者が中腰の場合）、撮像装置１１の撮像素子３６に結像する対象者の頭の直径は０．６１９ｍｍであり、そこから対象者の頭の位置が３００ｍｍ下がった場合には、撮像装置１１の撮像素子に結像する対象者の頭の像の大きさは０．５３８ｍｍとなる。すなわち、この場合には、頭の高さが３００ｍｍ変化することで、０．０８１ｍｍ（１３．１％）だけ頭の像の大きさ（直径）が変化する。このように、本実施形態においては、広角レンズ系３２の前側焦点から対象者の頭までの距離が離れるにつれて、頭の像の大きさの変化（変化率）が小さくなる。 Similarly, when the distance from the front focal point of the wide-angle lens system 32 to the position of the subject's head is 2000 mm (when the subject is a middle waist), the subject's head that forms an image on the image sensor 36 of the imaging device 11. Is 0.619 mm, and when the position of the subject's head is lowered by 300 mm, the size of the image of the subject's head imaged on the image sensor of the imaging device 11 is 0.538 mm. . That is, in this case, when the head height changes by 300 mm, the size (diameter) of the head image changes by 0.081 mm (13.1%). Thus, in the present embodiment, as the distance from the front focal point of the wide-angle lens system 32 to the subject's head increases, the change (change rate) in the size of the head image becomes smaller.

一般的に、成人であれば身長の差は３００ｍｍ程度であり、頭の大きさの差は身長の差よりも１桁小さいが、身長差と頭の大きさの差とは所定の関係を満足する傾向にある。このため、標準的な頭の大きさ（例えば直径２００ｍｍ）と、撮像された対象者の頭の大きさとを比較することにより、対象者の身長を類推することができる。また、一般的に、耳の位置は、頭頂部から１５０ｍｍから２００ｍｍ程度下であるので、頭の大きさから対象者の耳の高さ位置も類推することができる。オフィスに入る際には立っている場合が多いので、受付付近に設けられた撮像装置１１により頭の像を撮像して対象者の身長や耳の高さ位置を類推すれば、その後は、対象者の頭の像の大きさから広角レンズ系の前側焦点から対象者までの距離がわかるので、対象者の姿勢（立っている、中腰である、倒れている）および姿勢の変化を対象者のプライバシを保った状態で判別することができる。なお、対象者が倒れている場合、頭頂部から足先方向に向かって１５０〜２００ｍｍ程度のところに耳の位置があると類推することができる。このように、撮像装置１１により撮像される頭の位置・大きさを利用することで、例えば耳が髪により隠れていたとしても、耳の位置を類推することが可能となる。 In general, for adults, the difference in height is about 300 mm, and the difference in head size is an order of magnitude smaller than the difference in height, but the difference in height and head size satisfies a predetermined relationship. Tend to. Therefore, the height of the subject can be inferred by comparing the standard head size (for example, 200 mm in diameter) with the size of the head of the subject imaged. In general, since the position of the ear is about 150 mm to 200 mm below the top of the head, the height position of the subject's ear can also be estimated from the size of the head. Since it is often standing when entering the office, if the image of the head is imaged by the imaging device 11 provided near the reception and the height of the target person and the height of the ear are analogized, then the target Since the distance from the front focal point of the wide-angle lens system to the subject can be known from the size of the person's head image, the subject's posture (standing, lying down, lying down) and posture changes The determination can be made while maintaining privacy. In addition, when the subject falls down, it can be analogized that the position of the ear is about 150 to 200 mm from the top of the head toward the tip of the foot. In this way, by using the position and size of the head imaged by the imaging device 11, it is possible to analogize the position of the ear even if the ear is hidden by hair, for example.

図７は、頭の像の大きさの変化率を示すグラフである。図７では、対象者の頭の位置が、横軸に示す値から１００ｍｍ変化した場合の、像の大きさの変化率を示している。この図７から分かるように、広角レンズ系３２の前側焦点から対象者の頭の位置までの距離が１０００ｍｍから１００ｍｍ遠ざかった場合、像の大きさの変化率が９．１％と大きいので、仮に頭の大きさが同一であっても、身長差が１００ｍｍ程度あれば、複数の対象者を身長差に基づいて容易に識別することができる。これに対し、広角レンズ系３２の前側焦点から対象者の頭の位置までの距離が２０００ｍｍから１００ｍｍ遠ざかった場合、像の大きさの変化率は４．８％となっている。この場合、上述した広角レンズ系３２の前側焦点から対象者の頭の位置までの距離が１０００ｍｍから１００ｍｍ遠ざかった場合に比べれば、像の変化率は小さくなるものの、同一の対象者の姿勢の変化程度であれば、容易に識別することができる。 FIG. 7 is a graph showing the change rate of the size of the head image. FIG. 7 shows the rate of change in image size when the position of the subject's head changes 100 mm from the value shown on the horizontal axis. As can be seen from FIG. 7, when the distance from the front focal point of the wide-angle lens system 32 to the position of the subject's head is increased from 1000 mm to 100 mm, the change rate of the image size is as large as 9.1%. Even if the head size is the same, if the height difference is about 100 mm, a plurality of subjects can be easily identified based on the height difference. On the other hand, when the distance from the front focal point of the wide-angle lens system 32 to the position of the subject's head is away from 2000 mm to 100 mm, the change rate of the image size is 4.8%. In this case, although the rate of change of the image is smaller than when the distance from the front focal point of the wide-angle lens system 32 described above to the position of the subject's head is 1000 mm to 100 mm, the change in the posture of the same subject is reduced. If so, it can be easily identified.

このように、本実施形態の撮像装置１１の撮像結果を用いれば、対象者の頭の像の大きさから広角レンズ系３２の前側焦点から対象者までの距離を検出することができるので、制御部２５は、この検出結果を用いることで、対象者の姿勢（直立している、中腰である、倒れている）及び姿勢の変化を判別することができる。この点について、図８（ａ），図８（ｂ）に基づいて、より詳細に説明する。 Thus, if the imaging result of the imaging device 11 of the present embodiment is used, the distance from the front focal point of the wide-angle lens system 32 to the subject can be detected from the size of the image of the subject's head. By using this detection result, the unit 25 can determine the posture of the subject (upright, middle waist, falling) and the change in posture. This point will be described in more detail based on FIGS. 8A and 8B.

図８（ａ）、図８（ｂ）は、対象者の姿勢に応じた頭の像の大きさの変化を模式的に示す図である。図８（ｂ）に示すように、撮像装置１１を天井部に設けて、対象者の頭を撮像すると、図８（ｂ）の左側の対象者のように直立している場合には、図８（ａ）に示すように頭が大きく撮像され、図８（ｂ）の右側の対象者のように倒れている場合には、図８（ａ）に示すように頭が小さく撮像される。また、図８（ｂ）の中央の対象者のように、中腰の状態にある場合には、頭の像は、立っているときよりも小さく、倒れているときよりも大きい。したがって、本実施形態では、制御部２５は、撮像装置１１から送信されてくる画像に基づいて、対象者の頭の像の大きさを検出することで、対象者の状態を判定することができる。この場合、対象者の頭の像から、対象者の姿勢や姿勢の変化を判別しているので、対象者の顔や体全体などを用いた判別を行う場合と比べて、プライバシを保護することができる。 FIGS. 8A and 8B are diagrams schematically showing changes in the size of the head image in accordance with the posture of the subject. As shown in FIG. 8B, when the imaging device 11 is provided on the ceiling and the head of the subject is imaged, when the subject is standing upright like the subject on the left side of FIG. When the head is imaged large as shown in FIG. 8A, and the subject falls down like the subject on the right side of FIG. 8B, the head is imaged small as shown in FIG. 8A. In addition, when the subject is in the middle waist as in the central subject in FIG. 8B, the head image is smaller than when standing and larger than when lying down. Therefore, in the present embodiment, the control unit 25 can determine the state of the subject by detecting the size of the image of the subject's head based on the image transmitted from the imaging device 11. . In this case, since the posture of the subject and the change in posture are discriminated from the image of the subject's head, privacy is protected compared to the case where discrimination using the subject's face or whole body is performed. Can do.

なお、図６（ａ）、図６（ｂ）及び図７では、広角レンズ系３２の画角の低い位置（広角レンズ系３２の真下）に、対象者が存在している場合におけるグラフを示している。すなわち、対象者が広角レンズ系３２の周辺画角位置に存在している場合には、対象者との見込み角に応じたディストーションの影響を受けるおそれがある。これについて、詳述する。 6A, 6B, and 7 show graphs in the case where the subject is present at a position where the angle of view of the wide-angle lens system 32 is low (below the wide-angle lens system 32). ing. That is, when the subject is present at the peripheral field angle position of the wide-angle lens system 32, there is a risk of being affected by distortion according to the expected angle with the subject. This will be described in detail.

図９には、対象者の位置に応じた、撮像素子３６に撮像される対象者の頭の像の大きさの変化が示されている。なお、撮像素子３６の中心は、広角レンズ系３２の光軸中心と一致しているものとする。この場合、対象者が直立している場合であっても、撮像装置１１の直下に立っている場合と、撮像装置１１から離れて立っている場合では、ディストーションの影響を受けて、撮像装置１１に撮像される頭の像の大きさが変化する。ここで、図９の位置ｐ１において、頭が撮像された場合、当該撮像結果からは、撮像素子３６で撮像された像の大きさ、撮像素子３６の中心からの距離Ｌ１、撮像素子３６の中心からの角度θ１を取得することができる。また、図９の位置Ｐ２において、頭が撮像された場合、当該撮像結果からは、撮像素子３６で撮像された像の大きさ、撮像素子３６の中心からの距離Ｌ２、撮像素子３６の中心からの角度θ２を取得することができる。なお、距離Ｌ１、Ｌ２は、広角レンズ系３２の前側焦点と、対象者の頭との距離を表すパラメータである。また、撮像素子３６の中心からの角度θ１、θ２は、対象者に対する広角レンズ系３２の見込み角を表すパラメータである。このような場合において、制御部２５では、撮像素子３６の中心からの距離Ｌ１、Ｌ２、撮像素子３６の中心からの角度θ１、θ２に基づいて、撮像した像の大きさを補正する。換言すれば、対象者が同じ姿勢のときに、撮像素子３６の位置ｐ１に撮像される像の大きさと、位置ｐ２に撮像される像の大きさとが実質的に等しくなるように補正する。このようにすることで、本実施形態では、撮像装置１１と対象者との位置関係（対象者までの距離や対象者との見込み角）にかかわらず、対象者の姿勢を精度よく検出することができる。なお、この補正に用いるパラメータ（補正テーブル）は、格納部２４に記憶されているものとする。 FIG. 9 shows a change in the size of the image of the head of the subject imaged by the image sensor 36 in accordance with the position of the subject. It is assumed that the center of the image sensor 36 coincides with the optical axis center of the wide-angle lens system 32. In this case, even when the subject is standing upright, when the subject is standing directly below the imaging device 11 and when standing away from the imaging device 11, the imaging device 11 is affected by distortion. The size of the image of the head imaged changes. Here, when the head is imaged at the position p1 in FIG. 9, the size of the image imaged by the image sensor 36, the distance L1 from the center of the image sensor 36, and the center of the image sensor 36 are obtained from the imaging result. Can be obtained. Further, when the head is imaged at the position P2 in FIG. 9, from the imaging result, the size of the image captured by the image sensor 36, the distance L2 from the center of the image sensor 36, and the center of the image sensor 36 are obtained. Can be obtained. The distances L1 and L2 are parameters representing the distance between the front focal point of the wide-angle lens system 32 and the subject's head. Further, the angles θ1 and θ2 from the center of the image sensor 36 are parameters representing the expected angle of the wide-angle lens system 32 with respect to the subject. In such a case, the control unit 25 corrects the size of the captured image based on the distances L1 and L2 from the center of the image sensor 36 and the angles θ1 and θ2 from the center of the image sensor 36. In other words, when the subject is in the same posture, the size of the image captured at the position p1 of the image sensor 36 is corrected so as to be substantially equal to the size of the image captured at the position p2. By doing in this way, in this embodiment, regardless of the positional relationship between the imaging device 11 and the subject (the distance to the subject or the prospective angle with the subject), the posture of the subject can be detected accurately. Can do. It is assumed that parameters (correction table) used for this correction are stored in the storage unit 24.

ここで、撮像装置１１による撮像間隔は、制御部２５が設定するものとする。制御部２５は、オフィスに多くの人がいる可能性が高い時間帯と、それ以外の時間帯で、撮影の頻度（フレームレート）を変更することができる。例えば、制御部２５は、現在が、オフィスに多くの人がいる可能性が高い時間帯（例えば午前９時から午後６時まで）であると判断した場合には、１秒に１回静止画を撮像（３万２４００枚／日）するようにし、それ以外の時間帯と判定した場合には、５秒に１回静止画を撮像（６４８０枚／日）するようにする、などの設定をすることができる。また、撮像された静止画は、格納部２４（フラッシュメモリ９６ｂ）に一時的に保存したのち、例えば１日ごとの撮像データをＨＤＤ９６ａに保存し、その後に格納部２４から消去するようにすればよい。 Here, the imaging interval by the imaging device 11 is set by the control unit 25. The control unit 25 can change the shooting frequency (frame rate) in a time zone in which there is a high possibility that there are many people in the office and in other time zones. For example, if the control unit 25 determines that the current time is a time zone in which there is a high possibility that there are many people in the office (for example, from 9:00 am to 6:00 pm), the still image is once per second. If you decide to capture the image (32,400 images / day), and if it is determined that the time is other than that, set the settings such as capturing a still image once every 5 seconds (6480 images / day). can do. Further, after the captured still image is temporarily stored in the storage unit 24 (flash memory 96b), for example, the captured image data for each day is stored in the HDD 96a and then deleted from the storage unit 24. Good.

なお、静止画に代えて動画の撮影を行ってもよく、この場合、動画を連続して撮影しても、３〜５秒程度の短い動画を間欠的に撮影してもよい。 Note that a moving image may be taken instead of a still image. In this case, the moving image may be taken continuously, or a short moving image of about 3 to 5 seconds may be taken intermittently.

次に、撮像装置１１の撮像領域について説明する。 Next, the imaging area of the imaging device 11 will be described.

図１０は、一例として、オフィス内の１つの区画４３と、当該区画４３内に設けられた撮像装置１１の撮像領域と、の関係を模式的に示す図である。なお、図１０では、１つの区画４３内に４つの撮像装置１１（ただし撮像領域Ｐ１，Ｐ２，Ｐ３，Ｐ４のみが図示されている）が設けられているものとする。また、１つの区画が２５６ｍ²（１６ｍ×１６ｍ）であるものとする。更に、撮像領域Ｐ１〜Ｐ４それぞれは円形領域であるものとし、Ｘ方向及びＹ方向において隣接する撮像領域と重複（オーバラップ）した状態となっている。なお、図１０では、説明の便宜上、１つの区画を４分割した分割部分（撮像領域Ｐ１〜Ｐ４それぞれに対応）を分割部分Ａ１〜Ａ４として示している。この場合、広角レンズ系３２の画角が８０°、焦点距離６．１８８ｍｍとし、天井の高さを２.６ｍ、対象者の身長を１．６ｍとすると、広角レンズ系３２の真下を中心に半径５．６７ｍの円内（約１００ｍ²）が撮像領域となる。すなわち、分割部分Ａ１〜Ａ４は６４ｍ²となるので、各分割部分Ａ１〜Ａ４を、各撮像装置１１の撮像領域Ｐ１〜Ｐ４に含めることができるとともに、各撮像装置１１の撮像領域の一部を重複させることが可能となる。 FIG. 10 is a diagram schematically illustrating, as an example, the relationship between one section 43 in the office and the imaging area of the imaging device 11 provided in the section 43. In FIG. 10, it is assumed that four image pickup apparatuses 11 (only the image pickup areas P1, P2, P3, and P4 are illustrated) are provided in one section 43. One section is assumed to be 256 m ² (16 m × 16 m). Further, each of the imaging areas P1 to P4 is assumed to be a circular area, and is overlapped (overlapped) with adjacent imaging areas in the X direction and the Y direction. In FIG. 10, for convenience of explanation, a divided portion obtained by dividing one section into four (corresponding to the imaging regions P1 to P4) is shown as divided portions A1 to A4. In this case, assuming that the angle of view of the wide-angle lens system 32 is 80 °, the focal length is 6.188 mm, the height of the ceiling is 2.6 m, and the height of the subject is 1.6 m, centering directly below the wide-angle lens system 32. The inside of a circle having a radius of 5.67 m (about 100 m ² ) is an imaging region. That is, since the divided portions A1 to A4 are 64 m ² , the divided portions A1 to A4 can be included in the imaging regions P1 to P4 of each imaging device 11, and a part of the imaging region of each imaging device 11 is included. It is possible to overlap.

図１０は物体側から見た撮像領域Ｐ１〜Ｐ４の重複（オーバラップ）の概念を示したが、撮像領域Ｐ１〜Ｐ４は広角レンズ系３２に光が入射する領域であり、この広角レンズ系３２に入射した光の全てが矩形の撮像素子３６に入射するものではない。このため、本実施形態においては、隣接する複数の撮像素子３６の撮像領域Ｐ１〜Ｐ４が重複（オーバラップ）するように撮像装置１１をオフィスに設置すればよい。具体的には、撮像装置１１にその取り付けを調整するような調整部（例えば長穴や、大き目の調整穴、撮像位置を調整するシフト光学系）を設け、それぞれの撮像素子３６が撮像した映像を目しで確認しながら重複（オーバラップ）を調整して、それぞれの撮像装置１１の取り付け位置を決めるようにすればよい。なお、例えば、図１０に示す分割部分Ａ１と撮像素子３６の撮像領域とが一致していた場合には、それぞれの撮像装置１１にて撮像した画像が重複することなく、ぴったりと合うことになる。しかしながら、複数の撮像装置１１をそれぞれ取り付ける際の自由度や、天井の梁などで取り付け高さが異なる場合を考えると、前述のように複数の撮像素子３６の撮像領域Ｐ１〜Ｐ４を重複（オーバラップ）させるのが好ましい。 FIG. 10 shows the concept of overlapping (overlap) of the imaging regions P1 to P4 as viewed from the object side. The imaging regions P1 to P4 are regions where light is incident on the wide-angle lens system 32, and this wide-angle lens system 32. Not all of the light incident on the light enters the rectangular image sensor 36. For this reason, in the present embodiment, the imaging device 11 may be installed in the office so that the imaging regions P1 to P4 of the plurality of adjacent imaging devices 36 overlap (overlap). Specifically, the imaging device 11 is provided with an adjustment unit (for example, a long hole, a large adjustment hole, or a shift optical system that adjusts the imaging position) that adjusts the attachment, and images captured by the imaging elements 36. The overlapping position (overlap) may be adjusted while confirming with the eye, and the mounting position of each imaging device 11 may be determined. For example, when the divided portion A1 shown in FIG. 10 and the imaging region of the imaging device 36 match, the images captured by the respective imaging devices 11 do not overlap and exactly match each other. . However, considering the degree of freedom in mounting the plurality of imaging devices 11 and the case where the mounting height differs depending on the ceiling beam or the like, the imaging regions P1 to P4 of the plurality of imaging elements 36 overlap as described above. It is preferable to wrap).

なお、重複量は、人の頭の大きさに基づいて設定することができる。この場合、例えば、頭の外周を６０ｃｍとすれば、重複する領域に直径約２０ｃｍの円形が含まれるようにすればよい。なお、頭の一部が重複する領域に含まれればよいという設定の下では、例えば、直径約１０ｃｍの円形が含まれるようにすればよい。重複する量をこの程度に設定すれば、撮像装置１１を天井に取り付ける際の調整も楽になり、場合によっては調整なしでも複数の撮像装置１１の撮像領域を重複させることも可能である。 The overlap amount can be set based on the size of the person's head. In this case, for example, if the outer periphery of the head is 60 cm, a circle having a diameter of about 20 cm may be included in the overlapping region. In addition, under the setting that only a part of the head needs to be included in the overlapping region, for example, a circle having a diameter of about 10 cm may be included. If the overlapping amount is set to this level, the adjustment when the imaging device 11 is attached to the ceiling becomes easy. In some cases, the imaging regions of the plurality of imaging devices 11 can be overlapped without adjustment.

次に、図１１〜図１３に基づいて、案内部１０（撮像装置１１）を用いた対象者の追跡処理について、説明する。図１１には、対象者がオフィスに入るときの様子が模式的に示されている。 Next, based on FIGS. 11-13, the tracking process of the subject using the guidance part 10 (imaging device 11) is demonstrated. FIG. 11 schematically shows a state when the subject enters the office.

まず、図１１を用いて、対象者がオフィスに入る際の処理について説明する。図１１に示すように、対象者がオフィスに入る際には、対象者は、自己が保有するＩＤカード８９をカードリーダ８８にかざすものとする。カードリーダ８８が取得したカード情報は、制御部２５に送信される。制御部２５は、取得したカード情報と、格納部２４に記憶されている従業員情報とに基づいて、ＩＤカード８９をかざした対象者を特定する。なお、対象者が、従業員以外の場合、総合受付や守衛所等で渡されるゲストカードをかざすことになるため、当該対象者はゲストと特定されることになる。 First, the processing when the subject enters the office will be described with reference to FIG. As shown in FIG. 11, when the subject enters the office, the subject holds the ID card 89 held by the subject over the card reader 88. The card information acquired by the card reader 88 is transmitted to the control unit 25. Based on the acquired card information and the employee information stored in the storage unit 24, the control unit 25 identifies the target person who holds the ID card 89. If the target person is a person other than an employee, a guest card handed over at a general reception or a guardhouse is held over, so that the target person is specified as a guest.

上記のように対象者が特定された時点から、制御部２５は、カードリーダ８８の上方に設けられた案内部１０の撮像装置１１を用いた、対象者の頭の撮像を行う。そして、制御部２５は、撮像装置１１で撮像された画像の中から、頭と想定される画像部分を基準テンプレートとして切り出し、格納部２４に登録する。 From the point in time when the target person is specified as described above, the control unit 25 images the head of the target person using the imaging device 11 of the guide unit 10 provided above the card reader 88. Then, the control unit 25 cuts out an image portion assumed to be a head from the image captured by the imaging device 11 as a reference template, and registers it in the storage unit 24.

なお、撮像装置１１で撮像された画像の中から、頭と想定される画像部分を抽出する方法としては、例えば、
（１）複数の対象者の頭の画像のテンプレートを予め登録しておき、これらの画像を用いたパターンマッチングにより頭部分を抽出する方法
（２）想定される大きさの円形状の部分を頭部分として抽出する方法
などがある。 In addition, as a method of extracting the image part assumed to be a head from the image imaged with the imaging device 11, for example,
(1) A method of previously registering a template of a head image of a plurality of subjects and extracting a head portion by pattern matching using these images. (2) A circular portion having an assumed size is headed. There is a method of extracting as a part.

なお、上記頭部分の抽出の前に、カードリーダの近傍に設置されたカメラを用いて対象者を正面から撮像し、撮像装置１１の撮像領域のどの辺りで頭が撮像されるかを予測しておいてもよい。この場合、カメラの画像の顔認証結果から、対象者の頭の位置を予測してもよいし、カメラとして例えばステレオカメラを用いることで、対象者の頭の位置を予測してもよい。このようにすることで、頭部分の抽出を高精度に行うことができるようになる。 Prior to the extraction of the head part, the subject is imaged from the front using a camera installed in the vicinity of the card reader, and it is predicted where the head is imaged in the imaging area of the imaging device 11. You may keep it. In this case, the position of the subject's head may be predicted from the face authentication result of the image of the camera, or the position of the subject's head may be predicted by using, for example, a stereo camera as the camera. In this way, the head portion can be extracted with high accuracy.

ここで、対象者の身長は予め格納部２４に登録されているものとし、制御部２５は、身長と、基準テンプレートとを関連付けるものとする。なお、対象者がゲストの場合には、前述した対象者を正面から撮像するカメラ等により、身長を計測し、当該身長と、基準テンプレートを関連付けるものとする。 Here, it is assumed that the height of the subject is registered in the storage unit 24 in advance, and the control unit 25 associates the height with the reference template. When the target person is a guest, the height is measured by a camera or the like that images the target person from the front, and the height and the reference template are associated with each other.

また、制御部２５は、基準テンプレートの倍率を変更したテンプレート（合成テンプレート）を作成して、格納部２４に格納するものとする。この場合、制御部２５は、合成テンプレートとして、頭の高さが例えば１０ｃｍ単位で変化した場合に撮像装置１１で撮像される頭の大きさのテンプレートを作成するものとする。この合成テンプレートの作成に際して、制御部２５は、撮像装置１１の光学特性と基準テンプレートを取得したときの撮像位置との関係を考慮するものとする。 Further, the control unit 25 creates a template (composite template) in which the magnification of the reference template is changed and stores it in the storage unit 24. In this case, the control unit 25 creates a template of the size of the head that is imaged by the imaging device 11 when the height of the head changes in units of 10 cm, for example, as a composite template. When creating the composite template, the control unit 25 considers the relationship between the optical characteristics of the imaging device 11 and the imaging position when the reference template is acquired.

次に、図１２を用いて、オフィス内に入った直後の単一の撮像装置１１による追跡処理について説明する。対象者がオフィス内に入った後は、制御部２５は、図１２に示すように、撮像装置１１による画像の連続取得を開始する。そして、制御部２５は、連続取得される画像と、基準テンプレート（又は合成テンプレート）とのパターンマッチングを行って、スコア値が所定の基準値よりも高い部分（頭部分）を抽出し、当該抽出された部分から、対象者の位置（高さ位置及び床面内の２次元位置）を求める。この場合、図１２の画像αが取得された時点で、スコア値が所定の基準値よりも高くなったものとする。したがって、制御部２５は、図１２の画像αの位置を対象者の位置とするとともに、画像αを新たな基準テンプレートとし、かつ新たな基準テンプレートの合成テンプレートを作成する。 Next, the tracking process by the single imaging device 11 immediately after entering the office will be described with reference to FIG. After the target person enters the office, the control unit 25 starts continuous acquisition of images by the imaging device 11, as shown in FIG. And the control part 25 performs the pattern matching with the image acquired continuously, and a reference | standard template (or synthetic | combination template), extracts the part (head part) whose score value is higher than a predetermined | prescribed reference value, The said extraction The position (the height position and the two-dimensional position in the floor surface) of the subject person is obtained from the obtained part. In this case, it is assumed that the score value is higher than a predetermined reference value when the image α in FIG. 12 is acquired. Accordingly, the control unit 25 sets the position of the image α in FIG. 12 as the position of the subject person, sets the image α as a new reference template, and creates a new reference template composite template.

その後は、制御部２５は、新たな基準テンプレート（又は合成テンプレート）を用いて、対象者の頭を追跡し、対象者の位置が変わるたびに、そのときに得られた画像（例えば、図１２の画像β）を新たな基準テンプレートとするとともに、合成テンプレートを作成する（基準テンプレート及び合成テンプレートを更新する）。なお、上記のように追跡しているときに、頭の大きさが突然小さくなる場合がある。すなわち、パターンマッチングに用いる合成テンプレートの倍率が大きく変動する場合がある。このような場合には、制御部２５は、対象者が倒れるなどの異常が発生したと判断することとしてもよい。 Thereafter, the control unit 25 uses the new reference template (or composite template) to track the head of the subject, and whenever the location of the subject changes, an image obtained at that time (for example, FIG. 12). Image β) as a new reference template and a composite template is created (the reference template and the composite template are updated). When tracking is performed as described above, the size of the head may suddenly become smaller. That is, the magnification of the synthesis template used for pattern matching may vary greatly. In such a case, the control unit 25 may determine that an abnormality such as the target person falling has occurred.

次に、図１３に基づいて、２つの撮像装置１１間のつなぎ処理（基準テンプレート及び合成テンプレートの変更処理）について説明する。 Next, based on FIG. 13, a connection process between the two imaging devices 11 (a process for changing the reference template and the combined template) will be described.

前提として、図１３に示すように対象者が２つの撮像装置１１の間（前述した撮像領域の重複部分）に位置している場合において、制御部２５は、一方の（左側の）撮像装置１１で、対象者の頭の位置を検出しているとする。このときの基準テンプレートが図１３の画像βであるとする。この場合、制御部２５は、当該対象者の頭の位置に基づいて、他方の（右側の）撮像装置１１の撮像領域のどの位置で頭が撮像されるかを算出する。そして、制御部２５は、他方の（右側の）撮像装置１１の撮像領域のうち、頭が撮像されるべき位置の画像（図１３の画像γ）を、新たな基準テンプレートとするとともに、合成テンプレートを生成する。そして、これ以降の右側の撮像装置１１を用いた追跡処理では、基準テンプレート（画像γ）を更新しながら、図１２のような追跡処理を行うこととする。 As a premise, when the subject is located between the two imaging devices 11 (overlapping portions of the imaging regions described above) as shown in FIG. 13, the control unit 25 controls the one (left side) imaging device 11. Suppose that the position of the head of the subject is detected. It is assumed that the reference template at this time is the image β in FIG. In this case, based on the position of the subject's head, the control unit 25 calculates at which position in the imaging region of the other (right side) imaging device 11 the head is imaged. Then, the control unit 25 sets, as a new reference template, an image at a position where the head is to be imaged (image γ in FIG. 13) in the imaging area of the other (right side) imaging device 11, and a composite template Is generated. In the tracking process using the right imaging device 11 thereafter, the tracking process as shown in FIG. 12 is performed while updating the reference template (image γ).

以上のような処理を行うことで、基準テンプレートを随時更新することによる、オフィス内における対象者の追跡処理を行うことが可能である。 By performing the processing as described above, it is possible to perform tracking processing of the target person in the office by updating the reference template as needed.

次に、図１０の１つの区画４３内において４人の対象者（対象者Ａ，Ｂ，Ｃ，Ｄとする）が移動する場合の追跡処理について、図１４、図１５に基づいて説明する。なお、追跡処理の間は、制御部２５は、図１２，図１３のように基準テンプレートを随時更新する。 Next, a tracking process when four subjects (subjects A, B, C, and D) move within one section 43 in FIG. 10 will be described with reference to FIGS. 14 and 15. During the tracking process, the control unit 25 updates the reference template as needed as shown in FIGS.

図１４（ａ）には、時刻Ｔ１における状態が示されている。なお、図１４（ｂ）〜図１５（ｃ）には、時刻Ｔ１以降（時刻Ｔ２〜Ｔ５）における状態が示されている。 FIG. 14A shows the state at time T1. 14B to 15C show states after time T1 (time T2 to T5).

時刻Ｔ１においては、分割部分Ａ１に対象者Ｃ、分割部分Ａ３に対象者Ａ，Ｂが存在している。この場合、撮像領域Ｐ１を有する撮像装置１１が対象者Ｃの頭を撮像し、撮像領域Ｐ３を有する撮像装置１１が対象者Ａ，Ｂの頭を撮像している。 At time T1, the subject person C exists in the divided portion A1, and the subjects A and B exist in the divided portion A3. In this case, the imaging device 11 having the imaging region P1 images the head of the subject C, and the imaging device 11 having the imaging region P3 images the heads of the subjects A and B.

次いで、時刻Ｔ２においては、撮像領域Ｐ１を有する撮像装置１１が対象者Ｂ，Ｃの頭を撮像し、撮像領域Ｐ３を有する撮像装置１１が対象者Ａ，Ｂの頭を撮像している。 Next, at time T2, the imaging device 11 having the imaging region P1 images the heads of the subjects B and C, and the imaging device 11 having the imaging region P3 images the subjects A and B.

この場合、制御部２５は、時刻Ｔ１、Ｔ２における各撮像装置１１の撮像結果から、対象者Ａ、Ｃが、図１４（ｂ）の左右方向に移動し、対象者Ｂが図１４（ｂ）の上下方向に移動していることを認識する。なお、対象者Ｂが時刻Ｔ２において２つの撮像装置１１に撮像されているのは、対象者Ｂが２つの撮像装置１１の撮像領域が重複する部分に存在しているからである。この図１４（ｂ）の状態では、制御部２５は、対象者Ｂについて、図１３のつなぎ処理（基準テンプレート及び合成テンプレートの２つの撮像装置１１間での変更処理）を行う。 In this case, the control unit 25 moves the subjects A and C from the imaging results of the imaging devices 11 at times T1 and T2 in the left-right direction in FIG. 14B, and the subject B becomes FIG. 14B. Recognize that it is moving up and down. The reason why the subject B is captured by the two imaging devices 11 at time T2 is that the subject B exists in a portion where the imaging regions of the two imaging devices 11 overlap. In the state of FIG. 14B, the control unit 25 performs the connection process (change process between the two imaging devices 11 of the reference template and the combined template) of FIG.

次いで、時刻Ｔ３においては、撮像領域Ｐ１を有する撮像装置１１が対象者Ｂ，Ｃの頭を撮像し、撮像領域Ｐ２を有する撮像装置１１が対象者Ｃの頭を撮像し、撮像領域Ｐ３を有する撮像装置１１が対象者Ａの頭を撮像し、撮像領域Ｐ４を有する撮像装置１１が対象者Ａ，Ｄの頭を撮像している。 Next, at time T3, the imaging device 11 having the imaging region P1 images the heads of the subjects B and C, and the imaging device 11 having the imaging region P2 images the subject C and has the imaging region P3. The imaging device 11 images the head of the subject A, and the imaging device 11 having the imaging region P4 images the heads of the subjects A and D.

この場合、制御部２５は、時刻Ｔ３（図１５（ａ））において、対象者Ａが分割部分Ａ３と分割部分Ａ４との境界にいる（分割部分Ａ３から分割部分Ａ４に移動中である）ことを認識し、対象者Ｂが分割部分Ａ１にいることを認識し、対象者Ｃが分割部分Ａ１と分割部分Ａ２との境界にいる（分割部分Ａ１からＡ２に移動中である）ことを認識し、対象者Ｄが分割部分Ａ４にいることを認識する。この図１５（ａ）の状態では、制御部２５は、対象者ＡとＣについて、図１３のつなぎ処理（基準テンプレート及び合成テンプレートの２つの撮像装置１１間での変更処理）を行う。 In this case, the control unit 25 determines that the subject A is at the boundary between the divided part A3 and the divided part A4 at time T3 (FIG. 15A) (moving from the divided part A3 to the divided part A4). Recognizing that the subject B is in the divided portion A1, and recognizing that the subject C is at the boundary between the divided portion A1 and the divided portion A2 (moving from the divided portion A1 to A2). , It recognizes that the target person D is in the divided portion A4. In the state of FIG. 15A, the control unit 25 performs the connection process (the change process between the two imaging devices 11 of the reference template and the composite template) for the subjects A and C in FIG. 13.

同様に、制御部２５は、時刻Ｔ４（図１５（ｂ））において、対象者Ａが分割部分Ａ４、対象者Ｂが分割部分Ａ１、対象者Ｃが分割部分Ａ２、対象者Ｄが分割部分Ａ２とＡ４の間にいることを認識する。この図１５（ｂ）の状態では、制御部２５は、対象者Ｄについて、図１３のつなぎ処理（基準テンプレート及び合成テンプレートの２つの撮像装置１１間での変更処理）を行う。また、制御部２５は、時刻Ｔ５（図１５（ｃ））において、対象者Ａが分割部分Ａ４、対象者Ｂが分割部分Ａ１、対象者Ｃが分割部分Ａ２、対象者Ｄが分割部分Ａ２にいることを認識する。 Similarly, at time T4 (FIG. 15B), the control unit 25 determines that the subject A is the divided portion A4, the subject B is the divided portion A1, the subject C is the divided portion A2, and the subject D is the divided portion A2. Recognize that he is between A4 and A4. In the state of FIG. 15B, the control unit 25 performs the connection process (change process between the two imaging devices 11 of the reference template and the composite template) of FIG. Further, at time T5 (FIG. 15C), the control unit 25 determines that the subject person A is the divided portion A4, the subject person B is the divided portion A1, the subject person C is the divided portion A2, and the subject person D is the divided portion A2. Recognize that

本実施形態では、上述のように複数の撮像装置１１の撮像領域の一部を重複させているので、制御部２５は、対象者の位置および移動方向を認識することができる。このように、本実施形態では、制御部２５は、オフィス内において各対象者を継続的に高精度に追跡することが可能となっている。 In the present embodiment, as described above, since a part of the imaging regions of the plurality of imaging devices 11 is overlapped, the control unit 25 can recognize the position and moving direction of the subject. Thus, in the present embodiment, the control unit 25 can continuously track each target person in the office with high accuracy.

次に、図１６に基づいて、制御部２５による指向性スピーカ１３の制御方法について説明する。なお、図１６では、案内部１０が通路（廊下）に沿って配置された場合について図示しており、一点鎖線で示す領域は、各案内部１０が有する撮像装置１１の撮像範囲を意味するものとする。なお、図１６の場合にも隣接する撮像装置１１の撮像範囲は重複しているものとする。 Next, a method for controlling the directional speaker 13 by the control unit 25 will be described with reference to FIG. Note that FIG. 16 illustrates the case where the guide unit 10 is arranged along the passage (corridor), and the area indicated by the alternate long and short dash line means the imaging range of the imaging device 11 included in each guide unit 10. And Also in the case of FIG. 16, it is assumed that the imaging ranges of adjacent imaging devices 11 overlap.

本実施形態では、図１６に示すように対象者がポジションＫ１からポジションＫ４の方向（＋Ｘ方向）に移動する場合、制御部２５は、ポジションＫ１に対象者が位置していれば、案内部１０ａの指向性スピーカ１３を用いて対象者に対する音声による案内を行う（案内部１０ａから延びる太実線矢印参照）。 In the present embodiment, when the subject moves in the direction from the position K1 to the position K4 (+ X direction) as shown in FIG. 16, the control unit 25, if the subject is located at the position K1, guide unit 10a. The directional speaker 13 is used to guide the subject by voice (see the thick solid arrow extending from the guide unit 10a).

一方、制御部２５は、ポジションＫ２に対象者が位置している場合には、対象者を撮像している撮像装置１１を有する案内部１０ａではなく（案内部１０ａから延びる太破線矢印参照）、対象者を撮像していない撮像装置１１を有する案内部１０ｂの指向性スピーカ１３を用いて対象者に対する音声による案内を行う（案内部１０ｂから延びる太実線矢印参照）。 On the other hand, when the subject is located at the position K2, the control unit 25 is not the guide unit 10a having the imaging device 11 that images the subject (see the thick broken line arrow extending from the guide unit 10a). Guidance by voice is given to the subject using the directional speaker 13 of the guide unit 10b having the imaging device 11 that has not imaged the subject (see thick solid arrows extending from the guide unit 10b).

このような指向性スピーカ１３の制御を行うこととしているのは、対象者が＋Ｘ方向に移動している場合に、制御部２５が、案内部１０ａの指向性スピーカ１３から音声案内を行うと対象者の耳の後ろ側から音声案内をすることになる一方、制御部２５が、案内部１０ｂの指向性スピーカ１３の姿勢を制御して音声案内をすれば、対象者の耳の前側から音声案内を行うことができるからである。すなわち、対象者が＋Ｘ方向に移動している場合は、対象者よりも＋Ｘ方向に位置している指向性スピーカ１３を選択することにより、対象者の顔の正面から音声案内をすることができる。なお、制御部２５は、対象者の横から音声案内を行うように指向性スピーカ１３を選択するようにしてもよい。すなわち、制御部２５は、対象者の耳の後方からの音声案内を避けるように指向性スピーカ１３を選択すればよい。 The control of the directional speaker 13 is performed when the control unit 25 performs voice guidance from the directional speaker 13 of the guide unit 10a when the subject is moving in the + X direction. On the other hand, if the control unit 25 controls the posture of the directional speaker 13 of the guide unit 10b to provide voice guidance, the voice guidance is performed from the front side of the subject's ear. It is because it can be performed. That is, when the subject is moving in the + X direction, voice guidance can be provided from the front of the subject's face by selecting the directional speaker 13 positioned in the + X direction relative to the subject. . In addition, you may make it the control part 25 select the directional speaker 13 so that voice guidance may be performed from the side of a subject. That is, the control unit 25 may select the directional speaker 13 so as to avoid voice guidance from behind the subject's ear.

また、制御部２５は、ポジションＫ４に対象者が位置している場合には案内部１０ｄの指向性スピーカ１３を用いて対象者に音声案内を行うこととする。このような指向性スピーカ１３の制御を行うこととしているのは、ポジションＫ４において案内部１０ｃの指向性スピーカ１３を用いて対象者に音声案内をした場合（案内部１０ｃから延びる太破線矢印参照）、対象者の近くにいる他人に音声案内を聞かれてしまうおそれがあるからである。 Further, when the subject is located at position K4, the control unit 25 performs voice guidance to the subject using the directional speaker 13 of the guidance unit 10d. The control of the directional speaker 13 is performed when voice guidance is provided to the subject using the directional speaker 13 of the guide unit 10c at the position K4 (see the thick broken line arrow extending from the guide unit 10c). This is because voice guidance may be heard by other people near the subject.

本実施形態では、制御部２５は、上記のようにして、少なくとも１つの撮像装置１１の撮像結果に基づいて他人に音声案内を聞かれる恐れのない指向性スピーカ１３を選択する。なお、ポジションＫ４のように、他人が近くにいる場合であっても、対象者が指向性マイク１２を介して問い合わせを行う場合も想定される。このような場合には、対象者を撮像している案内部１０ｃの指向性マイク１２（対象者に最も近い位置に存在する指向性マイク１２）を用いて対象者が発する言葉を集音すればよい。ただし、これに限らず、制御部２５は、対象者の口の前側に位置する指向性マイク１２を用いて対象者が発する言葉を集音することとしてもよい。 In the present embodiment, as described above, the control unit 25 selects the directional speaker 13 that is unlikely to hear voice guidance from others based on the imaging result of at least one imaging device 11. It is assumed that the subject makes an inquiry through the directional microphone 12 even when another person is nearby as in the position K4. In such a case, if the words uttered by the subject are collected by using the directional microphone 12 (the directional microphone 12 present at the position closest to the subject) of the guide unit 10c imaging the subject. Good. However, the present invention is not limited to this, and the control unit 25 may collect words uttered by the subject using the directional microphone 12 positioned in front of the subject's mouth.

なお、各案内部１０は、必要に応じて駆動を開始（電源を投入）すればよい。例えば、案内部１０ａが外来者を撮像して、図１６上で＋Ｘ側に移動していることがわかった段階で、案内部１０ａに隣接する案内部１０ｂを駆動するようにしてもよい。この場合、案内部１０ａの撮像装置１１の撮像範囲と案内部１０ｂの撮像装置１１の撮像範囲との重複部分に外来者が来る前に、案内部１０ｂが駆動を開始していればよい。また、案内部１０ａは、外来者を撮像できなくなった時点で電源を落としたり、あるいは、省エネルギーモード（スタンバイモード）に入るようにすればよい。 In addition, what is necessary is just to start a drive (turning on a power supply) each guide part 10 as needed. For example, the guide unit 10a adjacent to the guide unit 10a may be driven when it is found that the guide unit 10a has taken an image of a visitor and moved to the + X side in FIG. In this case, it is only necessary for the guide unit 10b to start driving before a visitor comes to an overlapping portion between the imaging range of the imaging device 11 of the guide unit 10a and the imaging range of the imaging device 11 of the guide unit 10b. In addition, the guide unit 10a may turn off the power or enter the energy saving mode (standby mode) when it becomes impossible to capture an image of a visitor.

なお、図２に示す音声ユニット５０において、ユニット本体１６をＸ軸方向やＹ軸方向に駆動可能とする駆動機構を設けることとしてもよい。この場合、駆動機構を介して、対象者の前側（もしくは横側）から音声を出力できるように指向性スピーカ１３の位置を変更したり、他人に音声を聞かれない位置に指向性スピーカ１３の位置を変更したりすれば、指向性スピーカ１３（音声ユニット５０）の数を減らせることができる。 In the audio unit 50 shown in FIG. 2, a drive mechanism that can drive the unit main body 16 in the X-axis direction or the Y-axis direction may be provided. In this case, the position of the directional speaker 13 is changed so that the sound can be output from the front side (or the side) of the subject via the drive mechanism, or the directional speaker 13 is placed at a position where the sound is not heard by others. If the position is changed, the number of directional speakers 13 (audio units 50) can be reduced.

なお、図１６では一軸方向（Ｘ軸方向）に沿って配置された案内部１０を図示したが、これに加えて、Ｙ軸方向に沿って案内部１０を配置しても、同様の制御をすることができる。 In addition, although the guide part 10 arrange | positioned along the uniaxial direction (X-axis direction) was illustrated in FIG. 16, in addition to this, even if the guide part 10 is arrange | positioned along the Y-axis direction, the same control is performed. can do.

次に、本実施形態の案内システム１００の処理・動作について、図１７に基づいて詳細に説明する。図１７は、制御部２５による対象者に対する案内処理を示すフローチャートである。本実施形態では、オフィスに外来者（対象者）が来た場合の案内処理を例に採り説明する。 Next, processing and operation of the guidance system 100 of the present embodiment will be described in detail based on FIG. FIG. 17 is a flowchart showing guidance processing for the subject by the control unit 25. In the present embodiment, description will be made by taking an example of guidance processing when an outpatient (target person) comes to the office.

図１７の処理では、まず、ステップＳ１０において、制御部２５は、受付処理を行う。具体的には、制御部２５は、外来者が受付（図１１参照）に来た際に受付付近の天井に設けられた案内部１０の撮像装置１１により外来者の頭の像を撮像し、基準テンプレートと合成テンプレートを生成する。また、制御部２５は、事前に登録された情報から外来者の入出が許可されているエリアを認識するとともに、受付付近の案内部１０の指向性スピーカ１３から、打合せの場所を通知する。この場合、制御部２５は、例えば“担当の○○は第５応接室でお待ちしていますので、廊下をお進みください”というような音声案内を、音声合成部２３で音声合成させ、当該音声を指向性スピーカ１３から出力する。 In the process of FIG. 17, first, in step S10, the control unit 25 performs a reception process. Specifically, when the visitor comes to the reception (see FIG. 11), the control unit 25 takes an image of the head of the visitor by the imaging device 11 of the guide unit 10 provided on the ceiling near the reception, Generate a reference template and a composite template. In addition, the control unit 25 recognizes an area where an outpatient is allowed to enter and exit from information registered in advance, and notifies the meeting location from the directional speaker 13 of the guide unit 10 near the reception. In this case, the control unit 25 synthesizes a voice guidance such as “Since XX in charge is waiting in the 5th reception room, so please proceed in the hallway” by the voice synthesis unit 23, and the voice Is output from the directional speaker 13.

次いで、ステップＳ１２では、制御部２５は、図１２〜図１５を用いて説明したように、複数の案内部１０の撮像装置１１を用いて外来者の頭を撮像することにより、外来者の追尾を行う。この場合、基準テンプレートは随時更新され、合成テンプレートも随時作成される。 Next, in step S12, as described with reference to FIGS. 12 to 15, the control unit 25 tracks the visitor by imaging the visitor's head using the imaging device 11 of the plurality of guide units 10. I do. In this case, the reference template is updated as needed, and a composite template is also created as needed.

次いで、ステップＳ１４では、制御部２５は、外来者が受付を出たか否かを判断する。ここでの判断が肯定された場合には、図１７の全処理を終了するが、判断が否定された場合には、ステップＳ１６に移行する。 Next, in step S14, the control unit 25 determines whether or not an outpatient has accepted. If the determination here is affirmed, the entire process of FIG. 17 is terminated. If the determination is negative, the process proceeds to step S16.

次いで、ステップＳ１６では、外来者に対する案内が必要かどうかを判断する。この場合、制御部２５は、例えば、外来者が第５応接室に行く間に存在している分岐路（外来者が右に進む必要がある位置など）に近づいてきた場合に、外来者に対する案内が必要と判断する。また、制御部２５は、例えば、外来者が案内部１０の指向性マイク１２に向けて“トイレはどこですか”などの質問をした場合に案内が必要と判断する。また、制御部２５は、例えば、外来者が所定時間（例えば３秒から１０秒程度）立ち止まってしまった場合にも案内が必要と判断する。 Next, in step S16, it is determined whether or not guidance for an outpatient is necessary. In this case, for example, when the visitor approaches a branching path (such as a position where the visitor needs to go to the right) existing while the visitor goes to the fifth reception room, Judge that guidance is necessary. For example, the control unit 25 determines that guidance is necessary when a visitor asks the directional microphone 12 of the guidance unit 10 such as “Where is the toilet”? Further, the control unit 25 determines that guidance is necessary even when an outpatient has stopped for a predetermined time (for example, about 3 to 10 seconds).

次いで、ステップＳ１８では、制御部２５は、案内が必要か否かを判断する。このステップＳ１８での判断が否定された場合には、ステップＳ１４に戻るが、ステップＳ１８の判断が肯定された場合には、ステップＳ２０に移行する。 Next, in step S18, the control unit 25 determines whether guidance is necessary. If the determination in step S18 is negative, the process returns to step S14, but if the determination in step S18 is positive, the process proceeds to step S20.

ステップＳ２０に移行すると、制御部２５は、撮像装置１１の撮像結果に基づいて外来者の進行方向を確認するとともに、耳の位置（顔の正面の位置）を類推する。耳の位置は、受付において特定された人物（対象者）に関連付けられている身長から類推することができる。また、対象者に対して身長が関連付けられていない場合には、受付で撮像された頭の大きさや、受付で正面から撮像された対象者の画像などから求められる身長に基づいて、耳の位置を類推してもよい。 If transfering it to step S20, the control part 25 will confirm the advancing direction of a visitor based on the imaging result of the imaging device 11, and will presume the position of the ear (front position of the face). The position of the ear can be inferred from the height associated with the person (subject) identified at the reception. Also, if the height is not associated with the subject, the position of the ear is determined based on the height of the head imaged at the reception, the height of the subject imaged from the front at the reception, etc. You may analogize.

次いで、ステップＳ２２では、制御部２５は、外来者の位置に基づいて、音声を出力する指向性スピーカ１３を選択する。この場合、制御部２５は、図１６で説明したように、対象者の耳の前側又は横側、かつ対象者の近くにいる他人に音声案内を聞かれるおそれが無い方向に位置する指向性スピーカ１３を選択する。 Next, in step S22, the control unit 25 selects the directional speaker 13 that outputs sound based on the position of the visitor. In this case, as described with reference to FIG. 16, the control unit 25 is a directional speaker located in the front side or the side side of the subject's ear and in a direction in which there is no possibility of voice guidance being heard by another person near the subject. 13 is selected.

次いで、ステップＳ２４では、制御部２５は、駆動装置１４により指向性マイク１２および指向性スピーカ１３の位置を調節するとともに、指向性スピーカ１３の音量（出力）を設定する。この場合、制御部２５は、案内部１０ａの撮像装置１１の撮像結果に基づいて外来者と案内部１０ｂの指向性スピーカ１３との距離を検出し、検出された距離に基づいて指向性スピーカ１３の音量を設定するものとする。また、制御部２５は、撮像装置１１の撮像結果に基づいて外来者が直進していると判断した場合には、モータ１４ａ（図３参照）により指向性マイク１２および指向性スピーカ１３のチルト方向の位置調節を行う。更に、制御部２５は、撮像装置１１の撮像結果に基づいて外来者が廊下を曲がったと判断した場合には、モータ１４ｂ（図３参照）により指向性マイク１２および指向性スピーカ１３のパン方向の位置調節を行う。 Next, in step S <b> 24, the control unit 25 adjusts the positions of the directional microphone 12 and the directional speaker 13 with the driving device 14 and sets the volume (output) of the directional speaker 13. In this case, the control unit 25 detects the distance between the alien speaker and the directional speaker 13 of the guide unit 10b based on the imaging result of the imaging device 11 of the guide unit 10a, and the directional speaker 13 based on the detected distance. Set the volume of. When the control unit 25 determines that the visitor is moving straight on the basis of the imaging result of the imaging device 11, the tilt direction of the directional microphone 12 and the directional speaker 13 by the motor 14 a (see FIG. 3). Adjust the position of. Further, when the control unit 25 determines that the visitor has turned the corridor based on the imaging result of the imaging device 11, the control unit 25 uses the motor 14b (see FIG. 3) to move the directional microphone 12 and the directional speaker 13 in the pan direction. Adjust the position.

次いで、ステップＳ２６では、制御部２５は、ステップＳ２４の調節状態で、外来者に対して、案内又は警告を実施する。具体的には、例えば、外来者が右に曲がるべき分岐路に差し掛かった場合には、“右に曲がってください”などの音声案内を行う。また、例えば、外来者が“トイレはどこですか”などの音声を発していた場合には、制御部２５は、音声認識部２２に、指向性マイク１２から入力した音声を認識させ、外来者が入出を許可されているエリアの中から最も近いトイレの位置を案内する音声を、音声合成部２３に合成させる。そして、制御部２５は、音声合成部２３にて合成された音声を指向性スピーカ１３から出力する。また、例えば、外来者の侵入が許可されていないエリア（セキュリティエリア）に外来者が入ってしまった場合（又は入りそうな場合）には、制御部２５は、指向性スピーカ１３により、“このエリアへの立ち入りはご遠慮下さい”などの音声案内（警告）を行う。本実施形態では、指向性スピーカ１３を採用しているので、当該指向性スピーカ１３を用いた音声案内を行うことにより、音声案内が必要な人だけに適切に音声案内を行うことができる。 Next, in step S26, the control unit 25 performs guidance or warning for the outpatient in the adjusted state in step S24. Specifically, for example, when a visitor reaches a branch road that should turn right, voice guidance such as “turn right” is performed. Further, for example, when an outpatient utters a voice such as “Where is the toilet”, the control unit 25 causes the voice recognition unit 22 to recognize the voice input from the directional microphone 12 and The voice synthesizing unit 23 synthesizes the voice that guides the nearest toilet position from the area where entry / exit is permitted. Then, the control unit 25 outputs the voice synthesized by the voice synthesis unit 23 from the directional speaker 13. Further, for example, when a visitor enters (or is likely to enter) an area (security area) where entry of the visitor is not permitted, the control unit 25 causes the directional speaker 13 to Please refrain from entering the area ". In this embodiment, since the directional speaker 13 is employed, by performing voice guidance using the directional speaker 13, voice guidance can be appropriately performed only for a person who needs voice guidance.

上記のようにステップＳ２６の処理が終了した後は、ステップＳ１４に戻り、以降の処理は、外来者が受付を出るまで継続して行われることになる。これにより、オフィスに外来者が来た場合でも、人が案内する手間を省略することができるとともに、外来者がセキュリティエリア等へ入ってしまうことを防ぐことができる。また、外来者にセンサを持たせる必要がないため、外来者が煩わしさを感じることもない。 After the process of step S26 is completed as described above, the process returns to step S14, and the subsequent processes are continuously performed until the visitor leaves the reception. Thereby, even when a visitor comes to the office, it is possible to omit the time and effort required for the person to guide, and to prevent the visitor from entering the security area or the like. Further, since it is not necessary for the visitor to have a sensor, the visitor does not feel annoyed.

以上、詳細に説明したように、本実施形態によると、制御部２５は、対象者を含む画像を撮像可能な少なくとも１つの撮像装置１１から、撮像結果を取得し、取得した撮像結果に応じて、撮像装置１１の撮像範囲外に設けられた指向性スピーカ１３を制御する。これにより、撮像装置１１の撮像範囲内に設けられている指向性スピーカ１３から音声を出力すると、対象者の耳の後側から音声が発せられて対象者が聞き取りにくくなるような場合でも、撮像範囲外に設けられた指向性スピーカ１３から音声を出力することで、対象者は指向性スピーカから発せられる音声を聞き取りやすくなる。また、対象者の近くに他人がいて、他人に音声を聞かれるおそれがあるような場合に、撮像範囲外に設けられた指向性スピーカ１３から音声を出力することで、他人に音声を聞かれるのを抑制することができる。すなわち、指向性スピーカ１３の適切な制御が可能となる。 As described above in detail, according to the present embodiment, the control unit 25 acquires an imaging result from at least one imaging device 11 that can capture an image including the subject, and according to the acquired imaging result. The directional speaker 13 provided outside the imaging range of the imaging device 11 is controlled. As a result, when sound is output from the directional speaker 13 provided within the imaging range of the imaging device 11, the sound is emitted from the back side of the subject's ear, and the subject is difficult to hear. By outputting the sound from the directional speaker 13 provided outside the range, the target person can easily hear the sound emitted from the directional speaker. In addition, when there is another person near the target person and there is a possibility that the voice may be heard by another person, the voice can be heard by the other person by outputting the voice from the directional speaker 13 provided outside the imaging range. Can be suppressed. That is, appropriate control of the directional speaker 13 is possible.

また、本実施形態によると、制御部２５は、少なくとも１つの撮像装置１１の撮像結果に基づいて対象者の移動情報（位置など）を検出し、当該検出結果に基づいて、指向性スピーカ１３を制御するので、対象者の移動情報（位置など）に応じた適切な指向性スピーカ１３の制御が可能となる。 Further, according to the present embodiment, the control unit 25 detects the movement information (position, etc.) of the subject based on the imaging result of at least one imaging device 11, and the directional speaker 13 is controlled based on the detection result. Since the control is performed, it is possible to control the directional speaker 13 appropriately according to the movement information (position or the like) of the subject.

また、本実施形態によると、制御部２５は、対象者の移動情報に基づいて前記対象者が所定領域外（セキュリティエリア外）に移動すると判断したとき、又は所定領域外（セキュリティエリア外）に移動したと判断したときに、指向性スピーカ１３から対象者に対する警告を行うこととしている。これにより、人手を介さずに、セキュリティエリア外への対象者の侵入を防止することができる。 Further, according to the present embodiment, the control unit 25 determines that the subject moves outside the predetermined area (outside the security area) based on the movement information of the subject, or out of the predetermined area (outside the security area). When it is determined that the subject has moved, a warning is given to the subject from the directional speaker 13. Accordingly, it is possible to prevent the target person from entering the security area without human intervention.

また、本実施形態によると、制御部２５は、撮像装置１１が対象者とは異なる人を撮像した際に、指向性スピーカ１３を制御することとしているので、対象者とは異なる人（他人）により音声が聞かれないように、指向性スピーカを適切に制御することができる。 In addition, according to the present embodiment, the control unit 25 controls the directional speaker 13 when the imaging device 11 captures a person who is different from the target person. Therefore, it is possible to appropriately control the directional speaker so that no sound is heard.

また、本実施形態によると、駆動装置１４は、指向性スピーカ１３の位置及び／又は姿勢を調節するので、指向性スピーカ１３の音声出力方向を適切な向き（対象者が音声を聞き取りやすい向き）に調整することができる。 Moreover, according to this embodiment, since the drive device 14 adjusts the position and / or posture of the directional speaker 13, the sound output direction of the directional speaker 13 is an appropriate direction (the direction in which the target person can easily hear the sound). Can be adjusted.

また、本実施形態によると、駆動装置１４は、対象者の移動に応じて指向性スピーカ１３の位置及び／又は姿勢を調節するので、対象者が移動しても、指向性スピーカ１３の音声出力方向を適切な向きに調整することができる。 In addition, according to the present embodiment, the driving device 14 adjusts the position and / or posture of the directional speaker 13 according to the movement of the target person. The direction can be adjusted to an appropriate direction.

また、本実施形態によると、隣接する撮像装置１１の撮像領域が重複するように、隣接する撮像装置１１が配置されているので、隣接する撮像装置１１の撮像領域を跨いで対象者が移動する場合でも、隣接する撮像装置１１を用いて対象者の追跡を行うことが可能となる。 Moreover, according to this embodiment, since the adjacent imaging device 11 is arrange | positioned so that the imaging area of the adjacent imaging device 11 may overlap, a subject moves across the imaging area of the adjacent imaging device 11. Even in this case, it is possible to track the target person using the adjacent imaging device 11.

また、本実施形態によると、制御部２５は、撮像装置１１で撮像された頭部分の画像を基準テンプレートとし、対象者を追尾する場合には、基準テンプレートを用いて対象者の頭部分を特定するとともに、特定された頭部分の新たな画像で基準テンプレートを更新する。したがって、制御部２５は、移動する対象者を基準テンプレートを更新することで、頭の画像が変化する場合でも適切に追尾することが可能である。 Further, according to the present embodiment, the control unit 25 specifies the head portion of the subject using the reference template when the subject is tracked using the head portion image captured by the imaging device 11 as a reference template. In addition, the reference template is updated with a new image of the identified head portion. Therefore, the control unit 25 can appropriately track the moving target person even when the head image changes by updating the reference template.

また、本実施形態によると、制御部２５は、複数の撮像装置で対象者を同時に撮像できるときに、一の撮像装置により撮像される対象者の頭部分の位置情報を取得するとともに、他の撮像装置により撮像される画像のうち、頭部分が存在する領域の画像を他の撮像装置の基準テンプレートとする。したがって、一の撮像装置と他の撮像装置とで取得される頭部分の画像が異なる場合（例えば後頭部の画像βと前頭部の画像γの場合）でも、上記のように基準テンプレートを決定することで、複数の撮像装置を用いた対象者の追尾を適切に行うことが可能となる。 In addition, according to the present embodiment, the control unit 25 acquires the position information of the head portion of the subject imaged by one imaging device when the subject person can be imaged simultaneously by a plurality of imaging devices, Of the images picked up by the image pickup device, an image of an area where the head portion exists is used as a reference template of another image pickup device. Therefore, the reference template is determined as described above even when the images of the head portion acquired by one imaging device and another imaging device are different (for example, in the case of the occipital image β and the forehead image γ). Thus, it becomes possible to appropriately track the target person using a plurality of imaging devices.

また、本実施形態によると、制御部２５は、頭部分の大きさ情報が所定量以上変動した場合に、対象者の異常を判定するので、プライバシを保護した状態で、対象者の異常を発見することができる。 In addition, according to the present embodiment, the control unit 25 determines the target person's abnormality when the size information of the head portion fluctuates by a predetermined amount or more, so the abnormality of the target person is detected while protecting the privacy. can do.

また、本実施形態によると、制御部２５は、対象者を含む画像を撮像可能な撮像装置１１の撮像結果を取得し、取得した撮像結果から対象者の大きさ情報（耳の位置や身長、撮像装置１１からの距離など）を検出した結果に基づいて、指向性スピーカ１３の位置及び／又は姿勢を調節するので、指向性スピーカ１３の位置や姿勢を適切に調整することができる。これにより、指向性スピーカ１３から対象者に対して出力される音声を聞き取りやすくすることができる。 In addition, according to the present embodiment, the control unit 25 acquires an imaging result of the imaging device 11 that can capture an image including the target person, and the size information (ear position and height, ear size) of the target person from the acquired imaging result. Since the position and / or orientation of the directional speaker 13 is adjusted based on the result of detecting the distance from the imaging device 11), the position and orientation of the directional speaker 13 can be adjusted appropriately. Thereby, the sound output from the directional speaker 13 to the subject can be easily heard.

また、本実施形態によると、制御部２５は、対象者と撮像装置１１との距離に基づいて、指向性スピーカの出力（音量）を設定するので、指向性スピーカ１３から対象者に対して出力される音声を聞き取りやすくすることができる。 Further, according to the present embodiment, the control unit 25 sets the output (volume) of the directional speaker based on the distance between the target person and the imaging device 11, and therefore outputs from the directional speaker 13 to the target person. Can be easily heard.

また、本実施形態によると、制御部２５は、対象者の位置に応じて、指向性スピーカ１３による音声案内を行うこととしているので、対象者の位置が分岐路である場合やセキュリティエリア内又は近傍である場合などにおいて、適切な音声案内（又は警告）を行うことが可能である。 In addition, according to the present embodiment, the control unit 25 performs voice guidance by the directional speaker 13 according to the position of the target person. Therefore, when the position of the target person is a branch road or in the security area or Appropriate voice guidance (or warning) can be provided in the vicinity.

また、本実施形態によると、制御部２５は、対象者と撮像装置１１との位置関係に基づいて、対象者の大きさ情報を補正するので、撮像装置１１の光学系のディストーションの影響による検出誤差の発生を抑制することができる。 Further, according to the present embodiment, the control unit 25 corrects the size information of the subject based on the positional relationship between the subject and the imaging device 11, so that detection is performed due to the distortion of the optical system of the imaging device 11. The generation of errors can be suppressed.

なお、上記実施形態では、撮像装置１１により対象者の頭部分を撮像することとしたが、これに限らず、対象者の肩を撮像することとしてもよい。この場合、型の高さから、耳の位置を類推するようにしてもよい。 In the above-described embodiment, the imaging device 11 captures the subject's head, but the present invention is not limited thereto, and the subject's shoulder may be imaged. In this case, the position of the ear may be estimated from the height of the mold.

また、上記実施形態では、指向性マイク１２と指向性スピーカ１３とをユニット化する場合について説明したが、これに限らず、指向性マイク１２と指向性スピーカ１３を別々に設けるようにしてもよい。また、指向性マイク１２に代えて、指向性のないマイク（例えばズーム型マイク）を採用してもよいし、指向性スピーカ１３に代えて、指向性のないスピーカを採用してもよい。 In the above embodiment, the case where the directional microphone 12 and the directional speaker 13 are unitized has been described. However, the present invention is not limited thereto, and the directional microphone 12 and the directional speaker 13 may be provided separately. . Further, a microphone with no directivity (for example, a zoom microphone) may be employed instead of the directional microphone 12, or a speaker with no directivity may be employed instead of the directional speaker 13.

また、上記実施形態では、オフィスに案内システム１００を配備し、オフィスに外来者が来た場合に案内処理を行う場合について説明したが、これに限られるものではない。例えば、スーパーやデパートなどの売り場に案内システム１００を配備し、当該案内システム１００を売り場等への客の案内に用いることとしてもよい。同様に、病院などに案内システム１００を配備してもよい。この場合、案内システム１００を用いて患者を案内するようにしてもよい。例えば、人間ドックなどで複数の検査をする場合に、対象者を案内することができ、診断業務、精算業務等の効率化を図ることが可能となる。更に、美術館、映画館、コンサートホールなど、静寂さを要求される場所での案内にも、案内システム１００を用いることが可能である。また、他人に音声案内を聞かれる恐れもないため、対象者の個人情報を保護することもできる。なお、案内システム１００が配備される場所に係員が存在している場合には、案内が必要な対象者に対して音声案内を行うとともに、係員に案内が必要な対象者がいることを通知することとしてもよい。 In the above-described embodiment, the case where the guidance system 100 is deployed in an office and guidance processing is performed when a visitor comes to the office has been described. However, the present invention is not limited to this. For example, the guidance system 100 may be provided at a sales floor such as a supermarket or a department store, and the guidance system 100 may be used for guiding customers to the sales floor. Similarly, the guidance system 100 may be deployed in a hospital or the like. In this case, the guidance system 100 may be used to guide the patient. For example, when a plurality of examinations are performed using a medical checkup or the like, the target person can be guided, and the efficiency of diagnosis work, settlement work, etc. can be improved. Furthermore, the guidance system 100 can also be used for guidance in places where silence is required, such as museums, movie theaters, and concert halls. Moreover, since there is no fear that other people will hear the voice guidance, the personal information of the target person can be protected. In addition, when a staff member is present at the place where the guidance system 100 is deployed, voice guidance is given to a target person who needs guidance, and the staff member is notified that there is a target person who needs guidance. It is good as well.

なお、上記実施形態では、オフィスの受付にカードリーダ８８を設け、これによりオフィス内に入ろうとしている人物を特定する場合について説明したが、これに限らず、指紋や音声などの生体認証装置や、暗証番号入力装置などで人物を特定することとしてもよい。 In the above embodiment, a case has been described in which the card reader 88 is provided at the office reception, thereby identifying the person who is about to enter the office. However, the present invention is not limited to this, and a biometric authentication device such as a fingerprint or voice, A person may be specified by a personal identification number input device or the like.

上述した実施形態は本発明の好適な実施の例である。但し、これに限定されるものではなく、本発明の要旨を逸脱しない範囲内において種々変形実施可能である。 The above-described embodiment is an example of a preferred embodiment of the present invention. However, the present invention is not limited to this, and various modifications can be made without departing from the scope of the present invention.

１１撮像装置
１２指向性マイク
１３指向性スピーカ
２０本体部
２５制御部
１００案内システム DESCRIPTION OF SYMBOLS 11 Imaging device 12 Directional microphone 13 Directional speaker 20 Main body part 25 Control part 100 Guidance system

Claims

An acquisition device for acquiring an imaging result from at least one imaging device capable of imaging an image including the subject;
An electronic apparatus comprising: a control device that controls an audio device provided outside an imaging range of the imaging device according to an imaging result of the imaging device.

A detection device that detects movement information of the subject based on an imaging result of the at least one imaging device;
The electronic device according to claim 1, wherein the control device controls the audio device based on a detection result of the detection device.

The control device controls the audio device when it is determined that the subject moves outside the predetermined region based on the movement information detected by the detection device, or when it is determined that the subject moves outside the predetermined region. The electronic device according to claim 2, wherein a warning is given to the subject.

4. The electronic device according to claim 1, wherein the control device controls the audio device when the at least one imaging device images a person different from the target person. 5. machine.

The electronic apparatus according to claim 1, wherein the audio device includes a directional speaker.

The electronic apparatus according to claim 1, further comprising a drive control device that adjusts a position and / or posture of the audio device.

The electronic device according to claim 6, wherein the drive control device adjusts a position and / or posture of the audio device according to the movement of the subject.

The at least one imaging device includes a first imaging device and a second imaging device,
The first and second imaging devices are arranged such that a part of the imaging range of the first imaging device and a part of the imaging range of the second imaging device overlap. The electronic device as described in any one of Claim 1 to 7.

The audio device includes a first audio device provided in an imaging range of the first imaging device, and a second audio device provided in an imaging range of the second imaging device,
9. The electronic apparatus according to claim 8, wherein the control device controls the second audio device when the first audio device is located behind the subject.

The audio device includes a first audio device having a first microphone and a first speaker provided in an imaging range of the first imaging device, and a second speaker provided in an imaging range of the second imaging device. A second audio device comprising:
The electronic device according to claim 8, wherein the control device controls the second speaker when the first imaging device images the target person and a person different from the target person.

The electronic device according to claim 10, wherein the control device collects the voice of the subject by controlling the first microphone when the first imaging device images the subject. .

A tracking device that tracks the target person using the imaging result of the imaging device,
The tracking device acquires an image of a specific portion of the target person using the imaging device, uses the image of the specific portion as a template, and uses the template to track the target person when tracking the target person. The electronic device according to claim 1, wherein the specific part is specified, and the template is updated with a new image of the specified specific part of the target person.

The imaging device includes a first imaging device and a second imaging device having an imaging range that overlaps a part of the imaging range of the first imaging device,
The tracking device is
When the first imaging device and the second imaging device can simultaneously image the subject, the position information of the specific portion of the subject imaged by one imaging device is acquired,
The image corresponding to the position information of the specific part is specified among images picked up by the other image pickup device, and the image of the specified region is used as the template of the other image pickup device. 12. The electronic device according to 12.

The electronic device according to claim 12 or 13, wherein the tracking device determines an abnormality of the target person when the size information of the specific portion fluctuates by a predetermined amount or more.

At least one imaging device capable of capturing an image including the subject;
An audio device provided outside the imaging range of the imaging device;
An information transmission system comprising: the electronic device according to claim 1.